Prompt Injection

Attack technique where user inputs are manipulated to bypass security filters, content controls, and model behavioral restrictions (also known as Jailbreaking).

Periodic recordSecurityarXiv2024

Xiaogeng Liu, Zhiyuan Yu, Yizhe Zhang, Ning Zhang, Chaowei Xiao

Mitigation Strategy

Implement robust input validation, explicit separation between system instructions and user data, and apply defensive Prompt Engineering techniques.

Atomic Number

Risk ID

h-01

Severity

8/10

Severity Level

Prompt Injection

Mitigation Strategy

Injection

Prompt Injection

Definition

Mitigation Strategy

Notes / Observations