IP information in prompt
Copyrighted information or other intellectual property might be included as a part of the prompt that is sent to the model.
ENTITY
3 - Other
INTENT
2 - Unintentional
TIMING
2 - Post-deployment
Risk ID
mit1296
Domain lineage
2. Privacy & Security
2.1 > Compromise of privacy by leaking or correctly inferring sensitive information
Mitigation strategy
1. Employ Adversarial and Guidance-based Prompt Engineering Strategies, such as Negative Prompting and detailed Task Instruction Prompting, which explicitly instruct the model to steer the generative process away from reproducing identifiable copyrighted or proprietary content. 2. Apply Concept Erasure Techniques to the model weights to localize and eliminate the generative capacity associated with known protected intellectual property concepts, thereby mitigating the risk of inadvertent reproduction. 3. Deploy an Infringement Detection Module within the generation pipeline to identify and flag potentially copyrighted material in the output, coupled with a validation layer that verifies generated content against known IP databases.