Hate
Hate Speech
Automated generation or amplification of toxic content, targeted harassment, and hate speech via AI systems, facilitating harassment campaigns at scale.
Sayar Ghosh Roy, Ujwal Narayan, Tathagata Raha, Zubair Abid, Vasudeva Varma
Mitigation Strategy
Implementation of continuously updated toxicity filters, hybrid human-AI moderation, early brigading detection systems, and clear consequences for abuse.
Atomic Number
49
Hz
Risk ID
in-49
Severity
7/10
Severity Level