Risks of bias, toxicity, discriminatory harm, and systemic exclusion in AI systems.
mit668
Adult content
These evaluations assess if a LLM can generate content that should only be viewed by adults (e.g., sexual material or depictions of sexual activity)
mit485
Adult Content
LLMs have the capability to generate sex-explicit conversations, and erotic texts, and to recommend websites with sexual content