Ethics and Morality Issues
LMs need to pay more attention to universally accepted societal values at the level of ethics and morality, including the judgement of right and wrong, and its relationship with social norms and laws.
ENTITY
2 - AI
INTENT
3 - Other
TIMING
2 - Post-deployment
Risk ID
mit65
Domain lineage
7. AI System Safety, Failures, & Limitations
7.3 > Lack of capability or robustness
Mitigation strategy
1. Implement Continuous Ethical Monitoring and Feedback Loop. Establish a robust, real-time monitoring system to track and log instances of the LM generating content that violates predefined ethical guidelines, laws, or social norms post-deployment. Critically, integrate a confidential, low-latency feedback mechanism (e.g., an internal reporting system or "ethics canary" system) to allow rapid human review and immediate behavioral correction (e.g., targeted fine-tuning or guardrail activation) to address the lack of moral and ethical robustness when exposed to real-world inputs. 2. Embed Ethical Decision-Making Frameworks (EDMF) for Alignment. Architecturally integrate an Ethical Decision-Making Framework into the LM using advanced alignment techniques, such as Constitutional AI or targeted Reinforcement Learning from Human Feedback (RLHF) on ethical dilemmas. The EDMF must explicitly prompt the model to assess its output against formal rules (laws, regulations) and informal rules (societal values and moral judgment of right/wrong) and anticipate consequences before generation, thereby enhancing its inherent capability to navigate complex ethical space. 3. Formalize Stakeholder-Informed Ethical Governance and Review. Institute a formal, periodic governance process that involves diverse external stakeholders and ethics committees to review the LM's current ethical performance, the scope of its embedded societal values, and the adequacy of its regulatory compliance. Use the insights from this review to proactively update and retrain the model's ethical knowledge base and the organizational Code of Ethics to ensure continuous alignment with evolving societal values, ethical principles, and new legal requirements.