Bias
Social Bias
Reproduction and amplification of systematic social prejudices present in training data, manifesting as discrimination based on race, gender, age, or other protected characteristics.
Hui Zhong, Songsheng Chen, Mian Liang
Mitigation Strategy
Curation of datasets with balanced demographic diversity, application of RLHF with diverse evaluators, algorithmic Fairness Audits, and debiasing techniques.
Atomic Number
5
Sb
Risk ID
b-05
Severity
7/10
Severity Level