Back to the MIT repository
6. Socioeconomic and Environmental1 - Pre-deployment

Unrepresentative risk testing

Testing is unrepresentative when the test inputs are mismatched with the inputs that are expected during deployment.

Source: MIT AI Risk Repositorymit1322

ENTITY

1 - Human

INTENT

2 - Unintentional

TIMING

1 - Pre-deployment

Risk ID

mit1322

Domain lineage

6. Socioeconomic and Environmental

262 mapped risks

6.5 > Governance failure

Mitigation strategy

1. Conduct a comprehensive **Test Data Representativeness Audit** to systematically verify that the statistical properties (e.g., feature distribution, class balance, demographic composition) of the final test set accurately mirror the anticipated inputs and operational context of the deployment environment. This audit is essential for establishing non-biased generalization capability. 2. Enforce a **Strict Data Partitioning and Separation Protocol** by establishing and formally locking an immutable test dataset prior to the commencement of model training and hyperparameter tuning. This methodological separation prevents data leakage and ensures that the performance metrics derived from the test set serve as an unbiased estimate of real-world efficacy. 3. Institute **Continuous Monitoring for Data and Concept Drift** in the production environment. Establish performance indicators that trigger an alert when the characteristics of live input data or the relationship between inputs and outputs diverge significantly from the data used for testing, necessitating the creation of a refreshed, representative test set and subsequent model re-validation.