Back to the periodic table
74w-74
Sc

Scraping

Severity5/10

Illegal Scraping

Massive extraction of data from websites for model training ignoring robots.txt, terms of service, and data property rights.

Periodic recordLegalarXiv2025

Chung Peng Lee, Rachel Hong, Harry H. Jiang, Aster Plotnik, William Agnew, Jamie Morgenstern

Mitigation Strategy

Development and adoption of specific AI consent web protocols (e.g., extended robots.txt), legal consequences for unauthorized scraping, and respect for opt-out.

Atomic Number

74

Sc

Risk ID

w-74

Severity

5/10

Severity Level

74
Legal
w-74
Sc

Scraping

Illegal Scraping

RiesgosIA.org
Legal • #74

Illegal Scraping

Sc
Severity Level5/10

Definition

Massive extraction of data from websites for model training ignoring robots.txt, terms of service, and data property rights.

Mitigation Strategy

Development and adoption of specific AI consent web protocols (e.g., extended robots.txt), legal consequences for unauthorized scraping, and respect for opt-out.

Notes / Observations

1.
2.
3.
4.
5.
RiesgosIA.org • Periodic Table of AI RisksRiesgosIA.org