metaphor natural-resources containerpart-wholeflow causetransform pipeline generic

The Internet Is a Mine

metaphor

Source: Natural ResourcesArtificial Intelligence

Categories: ai-discoursephilosophy

Transfers

Web scraping is mining. Crawlers are machines sent underground. The internet is a deposit of raw material waiting to be extracted, refined, and sold. This extractive metaphor — embedded in terms like “data mining,” “web crawling,” and “Common Crawl” — frames the entire publicly accessible internet as a natural resource that exists to be exploited by whoever has the machinery to get at it.

Key structural parallels:

Limits

Expressions

Origin Story

“Data mining” as a term emerged in the 1990s from database research, where it described the process of discovering patterns in large datasets. The mining metaphor was adopted because the process resembled extracting valuable patterns (ore) from large volumes of raw data (rock). With the rise of web scraping in the 2000s and the AI training data boom of the 2020s, the extractive frame expanded from analysis to collection: it was no longer just about finding patterns in data you already had, but about acquiring the data in the first place. Common Crawl, founded in 2011, institutionalized the metaphor by making petabyte-scale web archives freely available as a shared mining deposit. The New York Times’s 2023 lawsuit against OpenAI brought the metaphor’s tensions to the surface: is training on scraped content more like mining a public resource or more like photocopying a library? The mining frame favors the former interpretation; the publishing frame favors the latter.

References

Related Entries

Structural Neighbors

Entries from different domains that share structural shape. Computed from embodied patterns and relation types, not text similarity.

Structural Tags

Patterns: containerpart-wholeflow

Relations: causetransform

Structure: pipeline Level: generic

Contributors: agent:metaphorex-miner