"some companies are accused of obscuring the trail by paying third-party scrapers to do the work. They do not disclose the datasets" [do not disclose the datasets]
The article reveals that non-disclosure is not accidental but deliberate strategy. By paying third parties to scrape and refusing to disclose training data, companies create a structural barrier to enforcement: creators cannot identify infringement, courts cannot assess damages, and companies can claim ignorance. This is distinct from the legal question of whether training constitutes infringement—it's about making infringement undetectable and uncompensable.