Consumer smart TVs and other devices are being used to scrape data for AI training through a network of residential proxies. Companies like Bright Data embed SDKs in apps, turning user devices into exit nodes for web-scraping traffic. This method bypasses datacenter IP blocks, allowing AI models to access vast amounts of internet data, often with opaque user consent. AI
IMPACT Highlights how AI training data is sourced, potentially impacting data privacy and the infrastructure supporting AI development.
RANK_REASON This article discusses a method of data collection for AI training, but does not announce a new model, product, or significant industry event.
Read on Hacker News — AI stories ≥50 points →
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →