Apple updates information on its own web crawler "Applebot". It clarifies that it can reject non-HTML content and paid pages as data for generating AI answers. https://applech2.com/archives/20260610-apple-update-applebot-policy.html #apple
Apple has updated its guidelines for Applebot, its web crawler. The company now explicitly states that Applebot can refuse to use non-HTML content and content behind paywalls for generating AI responses. This move aims to give Apple more control over how its data is utilized for AI training. AI
IMPACT Apple's move signals a growing trend of data control by major tech firms, potentially impacting the availability of diverse datasets for AI training.