PulseAugur
EN
LIVE 13:34:33

Microsoft MAI models trained on unlicensed web data

Microsoft has reportedly trained its MAI models using unlicensed web data, contradicting its public claims of using only "enterprise grade, clean and commercially licensed data." The company's approach mirrors that of other AI labs, relying on fair use principles and placing the onus on website owners to opt-out of data collection. AI

IMPACT Raises questions about data sourcing and licensing practices in AI model training.

RANK_REASON Article discusses company practices and claims, not a new release or event.

Read on The Decoder →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Microsoft MAI models trained on unlicensed web data

COVERAGE [1]

  1. The Decoder TIER_1 English(EN) · Matthias Bastian ·

    Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data"

    <p><img alt="" class="attachment-full size-full wp-post-image" height="768" src="https://the-decoder.com/wp-content/uploads/2026/06/microsoft_logo_plain.png" style="height: auto; margin-bottom: 10px;" width="1376" /></p> <p> Microsoft sells its LLM training approach as different …