MosaicML releases MPT-7B, an open-source LLM with commercial use license

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

MosaicML has released MPT-7B, an open-source transformer model trained on one trillion tokens that matches LLaMA-7B's quality and is commercially licensed. This model boasts an impressive context length of up to 84,000 tokens, significantly exceeding limitations found in models like GPT-3. MosaicML also open-sourced its LLM Foundry codebase used for training and evaluation, alongside three fine-tuned versions of MPT-7B, including one specialized for long-form storytelling. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

RANK_REASON Release of an open-source model with significant context length improvements and commercially viable licensing.

Read on Latent Space Podcast →

MosaicML releases MPT-7B, an open-source LLM with commercial use license

COVERAGE [1]

Latent Space Podcast TIER_1 · Alessio Fanelli and Latent.Space · 2023-05-20 21:03

MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML

We are excited to be the first podcast in the world to release an in-depth interview on the new SOTA in commercially licensed open source models - MosiacML MPT-7B!The Latent Space crew will be at the NYC Lux AI Summit next…

COVERAGE [1]

MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML

RELATED TOPICS