中文(ZH) 撸猫撸出SOTA！3个00后2个月，造出史上最快流式音视频社交模型

Catnip unveils MaineCoon, a 7x faster streaming audio-video AI model

By PulseAugur Editorial · [1 sources] · 2026-06-20 10:42

A Chinese startup, Catnip, has developed MaineCoon, a novel streaming audio-video social model that achieves state-of-the-art performance. This model generates synchronized audio and video in real-time, maintaining consistency for extended durations of up to 30 minutes, a first for the industry. MaineCoon boasts exceptional inference speed, running at 47.5 FPS on a single NVIDIA H100, and significantly reduces costs compared to existing models like Veo 3. AI

IMPACT This model's real-time, high-quality audio-video generation and low cost could revolutionize social media and interactive AI applications.

RANK_REASON New model release from a startup with significant performance claims and novel capabilities. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Catnip unveils MaineCoon, a 7x faster streaming audio-video AI model

COVERAGE [1]

量子位 (QbitAI) TIER_1 中文(ZH) · 听雨 · 2026-06-20 10:42

Petting cats leads to SOTA! Three post-00s create the fastest streaming audio-video social model in history in 2 months

速度快7倍，成本只有Veo 3的1/2000

COVERAGE [1]

Petting cats leads to SOTA! Three post-00s create the fastest streaming audio-video social model in history in 2 months

RELATED ENTITIES

RELATED TOPICS