MiniCPM-o 4.5 is a new 9B parameter omni-modal large language model designed for real-time, full-duplex interaction. It can simultaneously process and generate audio, video, and text, enabling proactive behaviors and continuous environmental understanding. The model utilizes the Omni-Flow framework for time-aligned processing and is optimized for efficient inference, allowing it to run on edge devices with less than 12GB of RAM. AI
影响 Enables real-time, full-duplex omni-modal interaction on consumer hardware, lowering the barrier for advanced AI applications.
排序理由 Release of a technical report and open-source model with performance claims and new framework.
- M1 Max
- CosyVoice2
- Gemini 2.5 Flash
- llama.cpp
- M5 Pro
- MiniCPM-o 4.5
- MiniCPM-V
- Omni-Flow
- OpenBMB
- Qwen3-Omni-30B-A3B
- RTX 5070
- THUNLP
- Tsinghua University
AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →