ENTITY Moshi

Moshi

PulseAugur coverage of Moshi — every cluster mentioning Moshi across labs, papers, and developer communities, ranked by signal.

Total · 30d

7

7 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

5

5 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL

TOOL · CL_104760 · Jun 20 · 09:59

New dialogue system integrates real-time facial generation with speech

Researchers have developed Moshi-Face, a novel full-duplex spoken dialogue system that integrates facial generation with audio processing. This system utilizes a VQ-VAE to encode facial data into discrete tokens and a F…
RESEARCH · CL_90870 · Jun 12 · 15:01

BayLing-Duplex enables native full-duplex speech dialogue with single LLM

Researchers have developed BayLing-Duplex, a novel full-duplex speech language model that enables simultaneous listening and speaking without relying on external turn-taking modules. This single autoregressive LLM can m…
RESEARCH · CL_41844 · May 19 · 18:11

Moshi dialogue models show synchronized internal states and predict turn-taking

Researchers have explored how full-duplex speech dialogue models coordinate their internal representations during interaction. By simulating dialogues between two instances of the Moshi model, they observed strong repre…
TOOL · CL_27309 · May 11 · 20:53

Thinking Machines previews interaction models for real-time AI collaboration

Thinking Machines has introduced a research preview of interaction models designed for native, real-time collaboration. These models process audio, video, and text simultaneously, allowing for continuous thought, respon…
RESEARCH · CL_79773 · May 4 · 00:00

New methods boost full-duplex speech models for better interaction

Researchers have developed new methods to enhance full-duplex speech models, enabling more natural and interactive conversations. One approach focuses on improving interactivity axes like pause handling and turn-taking …
RESEARCH · CL_13577 · May 3 · 07:47

Sakana AI's KAME architecture injects LLM knowledge into speech AI without latency

Sakana AI has developed KAME, a novel tandem architecture for speech-to-speech AI that aims to combine the speed of direct systems with the knowledge depth of LLM-based approaches. KAME operates with two asynchronous co…
RESEARCH · CL_06621 · Apr 28 · 04:00

Josh Talks launches first full-duplex Hindi conversational AI model

Researchers have developed the first open and reproducible full-duplex spoken dialogue system for the Hindi language. This system, named Human-1, adapts the Moshi architecture and was trained on over 26,000 hours of rea…