PulseAugur
EN
LIVE 12:34:10
ENTITY Moshi

Moshi

PulseAugur coverage of Moshi — every cluster mentioning Moshi across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
7
7 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
5
5 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL
  1. TOOL · CL_104760 ·

    New dialogue system integrates real-time facial generation with speech

    Researchers have developed Moshi-Face, a novel full-duplex spoken dialogue system that integrates facial generation with audio processing. This system utilizes a VQ-VAE to encode facial data into discrete tokens and a F…

  2. RESEARCH · CL_90870 ·

    BayLing-Duplex enables native full-duplex speech dialogue with single LLM

    Researchers have developed BayLing-Duplex, a novel full-duplex speech language model that enables simultaneous listening and speaking without relying on external turn-taking modules. This single autoregressive LLM can m…

  3. RESEARCH · CL_41844 ·

    Moshi dialogue models show synchronized internal states and predict turn-taking

    Researchers have explored how full-duplex speech dialogue models coordinate their internal representations during interaction. By simulating dialogues between two instances of the Moshi model, they observed strong repre…

  4. TOOL · CL_27309 ·

    Thinking Machines previews interaction models for real-time AI collaboration

    Thinking Machines has introduced a research preview of interaction models designed for native, real-time collaboration. These models process audio, video, and text simultaneously, allowing for continuous thought, respon…

  5. RESEARCH · CL_79773 ·

    New methods boost full-duplex speech models for better interaction

    Researchers have developed new methods to enhance full-duplex speech models, enabling more natural and interactive conversations. One approach focuses on improving interactivity axes like pause handling and turn-taking …

  6. RESEARCH · CL_13577 ·

    Sakana AI's KAME architecture injects LLM knowledge into speech AI without latency

    Sakana AI has developed KAME, a novel tandem architecture for speech-to-speech AI that aims to combine the speed of direct systems with the knowledge depth of LLM-based approaches. KAME operates with two asynchronous co…

  7. RESEARCH · CL_06621 ·

    Josh Talks launches first full-duplex Hindi conversational AI model

    Researchers have developed the first open and reproducible full-duplex spoken dialogue system for the Hindi language. This system, named Human-1, adapts the Moshi architecture and was trained on over 26,000 hours of rea…