ENTITY Hindi

Hindi

PulseAugur coverage of Hindi — every cluster mentioning Hindi across labs, papers, and developer communities, ranked by signal.

Total · 30d

26

26 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

22

22 over 90d

TIER MIX · 90D

research 15
tool 10
commentary 1

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

11 day(s) with sentiment data

RECENT · PAGE 1/2 · 26 TOTAL

TOOL · CL_111729 · Jun 26 · 04:00

New neural diarization model excels on low-resource Nepali-Hindi speech

Researchers have developed a new approach to speaker diarization, the process of identifying who spoke when in an audio recording, specifically for low-resource languages like Nepali-Hindi. They trained two neural netwo…
RESEARCH · CL_111587 · Jun 25 · 11:34

New technique SamaVaani audits and debiases clinical ASR for Indian languages

A new research paper introduces SamaVaani, a method for auditing and debiasing multilingual clinical Automatic Speech Recognition (ASR) systems for Indian languages. The study evaluated eight state-of-the-art ASR models…
RESEARCH · CL_107785 · Jun 23 · 17:10

New Marathi POS Tagging Dataset and BERT Models Released

Researchers have introduced L3Cube-MahaPOS, a new dataset for Marathi Part-of-Speech (POS) tagging, addressing the scarcity of annotated resources for the language. The dataset contains over 32,000 manually annotated se…
TOOL · CL_104068 · Jun 22 · 17:31

Amazon pilots Hindi Alexa+ in India with beta testing program

Amazon is piloting a Hindi-language version of its advanced AI assistant, Alexa+, in India. The company is inviting select customers to test the feature through a beta program, with feedback aimed at refining its capabi…
RESEARCH · CL_99664 · Jun 17 · 22:36

AI models struggle with multilingual mental health data generation

A new research paper explores the limitations of using persona-based localization to create multilingual mental health datasets. The study found that simply adjusting nationality and language parameters in synthetic per…
COMMENTARY · CL_94021 · Jun 16 · 04:58

AI Infrastructure Gap: Storage, Not Just GPUs, Dictates Performance

The AI industry is facing a significant infrastructure gap where organizations are investing heavily in GPUs but neglecting the underlying data storage and networking architecture. This imbalance leads to underutilized …
RESEARCH · CL_95877 · Jun 16 · 01:04

New N-VSSM Model Outperforms Claude Opus 4.5 in Long-Form Narrative Consistency

Researchers have developed NarrativeWorldBench, a new benchmark designed to evaluate large language models (LLMs) on their ability to maintain narrative consistency in long-form audio dramas. Current frontier LLMs strug…
RESEARCH · CL_82089 · Jun 9 · 15:50

LLMs struggle with cultural translation in math problems

A new study analyzed how large language models like Claude Opus 4, GPT-4.1, and Gemini 2.5 Pro translate math word problems across various languages and cultures. The research found that while models often agree on the …
TOOL · CL_76556 · Jun 7 · 20:32

Hindi speakers gain voice interface access via speech-to-UI tech

A developer aimed to bridge the gap for Hindi speakers in voice interfaces, noting that most current systems overlook this significant population. To address this, they developed a method to translate Hindi speech direc…
RESEARCH · CL_79140 · Jun 6 · 17:37

New dataset AgriGov boosts AI for Indian farmers

Researchers have developed AgriGov, a new multilingual dataset aimed at improving AI tools for Indian farmers. The dataset focuses on government schemes and welfare policies, initially covering 50 schemes across English…
RESEARCH · CL_72535 · Jun 4 · 11:32

Researchers enable English-to-Prakrit translation via multilingual model adaptation

Researchers have developed a method for English-to-Prakrit machine translation, a low-resource language pair not supported by existing models like IndicTrans2. By mapping Prakrit to the Hindi language tag within a multi…
TOOL · CL_65545 · Jun 2 · 04:00

LLMs show geographic bias in medical triage recommendations

A new study using Gemini 3.5 Flash found that large language models provide different medical triage recommendations based on the language of the patient's prompt, even when symptoms are identical. The model recommended…
TOOL · CL_56360 · May 28 · 04:00

New Benchmark Evaluates Multilingual VLMs on Bengali Culture and Dialects

Researchers have developed BanglaVerse, a new benchmark designed to evaluate the cultural understanding of multilingual vision-language models (VLMs) within the context of Bengali culture. This benchmark, comprising 1,1…
TOOL · CL_56160 · May 28 · 04:00

New AI translation methods struggle with gender preservation in Hindi

A new research paper explores the challenge of maintaining gender information in English-to-Hindi machine translation. The study found that current generative translation systems frequently erase explicit gender cues, p…
RESEARCH · CL_56328 · May 27 · 13:04

New ASR Error Analysis Tool Breaks Script Barriers

Researchers have developed a new automated alignment mechanism designed to improve the analysis of Automatic Speech Recognition (ASR) errors, particularly for languages that do not use the Latin script. This method is l…
RESEARCH · CL_51292 · May 23 · 12:44

Hubness hinders multilingual AI retrieval; Amharic needs in-language tuning

Research indicates that cross-lingual retrieval in multilingual embedding models is hindered by "hubness," a geometric pathology in embedding spaces, rather than anisotropy. Studies using models like Gemini, Mistral, an…
RESEARCH · CL_41788 · May 20 · 05:09

SCRIBE framework improves ASR for Indic languages with new error analysis

Researchers have introduced SCRIBE, a new diagnostic framework designed to improve automatic speech recognition (ASR) for Indic languages. Unlike traditional metrics like Word Error Rate (WER), SCRIBE categorizes errors…
RESEARCH · CL_30789 · May 13 · 06:55

New benchmark tackles ASR bias in Indic languages

Researchers have developed Vividh-ASR, a new benchmark designed to evaluate automatic speech recognition (ASR) models for Indic languages, specifically Hindi and Malayalam. This benchmark categorizes audio into four tie…
TOOL · CL_29391 · May 12 · 15:11

LLMs improve multilingual speech correction by tuning for fluency

Researchers have developed a new method for correcting disfluencies in multilingual speech transcripts using large language models (LLMs). The pipeline first identifies disfluent tokens and then uses these signals to fi…
RESEARCH · CL_15889 · May 4 · 06:20

LLMs show unreliable calibration in multilingual clinical diagnosis, study finds

A new research paper explores the reliability of large language models (LLMs) for multilingual orthopedic diagnosis, particularly in low-resource settings. The study found that while LLMs demonstrate strong linguistic c…