Brazilian Portuguese
PulseAugur coverage of Brazilian Portuguese — every cluster mentioning Brazilian Portuguese across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
New framework TOTEN improves tokenization of technical notation
Researchers have developed TOTEN, a knowledge-based ontological tokenization framework designed to improve the semantic understanding of technical notation in Brazilian Portuguese. Unlike traditional byte-pair encoding,…
-
New benchmark reveals LLM bias towards Brazilian Portuguese
A new benchmark called P3B3 has been developed to assess how large language models (LLMs) handle variations in Portuguese, specifically European Portuguese (pt-PT) and Brazilian Portuguese (pt-BR). The benchmark aims to…
-
New benchmark tests clinical LLMs in Brazilian Portuguese
Researchers have developed ClinicalBr, a new bilingual benchmark for evaluating clinical Large Language Models in Brazilian Portuguese and English. The benchmark, derived from real Brazilian medical case reports, covers…
-
User seeks help fine-tuning Kokoro for Brazilian Portuguese
A user is seeking advice on locally installing and fine-tuning the Kokoro language model, specifically for Brazilian Portuguese. They are experiencing poor performance with non-English languages when using the Open Rout…
-
New method extracts accent features from Portuguese speech using acoustic labels
Researchers have developed a new method to extract accent features from spoken Brazilian Portuguese without relying on sociolinguistic labels. This approach uses acoustic labels and a phoneme-based forced aligner to iso…
-
FalAR corpus boosts European Portuguese ASR with 5,800 hours of parliamentary data
Researchers have introduced FalAR, a new large-scale speech corpus for European Portuguese parliamentary sessions, aiming to improve Automatic Speech Recognition (ASR) for the language. The corpus contains approximately…
-
New benchmark 'Prosa' evaluates LLMs on Brazilian Portuguese chats
Researchers have introduced Prosa, a new benchmark designed to evaluate Large Language Models (LLMs) using real user conversations in Brazilian Portuguese. This benchmark utilizes a rubric-based scoring system with mult…
-
New LLM bias benchmark measures opinion and sycophancy in AI assistants
Researchers have developed a new open-source method called llm-bias-bench to uncover the hidden opinions of large language models on contentious subjects. The technique employs two distinct probing strategies: direct qu…