ENTITY Brazilian Portuguese

Brazilian Portuguese

PulseAugur coverage of Brazilian Portuguese — every cluster mentioning Brazilian Portuguese across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

8 over 90d

Releases · 30d

0 over 90d

Papers · 30d

7 over 90d

TIER MIX · 90D

research 4
tool 3
meme 1

TOPICS

SENTIMENT · 30D

5 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL

RESEARCH · CL_99667 · Jun 17 · 22:06

New framework TOTEN improves tokenization of technical notation

Researchers have developed TOTEN, a knowledge-based ontological tokenization framework designed to improve the semantic understanding of technical notation in Brazilian Portuguese. Unlike traditional byte-pair encoding,…
RESEARCH · CL_93392 · Jun 15 · 14:10

New benchmark reveals LLM bias towards Brazilian Portuguese

A new benchmark called P3B3 has been developed to assess how large language models (LLMs) handle variations in Portuguese, specifically European Portuguese (pt-PT) and Brazilian Portuguese (pt-BR). The benchmark aims to…
TOOL · CL_79845 · Jun 9 · 04:00

New benchmark tests clinical LLMs in Brazilian Portuguese

Researchers have developed ClinicalBr, a new bilingual benchmark for evaluating clinical Large Language Models in Brazilian Portuguese and English. The benchmark, derived from real Brazilian medical case reports, covers…
MEME · CL_77437 · Jun 8 · 04:58

User seeks help fine-tuning Kokoro for Brazilian Portuguese

A user is seeking advice on locally installing and fine-tuning the Kokoro language model, specifically for Brazilian Portuguese. They are experiencing poor performance with non-English languages when using the Open Rout…
TOOL · CL_62862 · Jun 1 · 04:00

New method extracts accent features from Portuguese speech using acoustic labels

Researchers have developed a new method to extract accent features from spoken Brazilian Portuguese without relying on sociolinguistic labels. This approach uses acoustic labels and a phoneme-based forced aligner to iso…
RESEARCH · CL_53583 · May 26 · 14:14

FalAR corpus boosts European Portuguese ASR with 5,800 hours of parliamentary data

Researchers have introduced FalAR, a new large-scale speech corpus for European Portuguese parliamentary sessions, aiming to improve Automatic Speech Recognition (ASR) for the language. The corpus contains approximately…
RESEARCH · CL_15870 · May 5 · 04:00

New benchmark 'Prosa' evaluates LLMs on Brazilian Portuguese chats

Researchers have introduced Prosa, a new benchmark designed to evaluate Large Language Models (LLMs) using real user conversations in Brazilian Portuguese. This benchmark utilizes a rubric-based scoring system with mult…
RESEARCH · CL_02961 · Apr 23 · 11:34

New LLM bias benchmark measures opinion and sycophancy in AI assistants

Researchers have developed a new open-source method called llm-bias-bench to uncover the hidden opinions of large language models on contentious subjects. The technique employs two distinct probing strategies: direct qu…

New framework TOTEN improves tokenization of technical notation

New benchmark reveals LLM bias towards Brazilian Portuguese

New benchmark tests clinical LLMs in Brazilian Portuguese

User seeks help fine-tuning Kokoro for Brazilian Portuguese

New method extracts accent features from Portuguese speech using acoustic labels

FalAR corpus boosts European Portuguese ASR with 5,800 hours of parliamentary data

New benchmark 'Prosa' evaluates LLMs on Brazilian Portuguese chats

New LLM bias benchmark measures opinion and sycophancy in AI assistants