Punjabi
PulseAugur coverage of Punjabi — every cluster mentioning Punjabi across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New benchmark reveals Vision-Language Models struggle with script consistency
A new benchmark, PuMVR, has been developed to evaluate Vision-Language Models (VLMs) on their ability to handle multiple scripts within a single language. The benchmark, comprising 1,000 parallel image-text instances ac…
-
LLMs struggle with cultural translation in math problems
A new study analyzed how large language models like Claude Opus 4, GPT-4.1, and Gemini 2.5 Pro translate math word problems across various languages and cultures. The research found that while models often agree on the …
-
LLMs show unreliable calibration in multilingual clinical diagnosis, study finds
A new research paper explores the reliability of large language models (LLMs) for multilingual orthopedic diagnosis, particularly in low-resource settings. The study found that while LLMs demonstrate strong linguistic c…