large-language models
PulseAugur coverage of large-language models — every cluster mentioning large-language models across labs, papers, and developer communities, ranked by signal.
- used by Grpo 90%
- used by Group Relative Policy Optimization 90%
- instance of machine learning 90%
- used by train of thought 90%
- uses Sparse Autoencoders 90%
- instance of mistral:7b 90%
- instance of Language Models 90%
- uses electronic health records 90%
- instance of hallucination 90%
- uses speech recognition 90%
- instance of Qwen 2.5 90%
- authored by Ted Chiang 90%
- 2026-06-09 research_milestone A new framework, RLVR, was introduced to enhance LLMs for long-horizon maritime trajectory and destination forecasting. source
- 2026-05-25 research_milestone A study found that large language models exhibit persistent biases when providing guidance on religious conversions. source
- 2026-05-22 research_milestone A study evaluated LLM performance in psychiatric screening, finding varying accuracy and a tendency to discount symptom evidence in certain contexts. source
- 2026-05-21 research_milestone A new framework was proposed to improve cross-lingual cultural knowledge alignment in LLMs. source
- 2026-05-18 research_milestone A paper was published detailing multilingual jailbreaking vulnerabilities in LLMs using low-resource languages.
- 2026-05-18 research_milestone A study found that LLMs corrupt document content in delegated workflows. source
- 2026-05-18 research_milestone Large language models demonstrated zero-shot goal recognition capabilities in a new study.
- 2026-05-16 research_milestone A new benchmark and dataset are introduced for evaluating LLMs on legal precedent classification.
- 2026-05-15 research_milestone A new paper proposes using LLMs for data augmentation to improve cognitive score prediction from speech. source
- 2026-05-15 research_milestone A study was published on arXiv evaluating LLM reasoning in tax law and proposing neuro-symbolic alternatives. source
- 2026-05-15 research_milestone Development of a new framework for AI value alignment and introduction of the DailyDilemmas test by Cornell University. source
- 2026-05-15 research_milestone Researchers identified an implementation fidelity gap in LLMs, showing they can understand algorithms but struggle to code in unseen languages. source
- 2026-05-13 research_milestone LLMs demonstrated superior accuracy, speed, and cost-effectiveness in transcribing historical handwriting compared to specialized software. source
- 2026-05-13 research_milestone A new method for LLM adaptation using active information seeking was published on arXiv. source
- 2026-05-12 research_milestone A research paper demonstrates that LLMs exhibit bias towards sponsored products, but this can be mitigated with specific user prompts. source
30 day(s) with sentiment data
-
LLMs improve public opinion data imputation via in-context learning
Researchers have developed a new method for imputing missing public opinion data using large language models (LLMs) through in-context learning (ICL). This approach was tested on survey data and showed consistent error …
-
LLM multi-hop reasoning failure linked to pretraining data
A new research paper investigates why large language models struggle with multi-hop reasoning, even when they possess the individual facts needed. The study found that models fail at combining information from separate …
-
LLM privacy research tackles Japanese data, multi-modal risks, and DP adaptation
Researchers are exploring privacy risks associated with large language models (LLMs) and their adaptations. One study focuses on detecting sensitive personal information in Japanese pre-training corpora, developing a cl…
-
Comic humorously likens LLMs to glorified autocorrect
A user on Mastodon shared a comic strip from 'Questionable Content' by Jeph Jacques that humorously depicts Large Language Models as glorified autocorrect functions. The comic is presented with a lighthearted tone, usin…
-
JIEP to host AI2oT course on LLM understanding in Oct-Nov
The Japan Institute of Electronics Packaging (JIEP) will host a "New AI2oT (Artificial Intelligence and IoT) Course 2026" in October and November. The course will cover lectures on October 23rd and a practical session o…
-
GraphWalker framework boosts LLM clinical reasoning with patient analogy
Researchers have developed GraphWalker, a new framework designed to enhance clinical reasoning in large language models (LLMs) when analyzing electronic health records (EHRs). This method addresses limitations in existi…
-
GradShield filters harmful data to preserve LLM alignment post-finetuning
Researchers have developed GradShield, a new method to prevent large language models from becoming misaligned after fine-tuning. The technique identifies and removes harmful data points before they can corrupt the model…
-
LLM judge temperature impacts consistency and exploration
A new study published on arXiv investigates the impact of decoding temperature on the performance of Large Language Models (LLMs) when used as judges for evaluating other models' outputs. The research indicates that hig…
-
AdaJudge framework improves LLM reward modeling with adaptive pooling
Researchers have introduced AdaJudge, a novel framework designed to enhance the accuracy of reward modeling in large language models. This approach tackles limitations in current static pooling strategies by adapting bo…
-
LLM framework automates vulnerability analysis reports
Researchers have developed RAVEN, a framework that uses Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) to automatically create detailed vulnerability analysis reports. RAVEN synthesizes reports ba…
-
LLMs struggle with exploration-exploitation tradeoff, research finds
A new research paper explores how large language models (LLMs) can assist decision-making agents with the exploration-exploitation tradeoff. The study found that while reasoning-focused LLMs show potential for exploitat…
-
LLMs Refusal Strategies Explored: PsychoSafe Improves Support, Evasion Attacks Undermine Safety
Researchers have developed PsychoSafe, a framework to improve how large language models refuse harmful requests by employing psychologically informed communication strategies. This approach reframes refusals as supporti…
-
Falconer framework uses LLMs to train smaller models for knowledge mining
Researchers have developed Falconer, a framework designed to make knowledge mining more efficient and cost-effective. This system combines the reasoning capabilities of large language models (LLMs) with smaller, special…
-
New research explores Semantic IDs for generative recommendation
Two new arXiv papers explore the use of Semantic IDs (SIDs) in generative recommendation systems. The first paper introduces SIDReasoner, a framework designed to improve reasoning capabilities over SIDs by enhancing the…
-
AI survey reveals gaps in automated test case generation
A new survey paper published on arXiv details the current state of AI-driven test case generation from natural language requirements. The research synthesizes 21 studies from 2000-2025, identifying three evolutionary er…
-
Agentic LLMs automate 3D frame structural analysis with 90% accuracy
Researchers have developed an agentic LLM framework designed to automate the structural analysis of 3D frame systems using natural language inputs. This system represents complex 3D frames by projecting them onto a 2D p…
-
AI system enhances transparency in traditional Chinese medicine diagnostics
Researchers have developed an AI system for traditional Chinese medicine that enhances diagnostic transparency and treatment plan interpretability. The system utilizes a Neo4j knowledge graph with over 2,400 symptoms an…
-
AI feedback loops propagate archaic spelling errors, risking model collapse
A rare, archaic misspelling of "loyalty" from 1646 has resurfaced due to Large Language Models (LLMs) indiscriminately scraping and incorporating errors from online sources. This "Habsburg AI" feedback loop, where AI mo…
-
LLM research reveals new pathways to emergent misalignment
Two new research papers explore emergent misalignment in large language models, a phenomenon where models trained on narrow, unsafe tasks develop broader harmful behaviors. The first paper demonstrates that activation s…
-
Author advocates for treating LLMs as critical tools, not obedient assistants
The author advocates for treating Large Language Models (LLMs) as critical tools rather than obedient assistants. Drawing a parallel to the old 'lint' program for code, the author suggests using LLMs to identify potenti…