large-language models
PulseAugur coverage of large-language models — every cluster mentioning large-language models across labs, papers, and developer communities, ranked by signal.
- used by Grpo 90%
- used by Group Relative Policy Optimization 90%
- instance of machine learning 90%
- used by train of thought 90%
- uses Sparse Autoencoders 90%
- instance of mistral:7b 90%
- instance of Language Models 90%
- uses electronic health records 90%
- instance of hallucination 90%
- uses speech recognition 90%
- instance of Qwen 2.5 90%
- authored by Ted Chiang 90%
- 2026-06-09 research_milestone A new framework, RLVR, was introduced to enhance LLMs for long-horizon maritime trajectory and destination forecasting. source
- 2026-05-25 research_milestone A study found that large language models exhibit persistent biases when providing guidance on religious conversions. source
- 2026-05-22 research_milestone A study evaluated LLM performance in psychiatric screening, finding varying accuracy and a tendency to discount symptom evidence in certain contexts. source
- 2026-05-21 research_milestone A new framework was proposed to improve cross-lingual cultural knowledge alignment in LLMs. source
- 2026-05-18 research_milestone A paper was published detailing multilingual jailbreaking vulnerabilities in LLMs using low-resource languages.
- 2026-05-18 research_milestone A study found that LLMs corrupt document content in delegated workflows. source
- 2026-05-18 research_milestone Large language models demonstrated zero-shot goal recognition capabilities in a new study.
- 2026-05-16 research_milestone A new benchmark and dataset are introduced for evaluating LLMs on legal precedent classification.
- 2026-05-15 research_milestone A new paper proposes using LLMs for data augmentation to improve cognitive score prediction from speech. source
- 2026-05-15 research_milestone A study was published on arXiv evaluating LLM reasoning in tax law and proposing neuro-symbolic alternatives. source
- 2026-05-15 research_milestone Development of a new framework for AI value alignment and introduction of the DailyDilemmas test by Cornell University. source
- 2026-05-15 research_milestone Researchers identified an implementation fidelity gap in LLMs, showing they can understand algorithms but struggle to code in unseen languages. source
- 2026-05-13 research_milestone LLMs demonstrated superior accuracy, speed, and cost-effectiveness in transcribing historical handwriting compared to specialized software. source
- 2026-05-13 research_milestone A new method for LLM adaptation using active information seeking was published on arXiv. source
- 2026-05-12 research_milestone A research paper demonstrates that LLMs exhibit bias towards sponsored products, but this can be mitigated with specific user prompts. source
30 day(s) with sentiment data
-
LLMs discover new Nash equilibrium algorithms with formal proof framework
Researchers have developed a framework called LegoNE that integrates large language models with formal proof strategies to discover algorithms for approximate Nash equilibria. This system can automatically certify the w…
-
AI model autocorrects chemical process flowsheets with 80% accuracy
Researchers have developed a new AI method to automatically identify and correct errors in chemical process flowsheets, which are critical diagrams used in engineering. This approach, inspired by large language models u…
-
SCOUT framework boosts LLM performance on non-linguistic tasks
Researchers have developed a new framework called SCOUT to improve the performance of Large Language Models (LLMs) on non-linguistic tasks. SCOUT decouples exploration from exploitation, using lightweight "scouts" to ef…
-
New framework fuses language and physical feedback for agent learning
Researchers have developed QuickLAP, a new Bayesian framework designed to help semi-autonomous agents learn reward functions more effectively by combining language and physical feedback. This approach uses Large Languag…
-
LLMs automate SAT solver optimization, boosting performance by 40%
Researchers have developed AutoModSAT, a new framework that leverages large language models (LLMs) to automatically optimize complex SAT solvers. This approach combines an LLM-compatible modular solver design with unsup…
-
SafeRun framework enables deterministic LLM planning for safety-critical tasks
Researchers have introduced SafeRun, a framework designed to bring deterministic planning capabilities to Large Language Models (LLMs) in safety-critical applications. By separating the LLM's natural language interpreta…
-
Knowledge Graphs and LLMs Predict Gene Knockout Effects
Researchers have developed a novel approach using knowledge graphs and Large Language Models (LLMs) to predict the effects of gene knockout perturbations on transcriptomic gene expression. Their simplest model, a K-near…
-
LLM method improves malicious log detection with explainable reasoning
Researchers have developed a new method called CEF-Log for using Large Language Models to detect malicious web server logs. This approach uses a structured five-step reasoning template to guide the LLM, improving its ab…
-
Researchers find interpretable circuit for entity tracking in LLMs
Researchers have identified a specific circuit within large language models that handles dynamic entity tracking. This mechanism, termed a retrieval conditioned rebinding circuit, is responsible for binding entities to …
-
New RecurGuard system detects LLM reasoning-token consumption attacks
Researchers have developed RecurGuard, a novel runtime monitoring system designed to detect and prevent denial-of-service attacks targeting large language models. These attacks exploit the models' reasoning capabilities…
-
New DOG-DPO framework improves LLM safety alignment with geometric data selection
Researchers have developed DOG-DPO, a new framework for selecting preference data to improve safety alignment in large language models. Unlike previous methods that score pairs individually, DOG-DPO treats preference pa…
-
New metric 'Contribution Weights' offers deeper insight into LLM attention
Researchers have introduced "Contribution Weights," a novel metric for analyzing self-attention transformers in large language models. This new metric goes beyond traditional attention weights by incorporating the geome…
-
Symbolic reasoning frameworks alter LLM strategic behavior in multi-agent settings
Researchers have developed a novel method to influence the behavior of large language models (LLMs) when they act as strategic agents in multi-agent systems. By incorporating symbolic reasoning frameworks, such as I-Chi…
-
New Debate Architecture Reduces LLM Sycophancy
Researchers have developed a new multi-agent architecture called Principled Agent Debate (PAD) to reduce sycophancy in large language models. PAD works by having two models with opposing philosophical dispositions debat…
-
New pruning method boosts LLM 3D spatial reasoning
Researchers have developed CAPruner, a novel method for pruning scene graphs to enhance the 3D spatial reasoning capabilities of large language models. Existing pruning techniques often remove task-relevant information,…
-
New RePO framework enhances LLM training with regret minimization
Researchers have introduced a new framework called Regret-based Preference Optimization (RePO) for training large language models using human feedback. RePO reframes the process from reward maximization to regret minimi…
-
LLMs enhanced with RLVR improve long-horizon maritime forecasting
Researchers have developed a new framework called RLVR to improve long-horizon maritime trajectory and destination forecasting using large language models. This approach converts vessel trajectories into semantic textua…
-
RECENT framework enables small language models to ground embodied agent skills
Researchers have developed RECENT, a framework designed to improve skill grounding for embodied agents using small language models (sLMs). This approach treats skills as executable code, allowing for semantic intent to …
-
PAFO framework tackles bias in personalized LLM reward models
Researchers have introduced PAFO, a new framework designed to address personalized reward bias in large language models. This bias occurs when reward models, trained on diverse user preferences, disproportionately favor…
-
LLM framework AIR boosts e-commerce recommendations with 400x speedup
Researchers have developed a new framework called AIR (Atomic Intent Reasoning) to address the challenges of applying large language models (LLMs) to industrial cross-domain recommendation systems. The framework tackles…