ENTITY large language model

large language model

PulseAugur coverage of large language model — every cluster mentioning large language model across labs, papers, and developer communities, ranked by signal.

Total · 30d

239

239 over 90d

Releases · 30d

0 over 90d

Papers · 30d

197

197 over 90d

TIER MIX · 90D

significant 2
research 67
tool 146
commentary 21
meme 3

RELATIONSHIPS

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/2 · 29 TOTAL

RESEARCH · CL_29301 · May 12 · 17:59

Pion optimizer preserves spectrum for stable LLM training

Researchers have introduced Pion, a novel spectrum-preserving optimizer designed for training large language models. Unlike traditional additive optimizers like Adam, Pion utilizes orthogonal transformations to update w…
TOOL · CL_28191 · May 12 · 09:56

Local LLM Setup Guide: Ollama and LM Studio for Private AI

This guide details how to set up a private, local Large Language Model (LLM) using Ollama and LM Studio. It provides instructions for a 2026-updated setup, emphasizing privacy and local control over AI models.
TOOL · CL_29434 · May 12 · 07:43

New LLM unlearning method targets minor components for better security

Researchers have identified a key vulnerability in current large language model (LLM) unlearning techniques, where models can quickly recover forgotten information through relearning attacks. This fragility stems from e…
TOOL · CL_28290 · May 11 · 15:13

AI agents exhibit "Bystander Effect," sacrificing reasoning for conformity

Researchers have identified a "Bystander Effect" in multi-agent systems where collaboration can lead to reduced reasoning quality, a phenomenon termed "cognitive loafing." Through analysis of 22,500 trajectories across …
TOOL · CL_28260 · May 11 · 12:41

Autonomous agent automates system identification using LLMs

Researchers have developed ASIA, an Autonomous System Identification Agent that uses a large language model to automate the process of system identification. This agent can autonomously select model classes, training al…
MEME · CL_26484 · May 11 · 12:01

LLMs enable novel data compression by recreating content from prompts

A novel approach to data sharing involves using a local, deterministic Large Language Model (LLM) as a form of unprecedented compression. By sending only a textual prompt to another party running the same LLM, it's poss…
TOOL · CL_25577 · May 8 · 15:09

New method measures gap between AI user simulators and real behavior

Researchers have developed a new method to quantify the differences between simulated and real user behaviors in AI assistants. This technique analyzes conversational data to measure how well user simulators replicate t…
RESEARCH · CL_22197 · May 8 · 04:00

PA-Bridge framework enhances LLM conversation starters with active user expression modeling

Researchers have developed a new framework called PA-Bridge to improve conversation starter recommendations in Large Language Model (LLM)-driven conversational search. This approach addresses the limitations of traditio…
TOOL · CL_18236 · May 6 · 02:07

Ten Python Libraries Streamline Large Language Model Application Development

This cluster contains two identical Mastodon posts linking to a KDnuggets article. The article lists ten Python libraries useful for developing applications that utilize Large Language Models.
RESEARCH · CL_15927 · May 5 · 04:00

LLMs steer text embedding projections for intent-driven analysis

Researchers have developed a new method called LLM-augmented semantic steering to improve the visualization of text embeddings. This technique allows analysts to guide the spatial organization of projected text data bas…
TOOL · CL_15982 · May 5 · 04:00

New benchmark evaluates LLMs on Indian financial regulations

Researchers have introduced IndiaFinBench, a new benchmark designed to evaluate how well large language models perform on Indian financial regulatory texts. This benchmark addresses a gap in existing resources, which pr…
RESEARCH · CL_14516 · May 4 · 06:08

Stanford offers LLM reasoning lesson with LinkedIn summary

Stanford University has released a lecture on Large Language Model (LLM) reasoning. The lecture, shared via a LinkedIn post, offers insights into the capabilities and complexities of LLM reasoning. Further details and r…
RESEARCH · CL_14076 · May 1 · 08:12

High-speed vision boosts zero-shot action understanding, research shows

Researchers have explored how temporal resolution impacts zero-shot semantic understanding of human actions, particularly for rapid movements. Their study, using kendo as a test case, found that higher frame rates signi…
RESEARCH · CL_11699 · May 1 · 04:00

LLM evaluation frameworks may mislead without prompt optimization

A new paper from Nicholas Sadjoli argues that current Large Language Model (LLM) evaluation frameworks are misleading because they use static prompts for all models. The research demonstrates that prompt optimization (P…
RESEARCH · CL_11406 · Apr 30 · 11:15

New MILD algorithm tackles expert imbalance in LLM routing tasks

Researchers have developed a new approach called MILD (Margin-based Imbalanced Learning to Defer) to address the expert imbalance problem in two-stage learning to defer systems. This method reframes deferral loss optimi…
RESEARCH · CL_11450 · Apr 30 · 06:39

Skills-Coach framework enhances LLM agent skills via training-free optimization

Researchers have developed Skills-Coach, an automated framework aimed at improving the self-evolution of skills within Large Language Model (LLM) agents. The system features four modules for task generation, skill optim…
RESEARCH · CL_10242 · Apr 30 · 04:00

Hierarchical Long-Term Semantic Memory for LinkedIn's Hiring Agent

Researchers have developed a Hierarchical Long-Term Semantic Memory (HLTM) framework to enhance the capabilities of Large Language Model (LLM) agents. This framework addresses challenges in scalability, retrieval speed,…
RESEARCH · CL_09814 · Apr 29 · 11:52

Speech Representation Models outperform LLMs in pediatric speech disorder classification

Researchers have developed a hierarchical approach using Speech Representation Models (SRMs) for classifying Speech Sound Disorders (SSD) in children, outperforming current Large Language Model (LLM) based methods. The …
RESEARCH · CL_08629 · Apr 29 · 04:00

LLMs measure parliamentary discourse's epistemic orientation, linking it to democracy

Researchers have developed a new method called the Evidence-Minus-Intuition (EMI) score to measure epistemic orientation in political discourse. This score, derived from large language model ratings and semantic similar…
RESEARCH · CL_08537 · Apr 28 · 17:39

Paper distinguishes three models for RLHF annotation: extension, evidence, and authority

A new paper proposes three distinct models for how human annotator judgments shape large language model behavior through Reinforcement Learning from Human Feedback (RLHF). These models are 'extension,' where annotators …

Pion optimizer preserves spectrum for stable LLM training

Local LLM Setup Guide: Ollama and LM Studio for Private AI

New LLM unlearning method targets minor components for better security

AI agents exhibit "Bystander Effect," sacrificing reasoning for conformity

Autonomous agent automates system identification using LLMs

LLMs enable novel data compression by recreating content from prompts

New method measures gap between AI user simulators and real behavior

PA-Bridge framework enhances LLM conversation starters with active user expression modeling

Ten Python Libraries Streamline Large Language Model Application Development

LLMs steer text embedding projections for intent-driven analysis

New benchmark evaluates LLMs on Indian financial regulations

Stanford offers LLM reasoning lesson with LinkedIn summary

High-speed vision boosts zero-shot action understanding, research shows

LLM evaluation frameworks may mislead without prompt optimization

New MILD algorithm tackles expert imbalance in LLM routing tasks

Skills-Coach framework enhances LLM agent skills via training-free optimization

Hierarchical Long-Term Semantic Memory for LinkedIn's Hiring Agent

Speech Representation Models outperform LLMs in pediatric speech disorder classification

LLMs measure parliamentary discourse's epistemic orientation, linking it to democracy

Paper distinguishes three models for RLHF annotation: extension, evidence, and authority