ENTITY LongMemEval

LongMemEval

PulseAugur coverage of LongMemEval — every cluster mentioning LongMemEval across labs, papers, and developer communities, ranked by signal.

Total · 30d

14

14 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

12

12 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

6 day(s) with sentiment data

RECENT · PAGE 1/1 · 14 TOTAL

TOOL · CL_87172 · Jun 12 · 06:29

VEKTOR Slipstream beats GPT-4 on local memory benchmark

VEKTOR Slipstream, a local agent memory framework, achieved a 79% score on the LongMemEval benchmark, outperforming full-context GPT-4 by 12 points. This benchmark specifically tests real-world memory retrieval failures…
TOOL · CL_86406 · Jun 11 · 23:48

LLM Memory Systems Outperform Full Context on Long Histories

A new benchmark, LongMemEval, has demonstrated that retrieval-based memory systems outperform full-context baselines for LLM agents dealing with long conversation histories. While full context remains competitive for sh…
TOOL · CL_82494 · Jun 10 · 04:00

Regimes system improves AI agent reliability with auditable improvement loops

Researchers have developed a new system called Regimes that enhances the trustworthiness of autonomous AI improvement loops. This system uses an event-sourced agent runtime to log all changes, allowing for auditable dia…
RESEARCH · CL_81266 · Jun 9 · 15:19

AI Memory Systems Can Harm Performance, Research Finds

New research indicates that AI memory systems, while intended to improve user experience and task completion, can paradoxically degrade model performance and foster sycophantic tendencies. Studies show that these system…
TOOL · CL_69502 · Jun 3 · 19:21

New Operator AI model specializes in precise KMP protocol actions

A new compact AI model named Operator has been developed to specialize in executing precise actions within the Kernel Memory Protocol (KMP). This model is designed to handle the strict operational requirements of KMP, s…
TOOL · CL_64907 · Jun 2 · 04:19

BECOMER API offers token-free memory for AI agents

A new open-source memory API called BECOMER has been developed to enhance AI agent performance by providing persistent memory without incurring LLM token costs for recalls. This API achieves a 94.4% score on the LongMem…
RESEARCH · CL_65549 · Jun 2 · 04:00

New benchmarks and methods tackle LLM long-context and memory challenges

Researchers are developing new methods to improve how large language models handle long conversation histories and complex documents. Several papers introduce novel architectures and benchmarks designed to overcome the …
RESEARCH · CL_48844 · May 21 · 21:58

Study shows training data curriculum fine-tunes RL agent specialization

A new study on arXiv explores how different training data curricula impact the performance of reinforcement learning (RL) agents designed to work with large language models (LLMs) and external memory banks. The research…
TOOL · CL_42226 · May 21 · 06:34

Grep tool matches vector retrieval accuracy in agentic search

A new study titled "Is Grep All You Need?" challenges the default reliance on vector retrieval for agentic search by comparing it against the traditional grep tool. Experiments using the LongMemEval benchmark showed tha…
RESEARCH · CL_41825 · May 20 · 05:26

New frameworks enhance AI dialogue memory and retrieval benchmarks

Researchers have developed new frameworks for improving long-term dialogue agents and evaluating conversational retrieval. MGRetrieval enhances memory retrieval by grounding reflective processes in historical memory str…
TOOL · CL_15465 · May 5 · 04:00

HyMem architecture boosts LLM agent memory efficiency by 92.6%

Researchers have developed HyMem, a novel hybrid memory architecture designed to improve the efficiency and effectiveness of large language model (LLM) agents in long-context scenarios. HyMem utilizes a dual-granular st…
RESEARCH · CL_10103 · Apr 30 · 04:00

AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

Multiple research papers released on arXiv propose novel frameworks for enhancing the memory capabilities of Large Language Model (LLM) agents. These approaches aim to overcome limitations in handling long-term conversa…
RESEARCH · CL_05046 · Apr 23 · 21:34

New AI agent memory systems leverage visual and semantic approaches for long-horizon tasks

Two new research papers propose novel memory architectures for autonomous AI agents to handle long-horizon tasks. OCR-Memory leverages visual representations of agent experience to store extensive histories with minimal…
RESEARCH · CL_02982 · Apr 23 · 04:59

MemPalace AI memory system praised for innovation, criticized for overstated claims

A new paper critically analyzes MemPalace, an open-source AI memory system that uses spatial metaphors inspired by the method of loci. While MemPalace achieved high retrieval performance and rapid adoption on GitHub, th…