New framework tackles LLM context limits in long shopping data

By PulseAugur Editorial · [1 sources] · 2026-06-06 06:22

Researchers have developed a new framework called Customer-Agent to handle extremely long customer shopping trajectories, which often exceed the context window limitations of current large language models. This framework utilizes a Reinforcement Learning with Verifiable Rewards (RLVR) approach, enabling agents to autonomously retrieve and parse trajectory data through code interpreter interactions. A new benchmark, ShopTrajQA, was also introduced to evaluate model performance on these long-context datasets, with variants up to 64k tokens. AI

IMPACT This research could enable more personalized e-commerce experiences by allowing LLMs to process extensive customer histories.

RANK_REASON The cluster contains an academic paper introducing a new framework and benchmark for LLMs. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Bing Yin · 2026-06-06 06:22

Customer-Agent: Overcoming Context Limitations in Ultra-Long Shopping Trajectories via Tool-Augmented Agents and RLVR

Understanding customer shopping trajectories is essential for enabling personalized shopping experiences. However, shopping records (i.e., customer's search, clicks, purchases, etc.) often span long time horizons over multiple years, resulting in extremely long trajectories that …

COVERAGE [1]

Customer-Agent: Overcoming Context Limitations in Ultra-Long Shopping Trajectories via Tool-Augmented Agents and RLVR

RELATED ENTITIES

RELATED TOPICS