arXiv:2606.14574v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed as planners for autonomous agents in household environments. While existing benchmarks evaluate whether LLM-generated plans execute successfully, they overlook a critical type…
arXiv:2606.14589v1 Announce Type: cross Abstract: LLM agent systems increasingly run as long-lived autonomous runtimes: scheduling jobs, calling tools, maintaining memory, and pushing results to humans. We present a longitudinal study of silent failures in one such system: a pers…
LLM agent systems increasingly run as long-lived autonomous runtimes: scheduling jobs, calling tools, maintaining memory, and pushing results to humans. We present a longitudinal study of silent failures in one such system: a personal-assistant agent runtime in continuous product…
Large language models (LLMs) are increasingly deployed as planners for autonomous agents in household environments. While existing benchmarks evaluate whether LLM-generated plans execute successfully, they overlook a critical type of failure: latent failures. Unlike immediate fai…
<h4>The latency walls, memory rot, reflection spirals, prompt injection patterns, and evaluation work that hit you after you deploy.</h4><p>The problems that show up only after you ship are never the ones in the diagram. They are the latency cliffs, the memory drift, the reflecti…
dev.to — LLM tag
TIER_1English(EN)·hhhfs9s7y9-code·
<h1> Why Retry Is Not Self-Healing: A Technical Deep-Dive for LLM APIs </h1> <p>When your LLM API call fails in production, what is your first instinct?</p> <p>Most developers reach for a retry loop. Exponential backoff, max attempts, maybe a circuit breaker.</p> <p>I thought the…
dev.to — LLM tag
TIER_1English(EN)·hhhfs9s7y9-code·
<h2> LLM API Reliability: The Reality Nobody Talks About </h2> <p>If you have run more than a few thousand LLM calls in production, you have seen the pattern: things work perfectly in development, then fall apart under load.</p> <h2> The Numbers </h2> <div class="table-wrapper-pa…