中文(ZH) Artificial Analysis放榜：千问3.7问鼎国产模型冠军，全球前五

新的“错误归因差距”攻击针对人工智能记忆层

作者 PulseAugur 编辑部 · [22 个来源] · 2026-05-21 06:40

一篇题为《错误归因差距》的新研究论文，将“语义规范漂移”（SND）作为一种针对代理式人工智能系统的新型攻击向量。该攻击利用记忆层，使其难以与模型失准区分。SND 涉及将策略文档注入向量存储，丢失出处，并作为可信上下文重新出现，导致代理行为不当。该论文还提出了“反事实组合测试”和“记忆持久信息流控制”作为防御机制，声称在识别攻击来源和阻止相当比例的攻击方面具有高准确率。 AI

影响新研究突显了人工智能记忆系统的漏洞，可能影响代理式人工智能应用程序的安全性和可靠性。

排序理由该集群主要包含一篇详细介绍人工智能系统新攻击向量和防御机制的研究论文，以及关于人工智能记忆和代理能力的讨论。

在量子位 (QbitAI) 阅读 →

AI 生成摘要 · Google Gemini · 来自 22 个来源。我们如何撰写摘要 →

报道来源 [22]

arXiv cs.AI TIER_1 English(EN) · Tanzim Ahad, Ismail Hossain, Md Jahangir Alam, Sai Puppala, Syed Bahauddin Alam, Sajedul Talukder · 2026-05-25 04:00

误归因鸿沟：在代理式AI系统中，记忆中毒如何看起来像模型故障

arXiv:2605.22842v1 Announce Type: cross Abstract: Multi-agent AI pipelines typically assume that agent misconduct originates from model misalignment. We identify a structural failure in this assumption, the \emph{Misattribution Gap}, where memory-layer attacks produce behaviors i…
量子位 (QbitAI) TIER_1 中文(ZH) · 量子位的朋友们 · 2026-05-21 09:16

人工智能分析榜单：Qwen3.7 夺国内模型冠军，全球排名前五

Qwen3.7-Max即将上线阿里云百炼对外提供API服务
36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-21 06:43

国际资本持续流出印度股市，今年以来全球投资者已从印度股市撤资约230亿美元。

据彭博社报道，国际资本持续流出印度股市，进一步加大卢比贬值压力。数据显示，今年以来，全球投资者已从印度股市总计撤出约230亿美元。据路透社报道，这一数字超过去年全年印度股市的外资流出总量。（央视财经）
36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-21 06:40

ArtificialAnalysis：Qwen3.7 夺国内模型桂冠，全球排名前五

36氪获悉，5月21日，三方机构ArtificialAnalysis公布了最新的全球大模型榜单，阿里新发布的旗舰模型Qwen3.7-Max得分56.6分，性能接近GPT、Claude、Gemini的最强模型，位列全球第五、国产第一。据了解，Qwen3.7-Max即将上线阿里云百炼对外提供API服务。
dev.to — Claude Code tag TIER_1 English(EN) · Michel Faure · 2026-05-27 11:54

为什么你的AI不应独自决定：三选项模式

<h2> Why your AI shouldn't decide alone: the 3-options pattern </h2> Tuesday afternoon, May 16th. Catherine calls from the Maisons-Laffitte branch. The line opens with her usual phrasing, "hum it bugs but it's quickly fixed," slipped into the receiver before she even …
Medium — Claude tag TIER_1 English(EN) · Mahesh Nandam · 2026-05-29 08:19

第四天：✅ Claude上下文 - 为您的AI提供机构记忆

<div class="medium-feed-item"><a href="https://maheshnandam.medium.com/day4-claude-context-give-your-ai-institutional-memory-eafff9ba72a1?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1266/1*feCKiIGl1JYI8q_16SpY2w.png" width=…
Medium — Claude tag TIER_1 English(EN) · Rahul Ahir · 2026-05-29 03:26

3个Claude AI记忆技巧，每个高阶用户都应掌握

<div class="medium-feed-item"><a href="https://medium.com/@ahirlog/3-claude-ai-memory-tricks-every-power-user-should-learn-a6cf2ebbd6ab?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1915/1*u4erK9fruXjnjnW7JEv75g.png" width="1…
Towards AI TIER_1 English(EN) · Vektor Memory · 2026-05-23 04:31

您的人工智能有记忆。它只是不知道该记住什么。

<h4>Why the next frontier of AI isn’t more data — it’s smarter forgetting.</h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*cLIZ1ww7t56SW4GeZlVMrw.jpeg" /></figure>A 12-minute read — Vektor MemoryYour AI assistant…
dev.to — MCP tag TIER_1 English(EN) · Frank Brsrk · 2026-05-21 16:04

我为 LLM 代理构建了一个推理工具。代理调用它时会收到什么。

Most LLM agent failures aren't model failures. They're shape-of-reasoning failures. Sycophancy. Drift under multi-turn pressure. Doubling down on hallucinations. Ignoring a critical RAG document. These aren't bugs that a model update fixes. They're structural properties…
dev.to — LLM tag TIER_1 English(EN) · Nicolas Dabene · 2026-05-28 20:15

MIRROR与Engram：AI如何学会思考和记忆

Advances in LLM memory and reasoning directly impact the capabilities of AI agents deployed in e-commerce. ## When AI Forgets Your Name Three Messages Later Have you ever had that frustrating conversation with ChatGPT or Claude? You mention an important detail at the be…
r/LocalLLaMA TIER_1 English(EN) · /u/hulk14 · 2026-05-28 20:01

本地模型是否足以满足AI会议记忆需求？

<div class="md">I’ve been trying to move more of my workflow local, but meeting memory is the one thing I still can’t really replace. Right now I’m using Bluedot with Claude because being able to search old meetings, transcripts, summaries, action items, recordi…
dev.to — LLM tag TIER_1 English(EN) · ClawBase · 2026-05-28 06:57

我测试了 33 款 AI 记忆引擎 — 哪些真正有效

6 months ago, I asked my AI agent what we'd been working on last week. It had no idea. Not because it couldn't remember — ChatGPT has memory, Claude has memory — but because I couldn't see what it stored, couldn't query it, couldn't tell it what to forget. A black box with a t…
dev.to — LLM tag TIER_1 English(EN) · Self-Correcting Systems · 2026-05-26 21:22

我测试了三种AI记忆检索策略。最糟糕的失败是语义上的

A deterministic test on 10 scenarios with 21 memory objects. After writing about AI memory as judgment infrastructure, I wanted to turn the idea into something more inspectable. Not a benchmark. Not a claim of generalization. Just a small art…
dev.to — LLM tag TIER_1 English(EN) · Self-Correcting Systems · 2026-05-25 23:57

AI 记忆应决定允许做什么上下文

Retrieval gets you the records. A mature memory system must also decide what each record is permitted to do. Long-running AI systems eventually retrieve multiple valid but conflicting memories: <ul> <li>an old summary,</li> <li>a current source file,</li> <…
dev.to — LLM tag TIER_1 English(EN) · Self-Correcting Systems · 2026-05-25 23:36

AI 记忆需要权威策略，而非仅仅更多上下文

When records conflict, the agent needs explicit rules for which one is allowed to steer the answer. Long-running AI systems eventually retrieve conflicting but individually valid memories: <ul> <li>an old summary,</li> <li>a newer source file,</li> <li>a re…
dev.to — LLM tag TIER_1 English(EN) · Self-Correcting Systems · 2026-05-25 22:36

我的AI记忆系统测试中的三个失败之处——以及它暴露出的自身缺陷

This is not proof. It is early, messy evidence from my own workflow: three failures, one small comparison, and one schema bug I missed. I'd spent a week arguing, in public, that AI memory should be built on discipline before infrastructure: preserve corrections…
dev.to — LLM tag TIER_1 English(EN) · mr_miou · 2026-05-25 15:42

为什么大多数AI在IDOR上会失败（以及AMAS如何通过因果推理来解决它）

<h2> The problem no one talks about </h2> Large language models are great at pattern matching. Show them enough “vulnerable” examples, and they learn the words – not the reason. That’s why they struggle with logical vulnerabilities<…
Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-05-24 23:04

在柏林与两位朋友一起搭建小型网络工作室。为中小企业提供固定价格的网站，1-3周交付。我们开源的副业项目：内部 Mac AI 助手

Building a small web studio in Berlin with two friends. Fixed-price websites for SMBs, 1–3 week delivery. Side project we open-sourced: internal Mac AI assistant — wake-word, screen vision, multi-provider routing (Claude/GPT/Gemini). MIT. Happy to chat about either if anyone's cu…
dev.to — LLM tag TIER_1 English(EN) · Keniel Maldonado · 2026-05-23 21:31

大多数AI的记忆都会衰退。例外是犯错的记忆。

By 2026 the question stopped being whether your AI can remember you. It can. Memory went from research demo to commodity infrastructure in about a year — managed services, a dozen frameworks, benchmark suites, drop-in integrations by the score. Soon every assistant and every a…
dev.to — LLM tag TIER_1 English(EN) · Self-Correcting Systems · 2026-05-23 21:31

大多数人工智能的记忆都会遗忘。例外是犯错的记忆。

By 2026 the question stopped being whether your AI can remember you. It can. Memory went from research demo to commodity infrastructure in about a year — managed services, a dozen frameworks, benchmark suites, drop-in integrations by the score. Soon every assistant and every a…
r/MachineLearning TIER_1 English(EN) · /u/Commercial-Kale-5271 · 2026-05-23 07:47

个性化AI记忆是值得解决的问题，还是我只是在自我安慰[D]

<div class="md">genuine question for this community every time i use claude or chatgpt i have to re-explain myself. and even their memory feature is shallow it remembers facts about me, not how i actually think. the idea i've been sitting on is dif…
dev.to — LLM tag TIER_1 English(EN) · Thousand Miles AI · 2026-05-22 02:28

Cola DLM — 先规划后写作的文本生成

On May 7, 2026, ByteDance Seed released a 2B-parameter language model that does not generate text one token at a time. Cola DLM — short for Continuous Latent Diffusion Language Model — plans the whole passage in a continuous latent space, then decodes those latents ba…

报道来源 [22]

相关实体

相关话题