text-embedding-3-large
PulseAugur coverage of text-embedding-3-large — every cluster mentioning text-embedding-3-large across labs, papers, and developer communities, ranked by signal.
3 天有情绪数据
-
作者警告:RAG 块重叠默认值损害性能
许多检索增强生成 (RAG) 管道错误地使用了 200 个 token 的默认块重叠,这一设置因早期 LangChain 教程而普及。这个默认值虽然对通用示例很方便,但可能导致召回率下降和存储成本增加,特别是对于不需要重叠的结构化文档。作者提出了一项简单的消融研究,可以在一小时内完成,以确定特定语料库的最佳块大小和重叠度,从而提高 RAG 的性能和效率。
-
LLM 评估工具已更新,支持生产数据和对抗性测试
提出了一种评估大型语言模型(LLM)的新方法,以解决静态评估工具无法检测模型回归的问题。该方法包括每周使用真实的生产追踪数据刷新评估数据集,并按意图集群进行分层抽样,以确保代表性。此外,一个永久性的对抗性数据集,该数据集是从表明模型故障的实际客户支持票证中精心挑选出来的,在评估过程中被赋予很高的权重,以优先考虑实际性能。
-
VectorSmuggle attack hides data in AI embeddings; VectorPin offers defense
Researchers have identified a new steganographic attack vector called VectorSmuggle, which allows attackers to hide data within embeddings stored in vector databases used by RAG systems. This method exploits the lack of…
-
Healthcare RAG AI fails, retrieving wrong patient data and causing $850K HIPAA fine
A healthcare AI system using Retrieval-Augmented Generation (RAG) mistakenly provided treatment recommendations for one patient to another due to similar names and medical terminology. The system, which used OpenAI's te…
-
Vector databases explained: how they work and what they can't do
This article explains the fundamental concepts behind vector databases, which are crucial for AI agents that require memory and accurate information retrieval. It details how content is transformed into numerical vector…
-
OpenAI launches new embedding models with price cuts and performance boosts
OpenAI has released new embedding models, text-embedding-3-small and text-embedding-3-large, offering significant improvements in performance and efficiency over previous models like text-embedding-ada-002. These new mo…