text-embedding-3-large
PulseAugur coverage of text-embedding-3-large — every cluster mentioning text-embedding-3-large across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
New SCAR method enhances RAG recall with adaptive chunking
Researchers have developed SCAR (Semantic Continuity-Aware Retrieval), a novel method to improve Retrieval-Augmented Generation (RAG) systems. SCAR addresses the issue of fixed-length chunking by adaptively expanding ne…
-
SQL Server 2025 integrates native AI with vector data types and external model support
Microsoft is integrating advanced AI capabilities directly into SQL Server 2025 and Azure SQL Database, transforming them into first-class AI platforms. Key features include a native VECTOR data type with DiskANN indexi…
-
RAG chunk overlap default harms performance, author warns
Many Retrieval-Augmented Generation (RAG) pipelines incorrectly use a default chunk overlap of 200 tokens, a setting popularized by early LangChain tutorials. This default, while convenient for generic examples, can lea…
-
LLM evaluation harness updated with production data and adversarial testing
A new approach to evaluating Large Language Models (LLMs) has been proposed to address the issue of static evaluation harnesses failing to detect model regressions. This method involves refreshing evaluation datasets we…
-
VectorSmuggle attack hides data in AI embeddings; VectorPin offers defense
Researchers have identified a new steganographic attack vector called VectorSmuggle, which allows attackers to hide data within embeddings stored in vector databases used by RAG systems. This method exploits the lack of…
-
Healthcare RAG AI fails, retrieving wrong patient data and causing $850K HIPAA fine
A healthcare AI system using Retrieval-Augmented Generation (RAG) mistakenly provided treatment recommendations for one patient to another due to similar names and medical terminology. The system, which used OpenAI's te…
-
Vector databases explained: how they work and what they can't do
This article explains the fundamental concepts behind vector databases, which are crucial for AI agents that require memory and accurate information retrieval. It details how content is transformed into numerical vector…
-
OpenAI launches new embedding models with price cuts and performance boosts
OpenAI has released new embedding models, text-embedding-3-small and text-embedding-3-large, offering significant improvements in performance and efficiency over previous models like text-embedding-ada-002. These new mo…