Faiss
PulseAugur coverage of Faiss — every cluster mentioning Faiss across labs, papers, and developer communities, ranked by signal.
11 天有情绪数据
-
AI系统利用OCR、RAG和LangGraph自动化合同审查
本文详细介绍了如何构建一个AI驱动的合同智能系统,自动化从各种文档格式中提取关键条款。该系统结合了PaddleOCR的光学字符识别(OCR)、FAISS和BM25等混合检索方法,以及LangGraph管道中的GPT-4o模型。这种方法旨在将非结构化的合同数据转化为结构化报告,解决漏报、财务损失和合规风险等问题。
-
LangGraph 模板指导 AI 代理开发
多篇 dev.to 文章详细介绍了如何使用 LangChain 的工作流系统 LangGraph 来构建 AI 代理。这些文章提供了常见代理模式的模板,包括用于文档查询的检索增强生成 (RAG)、能够规划和执行任务的多工具代理,以及需要用户审查的人机协作工作流。这些模板通过节点、边和状态管理来说明 LangGraph 的架构,用于创建复杂、有状态的 AI 应用程序。
-
RAG pipeline struggles with citations, developer proposes fix
A developer detailed a sophisticated Parent-Child RAG pipeline on GitHub, which, despite its advanced components like hybrid vector stores and LangGraph, suffered from inaccurate citations and hallucinations. The core i…
-
Recursive chunking excels in Khmer agricultural document RAG
Researchers evaluated four text chunking strategies for a Retrieval-Augmented Generation (RAG) framework using Khmer agricultural documents. The study found that a character-based Recursive chunking method, with a chunk…
-
Turbovec offers Rust vector index with Python bindings for efficient AI
Turbovec is a new open-source vector index library written in Rust with Python bindings, designed to reduce the memory footprint of vector embeddings for AI applications. It utilizes Google's TurboQuant algorithm, a dat…
-
Developer builds VORTEXRAG to fix RAG failures
A developer spent six months debugging a Retrieval-Augmented Generation (RAG) system for document Q&A, identifying two key failure modes: semantic drift in query reformulation and context poisoning by irrelevant but sim…
-
AI search engine built on 20GB laptop, no cloud needed
An individual developed a production-grade AI-powered e-commerce search engine that operates entirely on a consumer laptop with 20GB of RAM, eliminating the need for cloud services. This system addresses the limitations…
-
KernelMind project details code retrieval improvements and evaluation methods
The KernelMind project is detailing its development process, focusing on improving its code retrieval and evaluation capabilities. Early versions struggled with subjective evaluation, prompting the creation of a benchma…
-
GraphRAG通过检索连接知识来减少LLM令牌使用量
使用TigerGraph的GraphRAG方法开发的两个项目展示了其在减少令牌使用量和提高大型语言模型答案质量方面的有效性。这两个系统一个专注于网络安全,另一个专注于生物医学,将GraphRAG与传统的纯LLM和基础RAG方法进行了比较。通过利用知识图谱检索连接的实体和关系,GraphRAG为LLM提供了更集中的上下文,从而在保持准确性的同时降低了成本和延迟。
-
RAG systems enhance LLMs with external knowledge retrieval
Retrieval Augmented Generation (RAG) is a system design pattern that enhances Large Language Models (LLMs) by incorporating external knowledge. Instead of relying solely on the model's training data, RAG systems retriev…
-
Vector RAG vs. LLM Wiki: Study reveals trade-offs in research synthesis
A new research paper compares Vector Retrieval-Augmented Generation (RAG) against an LLM-compiled wiki for answering questions over a small corpus of 24 research papers. While the wiki excelled at synthesizing informati…
-
Developer uses SHA-256 to optimize offline RAG knowledge base updates
A developer created GridMind, an offline RAG assistant designed for low-resource environments, to address the challenge of efficiently updating knowledge bases. The solution involves using SHA-256 hashes to fingerprint …
-
本地文档AI需要OCR、RAG和本地推理
构建一个完全本地化的文档AI系统,需要的不仅仅是在本地机器上运行一个语言模型。它需要一个完整的管道,包括用于文档解析的光学字符识别(OCR)、用于搜索和选择相关信息的检索系统(RAG),以及用于生成响应的本地推理。如果没有强大的OCR和解析能力,检索系统可能无法找到准确的信息,导致本地LLM给出错误的答案。许多被宣传为“本地AI”的系统是不完整的,它们依赖外部服务来完成OCR或嵌入等关键步骤,从而损害了真正的本地运行。
-
RAG systems enhance LLMs by integrating external data retrieval
Retrieval-Augmented Generation (RAG) systems are a crucial technique for enhancing Large Language Models (LLMs) by allowing them to access and utilize external, up-to-date information. RAG addresses LLM limitations such…
-
RAG integrates private documents with LLMs using vector databases for semantic search
This article explains Retrieval-Augmented Generation (RAG) and the role of Vector Databases. RAG involves breaking down private documents into chunks, which are then processed by an embedding model to generate multi-dim…
-
Build AI Q&A Generator with LangChain, Groq, and FAISS
This project details how to build a Generative AI Question & Answer generator using Python, LangChain, Groq LLMs, Hugging Face Embeddings, and FAISS. The application takes a PDF, extracts content, splits it into managea…
-
Sentra-Guard system achieves 99.96% detection rate against adversarial LLM prompts
Researchers have developed Sentra-Guard, a real-time system designed to defend against adversarial prompts targeting large language models. The system employs a hybrid approach combining semantic embeddings with transfo…
-
Local RAG assistant offers sovereign access to cultural event data
A new open-source project offers a local Retrieval-Augmented Generation (RAG) assistant for exploring cultural event data. This assistant utilizes OpenAgenda data, FAISS for indexing, and the Mistral model to enable nat…
-
New Geodesic Semantic Search method improves citation graph navigation
Researchers have developed Geodesic Semantic Search (GSS), a novel retrieval system that navigates citation graphs using learned, node-specific Riemannian metrics. Unlike traditional methods relying on fixed Euclidean d…
-
FGDM: Reasoning Aware Multi-Agentic Framework for Software Bug Detection using Chain of Thought and Tree of Thought Prompting
Researchers have developed a new framework called FGDM for detecting and repairing software bugs. This multi-agent system leverages Large Language Models (LLMs) with Chain-of-Thought and Tree-of-Thoughts prompting to un…