实体 Royal Galician Academy

Royal Galician Academy

PulseAugur coverage of Royal Galician Academy — every cluster mentioning Royal Galician Academy across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

106

90 天内 106

发布 · 30天

90 天内 0

论文 · 30天

90 天内 69

层级分布 · 90 天

research 23
tool 73
commentary 10

关系

情绪 · 30 天

7 天有情绪数据

最近 · 第 1/6 页 · 共 106 条

COMMENTARY · CL_35855 · May 17 · 19:59

Blogger shares LLM chunking strategies for long MDX articles

A technical blogger details strategies for managing token limits when feeding long MDX articles to Large Language Models. The author explains that exceeding a model's context window can lead to errors or incomplete proc…
RESEARCH · CL_35736 · May 17 · 17:44

GraphRAG通过检索连接知识来减少LLM令牌使用量

使用TigerGraph的GraphRAG方法开发的两个项目展示了其在减少令牌使用量和提高大型语言模型答案质量方面的有效性。这两个系统一个专注于网络安全，另一个专注于生物医学，将GraphRAG与传统的纯LLM和基础RAG方法进行了比较。通过利用知识图谱检索连接的实体和关系，GraphRAG为LLM提供了更集中的上下文，从而在保持准确性的同时降低了成本和延迟。
TOOL · CL_35737 · May 17 · 17:34

CyberGraph RAG uses TigerGraph to improve LLM cybersecurity analysis

Researchers developed CyberGraph RAG, a system designed to improve how large language models handle cybersecurity data by leveraging graph databases. Unlike traditional RAG which struggles with the relational nature of …
TOOL · CL_35652 · May 17 · 14:38

Agentic RAG fixes 40% retrieval failure in LLM pipelines

A new approach called Agentic RAG addresses significant retrieval failures in standard RAG pipelines, which are shown to fail up to 40% of the time in production. Unlike standard RAG, Agentic RAG uses an agent to dynami…
TOOL · CL_35358 · May 17 · 08:29

LLM Wiki 在摄取时合成知识，优于 RAG

LLM Wiki 是一种新颖的知识管理方法，它在摄取时合成信息，而不是像传统的 RAG 系统那样按需检索片段。这种方法旨在主动构建结构化知识，并阐述何时这种预合成策略比查询时检索更有效。
RESEARCH · CL_35211 · May 17 · 03:57

GraphRAG benchmarks show efficiency gains over RAG and LLM-only

Two developers built benchmarking platforms to compare Large Language Model (LLM) inference pipelines during the TigerGraph Hackathon. Their work aimed to demonstrate how GraphRAG, a method incorporating graph-based ret…
TOOL · CL_35086 · May 17 · 00:01

LLM Fine-Tuning Explained: SFT, RAG, and Data Preparation

This blog post explains the process and necessity of fine-tuning large language models (LLMs) for specific tasks. It differentiates fine-tuning from Retrieval-Augmented Generation (RAG), stating that fine-tuning is best…
TOOL · CL_34862 · May 16 · 18:22

Spartans-GraphRAG uses knowledge graphs to cut LLM token costs

A new system called Spartans-GraphRAG has been developed to make Large Language Model (LLM) inference more efficient, particularly for complex tasks like cybersecurity threat intelligence. This system leverages knowledg…
TOOL · CL_33147 · May 15 · 15:06

RAG pipeline failures stem from embedding normalization drift

Production RAG systems often fail to return results for user queries due to embedding normalization drift, a problem not typically encountered in tutorial settings. This occurs when the preprocessing applied to user que…
COMMENTARY · CL_32404 · May 15 · 00:12

生成式AI通过基于代币的交易重新定义软件经济

随着生成式AI的出现，软件开发的经济模式发生了根本性转变，将每个提示都变成了金融交易。与成本可预测的传统软件不同，LLM的交互会消耗代币，使得每个架构决策都成为成本管理问题。这种新范式要求关注AI FinOps，其中高效的代币使用和智能的模型路由对于可持续扩展至关重要。那些掌握经济上可行架构的组织，而不仅仅是拥有最智能模型的组织，将处于领先地位。
COMMENTARY · CL_30235 · May 13 · 16:31

原始HTML阻碍LLM性能，Markdown更受青睐

原始HTML通常包含过多的样板代码和结构噪音，这会阻碍大型语言模型（LLM）和AI代理。直接将原始HTML输入LLM会导致令牌浪费、内容重要性被误解，以及在RAG系统中的检索性能下降。作者提倡将HTML转换为更干净的格式，如Markdown，后者能更好地保留关键内容，同时丢弃无关的布局和导航元素，最终提高LLM的输出质量和代理行为。
TOOL · CL_28736 · May 12 · 16:18

Developer uses SHA-256 to optimize offline RAG knowledge base updates

A developer created GridMind, an offline RAG assistant designed for low-resource environments, to address the challenge of efficiently updating knowledge bases. The solution involves using SHA-256 hashes to fingerprint …
RESEARCH · CL_28377 · May 12 · 10:10

RAG pipelines gain precision with multi-stage reranker models

Implementing a reranker layer in Retrieval-Augmented Generation (RAG) pipelines is crucial for improving answer precision, as initial retrieval stages may surface relevant documents but bury the best answer among less o…
RESEARCH · CL_27949 · May 12 · 07:41

Qwen 2.5 驱动多轮检索系统荣登 SemEval 排行榜

研究人员开发了一个用于多轮对话的三阶段检索系统，提高了信息检索任务的准确性。该系统首先使用微调的 Qwen 2.5 7B 模型优化上下文相关的查询，生成独立的问句。然后，它采用结合了 BM25 和密集向量检索的混合搜索，并与倒数排名融合（Reciprocal Rank Fusion）相结合，最后由一个交叉编码器模型对结果进行重新排序以提高精度。这种方法在最近的 SemEval 任务中取得了显著的 nDCG@5 分数，优于许多其他系统。
TOOL · CL_27950 · May 12 · 07:39

RAG agents use self-query, corrective, and adaptive retrieval

This article explores advanced Retrieval-Augmented Generation (RAG) techniques that enhance how large language models retrieve and utilize information. It details three patterns: Self-Query RAG, which optimizes search q…
COMMENTARY · CL_27458 · May 12 · 00:57

AI工程师角色围绕LLM堆栈、Python和RAG固化

对3449份AI工程师职位发布的2026年分析显示，该角色已围绕LLM堆栈固化，需要Python、LLM、检索增强生成（RAG）和云平台技能。虽然Python和LLM被认为是必备技能，但RAG和LangChain等框架现已普及。AI工程师的美国基本薪资中位数为146,000美元，分布式系统和数据平台技能可带来显著溢价。
RESEARCH · CL_26873 · May 11 · 17:01

AI agents break RAG; new architectures like GraphRAG emerge

Retrieval-augmented generation (RAG), a popular AI architecture for chatbots, is facing limitations as AI agents become more complex. Pinecone, a leading vector database provider, has acknowledged a design flaw where ag…
TOOL · CL_26871 · May 11 · 16:31

Local LLM users find lower quantization cuts latency with minimal quality loss

Running large language models locally can be optimized by understanding quantization's impact on latency and quality. While Q4_K_M is a common default, lower quantization levels like Q3_K_S can significantly reduce late…
COMMENTARY · CL_26681 · May 11 · 14:31

RAG 系统在生产环境中失败是由于工程缺陷，而非设计缺陷

本文认为，检索增强生成（RAG）系统本身并无固有缺陷，其在生产环境中的失败源于糟糕的工程实践。文章以一个银行聊天机器人失败的真实案例为例，指出了诸如分块大小过小、嵌入模型不匹配以及重排不足等问题。文章提供了一个优化 RAG 管道的指南，涵盖了从分块到评估的各个层面，旨在提高生产环境中的性能、降低成本并增强可信度。
TOOL · CL_27549 · May 11 · 09:10

New framework guides LLMs to choose between RAG and long-context processing

Researchers have developed a new framework called Pre-Route to help large language models decide whether to use retrieval-augmented generation (RAG) or long-context (LC) processing for document understanding. This proac…

Blogger shares LLM chunking strategies for long MDX articles

GraphRAG通过检索连接知识来减少LLM令牌使用量

CyberGraph RAG uses TigerGraph to improve LLM cybersecurity analysis

Agentic RAG fixes 40% retrieval failure in LLM pipelines

LLM Wiki 在摄取时合成知识，优于 RAG

GraphRAG benchmarks show efficiency gains over RAG and LLM-only

LLM Fine-Tuning Explained: SFT, RAG, and Data Preparation

Spartans-GraphRAG uses knowledge graphs to cut LLM token costs

RAG pipeline failures stem from embedding normalization drift

生成式AI通过基于代币的交易重新定义软件经济

原始HTML阻碍LLM性能，Markdown更受青睐

Developer uses SHA-256 to optimize offline RAG knowledge base updates

RAG pipelines gain precision with multi-stage reranker models

Qwen 2.5 驱动多轮检索系统荣登 SemEval 排行榜

RAG agents use self-query, corrective, and adaptive retrieval

AI工程师角色围绕LLM堆栈、Python和RAG固化

AI agents break RAG; new architectures like GraphRAG emerge

Local LLM users find lower quantization cuts latency with minimal quality loss

RAG 系统在生产环境中失败是由于工程缺陷，而非设计缺陷

New framework guides LLMs to choose between RAG and long-context processing