QA
PulseAugur coverage of QA — every cluster mentioning QA across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
New TRACE framework detects RAG poisoning attacks via token influence
Researchers have developed a new framework called TRACE to detect poisoning attacks in retrieval-augmented generation (RAG) systems. These attacks manipulate RAG models by inserting malicious documents into their retrie…
-
CalVerT enhances LLM agents with calibrated telemetry for improved QA
Researchers have introduced CalVerT, a new method to improve the performance of Large Language Model (LLM) agents in knowledge-intensive question-answering tasks. CalVerT addresses common failure modes where agents eith…
-
New research models optimal scheduling for paid QA forums
A new paper explores optimal scheduling strategies for question-answering forums staffed by paid knowledge workers. The research models these forums as queuing systems, calculating the capacity for handling requests whi…
-
CalVerT enhances LLM agents with telemetry for better QA performance
Researchers have introduced CalVerT, a novel method to enhance Large Language Model (LLM) agents in knowledge-intensive question answering tasks. CalVerT augments agents with calibrated self-confidence and grounding ver…
-
CacheWeaver optimizes RAG inference by improving cache efficiency
Researchers have developed CacheWeaver, a new method to optimize retrieval-augmented generation (RAG) inference by improving cache efficiency. This technique reorders evidence sequences to maximize the reuse of token pr…
-
AI tools to automate QA tasks for testers
This article provides a guide to AI tools that can automate tasks for Quality Assurance (QA) professionals. It focuses on practical prompt templates and specific tools, aiming to help junior and middle-level QA speciali…
-
Latent Memory cuts QA token use by 3x-10x
Researchers have developed a new method called Latent Memory to improve question answering systems for resource-constrained environments. This approach compresses multimodal evidence, such as text and images, into singl…
-
AI coding tools shift bottlenecks to QA, requiring pipeline transformation
The integration of AI coding tools has accelerated developer output, but this speed increase often shifts bottlenecks rather than eliminating them. As developers produce code faster, the quality assurance (QA) and testi…
-
AI-generated code raises quality concerns amid shrinking QA teams
The increasing use of AI for code generation is raising concerns about software quality and testing. With many companies relying on developers to test their own code, the question arises whether traditional QA processes…
-
New LLMs unify audio and language processing for full-duplex and medical applications
Researchers have developed UAF, a novel unified audio front-end LLM designed for full-duplex speech interaction. This model integrates diverse audio front-end tasks like voice activity detection and turn-taking into a s…
-
S2G-RAG framework improves multi-hop QA by judging evidence sufficiency
Researchers have introduced S2G-RAG, an iterative framework designed to improve retrieval-augmented question answering, particularly for multi-hop queries. The system features a controller called S2G-Judge that determin…