Brief

last 24h

[2/2] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · Mastodon — mastodon.social English(EN) · 4h

is this big enough to kill the # AI hype? Deep-Research Agents Can Be Poisoned via User-Generated Content https:// arxiv.org/pdf/2605.24245 # science # technolo

A new research paper details a vulnerability in deep-research agents, which can be compromised through user-generated content. The study, available on arXiv, explores how malicious input can poison these AI systems. This finding raises questions about the current state and hype surrounding AI development. AI

IMPACT Highlights a critical security flaw in AI agents, potentially impacting the reliability and safety of AI systems that process user-generated content.
TOOL · arXiv cs.CL English(EN) · 1w

When the Gold Standard Isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content

Researchers have identified significant challenges in evaluating the translation of user-generated content (UGC) due to its inherent non-standard language. They developed a taxonomy of twelve non-standard phenomena and five translation actions to analyze how different datasets handle UGC, revealing a spectrum of standardness in reference translations. The study found that large language models' translation scores are sensitive to specific instructions and improve when aligned with dataset guidelines, advocating for guideline-aware evaluation frameworks. AI

IMPACT Highlights the need for more nuanced evaluation metrics for LLMs handling diverse language inputs.
- user-generated content
- Lydia Nishimwe

Brief

is this big enough to kill the # AI hype? Deep-Research Agents Can Be Poisoned via User-Generated Content https:// arxiv.org/pdf/2605.24245 # science # technolo

When the Gold Standard Isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content