Brief

last 24h

[3/3] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · arXiv cs.AI English(EN) · 5d · [3 sources]

Automated ICD Classification of Psychiatric Diagnoses: From Classical NLP to Large Language Models

Researchers have developed an automated system to classify psychiatric diagnoses using Natural Language Processing (NLP) and Machine Learning (ML). The study evaluated various text representation methods, including classical models and Large Language Models (LLMs) like e5_large, BioLORD, and Llama-3-8B, on a dataset of over 145,000 Spanish psychiatric descriptions. The findings indicate that transformer-based embeddings significantly outperform traditional methods, with the fine-tuned e5_large model achieving a top F1 score of 0.866. This work highlights the importance of adapting LLMs to specialized clinical language for accurate diagnosis coding. AI

IMPACT Demonstrates LLMs' potential to reduce administrative burden in healthcare by automating complex diagnostic coding.
RESEARCH · arXiv cs.CL English(EN) · 3d · [2 sources]

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

A new paper benchmarks Google Embeddings 2 (GE2) against several open-source models for multilingual dense retrieval and RAG systems. GE2 achieved top performance across multiple tasks, including BEIR and an Italian RAG corpus, but exhibited significantly higher latency compared to local models. Multilingual-E5-large (mE5-L) offered comparable performance on Italian retrieval with much lower latency, making it a more practical choice for applications with strict response time requirements. AI

IMPACT Highlights trade-offs between cutting-edge performance and latency in retrieval models, guiding practical deployment choices.
TOOL · arXiv cs.IR (Information Retrieval) English(EN) · 1w

Temporal Decay of Co-Citation Predictability: A 20-Year Statute Retrieval Benchmark from 396M Ukrainian Court Citations

Researchers have developed a new benchmark, UA-StatuteRetrieval, to assess the stability of co-citation predictability in legal information systems over time. Analyzing 396 million Ukrainian court citations from 2007 to 2026, they found a significant decay in retrieval performance, with predictability dropping by up to 47%. While high-frequency articles and criminal procedure maintained stability, mid-frequency articles and civil law showed notable degradation, partly explained by a 2017 judicial reform and a 4.3% semantic shift in article citation patterns. AI

IMPACT Reveals temporal decay in legal information retrieval, suggesting a need for dynamic models beyond static co-citation analysis.

Brief

Automated ICD Classification of Psychiatric Diagnoses: From Classical NLP to Large Language Models

Benchmarking Google Embeddings 2 against Open-Source Models for Multilingual Dense Retrieval and RAG Systems

Temporal Decay of Co-Citation Predictability: A 20-Year Statute Retrieval Benchmark from 396M Ukrainian Court Citations