Finqa
PulseAugur coverage of Finqa — every cluster mentioning Finqa across labs, papers, and developer communities, ranked by signal.
1 天有情绪数据
-
New benchmarks and agentic RAG enhance LLM financial analysis
Researchers have developed FINESSE-Bench, a new benchmark suite designed to hierarchically evaluate the financial domain knowledge and technical analysis capabilities of large language models. This suite includes specia…
-
Fin-PRM model enhances LLM financial reasoning with specialized reward signals
Researchers have developed Fin-PRM, a specialized process reward model designed to improve financial reasoning in large language models. Unlike general-purpose models, Fin-PRM focuses on the structured and fact-sensitiv…
-
New benchmarks reveal LLMs struggle with Arabic and symbolic financial reasoning
Researchers have introduced SAHM, a new benchmark designed to evaluate Arabic financial and Shari'ah-compliant reasoning capabilities in large language models. The benchmark includes over 14,000 expert-verified instance…
-
TaNOS framework boosts numerical reasoning in tables, outperforming GPT-5
Researchers have developed TaNOS, a new framework designed to improve numerical reasoning in AI models when dealing with tabular data. This approach uses anonymized headers, operation sketches for structural cues, and s…