ENTITY LAD-bench

LAD-bench

PulseAugur coverage of LAD-bench — every cluster mentioning LAD-bench across labs, papers, and developer communities, ranked by signal.

Total · 30d

1

1 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

1

1 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL

RESEARCH · CL_96086 · Jun 16 · 02:32

New LAD-bench benchmark reveals logical reasoning flaws in vision-language models

Researchers have introduced LAD-bench, a new benchmark designed to evaluate the logical reasoning capabilities of large vision-language models (VLMs). The benchmark consists of over 1,000 synthetic images featuring logi…