Researchers have introduced TimeVista, a new framework that utilizes Vision-Language Models (VLMs) to evaluate time series forecasting. This approach leverages VLMs' ability to interpret time series plots alongside textual information, offering a more human-aligned judgment than traditional point-wise metrics. The TimeVista benchmark includes 5563 time series samples and has demonstrated that VLMs provide reliable and consistent evaluations, outperforming conventional methods in aligning with human preferences. This framework has been used to assess recent Time Series Foundation Models, revealing VLMs as robust and interpretable judges for this domain. AI
IMPACT VLMs offer a more human-aligned evaluation standard for time series forecasting, potentially improving model development and comparison.
RANK_REASON The cluster describes a new research paper introducing a novel framework and benchmark for evaluating time series forecasting models using VLMs. [lever_c_demoted from research: ic=1 ai=1.0]
- alphaXiv
- arXiv
- CatalyzeX
- DagsHub
- Gotit.pub
- Hugging Face
- LLM-as-a-Judge
- ScienceCast
- Time Series Foundation Models
- TimeVista
- vision-language model
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →