Researchers have introduced the Polish Massive Text Embedding Benchmark (PL-MTEB), a new evaluation suite designed to assess text embedding models specifically for the Polish language. This benchmark includes 30 diverse NLP tasks across five categories such as classification, clustering, and information retrieval. The study evaluated 30 publicly available text embedding models, analyzing their performance across different task types and sizes, with all datasets and code made publicly accessible. AI
排序理由 This is a research paper introducing a new benchmark for evaluating text embedding models in a specific language.
AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →