Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing category, color, material, style, and occasion tags, which is directly usable by recommendation and retrieval systems. In evaluations, Fashion Florence outperformed GPT-4o-mini and Gemini 2.5 Flash in category and style tag accuracy, while also demonstrating high JSON output validity and efficiency with its 0.77B parameters. AI
影响 Enables direct programmatic use of fashion attributes for recommendation and retrieval systems, improving e-commerce operations.
排序理由 The cluster describes a fine-tuned model release based on an existing architecture, with performance benchmarks and deployment details. [lever_c_demoted from research: ic=1 ai=1.0]
在 Hugging Face Daily Papers 阅读 →
- Fashion Florence
- Florence-2
- Gemini 2.5 Flash
- GPT-4o-mini
- Hugging Face
- iMaterialist Fashion dataset
- Loom
- LoRA
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →