PulseAugur
实时 07:09:39

Fashion Florence model extracts structured clothing attributes

Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing category, color, material, style, and occasion tags, which is directly usable by recommendation and retrieval systems. In evaluations, Fashion Florence outperformed GPT-4o-mini and Gemini 2.5 Flash in category and style tag accuracy, while also demonstrating high JSON output validity and efficiency with its 0.77B parameters. AI

影响 Enables direct programmatic use of fashion attributes for recommendation and retrieval systems, improving e-commerce operations.

排序理由 The cluster describes a fine-tuned model release based on an existing architecture, with performance benchmarks and deployment details. [lever_c_demoted from research: ic=1 ai=1.0]

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

Fashion Florence model extracts structured clothing attributes

报道来源 [1]

  1. Hugging Face Daily Papers TIER_1 English(EN) ·

    Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

    We present Fashion Florence, a Florence-2 vision-language model fine-tuned with LoRA to extract structured fashion attributes from clothing images. Given a single photograph, the model generates a JSON object containing category, color, material, style tags, and occasion tags, st…