Fashion Florence model extracts structured clothing attributes

By PulseAugur Editorial · [1 sources] · 2026-05-11 00:04

Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing category, color, material, style, and occasion tags, which is directly usable by recommendation and retrieval systems. In evaluations, Fashion Florence outperformed GPT-4o-mini and Gemini 2.5 Flash in category and style tag accuracy, while also demonstrating high JSON output validity and efficiency with its 0.77B parameters. AI

IMPACT Enables direct programmatic use of fashion attributes for recommendation and retrieval systems, improving e-commerce operations.

RANK_REASON The cluster describes a fine-tuned model release based on an existing architecture, with performance benchmarks and deployment details. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Hugging Face Daily Papers →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Fashion Florence model extracts structured clothing attributes

COVERAGE [1]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-05-11 00:04

Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

We present Fashion Florence, a Florence-2 vision-language model fine-tuned with LoRA to extract structured fashion attributes from clothing images. Given a single photograph, the model generates a JSON object containing category, color, material, style tags, and occasion tags, st…

COVERAGE [1]

Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

RELATED ENTITIES

RELATED TOPICS