PulseAugur
LIVE 04:13:53
tool · [1 source] ·
0
tool

Fashion Florence model extracts structured clothing attributes

Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing category, color, material, style, and occasion tags, which is directly usable by recommendation and retrieval systems. In evaluations, Fashion Florence outperformed GPT-4o-mini and Gemini 2.5 Flash in category and style tag accuracy, while also demonstrating high JSON output validity and efficiency with its 0.77B parameters. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Enables direct programmatic use of fashion attributes for recommendation and retrieval systems, improving e-commerce operations.

RANK_REASON The cluster describes a fine-tuned model release based on an existing architecture, with performance benchmarks and deployment details. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Hugging Face Daily Papers →

COVERAGE [1]

  1. Hugging Face Daily Papers TIER_1 ·

    Fashion Florence: Fine-Tuning Florence-2 for Structured Fashion Attribute Extraction

    We present Fashion Florence, a Florence-2 vision-language model fine-tuned with LoRA to extract structured fashion attributes from clothing images. Given a single photograph, the model generates a JSON object containing category, color, material, style tags, and occasion tags, st…