FireRed-Image-Edit-1.0 Technical Report
Researchers have introduced FireRed-Image-Edit, a diffusion transformer model designed for instruction-based image editing. The model leverages a massive 1.6 billion sample training corpus, meticulously curated and filtered to over 100 million high-quality pairs for both image generation and editing tasks. FireRed-Image-Edit employs a multi-stage training pipeline and introduces novel techniques for data efficiency and optimization, including Asymmetric Gradient Optimization and a differentiable Consistency Loss. Its performance is validated on the newly established REDEdit-Bench, a benchmark covering 15 editing categories, where it demonstrates competitive results against existing systems. AI
IMPACT Introduces a new benchmark and model for instruction-based image editing, potentially improving performance and offering new evaluation standards.