MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset
Researchers have introduced MONET, a new open dataset designed to facilitate text-to-image model training. The dataset comprises approximately 104.9 million image-text pairs, meticulously curated through stages of filtering, deduplication, and re-captioning. MONET aims to lower the barriers for large-scale, reproducible research in text-to-image generation by providing a high-quality, enriched corpus. AI
IMPACT Provides a large, open dataset to accelerate research and development in text-to-image generation models.