PulseAugur / Brief
EN
LIVE 19:39:42

Brief

last 24h
[1/1] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling

    Researchers have developed a new method called SemiPrune for efficiently pruning large datasets used in deep learning. This technique addresses the limitation of existing methods that require fully labeled data, which is often costly to obtain. SemiPrune leverages a small set of labeled data combined with semi-supervised learning to generate pseudo-labels for unlabeled data, enabling the application of supervised pruning methods. The approach estimates example difficulty from the training dynamics derived from these pseudo-labels, leading to more accurate coreset selection and state-of-the-art performance on various specialized datasets. AI

    IMPACT Offers a more cost-effective way to prepare large datasets for deep learning training, potentially accelerating research and development by reducing computational and storage requirements.