Brief

last 24h

[22/22] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI English(EN) · 6h

Certified Robustness from Approximate Gaussian Mixture Structures in Pretrained Latent Spaces

Researchers have developed a new framework to create certifiably robust deep learning classifiers by leveraging the latent structure within data representations. Their method proves that even approximate Gaussian mixture structures in pretrained models can yield robust classifiers with explicit bounds on accuracy degradation. This approach allows for the practical use of existing pretrained models without strict distributional assumptions, achieving competitive certified accuracy on benchmarks like CIFAR-10 and ImageNet while maintaining strong clean performance. AI

IMPACT Enhances formal guarantees for AI safety in critical applications by enabling robust classifiers with existing models.
TOOL · arXiv cs.AI English(EN) · 6h

PILOT: Policy-Informed Learned Optimization for Adaptive Deep Network Training

Researchers have developed PILOT, a novel adaptive optimizer for deep learning that adjusts its update strategy during training. Unlike traditional optimizers with fixed update rules, PILOT uses gradient-direction agreement to gauge training stability and modifies its approach based on whether gradients are stable, noisy, or inconsistent. Experiments on datasets like FashionMNIST and CIFAR-10 demonstrated that PILOT achieved superior accuracy compared to other optimizers across various convolutional neural network architectures. AI

IMPACT Introduces a novel adaptive optimization technique that could lead to more efficient and accurate deep learning model training.
- CIFAR-10
TOOL · MarkTechPost English(EN) · 13h

Step by Step Guide to Build and Compare FedAvg and FedProx Federated Learning on Non-IID CIFAR-10 with NVIDIA FLARE

This tutorial demonstrates how to implement and compare the FedAvg and FedProx federated learning algorithms using NVIDIA FLARE. The experiment utilizes a non-IID CIFAR-10 dataset, simulated by partitioning data with a Dirichlet distribution to mimic realistic label imbalance across clients. The guide details setting up the NVFlare environment, defining client-side scripts for local training and model exchange, and visualizing the global model's accuracy progression over training rounds. AI

IMPACT Provides a practical guide for researchers and developers to implement and compare federated learning algorithms, highlighting differences in performance on imbalanced data.
- FedAvg
- CIFAR-10
- FedProx
TOOL · arXiv cs.LG English(EN) · 1d

Nonlinear Transformations Against Unlearnable Datasets

Researchers have developed a new nonlinear transformation framework that can effectively learn from data previously considered unlearnable by deep learning models. This framework demonstrates significant improvements, ranging from 0.34% to 249.59%, in breaking various "unlearnable" datasets generated by twelve different data protection approaches. The findings suggest that current methods for preventing unauthorized data use are insufficient, highlighting an urgent need for more robust protection mechanisms. AI

IMPACT Challenges existing methods for data protection in AI, suggesting a need for more robust security measures against unauthorized data use.
TOOL · arXiv cs.LG English(EN) · 1d

Low-Cost Hard-Label Adversarial Attack with Theoretical Foundations

Researchers have developed a new framework for adversarial attacks on AI models, focusing on hard-label black-box scenarios where only the top prediction is accessible. Their approach introduces a novel zero-query initialization strategy and a Pattern-Driven Optimization algorithm, grounded in theoretical analysis that links existing methods to gradient sign approximation. This method demonstrates superior efficiency and success rates compared to state-of-the-art attacks across various datasets and model types, including commercial APIs and CLIP models, while also showing robustness against data corruption and specialized tasks like segmentation. AI

IMPACT This research introduces a more efficient and theoretically grounded method for adversarial attacks, potentially impacting AI model security and robustness testing.
- ImageNet
- CIFAR-10
- PathMNIST
- ImageNet-C
- ObjectNet
- Jun Liu
TOOL · arXiv cs.LG English(EN) · 1d

FIRMA: FIbonacci Ring Model Aggregation for Privacy-preserving Federated Learning

Researchers have introduced FIRMA, a novel family of three federated learning protocols designed to enhance privacy and efficiency. The protocols address limitations in existing methods by enabling server-free operation, ensuring permanent privacy for classification heads, and implementing principled asymmetric neighbor weighting. Experiments across various configurations show FIRMA outperforming standard federated learning approaches, particularly in scenarios with label skew and heterogeneity. AI

IMPACT Introduces novel privacy-preserving techniques for distributed model training, potentially improving data security in collaborative AI development.
RESEARCH · arXiv cs.LG English(EN) · 5d · [2 sources]

Adaptive Signal Resuscitation: Channel-wise Post-Pruning Repair for Sparse Vision Networks

Researchers have developed Adaptive Signal Resuscitation (ASR), a novel training-free method to repair sparse vision networks after pruning. ASR addresses the accuracy collapse seen in high-sparsity models by applying corrections at a channel-wise granularity, unlike previous layer-wise approaches. This technique estimates and stabilizes variance-matching corrections for each output channel, significantly improving performance in high-sparsity scenarios. For instance, ASR recovered 55.6% top-1 accuracy on ResNet-50 at 90% sparsity on CIFAR-10, a substantial improvement over existing methods. AI

IMPACT Improves accuracy of pruned vision models, potentially enabling more efficient deployment on resource-constrained devices.
TOOL · arXiv cs.AI English(EN) · 4d

FAIR-Pruner: A Flexible Framework for Automatic Layer-Wise Pruning via Tolerance of Difference

Researchers have developed FAIR-Pruner, a new framework designed for automatic, layer-wise structured pruning of deep neural networks. This method adaptively allocates sparsity across network layers by using both removal-oriented and protection-oriented signals. Experiments across various datasets and model architectures, including vision models and a Qwen1.5-MoE model, demonstrate that FAIR-Pruner achieves strong accuracy-compression trade-offs. The framework is available as an open-source package. AI

IMPACT Enables more efficient deployment of large neural networks by improving compression techniques.
- ImageNet
- CIFAR-10
- CIFAR-100
- DenseNet
- ResNet
- SVHN
- ConvNeXt
- FAIR-Pruner
- Qwen1.5-MoE-A2.7B-Chat
- Chengyao Yu
TOOL · arXiv cs.LG English(EN) · 4d

An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning

Researchers have developed a new optimization algorithm called IAdaPID-ADG, designed to improve the convergence and stability of deep learning models. This novel optimizer integrates concepts from AMSGrad and DiffGrad, specifically a non-increasing effective learning rate and a gradient difference modulation factor, to address limitations inherited from the widely used Adam optimizer. Evaluations on benchmark and real-world datasets demonstrated that IAdaPID-ADG significantly outperforms existing optimizers. AI

IMPACT Introduces a novel optimization algorithm that could lead to faster and more reliable training of deep learning models.
- Adam
- MNIST
- CIFAR10
- AMSGrad
- DiffGrad
- AnnoCerv
- IAdaPID-ADG
TOOL · arXiv cs.AI English(EN) · 4d

Causal Unlearning in Collaborative Optimization: Exact and Approximate Influence Reversal under Adversarial Contributions

Researchers have developed a new method called HF-KCU to efficiently remove a client's data contribution from federated learning models, addressing the computational burden of retraining. This approach approximates the influence function using Krylov subspace iterations, significantly reducing complexity and speeding up the process. A causal weighting mechanism ensures that only clients affected by the data deletion are updated, preserving model quality and enhancing privacy restoration, as demonstrated by membership inference attack success rates matching a retrained model. AI

IMPACT Enables more efficient and privacy-preserving data deletion in federated learning systems.
- MNIST
- CIFAR-10
- Fashion-MNIST
- Federated learning
- ResNet-18
- HF-KCU
- ViT-Lite
TOOL · arXiv cs.LG English(EN) · 4d

Don't Collapse Your Features: Why CenterLoss Hurts OOD Detection and Multi-Scale Mahalanobis Wins

Researchers have developed a new pipeline called GOEN that improves the detection of out-of-distribution inputs in machine learning systems. This method combines multi-scale features, L2 normalization, Mahalanobis distance, and a calibration head trained with real out-of-distribution examples. Their findings indicate that CenterLoss, a common regularizer for feature compactness, actually degrades out-of-distribution detection performance, while GOEN-NoCenterLoss achieved a superior OOD AUROC of 0.9483 on CIFAR-10 benchmarks. AI

IMPACT Enhances AI safety by improving the ability of models to recognize and flag unfamiliar or out-of-distribution data.
- CIFAR-10
- GOEN
- CenterLoss
TOOL · arXiv cs.LG English(EN) · 4d

AutoMCU: Feasibility-First MCU Neural Network Customization via LLM-based Multi-Agent Systems

Researchers have developed AutoMCU, a novel system that leverages LLM-based multi-agent approaches to customize neural networks for microcontroller units (MCUs). This method prioritizes feasibility by integrating vendor toolchain feedback early in the design process, significantly reducing the search cost and time compared to traditional hardware-aware neural architecture search methods. AutoMCU has demonstrated competitive accuracy on benchmark datasets and successful deployment on STM32 microcontrollers, making edge intelligence more accessible. AI

IMPACT Automates neural network deployment on resource-constrained MCUs, enabling more edge AI applications.
TOOL · arXiv cs.LG English(EN) · 4d

How Sparsity Allocation Shapes Label-Free Post-Pruning Recoverability

A new research paper investigates how the allocation of sparsity in neural networks impacts their ability to recover accuracy after pruning, especially when labeled retraining data is unavailable. The study compares different sparsity allocation methods like ERK and LAMP across various datasets and architectures, finding that the choice of allocation significantly affects post-repair accuracy. Researchers identified a critical transition regime where standard repair methods begin to fail, highlighting the need to jointly consider pruning allocation and repair strategies. AI

IMPACT Investigates methods to maintain neural network performance after aggressive pruning, crucial for efficient deployment in resource-constrained environments.
- CIFAR-10
- CIFAR-100
- ResNet-50
- Imagenette
- ResNet-34
- ResNet-18
- ImageNet-100
- DenseNet-121
TOOL · arXiv cs.LG English(EN) · 4d

Optimal Guarantees for Auditing R\'enyi Differentially Private Machine Learning

Researchers have developed a new auditing framework for machine learning algorithms that claim Rényi differential privacy (RDP). This framework uses the Donsker-Varadhan (DV) estimator to directly measure Rényi divergence, providing explicit confidence intervals for RDP auditing. The proposed method achieves information-theoretically optimal sample-complexity guarantees and shows empirical improvements over existing black-box methods, particularly for challenging small and moderate Rényi orders. AI

IMPACT Establishes new optimal guarantees for auditing privacy in ML models, potentially improving trust and security in deployed systems.
TOOL · arXiv cs.LG English(EN) · 4d

Attacking the Spike: On the Transferability and Security of Spiking Neural Networks to Adversarial Examples

Researchers have developed a new adversarial attack method called Mixed Dynamic Spiking Estimation (MDSE) specifically for Spiking Neural Networks (SNNs). This attack demonstrates that the effectiveness of white-box adversarial attacks on SNNs is heavily influenced by the choice of surrogate gradient estimator. The MDSE attack is designed to exploit multiple surrogate gradient estimators simultaneously, enabling it to generate adversarial examples that can fool both SNNs and traditional non-SNN models like Vision Transformers and CNNs. AI

IMPACT Introduces a novel attack that can fool both SNNs and traditional neural networks, highlighting security vulnerabilities in energy-efficient AI models.
RESEARCH · arXiv cs.LG English(EN) · 4d · [2 sources]

EnCAgg: Enhanced Clustering Aggregation for Robust Federated Learning against Dynamic Model Poisoning

Researchers have developed a new method called EnCAgg to improve the robustness of federated learning against dynamic model poisoning attacks. This approach uses a small set of known benign clients as references to accurately identify and filter out malicious gradients. The method incorporates density-based clustering in a low-dimensional space and a gradient generator model to reconnect sparse benign gradients, ultimately allowing more legitimate data to participate in the aggregation process. AI

IMPACT Enhances security for federated learning systems, enabling more reliable collaborative model training.
TOOL · arXiv cs.CV English(EN) · 6d

GAMR: Geometric-Aware Manifold Regularization with Virtual Outlier Synthesis for Learning with Noisy Labels

Researchers have developed a new method called GAMR (Geometric-Aware Manifold Regularization) to improve deep neural network performance when trained on datasets with noisy labels. Unlike existing methods that passively filter data, GAMR actively synthesizes virtual outlier samples to create distinct boundaries between data manifolds. This geometric approach enhances the separation between correctly labeled and mislabeled data, leading to more robust feature representations. The technique has shown state-of-the-art results on benchmarks like CIFAR-10, particularly under challenging noise conditions, and also improves out-of-distribution detection capabilities. AI

IMPACT Enhances model robustness and safety in real-world applications by improving performance on noisy datasets.
- CIFAR-10
- Deep neural networks
TOOL · arXiv cs.LG English(EN) · 5d

A New Framework to Analyse the Distributional Robustness of Deep Neural Networks

Researchers have developed a new framework to analyze the distributional robustness of deep neural networks, a key challenge for real-world AI deployment. The framework models interactions between layer weights and activations using Bernoulli distributions, with class separation serving as a proxy for robustness. Experiments on CIFAR-10 and ImageNet demonstrate that the proposed metrics can differentiate between networks that have memorized training data and those that have not, and show that distributional shifts reduce separation. AI

IMPACT Provides new diagnostic tools for understanding and improving the reliability of AI models when faced with changing data distributions.
RESEARCH · arXiv cs.AI English(EN) · 6d · [5 sources]

Trusted Weights, Treacherous Optimizations? Optimization-Triggered Backdoor Attacks on LLMs

Researchers have identified new vulnerabilities in large language models (LLMs) related to optimization techniques used during deployment. One study reveals that compilation processes, intended for efficiency, can be exploited to implant hidden backdoors that trigger under specific compiled conditions, bypassing standard safety checks and achieving high attack success rates on open-source LLMs. Another theoretical paper explores how, counter-intuitively, stronger triggers in backdoor attacks can sometimes aid defenders in high-dimensional settings, with attack success peaking at a finite trigger strength. AI

IMPACT New research highlights critical security vulnerabilities in LLM deployment pipelines, potentially impacting the safety and reliability of AI systems.
TOOL · arXiv cs.NE (Neural & Evolutionary) English(EN) · 4d

SpikingMoE: SDPrompt-Guided Dynamic Expert Fusion in Spiking Neural Networks

Researchers have introduced SpikingMoE, a novel framework that combines Spiking Neural Networks (SNNs) with a Mixture-of-Experts (MoE) architecture. This approach utilizes a spike-driven prompt (SDprompt) for biologically plausible, input-dependent routing of information to different expert modules. Designed for neuromorphic hardware, SpikingMoE aims to enhance energy efficiency in visual recognition tasks while maintaining competitive performance, achieving high accuracy on CIFAR-10 and CIFAR-100 datasets. AI

IMPACT Introduces a new architecture for energy-efficient visual recognition on neuromorphic hardware, potentially impacting specialized AI applications.
RESEARCH · arXiv cs.LG English(EN) · 1w · [11 sources]

Centralized vs Decentralized Federated Learning: A trade-off performance analysis

Researchers are exploring new methods to improve federated learning, a technique for training models across decentralized data sources while preserving privacy. One approach, "Choose Wisely and Privately," uses mutual information and a Potential Federation Loss to proactively select clients whose data maximizes utility and fairness before training begins. Another study introduces a lightweight geometric signal to detect atypical clients by measuring how their local training diverges from the global model's functional behavior. Additionally, new theoretical work establishes general lower bounds for differentially private federated learning protocols and analyzes the trade-offs between centralized and decentralized federated learning architectures. AI

IMPACT These advancements in federated learning could lead to more efficient and secure collaborative AI model training, particularly in scenarios with sensitive or distributed data.
TOOL · arXiv cs.NE (Neural & Evolutionary) English(EN) · 1w

Dual-axis attribution of zebrafish tectal microcircuits for energy-efficient and robust neurocomputing

Researchers have developed a new method to attribute specific computational functions to microcircuits within biological neural networks, using the zebrafish tectal microcircuit as a model. By analyzing signal propagation and simulating network perturbations, they identified distinct subcircuits responsible for energy-efficient processing and robustness. These attributed functions were then integrated into artificial neural networks, demonstrating improved performance under reduced computation and input noise. AI

IMPACT Provides a framework for designing more efficient and robust artificial neural networks by drawing inspiration from biological circuit organization.
- CIFAR-10
- ResNet18