Brief

last 24h

[12/12] 221 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · arXiv cs.AI English(EN) · 3d · [2 sources]

Learning Through Noise: Why Subliminal Learning Works and When It Fails

Researchers have demonstrated that subliminal learning in neural networks, where knowledge is transferred via task-unrelated data, is primarily governed by compatible output heads rather than shared model initialization. By splitting outputs into auxiliary and class heads, they showed that compatible auxiliary heads facilitate the transfer of teacher signals, improving student model representations. This mechanism allows students trained on noise to achieve performance comparable to teachers, providing a theoretically grounded understanding of subliminal learning and its limitations. AI

IMPACT Explains a novel mechanism for knowledge transfer in neural networks, potentially improving training efficiency and model performance.
- MNIST
- Vincent Christoph Brockers
RESEARCH · arXiv stat.ML English(EN) · 1w · [2 sources]

Federated Martingale Posterior Samping

Researchers have introduced Federated Martingale Posterior (FMP) sampling, a novel protocol for federated Bayesian neural networks. This method addresses the difficulty of specifying priors in large models by using a predictive distribution and refitting. FMP sampling allows clients to upload data embeddings, enabling the server to run the predictive sampler centrally, thus avoiding the need to share local datasets. Experiments on standard datasets demonstrate that FMP closely matches centralized performance and offers improved calibration compared to existing consensus methods. AI

IMPACT Introduces a more efficient and calibrated approach for training Bayesian neural networks in federated settings, potentially improving privacy and accuracy.
TOOL · arXiv cs.AI English(EN) · 1w

An Assessment of Human vs. Model Uncertainty in Soft-Label Learning and Calibration

Researchers have developed a new method to assess the uncertainty of AI models compared to human judgment in soft-label learning. Their work disentangles the benefits of human soft-labels from the correction of mislabeled data, revealing that human soft-labels improve model calibration and promote stable convergence. The study utilized MNIST and a synthetic dataset, demonstrating that models trained with human soft-labels better mirror human uncertainty than those trained with synthetic labels. AI

IMPACT Provides a diagnostic tool for aligning AI uncertainty with human judgment, crucial for developing more trustworthy AI systems.
- AI
- MNIST
- human
TOOL · arXiv cs.LG English(EN) · 3d

An Improved Adaptive PID Optimizer with Enhanced Convergence and Stability for Deep Learning

Researchers have developed a new optimization algorithm called IAdaPID-ADG, designed to improve the convergence and stability of deep learning models. This novel optimizer integrates concepts from AMSGrad and DiffGrad, specifically a non-increasing effective learning rate and a gradient difference modulation factor, to address limitations inherited from the widely used Adam optimizer. Evaluations on benchmark and real-world datasets demonstrated that IAdaPID-ADG significantly outperforms existing optimizers. AI

IMPACT Introduces a novel optimization algorithm that could lead to faster and more reliable training of deep learning models.
- Adam
- MNIST
- CIFAR10
- AMSGrad
- DiffGrad
- AnnoCerv
- IAdaPID-ADG
TOOL · arXiv cs.AI English(EN) · 3d

Causal Unlearning in Collaborative Optimization: Exact and Approximate Influence Reversal under Adversarial Contributions

Researchers have developed a new method called HF-KCU to efficiently remove a client's data contribution from federated learning models, addressing the computational burden of retraining. This approach approximates the influence function using Krylov subspace iterations, significantly reducing complexity and speeding up the process. A causal weighting mechanism ensures that only clients affected by the data deletion are updated, preserving model quality and enhancing privacy restoration, as demonstrated by membership inference attack success rates matching a retrained model. AI

IMPACT Enables more efficient and privacy-preserving data deletion in federated learning systems.
- MNIST
- CIFAR-10
- Fashion-MNIST
- Federated learning
- ResNet-18
- HF-KCU
- ViT-Lite
TOOL · arXiv cs.LG English(EN) · 3d

Optimal Guarantees for Auditing R\'enyi Differentially Private Machine Learning

Researchers have developed a new auditing framework for machine learning algorithms that claim Rényi differential privacy (RDP). This framework uses the Donsker-Varadhan (DV) estimator to directly measure Rényi divergence, providing explicit confidence intervals for RDP auditing. The proposed method achieves information-theoretically optimal sample-complexity guarantees and shows empirical improvements over existing black-box methods, particularly for challenging small and moderate Rényi orders. AI

IMPACT Establishes new optimal guarantees for auditing privacy in ML models, potentially improving trust and security in deployed systems.
TOOL · arXiv cs.LG English(EN) · 3d

Q-PhotoNAS: Hybrid Quantum Neural Architecture Search Framework on Photonic Devices

Researchers have developed Q-PhotoNAS, a novel framework for designing hybrid quantum-classical neural network architectures specifically for photonic devices. This system uses a genetic algorithm to automatically search for optimal configurations, considering both classical and quantum components. When tested on image classification tasks like Digits and MNIST, Q-PhotoNAS achieved high accuracies of 99.44% and 98.78% respectively, with projected fast inference times on photonic hardware. AI

IMPACT Automated architecture search for photonic quantum systems could accelerate the development and deployment of quantum AI applications.
RESEARCH · arXiv stat.ML English(EN) · 6d · [2 sources]

High-dimensional ridge regression with random features for non-identically distributed data with a variance profile

Two recent arXiv preprints explore high-dimensional ridge regression for non-identically distributed data, moving beyond standard assumptions of independent and identically distributed samples. The papers introduce variance profile models to analyze the predictive risk of ridge estimators, particularly focusing on the double descent phenomenon. Researchers used tools from random matrix theory and operator-valued free probability to derive asymptotic equivalents for risk and degrees of freedom, with numerical experiments validating their findings and highlighting how heterogeneous variance profiles can alter generalization behavior. AI

IMPACT These papers advance theoretical understanding of regression models, potentially informing future AI development by clarifying generalization properties under non-standard data distributions.
RESEARCH · arXiv cs.LG English(EN) · 4d · [2 sources]

EnCAgg: Enhanced Clustering Aggregation for Robust Federated Learning against Dynamic Model Poisoning

Researchers have developed a new method called EnCAgg to improve the robustness of federated learning against dynamic model poisoning attacks. This approach uses a small set of known benign clients as references to accurately identify and filter out malicious gradients. The method incorporates density-based clustering in a low-dimensional space and a gradient generator model to reconnect sparse benign gradients, ultimately allowing more legitimate data to participate in the aggregation process. AI

IMPACT Enhances security for federated learning systems, enabling more reliable collaborative model training.
RESEARCH · arXiv cs.LG English(EN) · 1w · [11 sources]

Centralized vs Decentralized Federated Learning: A trade-off performance analysis

Researchers are exploring new methods to improve federated learning, a technique for training models across decentralized data sources while preserving privacy. One approach, "Choose Wisely and Privately," uses mutual information and a Potential Federation Loss to proactively select clients whose data maximizes utility and fairness before training begins. Another study introduces a lightweight geometric signal to detect atypical clients by measuring how their local training diverges from the global model's functional behavior. Additionally, new theoretical work establishes general lower bounds for differentially private federated learning protocols and analyzes the trade-offs between centralized and decentralized federated learning architectures. AI

IMPACT These advancements in federated learning could lead to more efficient and secure collaborative AI model training, particularly in scenarios with sensitive or distributed data.
TOOL · arXiv cs.NE (Neural & Evolutionary) English(EN) · 1w

Thermodynamic Networks: Harnessing Non-Equilibrium Steady States for Computation

Researchers have developed a new framework called thermodynamic networks that uses physics-based computation through non-equilibrium steady states. These networks leverage the exchange of conserved quantities between reservoirs to encode computational problem solutions. The presence of Negative Differential Conductance (NDC) is identified as crucial for computational expressivity, enabling universal function approximation, while its absence limits computations to monotonic functions. The approach has been demonstrated on quantum dot and enzymatic reaction networks, achieving high performance on benchmarks like sine function approximation and MNIST digit classification. AI

IMPACT Introduces a novel physics-based computation method that could lead to new AI hardware architectures.
RESEARCH · arXiv cs.NE (Neural & Evolutionary) English(EN) · 2w · [2 sources]

Energy-Efficient Implementation of Spiking Recurrent Cells on FPGA

Two new research papers detail advancements in energy-efficient Spiking Neural Networks (SNNs) implemented on Field-Programmable Gate Arrays (FPGAs). The first paper introduces SPIKER-LL, an FPGA accelerator designed for adaptive local learning in SNNs, achieving high accuracy with minimal energy consumption. The second paper presents an FPGA implementation of Spiking Recurrent Cells, demonstrating a balance between biological plausibility and hardware efficiency, with results showing competitive accuracy and reduced energy usage. AI

IMPACT These FPGA implementations offer a path to more energy-efficient AI at the edge by optimizing Spiking Neural Networks for hardware.