Brief

last 24h

[4/4] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI Norsk(NO) · 8h

OnDeFog: Online Decision Transformer under Frame Dropping

Researchers have introduced OnDeFog, an advancement in reinforcement learning designed to handle frame dropping, a common issue in real-world applications due to communication delays or sensor failures. This new method integrates the frame-dropping mitigation techniques of DeFog with the online learning capabilities of an online decision transformer (ODT). Experimental results show that OnDeFog surpasses ODT in environments with high frame-dropping rates and outperforms DeFog when dealing with datasets containing substantial amounts of low-reward data. AI

IMPACT Improves reinforcement learning agent performance in scenarios with unreliable data transmission.
RESEARCH · arXiv cs.AI English(EN) · 1d · [2 sources]

Neural Additive and Basis Models with Feature Selection and Interactions

Researchers have developed new neural additive and basis models that incorporate feature selection to improve computational efficiency and model size. These models, proposed by Shinichi Shirakawa, build upon generalized additive models (GAMs) by using neural networks as nonlinear shape functions, offering high interpretability and visualization of feature contributions. The introduction of a feature selection layer addresses the computational bottlenecks previously encountered when dealing with feature interactions or high-dimensional datasets, enabling more efficient training and smaller model sizes while maintaining comparable or better performance than existing GAMs. AI

IMPACT These models offer a more interpretable and computationally efficient approach to deep learning, potentially improving the usability of complex models in various applications.
RESEARCH · arXiv cs.AI English(EN) · 3d · [3 sources]

Post-Hoc Merging is Not Enough: Many-Shot Model Merging with Loss-Gap Balancing

Researchers are developing new methods for optimizing model merging, a technique that combines the capabilities of multiple specialized AI models into a single, more powerful one. One approach focuses on creating surrogate benchmarks to efficiently tune merging hyperparameters, reducing the computational cost associated with large language models. Another method, PACT, addresses limitations in existing task-vector-based merging by preserving critical knowledge embedded in pre-trained weights, leading to improved performance across various benchmarks. A third technique, METIS, tackles information erasure in post-hoc merging by employing an iterative, loss-aware many-shot merging protocol to enhance multi-task performance. AI

IMPACT These advancements in model merging could lead to more efficient and capable AI systems by combining specialized models without extensive retraining.
RESEARCH · arXiv cs.LG English(EN) · 4d · [4 sources]

One-Step Generalization Ratio Guided Optimization for Domain Generalization

Researchers are exploring new methods for domain generalization (DG) and open domain generalization (ODG) in machine learning. One study demonstrates that simple DG methods like CORAL and MMD can be competitive with more complex approaches for ODG, and proposes extensions that maintain performance with lower computational costs. Another paper introduces an anti-causal setting for DG, leveraging unlabeled data by penalizing model sensitivity to covariate variations. Additionally, a new optimizer called GENIE is proposed, which uses the One-Step Generalization Ratio to balance parameter updates and promote learning of domain-invariant features, outperforming existing methods. AI

IMPACT These research papers explore advanced techniques for making AI models more robust to variations in data, potentially leading to more reliable AI systems in diverse real-world scenarios.

Brief

OnDeFog: Online Decision Transformer under Frame Dropping

Neural Additive and Basis Models with Feature Selection and Interactions

Post-Hoc Merging is Not Enough: Many-Shot Model Merging with Loss-Gap Balancing

One-Step Generalization Ratio Guided Optimization for Domain Generalization