Brief

last 24h

[50/76] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

RESEARCH · arXiv cs.LG English(EN) · 6h · [2 sources]

Not All Retrievals are Useful: Cross-Attention for Input-Aware RAG in Time Series Forecasting

Two new research papers explore advancements in retrieval-augmented generation (RAG) for time series forecasting. The first paper introduces SERAF, a framework that uses both time series similarity and textual descriptions for retrieval, demonstrating improved forecasting accuracy across multiple datasets. The second paper, Cross-RAG, addresses the issue of irrelevant retrieved data by employing cross-attention to focus on query-relevant samples, showing improved stability and performance across various RAG methods and forecasting models. AI

IMPACT These papers introduce novel techniques to improve the accuracy and stability of AI models in time series forecasting by enhancing how external knowledge is integrated.
RESEARCH · arXiv cs.AI English(EN) · 6h · [3 sources]

A Comprehensive Survey of Medical Image Segmentation: Challenges, Benchmarks, and Beyond

Three new research papers explore advancements in medical image segmentation, a critical field for clinical diagnostics. The first paper provides a comprehensive survey of the field, detailing datasets, methods based on U-Net, Transformer, and SAM architectures, and challenges. The second introduces K-Prism, a unified framework that integrates semantic priors, few-shot examples, and interactive feedback for universal segmentation across various modalities. The third paper, HadBalance, proposes a plug-and-play framework that uses geometric priors derived from Hadwiger's theorem, balanced with a conflict-aware objective to maintain accuracy on shape-heterogeneous data. AI

IMPACT These advancements in medical image segmentation could lead to more accurate diagnoses and personalized treatment plans.
- U-Net
- Hadwiger Shape Priors
- Bangwei Guo
- ScienceCast
- Gotit.pub
- DagsHub
- CatalyzeX Code Finder for Papers
- alphaXiv
- Hugging Face
- GitHub
- arXiv
- HadBalance
- K-Prism
- SAM
- Transformer
RESEARCH · arXiv cs.CV English(EN) · 6h · [3 sources]

Variational Deep Unfolding with Mamba-Based Nonlocal Modeling for Underwater Image Enhancement

Researchers have developed new methods for enhancing underwater images, addressing issues like poor visibility, color distortion, and blur. One approach utilizes a deep unfolding network incorporating Mamba layers to capture scene similarities and a proximal trajectory loss for consistency. Another method employs transfer learning and physics-based decomposition, leveraging prior knowledge from other vision tasks without requiring paired labels. A third framework uses a dual-branch system to jointly optimize image enhancement and object detection, improving clarity and color accuracy for downstream tasks. AI

IMPACT These advancements in underwater image enhancement could improve the performance of AI systems in marine research, exploration, and surveillance.
- CORE Recommender
- Influence Flower
- ScienceCast
- Hugging Face
- Gotit.pub
- DagsHub
- CatalyzeX Code Finder for Papers
- alphaXiv
- Mamba
- arXiv
- YOLOv8
- UIEB
- URPC
- arXivLabs
RESEARCH · arXiv cs.LG English(EN) · 6h · [2 sources]

Discrimination-free Insurance Pricing with Privatized Sensitive Attributes

Two research papers explore novel approaches to fairness in insurance pricing, addressing the tension between actuarial and solidarity fairness. The first paper introduces an \"alpha-Fair Individual Solvent Premium\" ($\alpha$-FISP) framework, which allows for a tunable continuum between actuarially fair and solidarity-based pricing while ensuring solvency. The second paper focuses on discrimination-free pricing by using privatized sensitive attributes, enabling fair pricing even when direct access to sensitive data like gender or race is restricted due to privacy or regulatory concerns. AI

IMPACT These papers introduce novel algorithmic frameworks for insurance pricing that balance fairness and solvency, potentially influencing future actuarial practices and regulatory approaches.
TOOL · arXiv cs.AI English(EN) · 6h

A Self Consistency Based Reranking for Narrative Question Answering

Researchers have developed a novel self-ensemble framework to improve narrative question answering (NQA) by reranking multiple generated answers. This approach enhances robustness by selecting answers based on semantic agreement, without altering the core model architecture. Experiments on the NarrativeQA dataset showed significant performance gains across various models, including FLAN-T5 and Pegasus-Large, with Pegasus-Large seeing a notable increase of over 14%. AI

IMPACT Enhances robustness of language models for complex question answering tasks.
TOOL · arXiv cs.CL English(EN) · 6h

When the Same Musical Knowledge Forgets Differently: A Clean Probe of Pathway-Dependent Forgetting

Researchers have investigated how the acquisition route of knowledge in multimodal AI models affects its susceptibility to forgetting. Using the musical piece "Für Elise" as a test case, they found that knowledge acquired through text descriptions is forgotten more readily than knowledge acquired through audio input, even under identical adaptation pressures. This phenomenon, termed pathway-dependent forgetting, was observed across various audio-language models and was robust to different experimental controls, suggesting that the input representation, rather than architectural depth, is a key factor. AI

IMPACT Suggests a new dimension for designing multimodal AI systems by considering how knowledge is acquired to improve retention.
TOOL · arXiv cs.CL English(EN) · 6h

Re-feeding Is Not Replaying: Measuring Replay Noise in Counterfactual Token-Credit Estimation

A new paper from arXiv explores the reliability of counterfactual token-credit estimation in language models. The research highlights that re-feeding the transcript prefix as a fresh prompt, a common method, can introduce significant noise compared to resuming from the verified decode-time KV state. This noise can alter credit estimates, particularly at low-margin decision tokens, and impacts the selection of critical tokens. The study suggests that using batch-invariant kernels or resuming decoder state is crucial for more accurate credit estimation, and recommends reporting a replica floor to account for inherent noise in single-sample measurements. AI

IMPACT Highlights potential unreliability in current methods for attributing model outputs to specific tokens, impacting research into model interpretability.
TOOL · arXiv cs.AI English(EN) · 6h

A Multi-level Analysis of Factors Associated with Student Performance: A Machine Learning Approach to the SAEB Microdata

Researchers have developed a multi-level machine learning model to analyze student performance using microdata from Brazil's System of Assessment of Basic Education (SAEB). The study integrated data on student socioeconomic status, teacher profiles, school indicators, and principal management. A Random Forest model achieved 90.2% accuracy and an AUC of 96.7%, outperforming other ensemble algorithms. Explainable AI (XAI) techniques revealed that the school's average socioeconomic level is the most significant predictor of student performance, highlighting the systemic nature of academic achievement. AI

IMPACT Provides an interpretable tool for policymakers to address educational disparities by identifying systemic factors influencing student performance.
TOOL · arXiv cs.AI English(EN) · 6h

An Empirical Investigation of Pre-Trained Deep Learning Model Reuse in the Scientific Process

A new study on arXiv investigates the reuse of pre-trained deep learning models (PTMs) within the scientific process, particularly in natural sciences. The research quantifies PTM utilization across 17,718 open-access papers, finding that "Biochemistry, Genetics and Molecular Biology" leads in PTM reuse. The study identifies "adaptation" as the most common reuse pattern and highlights the "testing" stage of the scientific process as most impacted by PTM integration. AI

IMPACT Demonstrates growing reliance on pre-trained models in scientific research, potentially lowering barriers to entry for complex analyses.
TOOL · arXiv cs.AI English(EN) · 6h

Evaluation of Alternative-Based Information Systems for Deliberative Polling using an Agentic Simulator

A new paper introduces the Agentic Bipolar Argumentation Simulator (ABAS) to evaluate information systems for deliberative polling. ABAS uses LLM-based agents to simulate voter behavior, including opinion formation, justification selection, and argumentation linking. The research addresses the 'coverage problem' in ensuring voters encounter a representative sample of arguments, particularly in adversarial scenarios, and proposes a framework to formalize polling as a six-tuple of justifications and relations. AI

IMPACT Introduces a novel simulation framework using LLM agents to address challenges in large-scale deliberative polling.
TOOL · arXiv cs.AI English(EN) · 6h

From Correlation to Causation in Lane Change Prediction for Automated Driving: A Causal Explanation Framework

A new framework for lane-change prediction in automated driving systems has been developed, moving beyond simple correlation to incorporate causal inference. This approach uses deep structural causal modeling and intervention-based analysis to not only predict maneuvers with over 95% F1-score in the seconds leading up to an event but also to explain the causal reasoning behind these predictions. The system identifies direct contributors, their upstream influences, and the causal chains involved, offering a more interpretable mechanism for understanding vehicle behavior. AI

IMPACT Introduces a more interpretable and robust method for autonomous driving systems to predict and explain maneuvers.
TOOL · arXiv cs.AI English(EN) · 6h

Orcheo: A Modular Full-Stack Platform for Conversational Search

Researchers have introduced Orcheo, an open-source platform designed to streamline the development and deployment of conversational search systems. The platform addresses challenges in sharing research contributions and deploying prototypes by offering a modular architecture, production-ready infrastructure with AI coding support, and a starter kit with over 45 pre-built components. Orcheo aims to facilitate reproducibility and ease of use for conversational search research. AI

IMPACT Accelerates research and development in conversational search systems by providing a unified, modular framework.
TOOL · arXiv cs.AI English(EN) · 6h

Parallel Test-Time Scaling with Multi-Sequence Verifiers

Researchers have developed a new method called the Multi-Sequence Verifier (MSV) to improve the performance and reduce the latency of large language models. MSV addresses two key bottlenecks in parallel test-time scaling: accurately selecting the best solution from multiple candidates and the high inference latency. By conditioning each candidate's correctness on the entire set of generated solutions, MSV achieves better calibration, leading to improved answer selection and enabling an early-stopping framework that halves latency while maintaining accuracy on mathematical reasoning benchmarks. AI

IMPACT Enhances LLM efficiency by improving solution selection and reducing inference time on complex reasoning tasks.
TOOL · arXiv cs.AI English(EN) · 6h

Sensor-Conditioned Representation Learning via Scene-Relevant Observation Quotients

Researchers have introduced a new framework called Observation-Quotient Tucker-Structured Autoencoding (OQ-TSAE) for learning representations in intelligent sensing systems. This framework aims to ensure that learned representations accurately reflect scene distinctions supported by sensor data while filtering out variations caused by nuisance factors. Experiments on a benchmark dataset demonstrated that OQ-TSAE improves representation correctness diagnostics compared to existing methods, and a variant of OQ-TSAE also showed competitive downstream utility and robustness in real-world radar experiments. AI

IMPACT Enhances the accuracy and interpretability of AI systems that rely on sensor data by ensuring representations are grounded in observable scene distinctions.
TOOL · arXiv cs.AI English(EN) · 6h

NEXUS: Neural Energy Fields for Physically Consistent Contact-Rich 3D Object Dynamics

Researchers have introduced NEXUS, a novel neural energy-field framework designed to model physically consistent contact-rich 3D object dynamics. Unlike previous methods that often model isolated physical effects, NEXUS composes conservative and non-conservative dynamics by representing objects as structural graphs and constructing dynamic contact graphs. The framework formulates motion through scalar energy and dissipation terms, inspired by Hamiltonian Neural Networks, allowing for additive composition of conservative effects like gravity and elastic deformation, and learned modeling of non-conservative effects such as damping and impact-induced energy loss. NEXUS has demonstrated improved long-horizon accuracy on trajectory benchmarks and shows promise for guiding contact-rich video generation with enhanced physical plausibility and visual quality. AI

IMPACT Introduces a new framework for physically consistent 3D object dynamics, potentially improving realism in physics-grounded video generation.
TOOL · arXiv cs.AI English(EN) · 6h

Visual-Seeker: Towards Visual-Native Multimodal Agentic Search via Active Visual Reasoning

Researchers have introduced Visual-Seeker, a novel agent designed for multimodal deep search that prioritizes visual information. Unlike previous methods that treat vision as static input, Visual-Seeker actively engages with fine-grained visual details throughout the search process. This approach aims to enhance multi-hop, cross-modal reasoning in complex web environments. The system has demonstrated state-of-the-art performance on five multimodal search benchmarks, outperforming some proprietary models. AI

IMPACT Enhances multimodal search capabilities by prioritizing active visual reasoning over static image inputs.
TOOL · arXiv cs.LG English(EN) · 6h

DAL: A Practical Prior-Free Black-Box Framework for Piecewise Stationary Bandits

Researchers have introduced Detection Augmented Learning (DAL), a new framework designed for piecewise stationary bandits that does not require prior knowledge of non-stationarity. DAL functions by integrating any existing stationary bandit algorithm with a change detector, thereby extending its applicability to a wide range of bandit problems. Empirical results across various synthetic and real-world datasets indicate that DAL consistently outperforms current state-of-the-art methods, demonstrating its effectiveness and scalability. AI
TOOL · arXiv cs.AI English(EN) · 6h

Towards Next-Generation Healthcare: A Survey of Medical Embodied AI for Perception, Decision-Making, and Action

A new survey paper published on arXiv explores the integration of embodied artificial intelligence (AI) into healthcare. The paper highlights the limitations of current foundation models in perceiving and interacting with the physical world, which is crucial for real-world clinical applications. It proposes embodied AI as a solution for agents to operate effectively in complex medical environments by coordinating perception, decision-making, and action. The survey also reviews existing medical applications, datasets, and challenges, while outlining future research directions. AI

IMPACT This survey highlights the potential of embodied AI to overcome limitations of current foundation models in physical interaction for healthcare applications.
TOOL · arXiv cs.AI English(EN) · 6h

Constitutional Value Potentials: reading and steering internal priority margins in language models

Researchers have developed a new method called Constitutional Value Potentials (CVP) to read and steer the internal priorities of language models. CVP learns a scalar potential for each value from a model's hidden state, indicating its internal pressure to preserve that value. This allows for the identification of priority margins, which are crucial for understanding how models handle value conflicts. The system predicts conflict violations with high accuracy and can generalize across different model scales, suggesting that these priorities are accessible within the model's activation space rather than solely through output behavior. AI

IMPACT Enables deeper understanding and control over LLM value alignment, potentially improving safety and reliability.
TOOL · arXiv cs.AI English(EN) · 6h

Exploiting Search in Symbolic Numeric Planning with Patterns

Researchers have introduced a novel procedure for numeric planning that leverages Symbolic Pattern Planning (SPP). This method involves dynamically recomputing and refining patterns to guide the search for intermediate states closer to a goal state. The procedure is proven correct and complete under specific conditions, offering different strategies for exploring the search space. AI

IMPACT Introduces a novel procedure for numeric planning that could enhance AI search capabilities in complex problem-solving scenarios.
TOOL · arXiv cs.CL English(EN) · 6h

Calibrated Triage, Not Autonomy: Confidence Estimation for Medical Vision-Language Models

A new research paper explores the effectiveness of confidence estimation for medical vision-language models (LVLMs). The study found that while LVLMs can generate fluent and confident answers, they often do so without accurately using the provided medical images, relying instead on language priors. This can lead to trustworthy-looking but incorrect diagnoses. The research evaluated seven confidence estimators across five open-weight LVLMs on three medical datasets, concluding that a calibrated confidence score is crucial for safe deployment, enabling models to triage cases rather than operate autonomously. The findings suggest that current confidence signals are insufficient for full autonomy, highlighting the need for models to abstain from cases where confidence is low. AI

IMPACT Highlights the critical need for reliable confidence scores in medical AI to ensure safe deployment and prevent autonomous decision-making in high-stakes scenarios.
TOOL · arXiv cs.AI English(EN) · 6h

Human Cognition in Machines: A Unified Perspective of World Models

A new arXiv paper proposes a unified framework for world models in AI, drawing parallels to human cognition. The paper, authored by Timothy Rupprecht, identifies gaps in current research, particularly in motivation and metacognition, and suggests future research directions informed by active inference and global workspace theory. It also introduces a new category of 'epistemic world models' for AI agents involved in scientific discovery. AI

IMPACT Proposes a new taxonomy for AI world models, highlighting under-researched areas like motivation and metacognition.
TOOL · arXiv cs.AI English(EN) · 6h

AgenticRec: A Recommendation-Oriented Agentic Framework with Progressive Tool-Integrated Reasoning Optimization

Researchers have introduced AgenticRec, a new framework designed to enhance recommender agents built on large-language models. This framework addresses the common issue of misalignment between an agent's reasoning processes and recommendation feedback, which can limit its ability to understand nuanced user preferences. AgenticRec employs a two-stage training approach: Recommendation-Oriented Trajectory Activation for optimizing implicit feedback, followed by Progressive Preference Refinement for sharpening preference boundaries through bidirectional reasoning. Experiments indicate that AgenticRec effectively improves recommender agent performance. AI

IMPACT Enhances LLM-based recommender agents by improving preference alignment and reasoning capabilities.
TOOL · arXiv cs.CV English(EN) · 6h

Multi-view feature High-order Fusion for Space Weak Object Detection and Segmentation

Researchers have developed a novel multi-view feature high-order fusion (MHF) method to improve the detection and segmentation of weak objects in space imagery. This approach extends traditional low-order feature fusion to higher orders, enhancing the model's ability to capture complementary information by introducing high-order multi-view feature perception and a recursive task-contribution gated selection mechanism. The MHF method is designed as a flexible, plug-and-play module compatible with various vision models, and has demonstrated state-of-the-art performance on newly constructed space science datasets and an open satellite video dataset. AI

IMPACT This new fusion method could significantly improve the accuracy of object detection and segmentation in space science applications.
TOOL · arXiv cs.LG English(EN) · 6h

Communication-Efficient Distributed Training for Collaborative Flat Optima Recovery in Deep Learning

Researchers have developed a new distributed training algorithm called Distributed Pull-Push Force (DPPF) designed to improve communication efficiency and model generalization in deep learning. DPPF incorporates a novel sharpness measure, Inverse Mean Valley, to encourage collaborative seeking of wide minima in the loss landscape. Empirical results show DPPF outperforms existing communication-efficient methods and achieves superior generalization compared to local gradient and synchronous gradient averaging techniques. AI

IMPACT This new algorithm could lead to more efficient and better-generalizing deep learning models through improved distributed training techniques.
TOOL · arXiv cs.CV English(EN) · 6h

ST-DiffEye: Diffusion-based Continuous Gaze Generation via Joint Scanpath-Trajectory Modeling

Researchers have developed ST-DiffEye, a novel diffusion framework for generating human gaze patterns. This model uniquely integrates both continuous eye-tracking trajectories and discrete scanpaths, treating gaze variability as a core feature rather than noise. The framework utilizes a joint modeling approach by concatenating these modalities as an input channel, requiring minimal architectural changes. An accompanying evaluation framework based on the Continuous Ranked Probability Score (CRPS) is also introduced to assess both accuracy and diversity of generated gaze. AI

IMPACT This research advances generative models for human behavior analysis, potentially impacting fields like HCI and user experience research.
TOOL · arXiv cs.CV English(EN) · 6h

City landscape in sight: A crowdsourced framework for unlocking urban-scale window view perceptions from real estate imagery

Researchers have developed a crowdsourced framework to analyze urban-scale window view perceptions using real estate imagery from Wuhan, China. The study collected over 27,000 pairwise comparisons from 300 participants on six perceptual dimensions, which were then used to train a hybrid neural network model. This model predicts human perceptions and maps their spatial distribution across the city, revealing that floor level and the composition of the view (sky, trees, buildings) significantly influence preferences, with non-linear effects observed. AI

IMPACT Provides a novel method for urban planning and real estate by quantifying visual preferences from residential window views.
TOOL · arXiv cs.CV English(EN) · 6h

Texture-Shape Bias Balancing for Robust Synthetic-to-Real Semantic Segmentation in Automotive NIR Imagery

Researchers have introduced a new framework for improving semantic segmentation in automotive near-infrared (NIR) imagery by addressing the domain gap between synthetic and real-world data. Their approach, called Target Style Adaptation (TSA), uses a fine-tuned latent diffusion model to transform synthetic images into realistic NIR-style variants. Additionally, a Voronoi-based Style Diversification (VSD) strategy is employed to reduce texture bias while preserving geometric information. Experiments demonstrated significant improvements in segmentation robustness, reducing the domain gap by up to 63.6% on exterior data and 28.4% on interior data. AI

IMPACT Enhances robustness of automotive perception systems in challenging lighting conditions.
TOOL · arXiv cs.CV English(EN) · 6h

SiGnature: Explicit Motion Diffusion for Stylized Semantic Gesture

Researchers have introduced SiGnature, a novel framework for generating stylized and semantic gestures that synchronize with speech. This system operates in an explicit joint-rotation space, allowing for the integration of external motion sequences, particularly semantic gestures, directly into the diffusion process without retraining. The framework's Joint Motion Integration (JMI) mechanism identifies active joints for semantic actions while the diffusion backbone handles posture and flow, preserving the speaker's unique style. AI

IMPACT Enables more natural and expressive AI-generated characters by improving the synchronization and semantic meaning of gestures.
TOOL · arXiv cs.LG English(EN) · 6h

Anomaly Detection via Mean Shift Density Enhancement

Researchers have introduced Mean Shift Density Enhancement (MSDE), a novel unsupervised anomaly detection framework designed for robustness across various anomaly types and noisy conditions. MSDE operates by analyzing how samples shift under density enhancement, with normal samples remaining stable while anomalous ones move significantly towards density modes. Evaluations on a benchmark of 46 datasets demonstrated MSDE's consistently strong and balanced performance compared to 13 established baselines, highlighting displacement-based scoring as a robust alternative. AI
TOOL · arXiv cs.AI English(EN) · 6h

Automated 3D Kinematic Monitoring for Circadian Activity and Anomaly Detection in Juvenile Fish

Researchers have developed a novel 3D behavioral phenotyping framework for juvenile fish, integrating deep learning with binocular stereo vision. This system automates non-contact body length estimation and reconstructs precise 3D swimming trajectories, enabling the quantification of true physical swimming speeds for the first time. The framework establishes circadian locomotor baselines and serves as an early warning system for physiological stress in high-density aquaculture environments. AI

IMPACT Enables precise, automated behavioral analysis in aquaculture, potentially improving fish health monitoring and breeding practices.
TOOL · arXiv cs.LG English(EN) · 6h

Discovering Subgroups with Exceptional Survival Characteristics

Researchers have developed Sysurv, a novel non-parametric and fully differentiable method for identifying subgroups with distinct survival characteristics. Unlike existing approaches that rely on restrictive assumptions or pre-discretized features, Sysurv can uncover human-readable rules that select these subgroups. Empirical evaluations, including a case study on cancer data, demonstrate Sysurv's ability to reveal insightful and actionable survival subgroups, surpassing current state-of-the-art methods. AI

IMPACT This new method could enhance predictive modeling in fields like medicine and engineering by identifying specific subgroups with unique survival or failure characteristics.
TOOL · arXiv stat.ML English(EN) · 6h

Optimality in importance sampling: a gentle survey

This paper provides a comprehensive review of optimality within importance sampling techniques, a critical component for the performance of Monte Carlo sampling methods. It explores various frameworks for designing adaptive proposal densities, including marginal likelihood approximation for model selection, the use of multiple proposal densities, and sequences of tempered posteriors. The survey also delves into applications in noisy scenarios such as approximate Bayesian computation and reinforcement learning, offering theoretical and empirical comparisons. AI

IMPACT Provides a theoretical foundation for advanced sampling techniques used in AI research.
RESEARCH · arXiv cs.CV English(EN) · 16h · [2 sources]

MeshLoom: Feed-Forward Non-Rigid Registration of Mesh Sequences

Researchers have introduced MeshLoom, a novel feed-forward registration network designed for non-rigid mesh sequences. This approach bypasses the limitations of traditional methods, such as costly per-instance optimization and restricted object categories, by directly reconstructing vertex deformations. MeshLoom is efficient, processing multiple meshes in seconds, and utilizes a topology-aware encoder-decoder architecture that fuses anchor-mesh topology with frame-specific cues like shape latents and image features. The network achieves state-of-the-art results in non-rigid registration and can also be applied to motion interpolation and mesh morphing. AI

IMPACT Introduces a novel network architecture for efficient non-rigid mesh registration, potentially improving applications in computer vision and graphics.
RESEARCH · arXiv cs.LG English(EN) · 17h · [2 sources]

Task-Error Residual Learning for Real-Robot Five-Ball Juggling

Researchers have developed a novel method called Task-Error Residual Learning to enable robots to perform complex tasks like five-ball juggling. This approach leverages directional task error, which provides more information than standard scalar rewards, to improve sample efficiency. By combining directional feedback with an informative prior, the system can achieve stable juggling with minimal attempts, significantly outperforming the years of practice typically required for humans. AI
RESEARCH · arXiv stat.ML English(EN) · 17h · [2 sources]

Sobolev Approximation by Fixed-Size Neural Networks with Arbitrary Accuracy

Researchers have introduced new activation functions, the Elementary Universal Activation Function (EUAF) and Differentiable Universal Activation Functions (DUAF), designed to enable fixed-size neural networks to achieve arbitrary-accuracy Sobolev approximation. The study demonstrates that functions within $W^{s, ext{inf}}((a,b)^d)$ can be approximated with arbitrary accuracy in the $W^{s-1, ext{inf}}$-norm using networks with these novel activations. Explicit bounds for network width and depth are provided, and sigmoidal variants of DUAF are also explored. AI

IMPACT Introduces novel activation functions that could enhance the approximation capabilities of fixed-size neural networks.
RESEARCH · arXiv stat.ML English(EN) · 18h · [2 sources]

Optimal Multiscale Learning of Linear Operators

Researchers have established the statistical and computational limits for learning bounded linear operators between Sobolev spaces using noisy input-output data. The problem is reframed as an infinite-dimensional matrix regression with a complex multiscale structure. A novel blockwise least-squares estimator has been developed that achieves optimal rates and computational efficiency by adapting sample sizes to different scales. AI
RESEARCH · arXiv cs.AI English(EN) · 19h · [2 sources]

ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies

Researchers have introduced ATOM-Bench, a new real-world benchmark designed to evaluate the atomic skills and compositional generalization capabilities of robotic manipulation policies. The benchmark includes 30 atomic tasks and 24 held-out compositional tasks, utilizing 3,000 human demonstrations for fine-tuning and evaluation. Initial tests on five representative policies revealed that while current models can grasp basic instruction-grounding, they struggle with fine-grained motor skills and reliably composing learned skills for novel tasks. AI

IMPACT This benchmark aims to improve the real-world generalization of robotic manipulation policies, addressing a key challenge in AI for robotics.
RESEARCH · arXiv cs.IR (Information Retrieval) English(EN) · 19h · [2 sources]

Understanding the Behaviors of Environment-aware Information Retrieval

Researchers have published a paper detailing a new method for improving retrieval-augmented generation (RAG) systems by teaching large language models (LLMs) to adapt their query formulation strategies for different information retrievers. Using reinforcement learning (RL), the study demonstrates that LLMs can learn to tailor queries to specific retriever characteristics, revealing distinct optimal query styles for various retrievers. The research also suggests that performance can be further enhanced by incorporating retriever-specific human guidance and by scaling model size, with a new branching-based rollout technique introduced to improve training stability for multi-retrieval-step trajectories. AI

IMPACT This research offers actionable insights for developing more effective RAG systems by enabling LLMs to better adapt to diverse information retrieval tools.
RESEARCH · arXiv cs.LG English(EN) · 20h · [2 sources]

A Validated LBM Dataset and Pipeline for Surrogate Modeling of Turbulent 3D Obstructed Channel Flows

Researchers have developed a validated dataset and pipeline for training neural operators to model turbulent 3D obstructed channel flows. The lattice Boltzmann solver used in the pipeline has been rigorously verified against experimental measurements, including Strouhal number and drag coefficients. This work aims to enable standardized comparison of surrogate models like Fourier Neural Operator and U-Net variants for tasks such as forecasting and super-resolution, using physics-informed metrics to assess their representation of turbulent energy cascades. AI

IMPACT Enables more rigorous evaluation and comparison of neural operators for complex fluid dynamics simulations.
RESEARCH · arXiv cs.LG English(EN) · 20h · [2 sources]

STAR-NT: Spatiotemporal Acceleration of Real-Time Neural Transparency Rendering

Researchers have developed STAR-NT, a novel framework designed to accelerate real-time neural transparency rendering. This method addresses the high computational costs associated with rendering overlapping transparent surfaces, particularly on less powerful hardware. By employing adaptive quadtree-based screen-space subdivision and temporal frame reuse through depth-based reprojection, STAR-NT significantly reduces rendering overhead while maintaining visual fidelity. AI
TOOL · arXiv cs.LG English(EN) · 6h

Identification and Inference for Algorithmic Frontiers with Selective Labels

This paper introduces a method for identifying and inferring the fairness-accuracy frontier, a concept crucial in econometrics. The proposed techniques allow for hypothesis testing and the construction of confidence sets for this frontier, particularly when outcome data is only available for a subset of individuals. The research provides a characterization of the identification region for the FA-frontier under specific selection processes and loss measurements, with extensions to broader loss functions currently in progress. AI
RESEARCH · arXiv cs.CV English(EN) · 20h · [2 sources]

Text-Vision Co-Instructed Image Editing

Researchers have introduced a novel image editing framework called TV-Edit that combines textual instructions with visual prompts for more precise and intent-faithful manipulation. This approach addresses the limitations of text-only methods, which lack fine-grained spatial control, and visual-only methods, which can suffer from semantic ambiguity. TV-Edit leverages a dataset of over 23,000 video-derived samples to unify semantic intent and spatial guidance, leading to improved structural consistency and performance over existing baselines. AI

IMPACT This research advances image editing capabilities by combining textual and visual inputs, potentially leading to more intuitive and precise user control in creative applications.
RESEARCH · arXiv cs.LG English(EN) · 17h · [2 sources]

Learning the Geometry of Data: A Mathematical Review of Shape Space Analysis

A new review paper published on arXiv, titled "Learning the Geometry of Data: A Mathematical Review of Shape Space Analysis," synthesizes research on shape space analysis. This field provides a mathematical and computational framework for studying geometric data, drawing from differential geometry, statistics, and machine learning. The paper outlines a pipeline for shape representation, metric construction, statistical analysis, and geometry-aware learning methods, highlighting applications in biology, medicine, anthropology, and computer vision. AI

IMPACT This review consolidates geometric data analysis techniques, potentially enabling more sophisticated pattern recognition in complex datasets across various scientific fields.
RESEARCH · arXiv cs.CV English(EN) · 22h · [3 sources]

SUP-MCRL: Subject-aware Unified Pseudo-feature Coded Multimodal Contrastive Representation Learning for EEG Visual Decoding

Two new research papers submitted to arXiv on June 15, 2026, explore advanced methods for decoding electroencephalography (EEG) signals. The first paper introduces subject-specific encoders to improve cross-subject EEG decoding by addressing distribution shifts, showing promise in improving accuracy for most subjects. The second paper, SUP-MCRL, presents a unified framework for EEG visual decoding that integrates semantic awareness, subject robustness, and representation consistency to overcome fidelity degradation in neural visual decoding. AI

IMPACT Advances in subject-aware EEG decoding could improve the accuracy and robustness of brain-computer interfaces for various applications.
RESEARCH · arXiv cs.AI English(EN) · 21h · [2 sources]

From Affect Prediction to Affect Forecasting: Evidence for Distinct Information Sources in Longitudinal Text

A new research paper proposes the Trait-State Affective Prediction (TSAP) framework and its temporal extension E-TSAP to distinguish between predicting current emotional states and forecasting future affective changes from longitudinal text. The study found that while textual semantics are effective for predicting current affect, prior numeric trajectory dynamics are better indicators for forecasting future emotional shifts. The proposed Affective Change Forecaster Hybrid (ACF-Hybrid) model, utilizing these numeric trajectories, achieved significantly higher forecasting accuracy than text-based models. AI

IMPACT This research highlights the distinct information sources required for predicting current emotions versus forecasting future affective changes in text, suggesting improvements for AI models in understanding and predicting human emotional dynamics.
RESEARCH · arXiv cs.AI English(EN) · 22h · [2 sources]

MR-GVNO: A Geometry-Aware Variational Physics-Informed Neural Operator for Mindlin-Reissner Plates on Irregular Domains

Researchers have developed MR-GVNO, a novel geometry-aware variational neural operator designed to accelerate response predictions for Mindlin-Reissner plates on irregular domains. This method utilizes boundary point clouds to represent complex geometries and integrates various input fields through a cross-attention mechanism. Trained using a physics-informed loss derived from the total potential energy, MR-GVNO achieves rapid, full-field inference and demonstrates strong generalization across different plate shapes and loading conditions, significantly outperforming traditional finite element methods in terms of computational cost. AI

IMPACT Accelerates engineering simulations by enabling millisecond-level full-field inference for complex plate structures.
RESEARCH · arXiv cs.CV English(EN) · 23h · [2 sources]

Rotational Symmetry based Object Pose Estimation from Point Clouds in the Absence of Known 3D Models

Researchers have developed a novel method for object pose estimation from point clouds that does not require known 3D models. This approach leverages the rotational symmetry inherent in many industrial objects to overcome challenges posed by confidentiality concerns that limit access to detailed 3D models. The method iteratively refines both the object's pose and the point cloud itself by incorporating a rotational symmetry constraint loss, which is computed using correspondences identified through nearest-neighbor search exploiting this symmetry. AI

IMPACT Enables object pose estimation in scenarios where 3D models are unavailable, potentially expanding applications in robotics and industrial automation.
RESEARCH · arXiv cs.CL English(EN) · 23h · [2 sources]

The BD-LSC Dataset: Facilitating the Benchmarking of Models for Lexical Semantic Change Detection in Slang and Standard Usage

Researchers have introduced the BD-LSC and ST-WSD datasets to benchmark models in detecting lexical semantic change, particularly for words with both slang and standard meanings. These datasets enable the study of sense gain, loss, and stability over time. While GPT-4o demonstrated strong performance in few-shot settings on metrics like Exact Sense Match, overall Macro-F1 scores indicate that identifying rare slang senses remains a significant challenge. AI

IMPACT New datasets may improve LLM understanding of nuanced language, especially slang.
RESEARCH · arXiv cs.CV English(EN) · 23h · [7 sources]

Local-GS: Accelerating 3D Gaussian Splatting via Tile-Local Warp Coherence

Researchers are exploring the application of 2D and 3D Gaussian Splatting techniques to various computer vision tasks, including image dehazing and low-light enhancement. New methods like Dehaze-GaussianImage and Fi-Gaussian leverage Gaussian primitives for more efficient and physically interpretable image processing, aiming to improve reconstruction fidelity and reduce artifacts. Additionally, advancements in accelerating 3D Gaussian Splatting, such as TurboGS and Local-GS, focus on optimizing rendering performance and GPU utilization for real-time applications. AI

IMPACT Advances in Gaussian Splatting are improving efficiency and fidelity in computer vision tasks like image dehazing and rendering.