Brief

last 24h

[50/327] 186 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv cs.AI · 1d

DISC: Decoupling Instruction from State-Conditioned Control via Policy Generation

Researchers have developed a new method called DISC that decouples language instructions from state-conditioned control in robotics. Unlike previous approaches that share network parameters, DISC uses a hypernetwork to generate task-specific policies directly from instructions, preventing observation leakage. This novel approach significantly outperforms existing methods on benchmarks like LIBERO-90 and Meta-World, demonstrating its effectiveness in complex, long-horizon tasks and real-world applications. AI

IMPACT Introduces a novel architecture for language-conditioned robotics that mitigates common failure modes and improves performance on complex tasks.
- $\\pi_0$
- LIBERO-90
TOOL · arXiv cs.AI · 1d

USV: Towards Understanding the User-generated Short-form Videos

Researchers have introduced USV, a new dataset comprising approximately 224,000 user-generated short-form videos. This dataset is designed to advance the understanding of high-level semantic information in videos, moving beyond instance-level recognition. To facilitate research, the paper also establishes topic recognition and video-text retrieval tasks on USV, proposing baseline methods like MMF-Net and VTCL. AI

IMPACT Introduces a new dataset and baseline methods to advance research in understanding user-generated short-form videos.
- MMF-Net
TOOL · arXiv cs.CV · 1d

HyDAR-Pano3D: A Hybrid Disentangled Anatomical Recovery Framework for Panoramic-to-3D Reconstruction

Researchers have developed HyDAR-Pano3D, a novel framework for reconstructing detailed 3D dental anatomy from 2D panoramic radiographs. This two-stage approach disentangles the learning process, first creating a normalized canonical volume using radiographic features and semantic priors from SAM, and then restoring patient-specific variations. The method significantly outperforms existing techniques, achieving high scores in PSNR, SSIM, and Dice for anatomical reconstruction, and enabling accurate downstream segmentation tasks. AI

IMPACT Enables more accurate 3D dental reconstructions from standard 2D X-rays, potentially reducing the need for CBCT scans and improving diagnostic capabilities.
- SAM
- HyDAR-Pano3D
TOOL · arXiv cs.LG · 1d

Beyond Numerical Features: CNN-Driven Algorithm Selection via Contour Plots for Continuous Black-Box Optimization

Researchers have developed a novel method for algorithm selection in continuous black-box optimization that utilizes contour plots instead of traditional numerical features. A Convolutional Neural Network (CNN) analyzes these contour visualizations of probed landscapes to predict the performance of different solvers. This image-based approach demonstrated significant improvements over the single best solver (SBS) on the BBOB 2009 benchmark and showed competitiveness with existing feature-based methods. AI

IMPACT Introduces a novel image-based approach for algorithm selection in optimization, potentially improving efficiency without relying on traditional numerical features.
- CNN
- BBOB 2009
TOOL · arXiv cs.CV · 1d

What Semantics Survive the Connector? Diagnosing VLM-to-DiT Alignment in Video Editing

Researchers have developed a new diagnostic dataset and protocol called TRACE-Edit to evaluate how well semantic information is preserved when Vision-Language Models (VLMs) are used for video editing. Their findings indicate that the alignment process between VLMs and Diffusion Transformer models (DiTs) can significantly degrade fine-grained structural details, challenging the assumption of lossless semantic transfer. This research identifies the VLM-to-DiT alignment as a critical bottleneck and provides a foundation for developing improved multi-modal alignment architectures. AI

IMPACT Identifies a key bottleneck in current video editing models, potentially guiding future research towards more semantically faithful multi-modal alignment.
- VLM
TOOL · arXiv cs.CV · 1d

Diffuse to Detect: Bi-Level Sample Rebalancing with Pseudo-Label Diffusion for Point-Supervised Infrared Small-Target Detection

Researchers have developed a new framework for infrared small-target detection using point supervision, addressing challenges of unstable pseudo-labels and sample imbalance. Their approach utilizes a physics-induced annotation strategy based on heat diffusion to generate reliable pseudo-masks from single-point labels. A bi-level dual-update framework optimizes detector weights, sample weights, and diffusion parameters, enhancing supervision and adapting to sample distribution. AI

IMPACT Introduces a novel method for improving the accuracy and efficiency of infrared small-target detection using physics-informed AI.
- Pseudo-labels
- Point supervision
TOOL · arXiv cs.LG · 1d

ShapeBench: A Scalable Benchmark and Diagnostic Suite for Standardized Evaluation in Aerodynamic Shape Optimization

Researchers have introduced ShapeBench, a new open-source benchmark designed to standardize evaluations in aerodynamic shape optimization. This benchmark includes 103 tasks across eight shape categories, featuring validated surrogates for rapid testing and optional high-fidelity CFD pipelines for verification. ShapeBench aims to enable fair comparisons between various optimization methods, including classical, general-purpose, and LLM-driven approaches, by using a consistent budget metric and highlighting the variance in optimizer performance across different tasks. AI

IMPACT Provides a standardized framework for evaluating and comparing AI-driven methods in aerodynamic shape optimization.
TOOL · arXiv cs.AI · 1d

VBFDD-Agent for Electric Vehicle Battery Fault Detection and Diagnosis: Descriptive Text Modeling of Battery Digital Signals

Researchers have developed VBFDD-Agent, a novel system designed for detecting and diagnosing faults in electric vehicle batteries. This agent utilizes a descriptive text modeling approach, transforming raw battery data into natural language descriptions to create a specialized corpus. By integrating this corpus with maintenance manuals and large language model reasoning, VBFDD-Agent provides structured diagnostic results and actionable maintenance recommendations, enhancing human-AI collaboration in battery health management. AI

IMPACT Introduces a new method for AI-driven diagnostics in electric vehicles, potentially improving safety and maintenance efficiency.
TOOL · arXiv cs.AI · 1d

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

A new paper analyzes the effectiveness of Gated Linear Units (GLU) in large language models, finding that they improve training speed by reshaping the neural tangent kernel (NTK) spectrum. Researchers observed that GLU structures lead to a smaller condition number and faster convergence, a phenomenon sometimes resulting in loss-crossing between GLU and non-GLU models. However, the study also indicated that GLU's benefit is primarily in optimization acceleration rather than reducing the generalization gap. AI

IMPACT Explains a key architectural advantage of modern LLMs, potentially guiding future model design for faster training.
TOOL · arXiv cs.AI · 1d

Conflict-Aware Additive Guidance for Flow Models under Compositional Rewards

Researchers have developed a new method called Conflict-Aware Additive Guidance ($g^ ext{car}$) to improve the control and fidelity of generative models, particularly when dealing with multiple, potentially conflicting, constraints. This technique addresses issues where combining constraints can lead to deviations from the natural data distribution. $g^ ext{car}$ dynamically detects and resolves these gradient conflicts, demonstrating effectiveness across various applications including image editing and decision-making for planning and control, while maintaining efficient computation. AI

IMPACT Enhances control and fidelity in generative models for complex, multi-constraint tasks.
TOOL · arXiv cs.AI · 1d

PACD-Net: Pseudo-Augmented Contrastive Distillation for Glycemic Control Estimation from SMBG

Researchers have developed PACD-Net, a novel self-supervised framework designed to estimate glycemic control metrics from sparse self-monitoring of blood glucose (SMBG) data. This approach uses pseudo-SMBG samples as teacher signals and contrastive learning to ensure consistent representations across different sampling patterns. The model, which employs a hybrid Swin Transformer-CNN backbone, demonstrates superior accuracy and stability compared to existing methods for estimating Time Above Range, Time in Range, and Time Below Range from real-world SMBG data, particularly under extremely sparse conditions. AI

IMPACT Offers a practical tool for interpreting clinical SMBG data and a generalizable method for learning from sparse sensor data.
- PACD-Net
- Swin Transformer-CNN
TOOL · arXiv cs.CV · 1d

STAR-IOD: Scale-decoupled Topology Alignment with Pseudo-label Refinement for Remote Sensing Incremental Object Detection

Researchers have developed STAR-IOD, a new framework designed to improve incremental object detection in remote sensing imagery. This method addresses challenges like intra-class scale variations and missing annotations, which hinder knowledge transfer and preservation in existing detectors. STAR-IOD utilizes a Subspace-decoupled Topology Distillation module for structural knowledge transfer and a Clustering-driven Pseudo-label Generator to accurately distinguish targets from background noise. The framework also introduces two new datasets, DIOR-IOD and DOTA-IOD, and demonstrates superior performance over state-of-the-art approaches. AI

IMPACT Introduces novel techniques for incremental object detection in remote sensing, potentially improving autonomous systems and data analysis in this domain.
TOOL · arXiv cs.LG · 1d

Lowering the Barrier to IREX Participation: Open-Source Algorithms, Toolkit, and Benchmarking for Iris Recognition

Researchers have introduced two novel open-source iris recognition algorithms, TripletIris and ArcIris, designed to lower participation barriers for the IREX X program. The paper details Python and IREX-compliant C++ implementations, enabling broader assessment of open-source solutions. Additionally, it provides open-source tools for iris segmentation and circle estimation, facilitating the development and integration of new recognition methods. AI

IMPACT Provides open-source tools and algorithms that could accelerate research and development in iris recognition systems.
TOOL · arXiv cs.AI · 1d

How to Build Marcus's Algebraic Mind: Algebro-Deterministic Substrate over Galois Fields

Researchers have developed a new hyperdimensional computing architecture called PyVaCoAl/VaCoAl, which is built around the XOR-and-shift operation over Galois Fields. This architecture aims to fulfill Gary Marcus's three core requirements for cognitive architectures: operations over variables, recursively structured representations, and a distinction between individuals and kinds. The system demonstrates reversible variable binding, non-commutative compositional bundling for distinguishing sentence structures, and address-space separation, potentially offering a functional neural substrate that more closely aligns with Marcus's specifications than previous approaches. AI

IMPACT Proposes a novel computational substrate that could enable more sophisticated AI architectures, potentially addressing limitations in current models.
- Gary Marcus
- PyVaCoAl/VaCoAl
TOOL · arXiv cs.CV · 1d

Resolving Long-Tail Ambiguity in Unsupervised 3D Point Cloud Segmentation with Language Priors

Researchers have developed LangTail, a new framework designed to improve unsupervised 3D point cloud segmentation by addressing the issue of long-tail ambiguity. This problem occurs when minor object classes are overlooked in favor of dominant ones during the segmentation process. LangTail integrates semantic knowledge from language models to create a more balanced understanding of categories, which is then used to guide the segmentation, leading to better identification of underrepresented classes. Experiments show significant improvements in mean Intersection over Union (mIoU) scores on benchmark datasets. AI

IMPACT Enhances representation of minority classes in 3D data, potentially improving AI's understanding of complex environments.
- nuScenes
- S3DIS
- LangTail
- ScanNet-v2
TOOL · 404 Media · 7h

This Archivist Has Saved 175,000 Articles from 30 Years of Writing about Magic: The Gathering

Gregor Stocks, a software engineer, has launched the Library of Leng, a searchable database dedicated to preserving articles about the game Magic: The Gathering. The project aims to combat internet churn by archiving old usenet posts, website content, and publisher announcements that are often lost over time. Stocks developed custom tools to parse the varied and often unformatted data from the early internet, and the response from the Magic community and authors has been overwhelmingly positive. AI

IMPACT Niche archival project with minimal direct impact on AI operations.
TOOL · arXiv cs.CL · 1d

Findings of the Fifth Shared Task on Multilingual Coreference Resolution: Expanding Datasets for Long-Range Entities

The Fifth Shared Task on Multilingual Coreference Resolution, held at the CODI-CRAC 2026 workshop, focused on systems that can identify mentions and cluster coreferential chains, particularly those spanning long distances across text. This year's task incorporated five new datasets and two additional languages, utilizing the CorefUD v1.4 collection which spans 19 languages. While traditional systems still outperformed, the ten participating systems, including four LLM-based approaches, showed significant promise for future advancements in the field. AI

IMPACT LLMs show promise in long-range coreference resolution, potentially improving natural language understanding in complex texts.
- CODI-CRAC 2026
- CorefUD
TOOL · arXiv cs.LG · 1d

Classification of Single and Mixed Partial Discharges under Switching Voltage Using an AWA-CNN Framework

Researchers have developed a novel Amplitude-Width-Area (AWA) pattern representation to analyze partial discharge (PD) pulses under switching-voltage excitation. This method maps PD pulses into visual patterns using amplitude, width, and area, enabling the distinction of six different PD source conditions. Convolutional Neural Network (CNN) models, specifically InceptionV3 and ResNet-18, achieved over 96% accuracy in classifying these sources, significantly outperforming a Random Forest baseline. AI

IMPACT Introduces a new visual representation for PD pulses, enabling higher accuracy classification of electrical faults using CNNs.
TOOL · arXiv cs.CV · 1d

Early High-Frequency Injection for Geometry-Sensitive OOD Detection

Researchers have developed a new method called Early High-Frequency Injection (EIHF) to improve out-of-distribution (OOD) detection in computer vision models. EIHF works by injecting high-frequency information into the input data before it's processed by the first convolution layer, without altering the training objective. This approach enhances the model's ability to distinguish between in-distribution and out-of-distribution data, particularly for geometry-sensitive tasks, by reshaping feature geometry and reducing overlap in scores. Experiments on CIFAR-100 and ImageNet-100 datasets showed promising results, including improved false positive rates and area under the receiver operating characteristic curve. AI

IMPACT Improves the robustness of computer vision models to unseen data, potentially enhancing reliability in real-world applications.
- CIFAR-100
- ImageNet-100
- Places
- EIHF
TOOL · arXiv cs.AI · 1d

Data-Efficient Neural Operator Training via Physics-Based Active Learning

Researchers have developed a new active learning technique called physics-based acquisition to improve the efficiency of training neural operators. This method uses the partial differential equation residual to intelligently select the most informative data samples for training. Experiments on the 1D Burgers and 2D Navier-Stokes equations demonstrate that this approach significantly reduces data requirements compared to random sampling and matches state-of-the-art data efficiency while incorporating physics into the model's understanding. AI

IMPACT This method could significantly reduce the computational cost and data requirements for training neural operators, accelerating their adoption in scientific simulations.
TOOL · arXiv cs.CV · 1d

Holistic Reliability Propagation: Decoupling Annotation and Prediction for Robust Noisy-Label

Researchers have developed a new method called Holistic Reliability Propagation (HRP) to improve learning with noisy labels in multimedia classification. HRP decouples the reliability of external annotations from model predictions, estimating separate weights for each. This approach uses bilevel meta-learning to produce two scalars, alpha for given labels and beta for pseudo-labels, which are then routed to different objectives. HRP has demonstrated improved accuracy over existing methods, particularly at high noise rates. AI

IMPACT This research offers a novel approach to enhance the robustness of AI models when trained on imperfect datasets, potentially improving performance in real-world applications with noisy data.
- Holistic Reliability Propagation
TOOL · arXiv cs.AI · 1d

SCRIBE: Diagnostic Evaluation and Rich Transcription Models for Indic ASR

Researchers have introduced SCRIBE, a new diagnostic framework designed to improve automatic speech recognition (ASR) for Indic languages. Unlike traditional Word Error Rate (WER) metrics, SCRIBE categorizes errors into lexical, punctuation, numeral, and domain-entity types, offering a more nuanced evaluation. This framework, along with open-weight rich transcription models for Hindi, Malayalam, and Kannada, aims to make ASR correction more cost-effective and accurate, especially for agglutinative languages. AI

IMPACT Improves ASR accuracy and diagnostic capabilities for under-resourced languages, potentially accelerating their adoption in voice-enabled applications.
TOOL · arXiv cs.LG · 1d

Stimulus symmetries can confound representational similarity analyses

A new research paper highlights how symmetries in network inputs can mislead representational similarity analyses (RSMs). These symmetries can make different network configurations appear functionally equivalent, yet produce distinct RSMs that reflect different representational geometries. The study demonstrates this issue in networks trained on image data, where latent symmetries can lead to sparse, drifting codes and consequently, drifting RSMs. The findings underscore the difficulties in comparing nonlinear neural codes when functionally equivalent representations are not simply rotational. AI

IMPACT Highlights potential pitfalls in analyzing neural network representations, impacting research methodology.
- arXiv
- Farhad Pashakhanloo
TOOL · arXiv cs.AI · 1d

SAVER: Selective As-Needed Vision Evidence for Multimodal Information Extraction

Researchers have developed SAVER, a novel framework designed to improve multimodal information extraction from social media posts. This system selectively uses visual evidence only when necessary, preventing computational waste and the amplification of misleading visual cues. SAVER employs a Conformal Groundability Gate to determine the relevance of images and a submodular selector to choose the most pertinent subset for analysis, ultimately enhancing accuracy while reducing processing load and latency. AI

IMPACT This research introduces a more efficient approach to multimodal information extraction, potentially improving the accuracy and speed of AI systems analyzing social media content.
- Conformal Groundability Gate
- Set Transformer
TOOL · SCMP — Tech · 7h

US jobless claims fall as lay-offs remain low despite economic uncertainty

New jobless claims in the US decreased to 209,000 for the week ending May 16, falling below analyst expectations. This decline indicates a continued trend of low lay-offs, contributing to a stable but somewhat stagnant labor market. Despite a low unemployment rate of 4.3%, the market is characterized by a 'low-hire, low-fire' dynamic, making it challenging for those out of work to find new positions. AI
TOOL · arXiv cs.AI · 1d

DIVE: Embedding Compression via Self-Limiting Gradient Updates

Researchers have developed DIVE, a new method for compressing high-dimensional embeddings from large language models to reduce storage and computational costs in vector search systems. Unlike previous methods that overfit with scarce labeled data, DIVE uses a self-limiting triplet loss to bound perturbations and a contrastive loss to provide dense self-supervised gradients. This approach reportedly outperforms existing compression adapters across multiple datasets and compression ratios, with an open-source implementation available. AI

IMPACT This new embedding compression technique could significantly reduce the resource requirements for deploying and scaling vector search systems, making LLM-powered applications more efficient.
TOOL · arXiv cs.LG · 1d

Automatic Discovery of Disease Subgroups by Contrasting with Healthy Controls

Researchers have developed a new method called Deep UCSL for identifying distinct subgroups within patient populations by contrasting them with healthy controls. This approach uses a deep feature extractor to learn a representation space that isolates disease-specific factors, ignoring common variations shared with healthy individuals. The method optimizes a novel loss function through an Expectation-Maximization strategy and has shown quantitative improvements in subgroup quality on both synthetic and real medical imaging datasets. AI

IMPACT Introduces a novel contrastive learning approach for more precise disease subgroup identification in medical imaging.
- arXiv
- Deep UCSL
TOOL · arXiv cs.CV (CA) · 1d

Let EEG Models Learn EEG

Researchers have developed a new framework called Just EEG Transformer (JET) for generating high-fidelity electroencephalogram (EEG) data. Unlike previous methods that use discrete denoising objectives, JET models EEG as continuous temporal sequences, better capturing the inherent dynamics and spectral structure of neural activity. This approach allows JET to preserve long-range temporal dependencies and generate more realistic signals, achieving over 40% reduction in TS-FID compared to existing baselines across multiple benchmarks. AI

IMPACT Enables more realistic EEG data generation, potentially accelerating research in neural modeling and brain-computer interfaces.
- arXiv
- Just EEG Transformer (JET)
TOOL · arXiv cs.CV · 1d

Vision Transformers and Convolutional Neural Networks for Land Use Scene Classification

A new research paper compares the effectiveness of Vision Transformers (ViTs) and Convolutional Neural Networks (CNNs) for land use scene classification using remote sensing imagery. The study evaluated AlexNet and ViT on the UC Merced Land Use and EuroSAT datasets, analyzing metrics like accuracy, precision, recall, and F1-score. Results indicate that CNNs are more robust with limited data and strong local textures, while ViTs excel at capturing global spatial relationships with sufficient training data, though they require more computational resources. AI

IMPACT Provides insights for selecting appropriate deep learning models for remote sensing land use classification tasks.
TOOL · arXiv cs.LG · 1d

FedCoE: Bridging Generalization and Personalization via Federated Coordinated Dual-level MoEs

Researchers have introduced FedCoE, a novel framework for Federated Learning that aims to balance global generalization with local personalization. Unlike traditional methods that struggle with non-IID data or overfit to local information, FedCoE utilizes a dual-level Mixture-of-Experts approach. This system maintains independent global expert models and uses a shared gating network to manage client-expert correlations, preventing expert drift. FedCoE also includes an adaptive mechanism to help new clients quickly utilize global experts without extensive local training, showing significant accuracy improvements in both general and cold-start scenarios. AI

IMPACT Introduces a new method to improve federated learning performance, potentially enabling more robust and personalized AI models in distributed environments.
TOOL · arXiv cs.CV · 1d

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools

Researchers have introduced IndusAgent, a novel framework designed to enhance open-vocabulary industrial anomaly detection using agentic tools. This system addresses limitations in multimodal large language models by integrating domain-specific reasoning and external tools for clearer visual interpretation. IndusAgent utilizes a structured dataset, Indus-CoT, and a reinforcement learning objective to optimize anomaly classification, localization, and efficient tool usage, achieving state-of-the-art zero-shot performance across multiple benchmarks. AI

IMPACT Enhances zero-shot anomaly detection capabilities in industrial settings, potentially improving quality control and reducing manual inspection needs.
TOOL · arXiv cs.CV · 1d

DarkShake-DVS: Event-based Human Action Recognition under Low-light andShaking Camera Conditions

Researchers have introduced DarkShake-DVS, a new benchmark dataset designed for human action recognition in challenging low-light and high-motion scenarios. The dataset includes over 18,000 real-world clips captured with synchronized IMU data to address limitations in existing event-based vision research. They also propose EIS-HAR, a novel method that combines motion compensation with a hybrid architecture for improved spatiotemporal feature extraction and action recognition. AI

IMPACT Introduces a new benchmark and method to improve AI's ability to recognize actions in challenging real-world conditions.
TOOL · arXiv cs.CV · 1d

Local-sensitive connectivity filter (ls-cf): A post-processing unsupervised improvement of the frangi, hessian and vesselness filters for multimodal vessel segmentation

Researchers have developed a new unsupervised method called the local-sensitive connectivity filter (LS-CF) to improve the segmentation of retinal blood vessels. This technique enhances existing filters like the Frangi filter by addressing discontinuities and ensuring pixel-level continuity. The LS-CF demonstrated superior performance on several multimodal datasets, outperforming state-of-the-art approaches in accuracy on the OSIRIX and IOSTAR datasets, and showing competitive results on DRIVE, STARE, and CHASE-DB. AI

IMPACT Introduces a novel unsupervised method for medical image analysis, potentially improving diagnostic accuracy in ophthalmology.
TOOL · arXiv cs.AI · 1d

RePCM: Region-Specific and Phenotype-Adaptive Bi-Ventricular Cardiac Motion Synthesis

Researchers have developed a novel method called RePCM for synthesizing cardiac motion from a single end-diastolic frame. This approach addresses limitations in traditional methods that often oversmooth data by creating models optimized for global patterns. RePCM utilizes a two-stage process: first, a reconstruction network and clustering identify region-specific motion descriptors, and second, a specialized module enforces synchronized region exchange within a conditional VAE to preserve localized dynamics. The system also incorporates a phenotype-adaptive prior to model inter-disease variability, showing improved geometric and functional metrics across multiple datasets. AI

IMPACT This new method could improve the analysis of regional cardiac function and disease-specific dynamics by enabling more accurate motion synthesis from limited data.
TOOL · arXiv cs.CV · 1d

LER-YOLO: Reliability-Aware Expert Routing for Misaligned RGB-Infrared UAV Detection

Researchers have developed LER-YOLO, a novel framework designed to improve the detection of small unmanned aerial vehicles using misaligned RGB and infrared imagery. The system incorporates an Uncertainty-Aware Target Alignment module to estimate spatial reliability and guide expert selection. This reliability-guided approach adaptively chooses experts for cross-modal fusion, effectively suppressing unreliable data and enhancing detection accuracy. AI

IMPACT Enhances drone detection capabilities by improving the fusion of multi-modal sensor data.
- MBU benchmark
- LER-YOLO
TOOL · arXiv cs.CV · 1d

SR-Ground: Image Quality Grounding for Super-Resolved Content

Researchers have introduced SR-Ground, a new dataset designed to improve image quality assessment for super-resolved images. This dataset features pixel-level annotations for various artifact types introduced by modern super-resolution models. By training models on SR-Ground, researchers have shown improved performance in identifying and even reducing these artifacts, demonstrating practical applications for the dataset. AI

IMPACT This dataset could lead to more reliable and interpretable image quality assessment for AI-generated images, improving user trust and downstream applications.
- arXiv
- SR-Ground
TOOL · arXiv cs.LG · 1d

Divide and Contrast: Learning Robust Temporal Features without Augmentation

Researchers have developed a new unsupervised framework called Divide and Contrast (Di-COT) for learning robust temporal features from time-series data without relying on data augmentation. Di-COT works by contrasting informative substructures within data windows, rather than individual timesteps, which allows for efficient and meaningful contrast while avoiding false positives. This method has demonstrated state-of-the-art performance across various tasks including classification and clustering on multiple large-scale datasets and benchmarks, while also significantly reducing training time. AI

IMPACT Introduces a novel unsupervised learning method for time-series data that improves efficiency and performance on downstream tasks.
TOOL · arXiv cs.AI · 1d

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

A new study evaluated AI reviewers on Nature-family papers, finding that while they can outperform top human reviewers in identifying correct, significant, and well-evidenced criticisms, they also exhibit distinct weaknesses. The research involved 45 scientists annotating over 2,900 criticisms from human and AI reviews. While AI reviewers like GPT-5.2, Gemini 3.0 Pro, and Claude Opus 4.5 showed strengths in accuracy and identifying unique issues, they also demonstrated limitations in specialized knowledge, handling multiple files, and an overly critical stance on minor points, suggesting they are best used as complements to human reviewers. AI

IMPACT AI reviewers show promise in scientific critique but require human oversight, potentially speeding up peer review.
TOOL · arXiv cs.AI · 1d

AMAR: Lightweight Attention-Based Multi-User Activity Recognition from Wi-Fi CSI

Researchers have developed AMAR, a novel framework for recognizing multiple simultaneous human activities using Wi-Fi channel state information (CSI). This attention-based system treats activity recognition as a set prediction problem, employing learnable query embeddings to detect concurrent actions from complex CSI data. AMAR utilizes an edge-cloud split architecture, with edge devices performing initial feature extraction and the cloud component handling final prediction, significantly outperforming existing methods in multi-user environments. AI

IMPACT This research could enable more sophisticated contactless sensing applications by improving the ability to track multiple individuals simultaneously using existing Wi-Fi infrastructure.
TOOL · Forbes — Innovation · 7h

Garmin Cirqa Price May Be Far Higher Than Expected

A Ukrainian retailer has listed the unannounced Garmin Cirqa wearable for approximately $450, a price significantly higher than its expected competitors like the Whoop and Fitbit Air. However, the retailer is not a major Garmin dealer, and its pricing for other Garmin models is also inflated compared to U.S. market rates. This suggests the listed price may not accurately reflect the Cirqa's final cost, especially given its screen-free design and the availability of similar devices at lower price points. AI

IMPACT This is a product pricing leak for a wearable device, with minimal direct impact on AI operators.
TOOL · arXiv cs.AI · 1d

SURGE: An Event-Centric Social Media Sentiment Time Series Benchmark with Interaction Structure

Researchers have introduced SURGE, a new benchmark dataset designed to analyze social media sentiment dynamics around public events. SURGE organizes over 800,000 posts from 67 events across five categories into time-series data, preserving the interaction structure between posts. This benchmark aims to improve opinion forecasting and crisis response by enabling the study of how post interactions influence collective dynamics and event evolution. AI

IMPACT Provides a new dataset for training and evaluating models in social media sentiment analysis and event forecasting.
TOOL · arXiv cs.LG · 1d

Reinforcement Learning-based Control via Y-wise Affine Neural Networks: Comparative Case Studies for Chemical Processes

Researchers have developed a new reinforcement learning (RL) approach called Y-wise Affine Neural Network (YANN-RL) for controlling chemical processes. This method aims to overcome the typical challenges of trust and lengthy training times associated with RL in this domain. By providing interpretable starting points, YANN-RL significantly reduces training time and data requirements compared to other RL algorithms and approaches the performance of nonlinear model predictive control without needing a full nonlinear model. AI

IMPACT This new RL method could significantly reduce training time and data needs for controlling complex chemical processes.
TOOL · arXiv cs.AI · 1d

Jointly Learning Predicates and Actions Enables Zero-Shot Skill Composition

Researchers have developed a new method called Predicate Action Skills (PACTS) that allows robots to learn and compose skills without retraining. PACTS models both the physical actions and the symbolic outcomes of these actions, enabling better generalization. This approach facilitates zero-shot skill composition through planning by using predicted outcomes to sequence and monitor task execution. AI

IMPACT Enables robots to learn and combine skills more flexibly, potentially accelerating the development of more adaptable robotic systems.
- Benedict Quartey
TOOL · arXiv cs.CV · 1d

PGC: Peak-Guided Calibration for Generalizable AI-Generated Image Detection

Researchers have developed a new framework called Peak-Guided Calibration (PGC) to improve the detection of AI-generated images. This method focuses on aggregating salient, local features using a peak-sensitive mechanism to overcome the limitations of detectors that rely solely on global image representations. PGC effectively calibrates global decisions by accentuating subtle, discriminative clues that might otherwise be lost. The framework demonstrates state-of-the-art performance, significantly improving accuracy on a new benchmark dataset, CommGen15, and setting new records on existing benchmarks. AI

IMPACT Improves the ability to distinguish real images from AI-generated ones, crucial for combating misinformation.
TOOL · dev.to — Claude Code tag · 7h

View Transitions API: 5 Patterns I Use Across RAXXO Sites in 2026

The View Transitions API allows developers to create smooth visual transitions between different states or pages within web applications. This API enables features like animated content swaps and shared element morphing, enhancing the user experience by making interfaces feel more polished and expensive. With widespread browser support, developers can implement these transitions with minimal JavaScript, leveraging the browser's compositor for efficient animations. AI

IMPACT Minimal direct impact on AI operators; focuses on web development tooling.
- Firefox
- Chrome
- Shopify
- Next.js
- Safari
- RAXXO
- View Transitions API
TOOL · arXiv cs.CV · 1d

Semantic Granularity Navigation in Image Editing

Researchers have developed NaviEdit, a new method to improve image editing by decoupling the editing process from the scale of the diffusion or flow model used. This approach aims to resolve the trade-off between semantic editability and structural fidelity by reallocating computational steps towards semantically relevant scales. NaviEdit operates at inference time without altering the pretrained model, showing improved results across various compatible editors and flow backbones. AI

IMPACT Enhances image editing capabilities by improving semantic control and structural fidelity in generative models.
- diffusion models
- NaviEdit
TOOL · arXiv cs.CL · 1d

Metaphors in Literary Post-Editing: Opening Pandora's Box?

A new paper explores how human post-editors handle metaphors translated by Neural Machine Translation and Large Language Models in literary texts. The study found that post-editors frequently altered metaphors, rating the machine translation output as poor and the post-editing process as more demanding than translating from scratch. These findings suggest that current NMT and LLM approaches struggle with figurative language in literary contexts, potentially limiting translator creativity and ownership. AI

IMPACT Reveals significant challenges for LLMs and NMT in translating nuanced figurative language, potentially impacting literary translation workflows.
TOOL · arXiv cs.LG · 1d

Q-SYNTH: Hybrid Quantum-Classical Adversarial Augmentation for Imbalanced Fraud Detection

Researchers have developed Q-SYNTH, a novel hybrid quantum-classical framework designed to address the challenge of imbalanced data in credit card fraud detection. This system uses a parameterized quantum circuit as the generator and a classical neural network as the discriminator to synthesize minority-class fraud samples. Evaluations show Q-SYNTH offers a promising balance between statistical fidelity to real fraud data and improved downstream fraud detection performance, outperforming some classical baselines in specific metrics. AI

IMPACT Introduces a novel hybrid quantum-classical approach to improve AI model performance on imbalanced datasets, potentially enhancing fraud detection systems.
TOOL · r/cursor · 8h

Built a workflow tool for AI coders. Took 3 months. Here's what it actually does.

A new tool called Herb has been developed to help AI coders manage their prompts and rules. It allows users to tag and search their AI coding instructions, preventing the loss of effective prompts into old chat histories. A key feature is a community library where developers can share and import working prompts, aiming to streamline the AI coding process. AI

IMPACT Provides AI coders with a centralized system for managing and sharing effective prompts and rules, potentially improving productivity.
TOOL · 36氪 (36Kr) 中文(ZH) · 7h

*ST Win-Semi: Controlling Shareholder Zhang Xuezheng Increases Holding by 0.04%

Xiaomi has officially launched the SU7 GT, a high-performance electric vehicle priced at 389,900 yuan. This new model features an upgraded Xiaomi motor V8s EVO, capable of reaching 28,000 rpm, and a dual-motor system delivering 1003 PS, a top speed of 300 km/h, and a 0-100 km/h acceleration of 2.92 seconds. The SU7 GT also set a new Nürburgring lap record for SUVs with a time of 7 minutes and 22.755 seconds. AI

IMPACT Minimal direct impact for AI operators; showcases advancements in EV performance metrics.
- Xiaomi
- SU7 GT
- V8s EVO