arXiv
PulseAugur coverage of arXiv — every cluster mentioning arXiv across labs, papers, and developer communities, ranked by signal.
- authored by graph neural networks 95%
- instance of graph neural networks 95%
- instance of reinforcement learning 90%
- instance of MLLMs 90%
- instance of Large Language Models 90%
- authored by GPT-5.1 90%
- authored by Convolutional neural networks in medical image understanding: a survey 90%
- used by SAM2 90%
- authored by Llama 3.3 70B Instruct 90%
- authored by Langevin dynamics 90%
- instance of Physics-Informed Neural Network 90%
- used by YOLOv11 90%
9 day(s) with sentiment data
-
Transformer model TAMO performs multi-objective optimization in-context
Researchers have developed TAMO, a novel transformer-based policy for multi-objective Bayesian optimization that operates entirely in-context. This approach eliminates the need for per-task surrogate fitting and acquisi…
-
arXiv paper coins "harness" for AI agent structure
A recent arXiv paper introduces the term "harness" to formally describe the components that structure and control AI agents, moving beyond informal terms like "setup" or "config." The paper, "Natural-Language Agent Harn…
-
New paper explores advanced semantic similarity in AI language processing
A new research paper titled "Beyond Semantic Similarity" has been published on arXiv, exploring advancements in language processing and machine learning. The paper delves into methods that go beyond traditional semantic…
-
CausalCine framework enables real-time multi-shot video generation
Researchers have introduced CausalCine, a new framework designed for generating multi-shot video narratives in real-time. Unlike existing autoregressive models that struggle with long sequences and semantic drift, Causa…
-
Vision Transformer uses core-periphery attention for linear scaling
Researchers have developed VECA, a novel Vision Transformer architecture that addresses the quadratic computational cost associated with high-resolution images. VECA utilizes an efficient linear-time attention mechanism…
-
LLM guidance refines text embeddings for better zero-shot task performance
Researchers have developed a method to improve the performance of text embedding models for zero-shot search and classification tasks. Their approach uses a large language model (LLM) to refine query embeddings in real-…
-
OmniNFT framework enhances joint audio-video generation with diffusion RL
Researchers have introduced OmniNFT, a new framework for generating joint audio and video content. This approach utilizes a modality-aware online diffusion reinforcement learning method to overcome challenges in multi-o…
-
New algorithm samples composite log-concave distributions efficiently
Researchers have developed a new proximal gradient algorithm designed to sample from composite log-concave distributions. This algorithm assumes access to gradient evaluations for one part of the distribution and a rest…
-
Researchers propose standardized evaluation for controlled text generation
A new research paper proposes a level-playing-field (LPF) evaluation approach to fairly compare controlled text generation (CTG) systems. The study found that when re-evaluated using standardized methods and datasets, t…
-
AI models fail to detect danger in long transcripts
A new paper reveals that leading AI models like Opus 4.6, GPT 5.4, and Gemini 3.1 exhibit significant performance degradation when classifying long transcripts, a crucial task for monitoring coding agents. These models …
-
New framework recasts graph learning via sequence modeling
Researchers have introduced a new framework called Linearized Graph Sequence Models, which reframes message-passing graph computations from a sequence modeling perspective. This approach aims to simplify architectural c…
-
New conformal prediction methods optimize prediction sets without data splitting
Two new research papers introduce advanced conformal prediction techniques to improve the accuracy and efficiency of prediction sets. The first paper, "Multi-Variable Conformal Prediction (MCP)," extends conformal predi…
-
New method visualizes spatial uncertainty to speed up data annotation
Researchers have developed a new method to improve the quality and efficiency of data annotation for machine learning models. Their approach visualizes spatial uncertainty in model predictions, guiding human annotators …
-
CAD-enhanced ML improves sheet metal bending effort estimation
Researchers have developed a novel machine learning approach for estimating manufacturing effort in sheet metal bending. This method enhances graph-based learning by integrating manufacturing-specific features, such as …
-
Cross-domain training boosts LLM monitor generalization
Researchers explored the effectiveness of cross-domain generalization for training language model monitors. Their findings indicate that training on multiple classification tasks with distinct prompts can partially impr…
-
LLMs and citation topology reconnect fragmented academic networks
Researchers have developed a new framework to address fragmentation in citation networks by integrating citation topology with large language model-based text similarity. This hybrid approach uses LLMs to identify seman…
-
New HashSCD framework enables efficient scene change detection
Researchers have developed HashSCD, a novel framework for scene change detection that utilizes patch-wise image hashing. This method allows for efficient identification of changes within images by encoding spatially ali…
-
New generative augmentation boosts 3D human pose estimation
Researchers have developed a new framework for generating diverse 3D human pose data to improve the generalization capabilities of pose estimation models. This controllable generative augmentation method synthesizes var…
-
New framework CoDAAR enhances multimodal learning with discrete representations
Researchers have developed a new framework called CoDAAR to improve multimodal learning by creating semantically aligned discrete representations. This approach balances the need for cross-modal generalizability with th…
-
MoCam uses diffusion dynamics for unified novel view synthesis
Researchers have introduced MoCam, a novel approach to generating new views of a scene by combining geometric and appearance information. This method uses a structured denoising process within a diffusion model, first e…