ENTITY Computer vision and pattern recognition

Computer vision and pattern recognition

PulseAugur coverage of Computer vision and pattern recognition — every cluster mentioning Computer vision and pattern recognition across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

22 over 90d

Releases · 30d

0 over 90d

Papers · 30d

22 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

13 day(s) with sentiment data

RECENT · PAGE 1/2 · 22 TOTAL

TOOL · CL_110032 · Jun 25 · 04:00

New method detects synthesized images efficiently on low-end devices

Researchers have developed a new, computationally efficient method for detecting synthesized images. This approach focuses on analyzing pixel fluctuations using gradient calculations, effectively acting as a high-pass f…
RESEARCH · CL_109620 · Jun 24 · 02:32

REViT: New Vision Transformer Achieves Roto-reflection Equivariance

Researchers have introduced REViT, a novel vision transformer that incorporates roto-reflection equivariance and convolutional attention. This approach aims to preserve rotational and flip symmetries in feature maps, wh…
RESEARCH · CL_109622 · Jun 24 · 00:00

MVTrack4Gen enhances 4D video generation with multi-view point tracking · 4 sources tracked

Researchers have introduced MVTrack4Gen, a novel framework designed to enhance 4D video generation from monocular reference videos. This method utilizes multi-view point tracking as a geometric and motion supervision si…
RESEARCH · CL_107898 · Jun 23 · 16:54

DDStereo Transformer achieves real-time 3D object detection

Researchers have introduced DDStereo, a new Dual-Decoder Stereo Transformer designed for real-time, open-set 3D object detection. This model addresses the critical safety challenges of speed and generalization in stereo…
RESEARCH · CL_107905 · Jun 23 · 16:00

VSANet introduced for light field image denoising using sparse attention

Researchers have developed VSANet, a novel network designed for light field image denoising. This network utilizes a view-aware sparse attention (VSA) block that processes 4D light field data by treating it as unified s…
RESEARCH · CL_107926 · Jun 23 · 10:59

New EgoSAT benchmark tests vision-language models on egocentric video reasoning

Researchers have introduced EgoSAT, a new benchmark designed to evaluate vision-language models (VLMs) on their ability to understand egocentric video streams. This benchmark unifies various tasks into a single streamin…
RESEARCH · CL_107937 · Jun 23 · 08:03

UniRED framework unifies RGB-D video interpolation with event guidance

Researchers have developed UniRED, a novel framework for interpolating RGB-D videos by integrating RGB appearance, depth geometry, and event-based temporal cues. This approach addresses limitations in existing methods t…
RESEARCH · CL_105283 · Jun 22 · 08:38

CanonicalGS pipeline enhances novel view synthesis with stable scene representation · 2 sources tracked

Researchers have developed CanonicalGS, a novel feed-forward pipeline designed to improve novel view synthesis by creating a stable, scene-centric representation from cluttered multi-view observations. This method aggre…
TOOL · CL_100242 · Jun 19 · 04:00

New AI framework enhances cinematic compositing with realistic character-environment integration

Researchers have developed a new video diffusion framework designed to improve cinematic compositing by better integrating green-screen characters into new environments. The model addresses challenges in bidirectional i…
RESEARCH · CL_99769 · Jun 18 · 17:59

UNIEGO framework uses proxy models for unified egocentric video representation

Researchers have developed UNIEGO, a novel unified egocentric video representation learning framework. UNIEGO utilizes a hierarchical multi-teacher distillation process with proxy models to translate diverse knowledge f…
RESEARCH · CL_97626 · Jun 17 · 14:46

New dataset and CRNN model advance Urdu handwritten text recognition

Researchers have introduced the Urdu Katib Handwritten Dataset (UKHD), the first offline dataset of historical Urdu handwritten text lines. This dataset aims to address the scarcity of resources for Urdu Handwritten Tex…
TOOL · CL_93941 · Jun 16 · 04:00

New framework unifies segmentation and VQA for robotic surgery

Researchers have developed a novel framework that unifies pixel-level segmentation and visual question answering (VQA) for robotic surgery. This approach uses object tokens generated by a vision-language model (VLM) to …
TOOL · CL_93899 · Jun 16 · 04:00

New Fusion Method Enhances Space Object Detection

Researchers have developed a novel multi-view feature high-order fusion (MHF) method to improve the detection and segmentation of weak objects in space imagery. This approach extends traditional low-order feature fusion…
RESEARCH · CL_93079 · Jun 15 · 11:41

New method estimates object pose without 3D models using rotational symmetry

Researchers have developed a novel method for object pose estimation from point clouds that does not require known 3D models. This approach leverages the rotational symmetry inherent in many industrial objects to overco…
RESEARCH · CL_91001 · Jun 12 · 15:35

New framework uses Deformable-DETR for automated quality assessment

Researchers have developed a new multi-view framework utilizing Deformable-DETR to automate the visual quality assessment of large white goods in remanufacturing. This approach aggregates information from multiple redun…
TOOL · CL_85003 · Jun 11 · 04:00

New models achieve 93% accuracy for node-link diagram segmentation

Researchers have developed new deep learning models for the semantic segmentation of node-link diagrams, which are commonly used to represent complex relationships and flowcharts. These diagrams are often inaccessible t…
RESEARCH · CL_68543 · Jun 3 · 04:00

AI models tackle single-image reflection separation with new techniques

Two new research papers propose advanced methods for separating reflections from single images, a challenging task in computer vision. One paper introduces a diffusion model that jointly generates transmission and refle…
RESEARCH · CL_68580 · Jun 2 · 13:38

New benchmark tackles semi-supervised multi-modal crowd counting

Researchers have introduced the first benchmark for semi-supervised multi-modal crowd counting. This new benchmark defines the task's setting and a standardized protocol for data partitioning. It also includes an evalua…
RESEARCH · CL_48299 · May 22 · 03:19

New framework improves video editing by selecting keyframes

Researchers have developed a new framework for robust video editing that addresses challenges posed by occlusions, viewpoint changes, and rapid object motion. The method focuses on selecting optimal anchor frames by eva…
TOOL · CL_30563 · May 13 · 08:22

New model unifies image restoration across adverse weather conditions

Researchers have developed a novel network architecture that unifies image restoration across various adverse weather conditions. This approach incorporates a unified imaging model that accounts for both individual part…

New method detects synthesized images efficiently on low-end devices

REViT: New Vision Transformer Achieves Roto-reflection Equivariance

MVTrack4Gen enhances 4D video generation with multi-view point tracking · 4 sources tracked

DDStereo Transformer achieves real-time 3D object detection

VSANet introduced for light field image denoising using sparse attention

New EgoSAT benchmark tests vision-language models on egocentric video reasoning

UniRED framework unifies RGB-D video interpolation with event guidance

CanonicalGS pipeline enhances novel view synthesis with stable scene representation · 2 sources tracked

New AI framework enhances cinematic compositing with realistic character-environment integration

UNIEGO framework uses proxy models for unified egocentric video representation

New dataset and CRNN model advance Urdu handwritten text recognition

New framework unifies segmentation and VQA for robotic surgery

New Fusion Method Enhances Space Object Detection

New method estimates object pose without 3D models using rotational symmetry

New framework uses Deformable-DETR for automated quality assessment

New models achieve 93% accuracy for node-link diagram segmentation

AI models tackle single-image reflection separation with new techniques

New benchmark tackles semi-supervised multi-modal crowd counting

New framework improves video editing by selecting keyframes

New model unifies image restoration across adverse weather conditions