实体 computer vision

computer vision

PulseAugur coverage of computer vision — every cluster mentioning computer vision across labs, papers, and developer communities, ranked by signal.

总计 · 30天

34

90 天内 34

发布 · 30天

0

90 天内 0

论文 · 30天

27

90 天内 27

层级分布 · 90 天

significant 1
research 12
tool 19
commentary 2

关系

情绪 · 30 天

8 天有情绪数据

最近 · 第 2/2 页 · 共 34 条

RESEARCH · CL_11394 · Apr 30 · 06:54

REVIVE 3D generates voluminous 3D assets from flat images with novel enhancement pipeline

Researchers have developed REVIVE 3D, a novel two-stage pipeline designed to generate detailed 3D assets from flat 2D images. The system first creates an "Inflated Prior" by recovering global volume and adding part-awar…
RESEARCH · CL_11380 · Apr 30 · 04:00

Surveys explore robot learning from human videos and world models, while new networks tackle driver monitoring.

Two new survey papers explore advancements in robot learning, focusing on different data acquisition and utilization strategies. One paper provides a comprehensive review of world models, which are predictive representa…
RESEARCH · CL_09745 · Apr 29 · 13:27

New AI methods advance 3D reconstruction, image segmentation, and sound recovery

Researchers have developed new methods for image segmentation and reconstruction. One paper introduces a novel approach for topology-preserving image segmentation using a differentiable method for simple point detection…
RESEARCH · CL_08571 · Apr 29 · 04:00

AI-generated outpainted vehicles dataset boosts detection performance

Researchers have developed AIDOVECL, a novel dataset for vehicle classification and localization generated using AI outpainting techniques. This method addresses the bottleneck of manual image labeling in computer visio…
RESEARCH · CL_08520 · Apr 28 · 16:02

New knowledge distillation methods enhance model compression and diversity

Two new research papers propose methods to improve black-box knowledge distillation, a technique for compressing large AI models into smaller ones without direct access to the teacher model's training data. The first pa…
RESEARCH · CL_15753 · Apr 28 · 04:00

New research explores 4D geometry and dynamic scene understanding with novel frameworks

Researchers have introduced several new frameworks and datasets for advancing 4D (three spatial dimensions plus time) understanding and reconstruction from visual data. These include 4DThinker, which enables vision-lang…
RESEARCH · CL_06547 · Apr 28 · 04:00

BIR-Adapter offers parameter-efficient blind image restoration with diffusion models

Researchers have developed the BIR-Adapter, a novel parameter-efficient method for blind image restoration using diffusion models. This adapter integrates an attention mechanism and a sampling guidance strategy to reduc…
RESEARCH · CL_06534 · Apr 28 · 04:00

YOLOv8 to YOLO11: Review details architecture evolution and challenges

This paper provides a detailed comparative review of the YOLOv8 through YOLO11 computer vision models. It aims to clarify the architectures and distinctions between these rapidly evolving object detection systems, many …
RESEARCH · CL_06531 · Apr 28 · 04:00

OmniVTG dataset and CoT paradigm enhance open-world video temporal grounding

Researchers have introduced OmniVTG, a large-scale dataset and training paradigm designed to improve open-world Video Temporal Grounding (VTG) for Multimodal Large Language Models (MLLMs). The dataset was created using …
RESEARCH · CL_06454 · Apr 28 · 04:00

MetaErr framework predicts deep neural network failures before they happen

Researchers have introduced MetaErr, a novel framework designed to predict when deep neural networks are likely to fail on specific data samples. Unlike previous efforts focused solely on reducing error rates, MetaErr e…
RESEARCH · CL_05113 · Apr 27 · 04:00

UAVs use vision-only system for altitude-adaptive geo-localization in GPS-denied environments

Researchers have developed a novel vision-only system for Unmanned Aerial Vehicles (UAVs) to determine their location even when GPS is unavailable. The system first estimates the UAV's altitude from a single image by an…
RESEARCH · CL_04945 · Apr 24 · 04:12

Computer vision research advances multimodal understanding and robust segmentation

Researchers have developed WeatherSeg, a semi-supervised segmentation framework designed to improve autonomous driving perception in adverse weather conditions by using a dual teacher-student model for knowledge distill…
RESEARCH · CL_04914 · Apr 23 · 17:59

AI模型学会以不同速度分析和生成视频

研究人员开发了新的方法来理解和操纵视频中的时间流。一篇论文探讨了自监督学习在检测速度变化和估计播放速度方面的应用，从而能够创建大型慢动作数据集以及用于速度条件视频生成和时间超分辨率的模型。另一项研究分析了三十年来主题地图设计的演变，利用计算机视觉和大模型量化了多语种期刊中的地图元素、颜色和布局，发现了设计实践中的机构趋同。
RESEARCH · CL_02911 · Apr 23 · 13:18

New FR-IQA method uses causal inference for image quality assessment

Researchers have developed a new framework for full-reference image quality assessment (FR-IQA) that utilizes causal inference and decoupled representation learning. This approach separates image content from degradatio…