English(EN)EgoPressDiff: Multimodal Video Diffusion for Egocentric UV-Domain Hand-Pressure Estimation
机器人从自拍视角视频中学习操作和抓握压力
作者PulseAugur 编辑部·[5 个来源]·
研究人员开发了新的框架,使机器人能够从单个人类演示的自拍视角视频中学习灵巧的操作和抓握压力。EgoAERO使机器人能够在不需要物体资产的情况下从单个人类演示中学习操作技能,并将其转换为机器人策略。EgoTactile和EgoPressDiff专注于从自拍视角视频中估计抓握压力,利用扩散模型和多模态条件在现实场景中取得最先进的结果。
AI
arXiv:2606.08057v1 Announce Type: cross Abstract: Egocentric RGB-D videos offer a natural source of human dexterous manipulation demonstrations, but existing data is difficult to use for robot learning because object pose, geometry, and contact information are often missing or re…
arXiv:2606.09243v1 Announce Type: cross Abstract: Estimating full-hand grasp pressure from egocentric video is critical for immersive VR and robotic manipulation, yet dense tactile sensing often relies on intrusive hardware. Existing vision-based methods predominantly rely on pla…
arXiv:2606.06872v1 Announce Type: cross Abstract: Estimating hand-surface contact pressure from an egocentric view is crucial for AR/VR devices, robotic imitation, and ergonomic analysis. Existing methods often discretize pressure signal and process frames independently, leading …
Estimating full-hand grasp pressure from egocentric video is critical for immersive VR and robotic manipulation, yet dense tactile sensing often relies on intrusive hardware. Existing vision-based methods predominantly rely on planar surfaces or fingertip contacts, failing to gen…
Estimating hand-surface contact pressure from an egocentric view is crucial for AR/VR devices, robotic imitation, and ergonomic analysis. Existing methods often discretize pressure signal and process frames independently, leading to quantization errors and temporal inconsistencie…