Picture for Yu Deng

Yu Deng

IBM

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Add code
Feb 07, 2025
Viaarxiv icon

MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction

Add code
Dec 14, 2024
Figure 1 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Figure 2 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Figure 3 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Figure 4 for MorphiNet: A Graph Subdivision Network for Adaptive Bi-ventricle Surface Reconstruction
Viaarxiv icon

Structured 3D Latents for Scalable and Versatile 3D Generation

Add code
Dec 02, 2024
Figure 1 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 2 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 3 for Structured 3D Latents for Scalable and Versatile 3D Generation
Figure 4 for Structured 3D Latents for Scalable and Versatile 3D Generation
Viaarxiv icon

CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Add code
Nov 29, 2024
Figure 1 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Figure 2 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Figure 3 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Figure 4 for CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Viaarxiv icon

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Add code
Oct 24, 2024
Figure 1 for MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Figure 2 for MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Figure 3 for MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Figure 4 for MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Viaarxiv icon

GearTrack: Automating 6D Pose Estimation

Add code
Sep 30, 2024
Figure 1 for GearTrack: Automating 6D Pose Estimation
Figure 2 for GearTrack: Automating 6D Pose Estimation
Figure 3 for GearTrack: Automating 6D Pose Estimation
Figure 4 for GearTrack: Automating 6D Pose Estimation
Viaarxiv icon

Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer

Add code
Mar 20, 2024
Figure 1 for Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Figure 2 for Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Figure 3 for Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Figure 4 for Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer
Viaarxiv icon

Deep learning with noisy labels in medical prediction problems: a scoping review

Add code
Mar 19, 2024
Figure 1 for Deep learning with noisy labels in medical prediction problems: a scoping review
Figure 2 for Deep learning with noisy labels in medical prediction problems: a scoping review
Figure 3 for Deep learning with noisy labels in medical prediction problems: a scoping review
Figure 4 for Deep learning with noisy labels in medical prediction problems: a scoping review
Viaarxiv icon

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains

Add code
Jan 23, 2024
Figure 1 for Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
Figure 2 for Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
Figure 3 for Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
Figure 4 for Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains
Viaarxiv icon

AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents

Add code
Dec 04, 2023
Figure 1 for AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Figure 2 for AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Figure 3 for AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Figure 4 for AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents
Viaarxiv icon