Picture for Yizhou Wang

Yizhou Wang

Enhanced MRI Representation via Cross-series Masking

Add code
Dec 10, 2024
Viaarxiv icon

Simulating Human-like Daily Activities with Desire-driven Autonomy

Add code
Dec 09, 2024
Viaarxiv icon

Towards Zero-shot 3D Anomaly Localization

Add code
Dec 05, 2024
Viaarxiv icon

BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird's-Eye View

Add code
Dec 01, 2024
Figure 1 for BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird's-Eye View
Figure 2 for BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird's-Eye View
Figure 3 for BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird's-Eye View
Figure 4 for BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird's-Eye View
Viaarxiv icon

SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens

Add code
Nov 29, 2024
Viaarxiv icon

Free-form Generation Enhances Challenging Clothed Human Modeling

Add code
Nov 29, 2024
Viaarxiv icon

GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data

Add code
Nov 27, 2024
Viaarxiv icon

AlphaChimp: Tracking and Behavior Recognition of Chimpanzees

Add code
Oct 22, 2024
Viaarxiv icon

Ego3DT: Tracking Every 3D Object in Ego-centric Videos

Add code
Oct 11, 2024
Figure 1 for Ego3DT: Tracking Every 3D Object in Ego-centric Videos
Viaarxiv icon

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models

Add code
Oct 04, 2024
Figure 1 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Figure 2 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Figure 3 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Figure 4 for Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
Viaarxiv icon