Picture for Chen Wang

Chen Wang

Hye-Young

ChineseSimpleVQA -- "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models

Add code
Feb 19, 2025
Viaarxiv icon

Latent Swap Joint Diffusion for Long-Form Audio Generation

Add code
Feb 07, 2025
Figure 1 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Figure 2 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Figure 3 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Figure 4 for Latent Swap Joint Diffusion for Long-Form Audio Generation
Viaarxiv icon

Nearly Tight Bounds for Exploration in Streaming Multi-armed Bandits with Known Optimality Gap

Add code
Feb 03, 2025
Viaarxiv icon

VL-Nav: Real-time Vision-Language Navigation with Spatial Reasoning

Add code
Feb 02, 2025
Viaarxiv icon

Balance Divergence for Knowledge Distillation

Add code
Jan 14, 2025
Figure 1 for Balance Divergence for Knowledge Distillation
Figure 2 for Balance Divergence for Knowledge Distillation
Figure 3 for Balance Divergence for Knowledge Distillation
Figure 4 for Balance Divergence for Knowledge Distillation
Viaarxiv icon

LPRnet: A self-supervised registration network for LiDAR and photogrammetric point clouds

Add code
Jan 10, 2025
Viaarxiv icon

Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation

Add code
Jan 09, 2025
Viaarxiv icon

ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

Add code
Jan 06, 2025
Viaarxiv icon

FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs

Add code
Dec 30, 2024
Viaarxiv icon

iKap: Kinematics-aware Planning with Imperative Learning

Add code
Dec 12, 2024
Viaarxiv icon