Picture for Jian Yang

Jian Yang

additional authors not shown

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

Add code
Feb 17, 2025
Viaarxiv icon

Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance

Add code
Feb 17, 2025
Viaarxiv icon

Multi-Agent Collaboration for Multilingual Code Instruction Tuning

Add code
Feb 11, 2025
Viaarxiv icon

Learning Inverse Laplacian Pyramid for Progressive Depth Completion

Add code
Feb 11, 2025
Viaarxiv icon

Adaptive Perception for Unified Visual Multi-modal Object Tracking

Add code
Feb 10, 2025
Viaarxiv icon

CryptoX : Compositional Reasoning Evaluation of Large Language Models

Add code
Feb 08, 2025
Viaarxiv icon

MindAligner: Explicit Brain Functional Alignment for Cross-Subject Visual Decoding from Limited fMRI Data

Add code
Feb 07, 2025
Viaarxiv icon

MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images

Add code
Feb 05, 2025
Viaarxiv icon

InterLCM: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration

Add code
Feb 04, 2025
Viaarxiv icon