Picture for Yaojie Zhang

Yaojie Zhang

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

Add code
Feb 09, 2026
Viaarxiv icon

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Add code
Jan 27, 2026
Viaarxiv icon

Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles

Add code
Jun 12, 2025
Viaarxiv icon

GS-Matching: Reconsidering Feature Matching task in Point Cloud Registration

Add code
Dec 06, 2024
Viaarxiv icon

Sight View Constraint for Robust Point Cloud Registration

Add code
Sep 08, 2024
Figure 1 for Sight View Constraint for Robust Point Cloud Registration
Figure 2 for Sight View Constraint for Robust Point Cloud Registration
Figure 3 for Sight View Constraint for Robust Point Cloud Registration
Figure 4 for Sight View Constraint for Robust Point Cloud Registration
Viaarxiv icon

Robust Multi-Robot Global Localization with Unknown Initial Pose based on Neighbor Constraints

Add code
Jun 27, 2024
Figure 1 for Robust Multi-Robot Global Localization with Unknown Initial Pose based on Neighbor Constraints
Figure 2 for Robust Multi-Robot Global Localization with Unknown Initial Pose based on Neighbor Constraints
Figure 3 for Robust Multi-Robot Global Localization with Unknown Initial Pose based on Neighbor Constraints
Figure 4 for Robust Multi-Robot Global Localization with Unknown Initial Pose based on Neighbor Constraints
Viaarxiv icon