Picture for Xiangyu Zeng

Xiangyu Zeng

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Add code
Oct 25, 2024
Viaarxiv icon

Spatiotemporal Attention Enhances Lidar-Based Robot Navigation in Dynamic Environments

Add code
Oct 30, 2023
Viaarxiv icon

Understanding AI Cognition: A Neural Module for Inference Inspired by Human Memory Mechanisms

Add code
Oct 01, 2023
Viaarxiv icon

Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based Deep Reinforcement Learning

Add code
Mar 02, 2023
Viaarxiv icon

Single View Physical Distance Estimation using Human Pose

Add code
Jun 18, 2021
Figure 1 for Single View Physical Distance Estimation using Human Pose
Figure 2 for Single View Physical Distance Estimation using Human Pose
Figure 3 for Single View Physical Distance Estimation using Human Pose
Figure 4 for Single View Physical Distance Estimation using Human Pose
Viaarxiv icon

User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks

Add code
Sep 18, 2018
Figure 1 for User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks
Figure 2 for User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks
Figure 3 for User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks
Figure 4 for User Information Augmented Semantic Frame Parsing using Coarse-to-Fine Neural Networks
Viaarxiv icon

Learning Speech Rate in Speech Recognition

Add code
Jun 02, 2015
Figure 1 for Learning Speech Rate in Speech Recognition
Figure 2 for Learning Speech Rate in Speech Recognition
Figure 3 for Learning Speech Rate in Speech Recognition
Figure 4 for Learning Speech Rate in Speech Recognition
Viaarxiv icon