Picture for Tong Xu

Tong Xu

An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation

Add code
Jan 25, 2025
Viaarxiv icon

A Contrastive Pretrain Model with Prompt Tuning for Multi-center Medication Recommendation

Add code
Dec 28, 2024
Viaarxiv icon

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs

Add code
Dec 02, 2024
Figure 1 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 2 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 3 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 4 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Viaarxiv icon

ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval

Add code
Nov 24, 2024
Figure 1 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 2 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 3 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Figure 4 for ScalingNote: Scaling up Retrievers with Large Language Models for Real-World Dense Retrieval
Viaarxiv icon

Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain

Add code
Sep 26, 2024
Figure 1 for Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Figure 2 for Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Figure 3 for Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Figure 4 for Verti-Selector: Automatic Curriculum Learning for Wheeled Mobility on Vertically Challenging Terrain
Viaarxiv icon

Generating Event-oriented Attribution for Movies via Two-Stage Prefix-Enhanced Multimodal LLM

Add code
Sep 14, 2024
Viaarxiv icon

PIETRA: Physics-Informed Evidential Learning for Traversing Out-of-Distribution Terrain

Add code
Sep 04, 2024
Viaarxiv icon

Reinforcement Learning for Wheeled Mobility on Vertically Challenging Terrain

Add code
Sep 04, 2024
Figure 1 for Reinforcement Learning for Wheeled Mobility on Vertically Challenging Terrain
Figure 2 for Reinforcement Learning for Wheeled Mobility on Vertically Challenging Terrain
Figure 3 for Reinforcement Learning for Wheeled Mobility on Vertically Challenging Terrain
Figure 4 for Reinforcement Learning for Wheeled Mobility on Vertically Challenging Terrain
Viaarxiv icon

VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation

Add code
Aug 29, 2024
Figure 1 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Figure 2 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Figure 3 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Figure 4 for VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation
Viaarxiv icon

An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models

Add code
Aug 21, 2024
Figure 1 for An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models
Figure 2 for An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models
Figure 3 for An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models
Figure 4 for An Asymptotically Optimal Coordinate Descent Algorithm for Learning Bayesian Networks from Gaussian Models
Viaarxiv icon