Picture for Tao He

Tao He

LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs

Add code
Mar 14, 2025
Viaarxiv icon

FastVID: Dynamic Density Pruning for Fast Video Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Add code
Feb 28, 2025
Viaarxiv icon

Qwen2.5-1M Technical Report

Add code
Jan 26, 2025
Viaarxiv icon

Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Add code
Jan 15, 2025
Figure 1 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 2 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 3 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 4 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Viaarxiv icon

Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues

Add code
Dec 19, 2024
Figure 1 for Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues
Figure 2 for Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues
Figure 3 for Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues
Figure 4 for Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues
Viaarxiv icon

A Lightweight U-like Network Utilizing Neural Memory Ordinary Differential Equations for Slimming the Decoder

Add code
Dec 09, 2024
Viaarxiv icon

Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation

Add code
Oct 23, 2024
Viaarxiv icon

MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality

Add code
Sep 07, 2024
Figure 1 for MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Figure 2 for MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Figure 3 for MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Figure 4 for MuAP: Multi-step Adaptive Prompt Learning for Vision-Language Model with Missing Modality
Viaarxiv icon

Strengthening Layer Interaction via Dynamic Layer Attention

Add code
Jun 19, 2024
Viaarxiv icon