Picture for Shiyu Hu

Shiyu Hu

Students Rather Than Experts: A New AI For Education Pipeline To Model More Human-Like And Personalised Early Adolescences

Add code
Oct 21, 2024
Viaarxiv icon

Can LVLMs Describe Videos like Humans? A Five-in-One Video Annotations Benchmark for Better Human-Machine Comparison

Add code
Oct 20, 2024
Viaarxiv icon

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Add code
Oct 03, 2024
Viaarxiv icon

Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark

Add code
Sep 13, 2024
Figure 1 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 2 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 3 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Viaarxiv icon

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM

Add code
May 20, 2024
Viaarxiv icon

BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision

Add code
Feb 07, 2024
Viaarxiv icon

SOTVerse: A User-defined Task Space of Single Object Tracking

Add code
Apr 15, 2022
Figure 1 for SOTVerse: A User-defined Task Space of Single Object Tracking
Figure 2 for SOTVerse: A User-defined Task Space of Single Object Tracking
Figure 3 for SOTVerse: A User-defined Task Space of Single Object Tracking
Figure 4 for SOTVerse: A User-defined Task Space of Single Object Tracking
Viaarxiv icon

Global Instance Tracking: Locating Target More Like Humans

Add code
Feb 26, 2022
Viaarxiv icon