Picture for Tao Jin

Tao Jin

Andrew

HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval

Add code
Jan 22, 2026
Viaarxiv icon

Delving Deeper: Hierarchical Visual Perception for Robust Video-Text Retrieval

Add code
Jan 19, 2026
Viaarxiv icon

Hybrid LLM and Higher-Order Quantum Approximate Optimization for CSA Collateral Management

Add code
Oct 30, 2025
Viaarxiv icon

Chat-Driven Text Generation and Interaction for Person Retrieval

Add code
Sep 16, 2025
Viaarxiv icon

SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

Add code
Sep 04, 2025
Figure 1 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 2 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 3 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Figure 4 for SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Viaarxiv icon

TAP: Parameter-efficient Task-Aware Prompting for Adverse Weather Removal

Add code
Aug 11, 2025
Viaarxiv icon

Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval

Add code
Jun 17, 2025
Viaarxiv icon

IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models

Add code
May 30, 2025
Viaarxiv icon

Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation

Add code
May 30, 2025
Viaarxiv icon

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Add code
May 20, 2025
Figure 1 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 2 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 3 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 4 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Viaarxiv icon