Picture for Tian Gan

Tian Gan

BMRL: Bi-Modal Guided Multi-Perspective Representation Learning for Zero-Shot Deepfake Attribution

Add code
Apr 19, 2025
Viaarxiv icon

Preview-based Category Contrastive Learning for Knowledge Distillation

Add code
Oct 18, 2024
Viaarxiv icon

Social Debiasing for Fair Multi-modal LLMs

Add code
Aug 13, 2024
Viaarxiv icon

SHE-Net: Syntax-Hierarchy-Enhanced Text-Video Retrieval

Add code
Apr 22, 2024
Viaarxiv icon

SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks

Add code
Jan 31, 2024
Viaarxiv icon

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

Add code
Jan 09, 2024
Viaarxiv icon

RTQ: Rethinking Video-language Understanding Based on Image-text Model

Add code
Dec 01, 2023
Figure 1 for RTQ: Rethinking Video-language Understanding Based on Image-text Model
Figure 2 for RTQ: Rethinking Video-language Understanding Based on Image-text Model
Figure 3 for RTQ: Rethinking Video-language Understanding Based on Image-text Model
Figure 4 for RTQ: Rethinking Video-language Understanding Based on Image-text Model
Viaarxiv icon

EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints

Add code
Aug 21, 2023
Figure 1 for EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Figure 2 for EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Figure 3 for EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Figure 4 for EVE: Efficient zero-shot text-based Video Editing with Depth Map Guidance and Temporal Consistency Constraints
Viaarxiv icon

Temporal Sentence Grounding in Streaming Videos

Add code
Aug 14, 2023
Viaarxiv icon

Micro-video Tagging via Jointly Modeling Social Influence and Tag Relation

Add code
Mar 15, 2023
Viaarxiv icon