Picture for Taian Guo

Taian Guo

Multimodal Label Relevance Ranking via Reinforcement Learning

Add code
Jul 18, 2024
Viaarxiv icon

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Add code
Jun 29, 2024
Figure 1 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Figure 2 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Figure 3 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Figure 4 for MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
Viaarxiv icon

Unified and Dynamic Graph for Temporal Character Grouping in Long Videos

Add code
Aug 29, 2023
Viaarxiv icon

D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation

Add code
Aug 08, 2023
Figure 1 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Figure 2 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Figure 3 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Figure 4 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Viaarxiv icon

VLMAE: Vision-Language Masked Autoencoder

Add code
Aug 19, 2022
Figure 1 for VLMAE: Vision-Language Masked Autoencoder
Figure 2 for VLMAE: Vision-Language Masked Autoencoder
Figure 3 for VLMAE: Vision-Language Masked Autoencoder
Figure 4 for VLMAE: Vision-Language Masked Autoencoder
Viaarxiv icon

Exploiting Feature Diversity for Make-up Temporal Video Grounding

Add code
Aug 12, 2022
Figure 1 for Exploiting Feature Diversity for Make-up Temporal Video Grounding
Figure 2 for Exploiting Feature Diversity for Make-up Temporal Video Grounding
Figure 3 for Exploiting Feature Diversity for Make-up Temporal Video Grounding
Figure 4 for Exploiting Feature Diversity for Make-up Temporal Video Grounding
Viaarxiv icon

Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer

Add code
Jul 05, 2022
Figure 1 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Figure 2 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Figure 3 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Figure 4 for Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer
Viaarxiv icon

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution

Add code
Jul 23, 2020
Figure 1 for MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
Figure 2 for MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
Figure 3 for MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
Figure 4 for MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
Viaarxiv icon