Picture for Xinxiao Wu

Xinxiao Wu

Storyboard guided Alignment for Fine-grained Video Action Recognition

Add code
Oct 18, 2024
Viaarxiv icon

Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning

Add code
Mar 02, 2024
Viaarxiv icon

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification

Add code
May 25, 2023
Figure 1 for DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Figure 2 for DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Figure 3 for DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Figure 4 for DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
Viaarxiv icon

Meta-causal Learning for Single Domain Generalization

Add code
Apr 07, 2023
Viaarxiv icon

Learning What You Should Learn

Add code
Dec 11, 2022
Viaarxiv icon

Knowledge Prompting for Few-shot Action Recognition

Add code
Nov 22, 2022
Viaarxiv icon

Bootstrap Generalization Ability from Loss Landscape Perspective

Add code
Sep 18, 2022
Figure 1 for Bootstrap Generalization Ability from Loss Landscape Perspective
Figure 2 for Bootstrap Generalization Ability from Loss Landscape Perspective
Figure 3 for Bootstrap Generalization Ability from Loss Landscape Perspective
Figure 4 for Bootstrap Generalization Ability from Loss Landscape Perspective
Viaarxiv icon

Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos

Add code
May 12, 2022
Figure 1 for Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos
Figure 2 for Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos
Figure 3 for Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos
Figure 4 for Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos
Viaarxiv icon

Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph

Add code
Jul 26, 2021
Figure 1 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Figure 2 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Figure 3 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Figure 4 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Viaarxiv icon

Adaptive Recursive Circle Framework for Fine-grained Action Recognition

Add code
Jul 25, 2021
Figure 1 for Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Figure 2 for Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Figure 3 for Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Figure 4 for Adaptive Recursive Circle Framework for Fine-grained Action Recognition
Viaarxiv icon