Picture for Shiyuan Huang

Shiyuan Huang

WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization

Add code
May 28, 2024
Viaarxiv icon

Characterizing Video Question Answering with Sparsified Inputs

Add code
Nov 27, 2023
Viaarxiv icon

Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations

Add code
Oct 17, 2023
Figure 1 for Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Figure 2 for Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Figure 3 for Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Figure 4 for Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations
Viaarxiv icon

Supervised Masked Knowledge Distillation for Few-Shot Transformers

Add code
Mar 29, 2023
Viaarxiv icon

DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection

Add code
Mar 16, 2023
Viaarxiv icon

TempCLR: Temporal Alignment Representation with Contrastive Learning

Add code
Dec 28, 2022
Viaarxiv icon

Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy

Add code
Oct 18, 2022
Figure 1 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Figure 2 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Figure 3 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Figure 4 for Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy
Viaarxiv icon

Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval

Add code
Jun 05, 2022
Figure 1 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 2 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 3 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Figure 4 for Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval
Viaarxiv icon

Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting

Add code
Apr 16, 2022
Figure 1 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 2 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 3 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Figure 4 for Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting
Viaarxiv icon

Few-Shot Object Detection with Fully Cross-Transformer

Add code
Mar 28, 2022
Figure 1 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 2 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 3 for Few-Shot Object Detection with Fully Cross-Transformer
Figure 4 for Few-Shot Object Detection with Fully Cross-Transformer
Viaarxiv icon