Picture for Keze Wang

Keze Wang

Category-Adaptive Cross-Modal Semantic Refinement and Transfer for Open-Vocabulary Multi-Label Recognition

Add code
Dec 09, 2024
Viaarxiv icon

Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body

Add code
Nov 21, 2024
Figure 1 for Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
Figure 2 for Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
Figure 3 for Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
Figure 4 for Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
Viaarxiv icon

Improving Network Interpretability via Explanation Consistency Evaluation

Add code
Aug 08, 2024
Figure 1 for Improving Network Interpretability via Explanation Consistency Evaluation
Figure 2 for Improving Network Interpretability via Explanation Consistency Evaluation
Figure 3 for Improving Network Interpretability via Explanation Consistency Evaluation
Figure 4 for Improving Network Interpretability via Explanation Consistency Evaluation
Viaarxiv icon

On Training Data Influence of GPT Models

Add code
Apr 11, 2024
Viaarxiv icon

NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning

Add code
Mar 02, 2024
Figure 1 for NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning
Figure 2 for NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning
Figure 3 for NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning
Figure 4 for NeRF-VPT: Learning Novel View Representations with Neural Radiance Fields via View Prompt Tuning
Viaarxiv icon

Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention

Add code
Jan 15, 2024
Figure 1 for Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
Figure 2 for Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
Figure 3 for Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
Figure 4 for Video Super-Resolution Transformer with Masked Inter&Intra-Frame Attention
Viaarxiv icon

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

Add code
Dec 18, 2023
Viaarxiv icon

Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering

Add code
Nov 29, 2023
Figure 1 for Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
Figure 2 for Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
Figure 3 for Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
Figure 4 for Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering
Viaarxiv icon

SQLNet: Scale-Modulated Query and Localization Network for Few-Shot Class-Agnostic Counting

Add code
Nov 16, 2023
Viaarxiv icon

VisualProg Distiller: Learning to Fine-tune Non-differentiable Visual Programming Frameworks

Add code
Sep 18, 2023
Viaarxiv icon