Picture for Wen-Huang Cheng

Wen-Huang Cheng

National Yang Ming Chiao Tung University

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language

Add code
Sep 02, 2024
Viaarxiv icon

ReCorD: Reasoning and Correcting Diffusion for HOI Generation

Add code
Jul 25, 2024
Viaarxiv icon

The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation

Add code
Jul 17, 2024
Figure 1 for The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Figure 2 for The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Figure 3 for The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Figure 4 for The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Viaarxiv icon

A DeNoising FPN With Transformer R-CNN for Tiny Object Detection

Add code
Jun 11, 2024
Figure 1 for A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Figure 2 for A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Figure 3 for A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Figure 4 for A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Viaarxiv icon

SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge

Add code
May 17, 2024
Figure 1 for SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
Figure 2 for SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
Figure 3 for SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
Figure 4 for SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge
Viaarxiv icon

An Investigation of Incorporating Mamba for Speech Enhancement

Add code
May 10, 2024
Figure 1 for An Investigation of Incorporating Mamba for Speech Enhancement
Figure 2 for An Investigation of Incorporating Mamba for Speech Enhancement
Figure 3 for An Investigation of Incorporating Mamba for Speech Enhancement
Figure 4 for An Investigation of Incorporating Mamba for Speech Enhancement
Viaarxiv icon

EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning

Add code
Apr 25, 2024
Viaarxiv icon

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

Add code
Apr 12, 2024
Viaarxiv icon

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection

Add code
Apr 11, 2024
Viaarxiv icon

MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection

Add code
Apr 07, 2024
Viaarxiv icon