Picture for Yuhong Zhang

Yuhong Zhang

HumanMM: Global Human Motion Recovery from Multi-shot Videos

Add code
Mar 10, 2025
Viaarxiv icon

Frequency-Based Alignment of EEG and Audio Signals Using Contrastive Learning and SincNet for Auditory Attention Detection

Add code
Mar 06, 2025
Viaarxiv icon

Consistent Video Colorization via Palette Guidance

Add code
Jan 31, 2025
Figure 1 for Consistent Video Colorization via Palette Guidance
Figure 2 for Consistent Video Colorization via Palette Guidance
Figure 3 for Consistent Video Colorization via Palette Guidance
Figure 4 for Consistent Video Colorization via Palette Guidance
Viaarxiv icon

Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset

Add code
Jan 09, 2025
Figure 1 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Figure 2 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Figure 3 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Figure 4 for Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset
Viaarxiv icon

Towards Effective Graph Rationalization via Boosting Environment Diversity

Add code
Dec 17, 2024
Figure 1 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Figure 2 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Figure 3 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Figure 4 for Towards Effective Graph Rationalization via Boosting Environment Diversity
Viaarxiv icon

DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding

Add code
Nov 21, 2024
Figure 1 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 2 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 3 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Figure 4 for DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding
Viaarxiv icon

Diff-Restorer: Unleashing Visual Prompts for Diffusion-based Universal Image Restoration

Add code
Jul 04, 2024
Viaarxiv icon

MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration

Add code
Jul 04, 2024
Figure 1 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Figure 2 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Figure 3 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Figure 4 for MRIR: Integrating Multimodal Insights for Diffusion-based Realistic Image Restoration
Viaarxiv icon

CLDTA: Contrastive Learning based on Diagonal Transformer Autoencoder for Cross-Dataset EEG Emotion Recognition

Add code
Jun 12, 2024
Figure 1 for CLDTA: Contrastive Learning based on Diagonal Transformer Autoencoder for Cross-Dataset EEG Emotion Recognition
Figure 2 for CLDTA: Contrastive Learning based on Diagonal Transformer Autoencoder for Cross-Dataset EEG Emotion Recognition
Figure 3 for CLDTA: Contrastive Learning based on Diagonal Transformer Autoencoder for Cross-Dataset EEG Emotion Recognition
Figure 4 for CLDTA: Contrastive Learning based on Diagonal Transformer Autoencoder for Cross-Dataset EEG Emotion Recognition
Viaarxiv icon

Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior

Add code
Apr 25, 2024
Figure 1 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Figure 2 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Figure 3 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Figure 4 for Multimodal Semantic-Aware Automatic Colorization with Diffusion Prior
Viaarxiv icon