Picture for Yao Du

Yao Du

EAGLE: Elevating Geometric Reasoning through LLM-empowered Visual Instruction Tuning

Add code
Aug 21, 2024
Viaarxiv icon

Teach CLIP to Develop a Number Sense for Ordinal Regression

Add code
Aug 07, 2024
Viaarxiv icon

MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

Add code
Jun 11, 2024
Viaarxiv icon

Toward Efficient Visual Gyroscopes: Spherical Moments, Harmonics Filtering, and Masking Techniques for Spherical Camera Applications

Add code
Apr 02, 2024
Viaarxiv icon

Sign Language Production with Latent Motion Transformer

Add code
Dec 20, 2023
Viaarxiv icon

ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers

Add code
Aug 30, 2023
Viaarxiv icon

Inferring Attracting Basins of Power System with Machine Learning

Add code
May 20, 2023
Viaarxiv icon

Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation

Add code
Aug 19, 2022
Figure 1 for Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation
Figure 2 for Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation
Figure 3 for Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation
Figure 4 for Vector Quantized Diffusion Model with CodeUnet for Text-to-Sign Pose Sequences Generation
Viaarxiv icon

Multi-Scale Local-Temporal Similarity Fusion for Continuous Sign Language Recognition

Add code
Jul 27, 2021
Figure 1 for Multi-Scale Local-Temporal Similarity Fusion for Continuous Sign Language Recognition
Figure 2 for Multi-Scale Local-Temporal Similarity Fusion for Continuous Sign Language Recognition
Figure 3 for Multi-Scale Local-Temporal Similarity Fusion for Continuous Sign Language Recognition
Figure 4 for Multi-Scale Local-Temporal Similarity Fusion for Continuous Sign Language Recognition
Viaarxiv icon