Picture for Xavier Alameda-Pineda

Xavier Alameda-Pineda

ROBOTLEARN

Diffusion-based Unsupervised Audio-visual Speech Enhancement

Add code
Oct 04, 2024
Viaarxiv icon

Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking

Add code
Jul 16, 2024
Viaarxiv icon

MEGA: Masked Generative Autoencoder for Human Mesh Recovery

Add code
May 29, 2024
Viaarxiv icon

Socially Pertinent Robots in Gerontological Healthcare

Add code
Apr 11, 2024
Viaarxiv icon

VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space

Add code
Dec 13, 2023
Figure 1 for VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space
Figure 2 for VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space
Figure 3 for VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space
Figure 4 for VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space
Viaarxiv icon

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

Add code
Dec 07, 2023
Viaarxiv icon

On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers

Add code
Aug 18, 2023
Figure 1 for On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers
Figure 2 for On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers
Figure 3 for On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers
Figure 4 for On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers
Viaarxiv icon

A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation

Add code
Jul 04, 2023
Figure 1 for A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Figure 2 for A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Figure 3 for A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Figure 4 for A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
Viaarxiv icon

Semi-supervised learning made simple with self-supervised clustering

Add code
Jun 13, 2023
Viaarxiv icon

Unsupervised speech enhancement with deep dynamical generative speech and noise models

Add code
Jun 13, 2023
Viaarxiv icon