Picture for Shiqi Yang

Shiqi Yang

Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control

Add code
Dec 10, 2024
Viaarxiv icon

OpenMU: Your Swiss Army Knife for Music Understanding

Add code
Oct 21, 2024
Viaarxiv icon

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models

Add code
Oct 08, 2024
Viaarxiv icon

Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models

Add code
Oct 02, 2024
Figure 1 for Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Figure 2 for Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Figure 3 for Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Figure 4 for Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models
Viaarxiv icon

ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation

Add code
Aug 21, 2024
Figure 1 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 2 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 3 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Figure 4 for ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
Viaarxiv icon

Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning

Add code
Jul 03, 2024
Viaarxiv icon

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Add code
Jul 01, 2024
Figure 1 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 2 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 3 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Figure 4 for Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
Viaarxiv icon

SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond

Add code
Jun 26, 2024
Viaarxiv icon

Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation

Add code
May 23, 2024
Figure 1 for Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Figure 2 for Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Figure 3 for Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Figure 4 for Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Viaarxiv icon

DestripeCycleGAN: Stripe Simulation CycleGAN for Unsupervised Infrared Image Destriping

Add code
Feb 14, 2024
Viaarxiv icon