Picture for Mingshuang Luo

Mingshuang Luo

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

Add code
Jan 29, 2026
Viaarxiv icon

CLIP-Guided Adaptable Self-Supervised Learning for Human-Centric Visual Tasks

Add code
Jan 19, 2026
Viaarxiv icon

FlowAct-R1: Towards Interactive Humanoid Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Add code
Nov 22, 2024
Viaarxiv icon

M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation

Add code
May 29, 2024
Viaarxiv icon

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Add code
Oct 31, 2022
Figure 1 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 2 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 3 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 4 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Viaarxiv icon

Fast and parallel decoding for transducer

Add code
Oct 31, 2022
Viaarxiv icon

Pruned RNN-T for fast, memory-efficient ASR training

Add code
Jun 23, 2022
Figure 1 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 2 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 3 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 4 for Pruned RNN-T for fast, memory-efficient ASR training
Viaarxiv icon

Synchronous Bidirectional Learning for Multilingual Lip Reading

Add code
May 12, 2020
Figure 1 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 2 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 3 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 4 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Viaarxiv icon

Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

Add code
Mar 09, 2020
Figure 1 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 2 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 3 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 4 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Viaarxiv icon