Picture for Mingshuang Luo

Mingshuang Luo

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Add code
Nov 22, 2024
Viaarxiv icon

M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation

Add code
May 29, 2024
Viaarxiv icon

Fast and parallel decoding for transducer

Add code
Oct 31, 2022
Viaarxiv icon

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Add code
Oct 31, 2022
Figure 1 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 2 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 3 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 4 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Viaarxiv icon

Pruned RNN-T for fast, memory-efficient ASR training

Add code
Jun 23, 2022
Figure 1 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 2 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 3 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 4 for Pruned RNN-T for fast, memory-efficient ASR training
Viaarxiv icon

Synchronous Bidirectional Learning for Multilingual Lip Reading

Add code
May 12, 2020
Figure 1 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 2 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 3 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 4 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Viaarxiv icon

Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

Add code
Mar 09, 2020
Figure 1 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 2 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 3 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 4 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Viaarxiv icon