Picture for Frank Zhang

Frank Zhang

Jack

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

Add code
Mar 14, 2024
Viaarxiv icon

Pushing the performances of ASR models on English and Spanish accents

Add code
Dec 22, 2022
Viaarxiv icon

Scaling ASR Improves Zero and Few Shot Learning

Add code
Nov 29, 2021
Figure 1 for Scaling ASR Improves Zero and Few Shot Learning
Figure 2 for Scaling ASR Improves Zero and Few Shot Learning
Figure 3 for Scaling ASR Improves Zero and Few Shot Learning
Figure 4 for Scaling ASR Improves Zero and Few Shot Learning
Viaarxiv icon

Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings

Add code
Oct 08, 2021
Figure 1 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 2 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 3 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Figure 4 for Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Viaarxiv icon

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

Add code
Aug 04, 2021
Figure 1 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 2 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 3 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Figure 4 for Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Viaarxiv icon

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models

Add code
Jul 09, 2021
Figure 1 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 2 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 3 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Figure 4 for On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Viaarxiv icon

Improving RNN Transducer Based ASR with Auxiliary Tasks

Add code
Nov 09, 2020
Figure 1 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 2 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 3 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Figure 4 for Improving RNN Transducer Based ASR with Auxiliary Tasks
Viaarxiv icon

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition

Add code
Nov 03, 2020
Figure 1 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 2 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 3 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Figure 4 for Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition
Viaarxiv icon

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications

Add code
Oct 29, 2020
Figure 1 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 2 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 3 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Figure 4 for Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
Viaarxiv icon