Picture for Tian-Hao Zhang

Tian-Hao Zhang

I2TTS: Image-indicated Immersive Text-to-speech Synthesis with Spatial Perception

Add code
Nov 20, 2024
Viaarxiv icon

Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores

Add code
Jun 06, 2024
Viaarxiv icon

Say Goodbye to RNN-T Loss: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition

Add code
Jul 27, 2023
Viaarxiv icon

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding

Add code
May 23, 2023
Viaarxiv icon

Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition

Add code
Sep 14, 2021
Figure 1 for Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Figure 2 for Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Figure 3 for Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Figure 4 for Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Viaarxiv icon