Picture for Hang Lv

Hang Lv

SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR

Add code
Dec 07, 2024
Viaarxiv icon

MuseGraph: Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining

Add code
Mar 13, 2024
Viaarxiv icon

Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation

Add code
Oct 22, 2023
Viaarxiv icon

Minimizing Sequential Confusion Error in Speech Command Recognition

Add code
Jul 04, 2022
Figure 1 for Minimizing Sequential Confusion Error in Speech Command Recognition
Figure 2 for Minimizing Sequential Confusion Error in Speech Command Recognition
Viaarxiv icon

WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit

Add code
Mar 29, 2022
Figure 1 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 2 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 3 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Figure 4 for WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Viaarxiv icon

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

Add code
Oct 18, 2021
Figure 1 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 2 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 3 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Figure 4 for WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Viaarxiv icon

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

Add code
Mar 16, 2021
Figure 1 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 2 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 3 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Figure 4 for An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Viaarxiv icon

Wake Word Detection with Streaming Transformers

Add code
Feb 08, 2021
Figure 1 for Wake Word Detection with Streaming Transformers
Figure 2 for Wake Word Detection with Streaming Transformers
Figure 3 for Wake Word Detection with Streaming Transformers
Figure 4 for Wake Word Detection with Streaming Transformers
Viaarxiv icon

Wake Word Detection with Alignment-Free Lattice-Free MMI

Add code
May 25, 2020
Figure 1 for Wake Word Detection with Alignment-Free Lattice-Free MMI
Figure 2 for Wake Word Detection with Alignment-Free Lattice-Free MMI
Figure 3 for Wake Word Detection with Alignment-Free Lattice-Free MMI
Figure 4 for Wake Word Detection with Alignment-Free Lattice-Free MMI
Viaarxiv icon

Espresso: A Fast End-to-end Neural Speech Recognition Toolkit

Add code
Oct 15, 2019
Figure 1 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Figure 2 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Figure 3 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Figure 4 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Viaarxiv icon