Picture for Pengcheng Guo

Pengcheng Guo

SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR

Add code
Dec 07, 2024
Viaarxiv icon

Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge

Add code
Sep 16, 2024
Viaarxiv icon

NPU-NTU System for Voice Privacy 2024 Challenge

Add code
Sep 06, 2024
Figure 1 for NPU-NTU System for Voice Privacy 2024 Challenge
Figure 2 for NPU-NTU System for Voice Privacy 2024 Challenge
Viaarxiv icon

Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

Add code
Aug 28, 2024
Figure 1 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 2 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 3 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Figure 4 for Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
Viaarxiv icon

Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

Add code
Aug 20, 2024
Viaarxiv icon

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Add code
Aug 07, 2024
Figure 1 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 2 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 3 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Figure 4 for Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Viaarxiv icon

CRMSP: A Semi-supervised Approach for Key Information Extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling

Add code
Jul 19, 2024
Viaarxiv icon

MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement

Add code
Jul 16, 2024
Figure 1 for MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement
Figure 2 for MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement
Figure 3 for MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement
Figure 4 for MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement
Viaarxiv icon

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix

Add code
May 17, 2024
Figure 1 for Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix
Figure 2 for Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix
Figure 3 for Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix
Figure 4 for Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix
Viaarxiv icon

Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

Add code
May 06, 2024
Figure 1 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 2 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 3 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Figure 4 for Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets
Viaarxiv icon