Picture for Helen Meng

Helen Meng

Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection

Add code
Dec 09, 2024
Viaarxiv icon

Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease

Add code
Nov 28, 2024
Viaarxiv icon

A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models

Add code
Nov 13, 2024
Viaarxiv icon

Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models

Add code
Nov 12, 2024
Viaarxiv icon

Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains

Add code
Oct 24, 2024
Figure 1 for Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains
Figure 2 for Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains
Figure 3 for Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains
Figure 4 for Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains
Viaarxiv icon

Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech

Add code
Sep 22, 2024
Figure 1 for Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech
Figure 2 for Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech
Figure 3 for Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech
Figure 4 for Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech
Viaarxiv icon

Speaking from Coarse to Fine: Improving Neural Codec Language Model via Multi-Scale Speech Coding and Generation

Add code
Sep 18, 2024
Viaarxiv icon

Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions

Add code
Sep 13, 2024
Figure 1 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Figure 2 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Figure 3 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Figure 4 for Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions
Viaarxiv icon

SongCreator: Lyrics-based Universal Song Generation

Add code
Sep 09, 2024
Figure 1 for SongCreator: Lyrics-based Universal Song Generation
Figure 2 for SongCreator: Lyrics-based Universal Song Generation
Figure 3 for SongCreator: Lyrics-based Universal Song Generation
Figure 4 for SongCreator: Lyrics-based Universal Song Generation
Viaarxiv icon

SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis

Add code
Sep 02, 2024
Viaarxiv icon