Picture for Xixin Wu

Xixin Wu

Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution

Add code
Mar 03, 2025
Viaarxiv icon

Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data

Add code
Jan 19, 2025
Figure 1 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Figure 2 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Figure 3 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Figure 4 for Leveraging Chain of Thought towards Empathetic Spoken Dialogue without Corresponding Question-Answering Data
Viaarxiv icon

DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions

Add code
Jan 08, 2025
Figure 1 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Figure 2 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Figure 3 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Figure 4 for DrawSpeech: Expressive Speech Synthesis Using Prosodic Sketches as Control Conditions
Viaarxiv icon

Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives

Add code
Jan 07, 2025
Figure 1 for Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives
Figure 2 for Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives
Figure 3 for Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives
Figure 4 for Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives
Viaarxiv icon

Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT

Add code
Jan 02, 2025
Figure 1 for Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
Figure 2 for Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
Figure 3 for Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
Figure 4 for Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT
Viaarxiv icon

learning discriminative features from spectrograms using center loss for speech emotion recognition

Add code
Jan 02, 2025
Viaarxiv icon

Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema

Add code
Dec 30, 2024
Figure 1 for Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema
Figure 2 for Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema
Viaarxiv icon

Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection

Add code
Dec 09, 2024
Viaarxiv icon

Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease

Add code
Nov 28, 2024
Figure 1 for Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease
Figure 2 for Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease
Figure 3 for Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease
Figure 4 for Devising a Set of Compact and Explainable Spoken Language Feature for Screening Alzheimer's Disease
Viaarxiv icon

Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models

Add code
Nov 12, 2024
Viaarxiv icon