Picture for Xueyuan Chen

Xueyuan Chen

A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models

Add code
Nov 13, 2024
Viaarxiv icon

NC-NCD: Novel Class Discovery for Node Classification

Add code
Jul 25, 2024
Figure 1 for NC-NCD: Novel Class Discovery for Node Classification
Figure 2 for NC-NCD: Novel Class Discovery for Node Classification
Figure 3 for NC-NCD: Novel Class Discovery for Node Classification
Figure 4 for NC-NCD: Novel Class Discovery for Node Classification
Viaarxiv icon

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction

Add code
Jun 12, 2024
Figure 1 for CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
Figure 2 for CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
Figure 3 for CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
Figure 4 for CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
Viaarxiv icon

SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models

Add code
Jun 04, 2024
Figure 1 for SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
Figure 2 for SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
Figure 3 for SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
Figure 4 for SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
Viaarxiv icon

Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy

Add code
Mar 24, 2024
Viaarxiv icon

Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction

Add code
Jan 31, 2024
Viaarxiv icon

StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis

Add code
Dec 19, 2023
Viaarxiv icon

SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning

Add code
May 08, 2023
Viaarxiv icon

Structural Entropy Guided Graph Hierarchical Pooling

Add code
Jun 26, 2022
Figure 1 for Structural Entropy Guided Graph Hierarchical Pooling
Figure 2 for Structural Entropy Guided Graph Hierarchical Pooling
Figure 3 for Structural Entropy Guided Graph Hierarchical Pooling
Figure 4 for Structural Entropy Guided Graph Hierarchical Pooling
Viaarxiv icon

A Character-level Span-based Model for Mandarin Prosodic Structure Prediction

Add code
Mar 31, 2022
Figure 1 for A Character-level Span-based Model for Mandarin Prosodic Structure Prediction
Figure 2 for A Character-level Span-based Model for Mandarin Prosodic Structure Prediction
Figure 3 for A Character-level Span-based Model for Mandarin Prosodic Structure Prediction
Figure 4 for A Character-level Span-based Model for Mandarin Prosodic Structure Prediction
Viaarxiv icon