Picture for Jiahe Lei

Jiahe Lei

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues

Add code
Nov 05, 2024
Figure 1 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Figure 2 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Figure 3 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Figure 4 for pTSE-T: Presentation Target Speaker Extraction using Unaligned Text Cues
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models

Add code
Feb 20, 2024
Viaarxiv icon

TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering

Add code
Oct 23, 2023
Viaarxiv icon

HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering

Add code
Sep 22, 2023
Viaarxiv icon

MMHQA-ICL: Multimodal In-context Learning for Hybrid Question Answering over Text, Tables and Images

Add code
Sep 09, 2023
Viaarxiv icon