Picture for Yi-Chang Chen

Yi-Chang Chen

TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling

Add code
Apr 09, 2025
Viaarxiv icon

BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights

Add code
Jan 29, 2025
Viaarxiv icon

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

Add code
Jan 25, 2025
Viaarxiv icon

Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation

Add code
Dec 02, 2024
Viaarxiv icon

Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition

Add code
May 23, 2024
Viaarxiv icon

Breeze-7B Technical Report

Add code
Mar 05, 2024
Viaarxiv icon

Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite

Add code
Oct 02, 2023
Figure 1 for Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Figure 2 for Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Figure 3 for Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Viaarxiv icon

Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning

Add code
Jul 18, 2023
Viaarxiv icon

g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin

Add code
Mar 24, 2022
Figure 1 for g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin
Figure 2 for g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin
Figure 3 for g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin
Figure 4 for g2pW: A Conditional Weighted Softmax BERT for Polyphone Disambiguation in Mandarin
Viaarxiv icon

SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition

Add code
Feb 24, 2022
Figure 1 for SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition
Figure 2 for SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition
Figure 3 for SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition
Viaarxiv icon