Picture for Chan-Jan Hsu

Chan-Jan Hsu

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

Add code
Jan 25, 2025
Viaarxiv icon

Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation

Add code
Dec 02, 2024
Viaarxiv icon

FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web

Add code
Nov 25, 2024
Figure 1 for FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web
Figure 2 for FineWeb-zhtw: Scalable Curation of Traditional Chinese Text Data from the Web
Viaarxiv icon

Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

Add code
Nov 11, 2024
Viaarxiv icon

Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition

Add code
May 23, 2024
Viaarxiv icon

Breeze-7B Technical Report

Add code
Mar 05, 2024
Viaarxiv icon

Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite

Add code
Oct 02, 2023
Figure 1 for Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Figure 2 for Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Figure 3 for Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Viaarxiv icon

Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning

Add code
Jul 18, 2023
Viaarxiv icon

Extending the Pre-Training of BLOOM for Improved Support of Traditional Chinese: Models, Methods and Results

Add code
Mar 08, 2023
Viaarxiv icon

Bridging Speech and Textual Pre-trained Models with Unsupervised ASR

Add code
Nov 06, 2022
Viaarxiv icon