Picture for Chanjun Park

Chanjun Park

Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching

Add code
Oct 24, 2024
Viaarxiv icon

Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs

Add code
Oct 16, 2024
Viaarxiv icon

Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models

Add code
Oct 07, 2024
Figure 1 for Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
Figure 2 for Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
Figure 3 for Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
Figure 4 for Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models
Viaarxiv icon

InstaTrans: An Instruction-Aware Translation Framework for Non-English Instruction Datasets

Add code
Oct 02, 2024
Viaarxiv icon

1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models

Add code
Sep 30, 2024
Viaarxiv icon

Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora

Add code
Sep 15, 2024
Viaarxiv icon

Understanding LLM Development Through Longitudinal Study: Insights from the Open Ko-LLM Leaderboard

Add code
Sep 05, 2024
Viaarxiv icon

ChatLang-8: An LLM-Based Synthetic Data Generation Framework for Grammatical Error Correction

Add code
Jun 05, 2024
Viaarxiv icon

Enhancing Consistency and Role-Specific Knowledge Capturing by Rebuilding Fictional Character's Persona

Add code
Jun 01, 2024
Viaarxiv icon

Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark

Add code
May 31, 2024
Viaarxiv icon