Picture for Dongjun Jang

Dongjun Jang

RCScore: Quantifying Response Consistency in Large Language Models

Add code
Oct 30, 2025
Figure 1 for RCScore: Quantifying Response Consistency in Large Language Models
Figure 2 for RCScore: Quantifying Response Consistency in Large Language Models
Figure 3 for RCScore: Quantifying Response Consistency in Large Language Models
Figure 4 for RCScore: Quantifying Response Consistency in Large Language Models
Viaarxiv icon

P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs

Add code
Jul 22, 2025
Figure 1 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Figure 2 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Figure 3 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Figure 4 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Viaarxiv icon

KoBALT: Korean Benchmark For Advanced Linguistic Tasks

Add code
May 22, 2025
Figure 1 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Figure 2 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Figure 3 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Figure 4 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Viaarxiv icon

DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine

Add code
Nov 14, 2024
Figure 1 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Figure 2 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Figure 3 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Figure 4 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Viaarxiv icon

A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark

Add code
Mar 25, 2024
Figure 1 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Figure 2 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Figure 3 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Figure 4 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Viaarxiv icon

KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models

Add code
Mar 25, 2024
Figure 1 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Figure 2 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Figure 3 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Figure 4 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Viaarxiv icon

Korean Bio-Medical Corpus for Medical Named Entity Recognition

Add code
Mar 24, 2024
Viaarxiv icon

CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean

Add code
Feb 23, 2024
Figure 1 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Figure 2 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Figure 3 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Figure 4 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Viaarxiv icon

Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models

Add code
Nov 30, 2023
Figure 1 for Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models
Figure 2 for Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models
Figure 3 for Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models
Figure 4 for Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models
Viaarxiv icon

DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

Add code
Nov 23, 2023
Viaarxiv icon