Picture for Hyopil Shin

Hyopil Shin

RCScore: Quantifying Response Consistency in Large Language Models

Add code
Oct 30, 2025
Figure 1 for RCScore: Quantifying Response Consistency in Large Language Models
Figure 2 for RCScore: Quantifying Response Consistency in Large Language Models
Figure 3 for RCScore: Quantifying Response Consistency in Large Language Models
Figure 4 for RCScore: Quantifying Response Consistency in Large Language Models
Viaarxiv icon

P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs

Add code
Jul 22, 2025
Figure 1 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Figure 2 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Figure 3 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Figure 4 for P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs
Viaarxiv icon

KoBALT: Korean Benchmark For Advanced Linguistic Tasks

Add code
May 22, 2025
Figure 1 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Figure 2 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Figure 3 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Figure 4 for KoBALT: Korean Benchmark For Advanced Linguistic Tasks
Viaarxiv icon

MoFE: Mixture of Frozen Experts Architecture

Add code
Mar 09, 2025
Figure 1 for MoFE: Mixture of Frozen Experts Architecture
Figure 2 for MoFE: Mixture of Frozen Experts Architecture
Figure 3 for MoFE: Mixture of Frozen Experts Architecture
Figure 4 for MoFE: Mixture of Frozen Experts Architecture
Viaarxiv icon

How does a Language-Specific Tokenizer affect LLMs?

Add code
Feb 18, 2025
Figure 1 for How does a Language-Specific Tokenizer affect LLMs?
Figure 2 for How does a Language-Specific Tokenizer affect LLMs?
Figure 3 for How does a Language-Specific Tokenizer affect LLMs?
Figure 4 for How does a Language-Specific Tokenizer affect LLMs?
Viaarxiv icon

DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine

Add code
Nov 14, 2024
Figure 1 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Figure 2 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Figure 3 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Figure 4 for DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine
Viaarxiv icon

KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models

Add code
Mar 25, 2024
Figure 1 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Figure 2 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Figure 3 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Figure 4 for KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Viaarxiv icon

A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark

Add code
Mar 25, 2024
Figure 1 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Figure 2 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Figure 3 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Figure 4 for A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark
Viaarxiv icon

Korean Bio-Medical Corpus for Medical Named Entity Recognition

Add code
Mar 24, 2024
Viaarxiv icon

CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean

Add code
Feb 23, 2024
Figure 1 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Figure 2 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Figure 3 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Figure 4 for CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean
Viaarxiv icon