Picture for Kyubyong Park

Kyubyong Park

K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings

Add code
Oct 24, 2023
Viaarxiv icon

A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models

Add code
Jun 06, 2023
Viaarxiv icon

An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks

Add code
Oct 06, 2020
Figure 1 for An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks
Figure 2 for An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks
Figure 3 for An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks
Figure 4 for An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks
Viaarxiv icon

KoParadigm: A Korean Conjugation Paradigm Generator

Add code
Apr 28, 2020
Figure 1 for KoParadigm: A Korean Conjugation Paradigm Generator
Figure 2 for KoParadigm: A Korean Conjugation Paradigm Generator
Figure 3 for KoParadigm: A Korean Conjugation Paradigm Generator
Figure 4 for KoParadigm: A Korean Conjugation Paradigm Generator
Viaarxiv icon

An Empirical Study of Invariant Risk Minimization

Add code
Apr 10, 2020
Figure 1 for An Empirical Study of Invariant Risk Minimization
Figure 2 for An Empirical Study of Invariant Risk Minimization
Figure 3 for An Empirical Study of Invariant Risk Minimization
Figure 4 for An Empirical Study of Invariant Risk Minimization
Viaarxiv icon

KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding

Add code
Apr 08, 2020
Figure 1 for KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
Figure 2 for KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
Figure 3 for KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
Figure 4 for KorNLI and KorSTS: New Benchmark Datasets for Korean Natural Language Understanding
Viaarxiv icon

g2pM: A Neural Grapheme-to-Phoneme Conversion Package for MandarinChinese Based on a New Open Benchmark Dataset

Add code
Apr 07, 2020
Figure 1 for g2pM: A Neural Grapheme-to-Phoneme Conversion Package for MandarinChinese Based on a New Open Benchmark Dataset
Figure 2 for g2pM: A Neural Grapheme-to-Phoneme Conversion Package for MandarinChinese Based on a New Open Benchmark Dataset
Figure 3 for g2pM: A Neural Grapheme-to-Phoneme Conversion Package for MandarinChinese Based on a New Open Benchmark Dataset
Figure 4 for g2pM: A Neural Grapheme-to-Phoneme Conversion Package for MandarinChinese Based on a New Open Benchmark Dataset
Viaarxiv icon

Jejueo Datasets for Machine Translation and Speech Synthesis

Add code
Nov 27, 2019
Figure 1 for Jejueo Datasets for Machine Translation and Speech Synthesis
Figure 2 for Jejueo Datasets for Machine Translation and Speech Synthesis
Figure 3 for Jejueo Datasets for Machine Translation and Speech Synthesis
Figure 4 for Jejueo Datasets for Machine Translation and Speech Synthesis
Viaarxiv icon

word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs

Add code
Nov 27, 2019
Figure 1 for word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
Figure 2 for word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
Figure 3 for word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
Figure 4 for word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
Viaarxiv icon

A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning

Add code
Jul 02, 2019
Figure 1 for A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning
Figure 2 for A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning
Figure 3 for A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning
Figure 4 for A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning
Viaarxiv icon