Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

SangHun Im

TARDiS : Text Augmentation for Refining Diversity and Separability

Jan 06, 2025

Kyungmin Kim, SangHun Im, GiBaeg Kim, Heung-Seon Oh

Figure 1 for TARDiS : Text Augmentation for Refining Diversity and Separability

Figure 2 for TARDiS : Text Augmentation for Refining Diversity and Separability

Figure 3 for TARDiS : Text Augmentation for Refining Diversity and Separability

Figure 4 for TARDiS : Text Augmentation for Refining Diversity and Separability

Abstract:Text augmentation (TA) is a critical technique for text classification, especially in few-shot settings. This paper introduces a novel LLM-based TA method, TARDiS, to address challenges inherent in the generation and alignment stages of two-stage TA methods. For the generation stage, we propose two generation processes, SEG and CEG, incorporating multiple class-specific prompts to enhance diversity and separability. For the alignment stage, we introduce a class adaptation (CA) method to ensure that generated examples align with their target classes through verification and modification. Experimental results demonstrate TARDiS's effectiveness, outperforming state-of-the-art LLM-based TA methods in various few-shot text classification tasks. An in-depth analysis confirms the detailed behaviors at each stage.

* 10 pages

Via

Access Paper or Ask Questions

Hierarchy Decoder is All You Need To Text Classification

Nov 22, 2021

SangHun Im, Gibaeg Kim, Heung-Seon Oh, Seongung Jo, Donghwan Kim

Figure 1 for Hierarchy Decoder is All You Need To Text Classification

Figure 2 for Hierarchy Decoder is All You Need To Text Classification

Figure 3 for Hierarchy Decoder is All You Need To Text Classification

Figure 4 for Hierarchy Decoder is All You Need To Text Classification

Abstract:Hierarchical text classification (HTC) to a taxonomy is essential for various real applications butchallenging since HTC models often need to process a large volume of data that are severelyimbalanced and have hierarchy dependencies. Existing local and global approaches use deep learningto improve HTC by reducing the time complexity and incorporating the hierarchy dependencies.However, it is difficult to satisfy both conditions in a single HTC model. This paper proposes ahierarchy decoder (HiDEC) that uses recursive hierarchy decoding based on an encoder-decoderarchitecture. The key idea of the HiDEC involves decoding a context matrix into a sub-hierarchysequence using recursive hierarchy decoding, while staying aware of hierarchical dependenciesand level information. The HiDEC is a unified model that incorporates the benefits of existingapproaches, thereby alleviating the aforementioned difficulties without any trade-off. In addition, itcan be applied to both single- and multi-label classification with a minor modification. The superiorityof the proposed model was verified on two benchmark datasets (WOS-46985 and RCV1) with anexplanation of the reasons for its success

* 13 pages, 5 figures

Via

Access Paper or Ask Questions