Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer

Apr 27, 2022

Ayan Sengupta, Tharun Suresh, Md Shad Akhtar, Tanmoy Chakraborty

Figure 1 for A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer

Figure 2 for A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer

Figure 3 for A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer

Figure 4 for A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer

Share this with someone who'll enjoy it:

Abstract:Being a popular mode of text-based communication in multilingual communities, code-mixing in online social media has became an important subject to study. Learning the semantics and morphology of code-mixed language remains a key challenge, due to scarcity of data and unavailability of robust and language-invariant representation learning technique. Any morphologically-rich language can benefit from character, subword, and word-level embeddings, aiding in learning meaningful correlations. In this paper, we explore a hierarchical transformer-based architecture (HIT) to learn the semantics of code-mixed languages. HIT consists of multi-headed self-attention and outer product attention components to simultaneously comprehend the semantic and syntactic structures of code-mixed texts. We evaluate the proposed method across 6 Indian languages (Bengali, Gujarati, Hindi, Tamil, Telugu and Malayalam) and Spanish for 9 NLP tasks on 17 datasets. The HIT model outperforms state-of-the-art code-mixed representation learning and multilingual language models in all tasks. We further demonstrate the generalizability of the HIT architecture using masked language modeling-based pre-training, zero-shot learning, and transfer learning approaches. Our empirical results show that the pre-training objectives significantly improve the performance on downstream tasks.

* 12 pages, 1 figure, 11 tables

View paper on

Share this with someone who'll enjoy it:

Title:A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer

Paper and Code