Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hesham Al-Bataineh

Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions

Sep 19, 2019

Hesham Al-Bataineh, Wael Farhan, Ahmad Mustafa, Haitham Seelawi, Hussein T. Al-Natsheh

Figure 1 for Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions

Figure 2 for Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions

Figure 3 for Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions

Figure 4 for Deep Contextualized Pairwise Semantic Similarity for Arabic Language Questions

Abstract:Question semantic similarity is a challenging and active research problem that is very useful in many NLP applications, such as detecting duplicate questions in community question answering platforms such as Quora. Arabic is considered to be an under-resourced language, has many dialects, and rich in morphology. Combined together, these challenges make identifying semantically similar questions in Arabic even more difficult. In this paper, we introduce a novel approach to tackle this problem, and test it on two benchmarks; one for Modern Standard Arabic (MSA), and another for the 24 major Arabic dialects. We are able to show that our new system outperforms state-of-the-art approaches by achieving 93% F1-score on the MSA benchmark and 82% on the dialectical one. This is achieved by utilizing contextualized word representations (ELMo embeddings) trained on a text corpus containing MSA and dialectic sentences. This in combination with a pairwise fine-grained similarity layer, helps our question-to-question similarity model to generalize predictions on different dialects while being trained only on question-to-question MSA data.

* Accepted at ICTAI 2019

Via

Access Paper or Ask Questions

NSURL-2019 Shared Task 8: Semantic Question Similarity in Arabic

Sep 12, 2019

Haitham Seelawi, Ahmad Mustafa, Hesham Al-Bataineh, Wael Farhan, Hussein T. Al-Natsheh

Figure 1 for NSURL-2019 Shared Task 8: Semantic Question Similarity in Arabic

Figure 2 for NSURL-2019 Shared Task 8: Semantic Question Similarity in Arabic

Figure 3 for NSURL-2019 Shared Task 8: Semantic Question Similarity in Arabic

Figure 4 for NSURL-2019 Shared Task 8: Semantic Question Similarity in Arabic

Abstract:Question semantic similarity (Q2Q) is a challenging task that is very useful in many NLP applications, such as detecting duplicate questions and question answering systems. In this paper, we present the results and findings of the shared task (Semantic Question Similarity in Arabic). The task was organized as part of the first workshop on NLP Solutions for Under Resourced Languages (NSURL 2019) The goal of the task is to predict whether two questions are semantically similar or not, even if they are phrased differently. A total of 9 teams participated in the task. The datasets created for this task are made publicly available to support further research on Arabic Q2Q.

* 8 pages, 2 figure, 3 tables, conference paper

Via

Access Paper or Ask Questions