Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Talaat Khalil

Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis

Oct 23, 2017

Samhaa R. El-Beltagy, Talaat Khalil, Amal Halaby, Muhammad Hammad

Figure 1 for Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis

Figure 2 for Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis

Figure 3 for Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis

Abstract:The importance of building sentiment analysis tools for Arabic social media has been recognized during the past couple of years, especially with the rapid increase in the number of Arabic social media users. One of the main difficulties in tackling this problem is that text within social media is mostly colloquial, with many dialects being used within social media platforms. In this paper, we present a set of features that were integrated with a machine learning based sentiment analysis model and applied on Egyptian, Saudi, Levantine, and MSA Arabic social media datasets. Many of the proposed features were derived through the use of an Arabic Sentiment Lexicon. The model also presents emoticon based features, as well as input text related features such as the number of segments within the text, the length of the text, whether the text ends with a question mark or not, etc. We show that the presented features have resulted in an increased accuracy across six of the seven datasets we've experimented with and which are all benchmarked. Since the developed model out-performs all existing Arabic sentiment analysis systems that have publicly available datasets, we can state that this model presents state-of-the-art in Arabic sentiment analysis.

* arXiv admin note: This version has been removed because it is in violation of arXiv's copyright policy

Via

Access Paper or Ask Questions

Toward a full-scale neural machine translation in production: the Booking.com use case

Sep 25, 2017

Pavel Levin, Nishikant Dhanuka, Talaat Khalil, Fedor Kovalev, Maxim Khalilov

Figure 1 for Toward a full-scale neural machine translation in production: the Booking.com use case

Figure 2 for Toward a full-scale neural machine translation in production: the Booking.com use case

Figure 3 for Toward a full-scale neural machine translation in production: the Booking.com use case

Figure 4 for Toward a full-scale neural machine translation in production: the Booking.com use case

Abstract:While some remarkable progress has been made in neural machine translation (NMT) research, there have not been many reports on its development and evaluation in practice. This paper tries to fill this gap by presenting some of our findings from building an in-house travel domain NMT system in a large scale E-commerce setting. The three major topics that we cover are optimization and training (including different optimization strategies and corpus sizes), handling real-world content and evaluating results.

* 11 pages, 4 figures, presented at MT Summit XVI, Commercial MT Users and Translators Track

Via

Access Paper or Ask Questions