Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ravi Shekhar

SiTSE: Sinhala Text Simplification Dataset and Evaluation

Dec 02, 2024

Surangika Ranathunga, Rumesh Sirithunga, Himashi Rathnayake, Lahiru De Silva, Thamindu Aluthwala, Saman Peramuna, Ravi Shekhar

Figure 1 for SiTSE: Sinhala Text Simplification Dataset and Evaluation

Figure 2 for SiTSE: Sinhala Text Simplification Dataset and Evaluation

Figure 3 for SiTSE: Sinhala Text Simplification Dataset and Evaluation

Figure 4 for SiTSE: Sinhala Text Simplification Dataset and Evaluation

Abstract:Text Simplification is a task that has been minimally explored for low-resource languages. Consequently, there are only a few manually curated datasets. In this paper, we present a human curated sentence-level text simplification dataset for the Sinhala language. Our evaluation dataset contains 1,000 complex sentences and corresponding 3,000 simplified sentences produced by three different human annotators. We model the text simplification task as a zero-shot and zero resource sequence-to-sequence (seq-seq) task on the multilingual language models mT5 and mBART. We exploit auxiliary data from related seq-seq tasks and explore the possibility of using intermediate task transfer learning (ITTL). Our analysis shows that ITTL outperforms the previously proposed zero-resource methods for text simplification. Our findings also highlight the challenges in evaluating text simplification systems, and support the calls for improved metrics for measuring the quality of automated text simplification systems that would suit low-resource languages as well. Our code and data are publicly available: https://github.com/brainsharks-fyp17/Sinhala-Text-Simplification-Dataset-and-Evaluation

Via

Access Paper or Ask Questions

CoRAL: a Context-aware Croatian Abusive Language Dataset

Nov 11, 2022

Ravi Shekhar, Mladen Karan, Matthew Purver

Abstract:In light of unprecedented increases in the popularity of the internet and social media, comment moderation has never been a more relevant task. Semi-automated comment moderation systems greatly aid human moderators by either automatically classifying the examples or allowing the moderators to prioritize which comments to consider first. However, the concept of inappropriate content is often subjective, and such content can be conveyed in many subtle and indirect ways. In this work, we propose CoRAL -- a language and culturally aware Croatian Abusive dataset covering phenomena of implicitness and reliance on local and global context. We show experimentally that current models degrade when comments are not explicit and further degrade when language skill and context knowledge are required to interpret the comment.

* Findings of the ACL: AACL-IJCNLP, 2022

Via

Access Paper or Ask Questions

Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model

Sep 21, 2021

Elaine Zosa, Ravi Shekhar, Mladen Karan, Matthew Purver

Figure 1 for Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model

Figure 2 for Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model

Figure 3 for Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model

Figure 4 for Not All Comments are Equal: Insights into Comment Moderation from a Topic-Aware Model

Abstract:Moderation of reader comments is a significant problem for online news platforms. Here, we experiment with models for automatic moderation, using a dataset of comments from a popular Croatian newspaper. Our analysis shows that while comments that violate the moderation rules mostly share common linguistic and thematic features, their content varies across the different sections of the newspaper. We therefore make our models topic-aware, incorporating semantic features from a topic model into the classification decision. Our results show that topic information improves the performance of the model, increases its confidence in correct outputs, and helps us understand the model's outputs.

* Accepted to RANLP 2021

Via

Access Paper or Ask Questions

Neural Machine Translation for Low-Resource Languages: A Survey

Jun 29, 2021

Surangika Ranathunga, En-Shiun Annie Lee, Marjana Prifti Skenduli, Ravi Shekhar, Mehreen Alam, Rishemjit Kaur

Figure 1 for Neural Machine Translation for Low-Resource Languages: A Survey

Figure 2 for Neural Machine Translation for Low-Resource Languages: A Survey

Figure 3 for Neural Machine Translation for Low-Resource Languages: A Survey

Figure 4 for Neural Machine Translation for Low-Resource Languages: A Survey

Abstract:Neural Machine Translation (NMT) has seen a tremendous spurt of growth in less than ten years, and has already entered a mature phase. While considered as the most widely used solution for Machine Translation, its performance on low-resource language pairs still remains sub-optimal compared to the high-resource counterparts, due to the unavailability of large parallel corpora. Therefore, the implementation of NMT techniques for low-resource language pairs has been receiving the spotlight in the recent NMT research arena, thus leading to a substantial amount of research reported on this topic. This paper presents a detailed survey of research advancements in low-resource language NMT (LRL-NMT), along with a quantitative analysis aimed at identifying the most popular solutions. Based on our findings from reviewing previous work, this survey paper provides a set of guidelines to select the possible NMT technique for a given LRL data setting. It also presents a holistic view of the LRL-NMT research landscape and provides a list of recommendations to further enhance the research efforts on LRL-NMT.

* 35 pages, 8 figures

Via

Access Paper or Ask Questions

Evaluating the Representational Hub of Language and Vision Models

Apr 12, 2019

Ravi Shekhar, Ece Takmaz, Raquel Fernández, Raffaella Bernardi

Figure 1 for Evaluating the Representational Hub of Language and Vision Models

Figure 2 for Evaluating the Representational Hub of Language and Vision Models

Figure 3 for Evaluating the Representational Hub of Language and Vision Models

Figure 4 for Evaluating the Representational Hub of Language and Vision Models

Abstract:The multimodal models used in the emerging field at the intersection of computational linguistics and computer vision implement the bottom-up processing of the `Hub and Spoke' architecture proposed in cognitive science to represent how the brain processes and combines multi-sensory inputs. In particular, the Hub is implemented as a neural network encoder. We investigate the effect on this encoder of various vision-and-language tasks proposed in the literature: visual question answering, visual reference resolution, and visually grounded dialogue. To measure the quality of the representations learned by the encoder, we use two kinds of analyses. First, we evaluate the encoder pre-trained on the different vision-and-language tasks on an existing diagnostic task designed to assess multimodal semantic understanding. Second, we carry out a battery of analyses aimed at studying how the encoder merges and exploits the two modalities.

* Accepted to IWCS 2019

Via

Access Paper or Ask Questions

Jointly Learning to See, Ask, and GuessWhat

Sep 10, 2018

Aashish Venkatesh, Ravi Shekhar, Tim Baumgärtner, Elia Bruni, Raffaella Bernardi, Raquel Fernández

Figure 1 for Jointly Learning to See, Ask, and GuessWhat

Figure 2 for Jointly Learning to See, Ask, and GuessWhat

Figure 3 for Jointly Learning to See, Ask, and GuessWhat

Figure 4 for Jointly Learning to See, Ask, and GuessWhat

Abstract:We are interested in understanding how the ability to ground language in vision interacts with other abilities at play in dialogue, such as asking a series of questions to obtain the necessary information to perform a certain task. With this aim, we develop a Questioner agent in the context of the GuessWhat?! game. Our model exploits a neural network architecture to build a continuous representation of the dialogue state that integrates information from the visual and linguistic modalities and conditions future action. To play the GuessWhat?! game, the Questioner agent has to be able to do both, ask questions and guess a target object in the visual environment. In our architecture, these two capabilities are considered jointly as a supervised multi-task learning problem, to which cooperative learning can be further applied. We show that the introduction of our new architecture combined with these learning regimes yields an increase of 19.5% in task success accuracy with respect to a baseline model that treats submodules independently. With this increase, we reach an accuracy comparable to state-of-the-art models that use reinforcement learning, with the advantage that our architecture is entirely differentiable and thus easier to train. This suggests that combining our approach with reinforcement learning could lead to further improvements in the future. Finally, we present a range of analyses that examine the quality of the dialogues and shed light on the internal dynamics of the model.

Via

Access Paper or Ask Questions

Ask No More: Deciding when to guess in referential visual dialogue

Jun 12, 2018

Ravi Shekhar, Tim Baumgartner, Aashish Venkatesh, Elia Bruni, Raffaella Bernardi, Raquel Fernandez

Figure 1 for Ask No More: Deciding when to guess in referential visual dialogue

Figure 2 for Ask No More: Deciding when to guess in referential visual dialogue

Figure 3 for Ask No More: Deciding when to guess in referential visual dialogue

Figure 4 for Ask No More: Deciding when to guess in referential visual dialogue

Abstract:Our goal is to explore how the abilities brought in by a dialogue manager can be included in end-to-end visually grounded conversational agents. We make initial steps towards this general goal by augmenting a task-oriented visual dialogue model with a decision-making component that decides whether to ask a follow-up question to identify a target referent in an image, or to stop the conversation to make a guess. Our analyses show that adding a decision making component produces dialogues that are less repetitive and that include fewer unnecessary questions, thus potentially leading to more efficient and less unnatural interactions.

* COLING 2018 (accepted)

Via

Access Paper or Ask Questions

FOIL it! Find One mismatch between Image and Language caption

May 03, 2017

Ravi Shekhar, Sandro Pezzelle, Yauhen Klimovich, Aurelie Herbelot, Moin Nabi, Enver Sangineto, Raffaella Bernardi

Figure 1 for FOIL it! Find One mismatch between Image and Language caption

Figure 2 for FOIL it! Find One mismatch between Image and Language caption

Figure 3 for FOIL it! Find One mismatch between Image and Language caption

Figure 4 for FOIL it! Find One mismatch between Image and Language caption

Abstract:In this paper, we aim to understand whether current language and vision (LaVi) models truly grasp the interaction between the two modalities. To this end, we propose an extension of the MSCOCO dataset, FOIL-COCO, which associates images with both correct and "foil" captions, that is, descriptions of the image that are highly similar to the original ones, but contain one single mistake ("foil word"). We show that current LaVi models fall into the traps of this data and perform badly on three tasks: a) caption classification (correct vs. foil); b) foil word detection; c) foil word correction. Humans, in contrast, have near-perfect performance on those tasks. We demonstrate that merely utilising language cues is not enough to model FOIL-COCO and that it challenges the state-of-the-art by requiring a fine-grained understanding of the relation between text and image.

* To appear at ACL 2017

Via

Access Paper or Ask Questions