Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Silviu Oprea

Unsupervised Word Translation Pairing using Refinement based Point Set Registration

Nov 26, 2020

Silviu Oprea, Sourav Dutta, Haytham Assem

Figure 1 for Unsupervised Word Translation Pairing using Refinement based Point Set Registration

Figure 2 for Unsupervised Word Translation Pairing using Refinement based Point Set Registration

Figure 3 for Unsupervised Word Translation Pairing using Refinement based Point Set Registration

Figure 4 for Unsupervised Word Translation Pairing using Refinement based Point Set Registration

Abstract:Cross-lingual alignment of word embeddings play an important role in knowledge transfer across languages, for improving machine translation and other multi-lingual applications. Current unsupervised approaches rely on similarities in geometric structure of word embedding spaces across languages, to learn structure-preserving linear transformations using adversarial networks and refinement strategies. However, such techniques, in practice, tend to suffer from instability and convergence issues, requiring tedious fine-tuning for precise parameter setting. This paper proposes BioSpere, a novel framework for unsupervised mapping of bi-lingual word embeddings onto a shared vector space, by combining adversarial initialization and refinement procedure with point set registration algorithm used in image processing. We show that our framework alleviates the shortcomings of existing methodologies, and is relatively invariant to variable adversarial learning performance, depicting robustness in terms of parameter choices and training losses. Experimental evaluation on parallel dictionary induction task demonstrates state-of-the-art results for our framework on diverse language pairs.

Via

Access Paper or Ask Questions

iSarcasm: A Dataset of Intended Sarcasm

Nov 08, 2019

Silviu Oprea, Walid Magdy

Figure 1 for iSarcasm: A Dataset of Intended Sarcasm

Figure 2 for iSarcasm: A Dataset of Intended Sarcasm

Figure 3 for iSarcasm: A Dataset of Intended Sarcasm

Figure 4 for iSarcasm: A Dataset of Intended Sarcasm

Abstract:This paper considers the distinction between intended and perceived sarcasm in the context of textual sarcasm detection. The former occurs when an utterance is sarcastic from the perspective of its author, while the latter occurs when the utterance is interpreted as sarcastic by the audience. We show the limitations of previous labelling methods in capturing intended sarcasm and introduce the iSarcasm dataset of tweets labeled for sarcasm directly by their authors. We experiment with sarcasm detection models on our dataset. The low performance indicates that sarcasm might be a phenomenon under-studied computationally thus far.

* 10 pages

Via

Access Paper or Ask Questions

Exploring Author Context for Detecting Intended vs Perceived Sarcasm

Oct 25, 2019

Silviu Oprea, Walid Magdy

Figure 1 for Exploring Author Context for Detecting Intended vs Perceived Sarcasm

Figure 2 for Exploring Author Context for Detecting Intended vs Perceived Sarcasm

Figure 3 for Exploring Author Context for Detecting Intended vs Perceived Sarcasm

Figure 4 for Exploring Author Context for Detecting Intended vs Perceived Sarcasm

Abstract:We investigate the impact of using author context on textual sarcasm detection. We define author context as the embedded representation of their historical posts on Twitter and suggest neural models that extract these representations. We experiment with two tweet datasets, one labelled manually for sarcasm, and the other via tag-based distant supervision. We achieve state-of-the-art performance on the second dataset, but not on the one labelled manually, indicating a difference between intended sarcasm, captured by distant supervision, and perceived sarcasm, captured by manual labelling.

* Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019, pages 2854-2859
* 6 pages, 1 figure, ACL 2020

Via

Access Paper or Ask Questions

Flood Detection On Low Cost Orbital Hardware

Oct 14, 2019

Gonzalo Mateo-Garcia, Silviu Oprea, Lewis Smith, Josh Veitch-Michaelis, Guy Schumann, Yarin Gal, Atılım Güneş Baydin, Dietmar Backes

Figure 1 for Flood Detection On Low Cost Orbital Hardware

Figure 2 for Flood Detection On Low Cost Orbital Hardware

Figure 3 for Flood Detection On Low Cost Orbital Hardware

Figure 4 for Flood Detection On Low Cost Orbital Hardware

Abstract:Satellite imaging is a critical technology for monitoring and responding to natural disasters such as flooding. Despite the capabilities of modern satellites, there is still much to be desired from the perspective of first response organisations like UNICEF. Two main challenges are rapid access to data, and the ability to automatically identify flooded regions in images. We describe a prototypical flood segmentation system, identifying cloud, water and land, that could be deployed on a constellation of small satellites, performing processing on board to reduce downlink bandwidth by 2 orders of magnitude. We target PhiSat-1, part of the FSSCAT mission, which is planned to be launched by the European Space Agency (ESA) near the start of 2020 as a proof of concept for this new technology.

* Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

Via

Access Paper or Ask Questions