Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mehdi Drissi

Harvey Mudd College at SemEval-2019 Task 4: The Clint Buchanan Hyperpartisan News Detector

Apr 10, 2019

Mehdi Drissi, Pedro Sandoval, Vivaswat Ojha, Julie Medero

Figure 1 for Harvey Mudd College at SemEval-2019 Task 4: The Clint Buchanan Hyperpartisan News Detector

Figure 2 for Harvey Mudd College at SemEval-2019 Task 4: The Clint Buchanan Hyperpartisan News Detector

Figure 3 for Harvey Mudd College at SemEval-2019 Task 4: The Clint Buchanan Hyperpartisan News Detector

Figure 4 for Harvey Mudd College at SemEval-2019 Task 4: The Clint Buchanan Hyperpartisan News Detector

Abstract:We investigate the recently developed Bidirectional Encoder Representations from Transformers (BERT) model for the hyperpartisan news detection task. Using a subset of hand-labeled articles from SemEval as a validation set, we test the performance of different parameters for BERT models. We find that accuracy from two different BERT models using different proportions of the articles is consistently high, with our best-performing model on the validation set achieving 85% accuracy and the best-performing model on the test set achieving 77%. We further determined that our model exhibits strong consistency, labeling independent slices of the same article identically. Finally, we find that randomizing the order of word pieces dramatically reduces validation accuracy (to approximately 60%), but that shuffling groups of four or more word pieces maintains an accuracy of about 80%, indicating the model mainly gains value from local context.

* Submitted to The 13th International Workshop on Semantic Evaluation (SemEval 2019). 5 pages including references

Via

Access Paper or Ask Questions

Hierarchical Text Generation using an Outline

Oct 20, 2018

Mehdi Drissi, Olivia Watkins, Jugal Kalita

Figure 1 for Hierarchical Text Generation using an Outline

Figure 2 for Hierarchical Text Generation using an Outline

Figure 3 for Hierarchical Text Generation using an Outline

Abstract:Many challenges in natural language processing require generating text, including language translation, dialogue generation, and speech recognition. For all of these problems, text generation becomes more difficult as the text becomes longer. Current language models often struggle to keep track of coherence for long pieces of text. Here, we attempt to have the model construct and use an outline of the text it generates to keep it focused. We find that the usage of an outline improves perplexity. We do not find that using the outline improves human evaluation over a simpler baseline, revealing a discrepancy in perplexity and human perception. Similarly, hierarchical generation is not found to improve human evaluation scores.

* 8 pages, Accepted to International Conference on Natural Language Processing

Via

Access Paper or Ask Questions

Program Language Translation Using a Grammar-Driven Tree-to-Tree Model

Jul 04, 2018

Mehdi Drissi, Olivia Watkins, Aditya Khant, Vivaswat Ojha, Pedro Sandoval, Rakia Segev, Eric Weiner, Robert Keller

Figure 1 for Program Language Translation Using a Grammar-Driven Tree-to-Tree Model

Figure 2 for Program Language Translation Using a Grammar-Driven Tree-to-Tree Model

Figure 3 for Program Language Translation Using a Grammar-Driven Tree-to-Tree Model

Figure 4 for Program Language Translation Using a Grammar-Driven Tree-to-Tree Model

Abstract:The task of translating between programming languages differs from the challenge of translating natural languages in that programming languages are designed with a far more rigid set of structural and grammatical rules. Previous work has used a tree-to-tree encoder/decoder model to take advantage of the inherent tree structure of programs during translation. Neural decoders, however, by default do not exploit known grammar rules of the target language. In this paper, we describe a tree decoder that leverages knowledge of a language's grammar rules to exclusively generate syntactically correct programs. We find that this grammar-based tree-to-tree model outperforms the state of the art tree-to-tree model in translating between two programming languages on a previously used synthetic task.

* Accepted at the ICML workshop Neural Abstract Machines & Program Induction v2. 4 pages excluding acknowledgements/references (6 pages total)

Via

Access Paper or Ask Questions