Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Abram Handler

Investigating Sports Commentator Bias within a Large Corpus of American Football Broadcasts

Oct 19, 2019

Jack Merullo, Luke Yeh, Abram Handler, Alvin Grissom II, Brendan O'Connor, Mohit Iyyer

Figure 1 for Investigating Sports Commentator Bias within a Large Corpus of American Football Broadcasts

Figure 2 for Investigating Sports Commentator Bias within a Large Corpus of American Football Broadcasts

Figure 3 for Investigating Sports Commentator Bias within a Large Corpus of American Football Broadcasts

Figure 4 for Investigating Sports Commentator Bias within a Large Corpus of American Football Broadcasts

Abstract:Sports broadcasters inject drama into play-by-play commentary by building team and player narratives through subjective analyses and anecdotes. Prior studies based on small datasets and manual coding show that such theatrics evince commentator bias in sports broadcasts. To examine this phenomenon, we assemble FOOTBALL, which contains 1,455 broadcast transcripts from American football games across six decades that are automatically annotated with 250K player mentions and linked with racial metadata. We identify major confounding factors for researchers examining racial bias in FOOTBALL, and perform a computational analysis that supports conclusions from prior social science studies.

Via

Access Paper or Ask Questions

Query-focused Sentence Compression in Linear Time

Apr 19, 2019

Abram Handler, Brendan O'Connor

Figure 1 for Query-focused Sentence Compression in Linear Time

Figure 2 for Query-focused Sentence Compression in Linear Time

Figure 3 for Query-focused Sentence Compression in Linear Time

Figure 4 for Query-focused Sentence Compression in Linear Time

Abstract:Search applications often display shortened sentences which must contain certain query terms and must fit within the space constraints of a user interface. This work introduces a new transition-based sentence compression technique developed for such settings. Our method constructs length and lexically constrained compressions in linear time, by growing a subgraph in the dependency parse of a sentence. This approach achieves a 4x speed up over baseline ILP compression techniques, and better reconstructs gold shortenings under constraints. Such efficiency gains permit constrained compression of multiple sentences, without unreasonable lag.

Via

Access Paper or Ask Questions

Human acceptability judgements for extractive sentence compression

Feb 01, 2019

Abram Handler, Brian Dillon, Brendan O'Connor

Figure 1 for Human acceptability judgements for extractive sentence compression

Figure 2 for Human acceptability judgements for extractive sentence compression

Figure 3 for Human acceptability judgements for extractive sentence compression

Figure 4 for Human acceptability judgements for extractive sentence compression

Abstract:Recent approaches to English-language sentence compression rely on parallel corpora consisting of sentence-compression pairs. However, a sentence may be shortened in many different ways, which each might be suited to the needs of a particular application. Therefore, in this work, we collect and model crowdsourced judgements of the acceptability of many possible sentence shortenings. We then show how a model of such judgements can be used to support a flexible approach to the compression task. We release our model and dataset for future work.

Via

Access Paper or Ask Questions

Rookie: A unique approach for exploring news archives

Aug 06, 2017

Abram Handler, Brendan O'Connor

Figure 1 for Rookie: A unique approach for exploring news archives

Figure 2 for Rookie: A unique approach for exploring news archives

Figure 3 for Rookie: A unique approach for exploring news archives

Figure 4 for Rookie: A unique approach for exploring news archives

Abstract:News archives are an invaluable primary source for placing current events in historical context. But current search engine tools do a poor job at uncovering broad themes and narratives across documents. We present Rookie: a practical software system which uses natural language processing (NLP) to help readers, reporters and editors uncover broad stories in news archives. Unlike prior work, Rookie's design emerged from 18 months of iterative development in consultation with editors and computational journalists. This process lead to a dramatically different approach from previous academic systems with similar goals. Our efforts offer a generalizable case study for others building real-world journalism software using NLP.

* Presented at KDD 2017: Data Science + Journalism workshop

Via

Access Paper or Ask Questions

Identifying civilians killed by police with distantly supervised entity-event extraction

Jul 22, 2017

Katherine A. Keith, Abram Handler, Michael Pinkham, Cara Magliozzi, Joshua McDuffie, Brendan O'Connor

Figure 1 for Identifying civilians killed by police with distantly supervised entity-event extraction

Figure 2 for Identifying civilians killed by police with distantly supervised entity-event extraction

Abstract:We propose a new, socially-impactful task for natural language processing: from a news corpus, extract names of persons who have been killed by police. We present a newly collected police fatality corpus, which we release publicly, and present a model to solve this problem that uses EM-based distant supervision with logistic regression and convolutional neural network classifiers. Our model outperforms two off-the-shelf event extractor systems, and it can suggest candidate victim names in some cases faster than one of the major manually-collected police fatality databases.

* Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

Via

Access Paper or Ask Questions

Visualizing textual models with in-text and word-as-pixel highlighting

Jun 20, 2016

Abram Handler, Su Lin Blodgett, Brendan O'Connor

Figure 1 for Visualizing textual models with in-text and word-as-pixel highlighting

Abstract:We explore two techniques which use color to make sense of statistical text models. One method uses in-text annotations to illustrate a model's view of particular tokens in particular documents. Another uses a high-level, "words-as-pixels" graphic to display an entire corpus. Together, these methods offer both zoomed-in and zoomed-out perspectives into a model's understanding of text. We show how these interconnected methods help diagnose a classifier's poor performance on Twitter slang, and make sense of a topic model on historical political texts.

* Presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY

Via

Access Paper or Ask Questions