Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Inferring the Origin Locations of Tweets with Quantitative Confidence

Nov 16, 2013

Reid Priedhorsky, Aron Culotta, Sara Y. Del Valle

Figure 1 for Inferring the Origin Locations of Tweets with Quantitative Confidence

Figure 2 for Inferring the Origin Locations of Tweets with Quantitative Confidence

Figure 3 for Inferring the Origin Locations of Tweets with Quantitative Confidence

Figure 4 for Inferring the Origin Locations of Tweets with Quantitative Confidence

Share this with someone who'll enjoy it:

Abstract:Social Internet content plays an increasingly critical role in many domains, including public health, disaster management, and politics. However, its utility is limited by missing geographic information; for example, fewer than 1.6% of Twitter messages (tweets) contain a geotag. We propose a scalable, content-based approach to estimate the location of tweets using a novel yet simple variant of gaussian mixture models. Further, because real-world applications depend on quantified uncertainty for such estimates, we propose novel metrics of accuracy, precision, and calibration, and we evaluate our approach accordingly. Experiments on 13 million global, comprehensively multi-lingual tweets show that our approach yields reliable, well-calibrated results competitive with previous computationally intensive methods. We also show that a relatively small number of training data are required for good estimates (roughly 30,000 tweets) and models are quite time-invariant (effective on tweets many weeks newer than the training set). Finally, we show that toponyms and languages with small geographic footprint provide the most useful location signals.

* 14 pages, 6 figures. Version 2: Move mathematics to appendix, 2 new references, various other presentation improvements. Version 3: Various presentation improvements, accepted at ACM CSCW 2014

View paper on

Share this with someone who'll enjoy it:

Title:Inferring the Origin Locations of Tweets with Quantitative Confidence

Paper and Code