Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guoning Hu

Analyzing users' sentiment towards popular consumer industries and brands on Twitter

Sep 21, 2017

Guoning Hu, Preeti Bhargava, Saul Fuhrmann, Sarah Ellinger, Nemanja Spasojevic

Figure 1 for Analyzing users' sentiment towards popular consumer industries and brands on Twitter

Figure 2 for Analyzing users' sentiment towards popular consumer industries and brands on Twitter

Figure 3 for Analyzing users' sentiment towards popular consumer industries and brands on Twitter

Figure 4 for Analyzing users' sentiment towards popular consumer industries and brands on Twitter

Abstract:Social media serves as a unified platform for users to express their thoughts on subjects ranging from their daily lives to their opinion on consumer brands and products. These users wield an enormous influence in shaping the opinions of other consumers and influence brand perception, brand loyalty and brand advocacy. In this paper, we analyze the opinion of 19M Twitter users towards 62 popular industries, encompassing 12,898 enterprise and consumer brands, as well as associated subject matter topics, via sentiment analysis of 330M tweets over a period spanning a month. We find that users tend to be most positive towards manufacturing and most negative towards service industries. In addition, they tend to be more positive or negative when interacting with brands than generally on Twitter. We also find that sentiment towards brands within an industry varies greatly and we demonstrate this using two industries as use cases. In addition, we discover that there is no strong correlation between topic sentiments of different industries, demonstrating that topic sentiments are highly dependent on the context of the industry that they are mentioned in. We demonstrate the value of such an analysis in order to assess the impact of brands on social media. We hope that this initial study will prove valuable for both researchers and companies in understanding users' perception of industries, brands and associated topics and encourage more research in this field.

* 2017 IEEE International Conference on Data Mining Workshops (ICDMW 2017)
* 8 pages, 11 figures, 1 table, 2017 IEEE International Conference on Data Mining Workshops (ICDMW 2017), ICDM Sentiment Elicitation from Natural Text for Information Retrieval and Extraction (ICDM SENTIRE) 2017 workshop

Via

Access Paper or Ask Questions

Lithium NLP: A System for Rich Information Extraction from Noisy User Generated Text on Social Media

Jul 13, 2017

Preeti Bhargava, Nemanja Spasojevic, Guoning Hu

Figure 1 for Lithium NLP: A System for Rich Information Extraction from Noisy User Generated Text on Social Media

Figure 2 for Lithium NLP: A System for Rich Information Extraction from Noisy User Generated Text on Social Media

Figure 3 for Lithium NLP: A System for Rich Information Extraction from Noisy User Generated Text on Social Media

Figure 4 for Lithium NLP: A System for Rich Information Extraction from Noisy User Generated Text on Social Media

Abstract:In this paper, we describe the Lithium Natural Language Processing (NLP) system - a resource-constrained, high- throughput and language-agnostic system for information extraction from noisy user generated text on social media. Lithium NLP extracts a rich set of information including entities, topics, hashtags and sentiment from text. We discuss several real world applications of the system currently incorporated in Lithium products. We also compare our system with existing commercial and academic NLP systems in terms of performance, information extracted and languages supported. We show that Lithium NLP is at par with and in some cases, outperforms state- of-the-art commercial NLP systems.

* 9 pages, 6 figures, 2 tables, EMNLP 2017 Workshop on Noisy User Generated Text WNUT 2017

Via

Access Paper or Ask Questions

High-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data

Mar 13, 2017

Preeti Bhargava, Nemanja Spasojevic, Guoning Hu

Figure 1 for High-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data

Figure 2 for High-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data

Figure 3 for High-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data

Figure 4 for High-Throughput and Language-Agnostic Entity Disambiguation and Linking on User Generated Data

Abstract:The Entity Disambiguation and Linking (EDL) task matches entity mentions in text to a unique Knowledge Base (KB) identifier such as a Wikipedia or Freebase id. It plays a critical role in the construction of a high quality information network, and can be further leveraged for a variety of information retrieval and NLP tasks such as text categorization and document tagging. EDL is a complex and challenging problem due to ambiguity of the mentions and real world text being multi-lingual. Moreover, EDL systems need to have high throughput and should be lightweight in order to scale to large datasets and run on off-the-shelf machines. More importantly, these systems need to be able to extract and disambiguate dense annotations from the data in order to enable an Information Retrieval or Extraction task running on the data to be more efficient and accurate. In order to address all these challenges, we present the Lithium EDL system and algorithm - a high-throughput, lightweight, language-agnostic EDL system that extracts and correctly disambiguates 75% more entities than state-of-the-art EDL systems and is significantly faster than them.

* 10 pages, 7 figures, 5 tables, WWW2017, Linked Data on the Web workshop 2017, LDOW'17

Via

Access Paper or Ask Questions

DAWT: Densely Annotated Wikipedia Texts across multiple languages

Mar 02, 2017

Nemanja Spasojevic, Preeti Bhargava, Guoning Hu

Figure 1 for DAWT: Densely Annotated Wikipedia Texts across multiple languages

Figure 2 for DAWT: Densely Annotated Wikipedia Texts across multiple languages

Figure 3 for DAWT: Densely Annotated Wikipedia Texts across multiple languages

Figure 4 for DAWT: Densely Annotated Wikipedia Texts across multiple languages

Abstract:In this work, we open up the DAWT dataset - Densely Annotated Wikipedia Texts across multiple languages. The annotations include labeled text mentions mapping to entities (represented by their Freebase machine ids) as well as the type of the entity. The data set contains total of 13.6M articles, 5.0B tokens, 13.8M mention entity co-occurrences. DAWT contains 4.8 times more anchor text to entity links than originally present in the Wikipedia markup. Moreover, it spans several languages including English, Spanish, Italian, German, French and Arabic. We also present the methodology used to generate the dataset which enriches Wikipedia markup in order to increase number of links. In addition to the main dataset, we open up several derived datasets including mention entity co-occurrence counts and entity embeddings, as well as mappings between Freebase ids and Wikidata item ids. We also discuss two applications of these datasets and hope that opening them up would prove useful for the Natural Language Processing and Information Retrieval communities, as well as facilitate multi-lingual research.

* 8 pages, 3 figures, 7 tables, WWW2017, WWW 2017 Companion proceedings

Via

Access Paper or Ask Questions