Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Evgeniy Gabrilovich

Laboratory for Computational Linguistics, Technion, Israel

Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

Dec 16, 2022

Jai Gupta, Yi Tay, Chaitanya Kamath, Vinh Q. Tran, Donald Metzler, Shailesh Bavadekar, Mimi Sun, Evgeniy Gabrilovich

Figure 1 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

Figure 2 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

Figure 3 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

Figure 4 for Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

Abstract:With the devastating outbreak of COVID-19, vaccines are one of the crucial lines of defense against mass infection in this global pandemic. Given the protection they provide, vaccines are becoming mandatory in certain social and professional settings. This paper presents a classification model for detecting COVID-19 vaccination related search queries, a machine learning model that is used to generate search insights for COVID-19 vaccinations. The proposed method combines and leverages advancements from modern state-of-the-art (SOTA) natural language understanding (NLU) techniques such as pretrained Transformers with traditional dense features. We propose a novel approach of considering dense features as memory tokens that the model can attend to. We show that this new modeling approach enables a significant improvement to the Vaccine Search Insights (VSI) task, improving a strong well-established gradient-boosting baseline by relative +15% improvement in F1 score and +14% in precision.

* EMNLP 2022

Via

Access Paper or Ask Questions

Machine-learned epidemiology: real-time detection of foodborne illness at scale

Dec 05, 2018

Adam Sadilek, Stephanie Caty, Lauren DiPrete, Raed Mansour, Tom Schenk Jr, Mark Bergtholdt, Ashish Jha, Prem Ramaswami, Evgeniy Gabrilovich

Figure 1 for Machine-learned epidemiology: real-time detection of foodborne illness at scale

Figure 2 for Machine-learned epidemiology: real-time detection of foodborne illness at scale

Figure 3 for Machine-learned epidemiology: real-time detection of foodborne illness at scale

Figure 4 for Machine-learned epidemiology: real-time detection of foodborne illness at scale

Abstract:Machine learning has become an increasingly powerful tool for solving complex problems, and its application in public health has been underutilized. The objective of this study is to test the efficacy of a machine-learned model of foodborne illness detection in a real-world setting. To this end, we built FINDER, a machine-learned model for real-time detection of foodborne illness using anonymous and aggregated web search and location data. We computed the fraction of people who visited a particular restaurant and later searched for terms indicative of food poisoning to identify potentially unsafe restaurants. We used this information to focus restaurant inspections in two cities and demonstrated that FINDER improves the accuracy of health inspections; restaurants identified by FINDER are 3.1 times as likely to be deemed unsafe during the inspection as restaurants identified by existing methods. Additionally, FINDER enables us to ascertain previously intractable epidemiological information, for example, in 38% of cases the restaurant potentially causing food poisoning was not the last one visited, which may explain the lower precision of complaint-based inspections. We found that FINDER is able to reliably identify restaurants that have an active lapse in food safety, allowing for implementation of corrective actions that would prevent the potential spread of foodborne illness.

* npj Digital Medicine 1:36 (2018)

Via

Access Paper or Ask Questions

A Review of Relational Machine Learning for Knowledge Graphs

Sep 28, 2015

Maximilian Nickel, Kevin Murphy, Volker Tresp, Evgeniy Gabrilovich

Figure 1 for A Review of Relational Machine Learning for Knowledge Graphs

Figure 2 for A Review of Relational Machine Learning for Knowledge Graphs

Figure 3 for A Review of Relational Machine Learning for Knowledge Graphs

Figure 4 for A Review of Relational Machine Learning for Knowledge Graphs

Abstract:Relational machine learning studies methods for the statistical analysis of relational, or graph-structured, data. In this paper, we provide a review of how such statistical models can be "trained" on large knowledge graphs, and then used to predict new facts about the world (which is equivalent to predicting new edges in the graph). In particular, we discuss two fundamentally different kinds of statistical relational models, both of which can scale to massive datasets. The first is based on latent feature models such as tensor factorization and multiway neural networks. The second is based on mining observable patterns in the graph. We also show how to combine these latent and observable models to get improved modeling power at decreased computational cost. Finally, we discuss how such statistical models of graphs can be combined with text-based information extraction methods for automatically constructing knowledge graphs from the Web. To this end, we also discuss Google's Knowledge Vault project as an example of such combination.

* To appear in Proceedings of the IEEE

Via

Access Paper or Ask Questions

Quizz: Targeted crowdsourcing with a billion (potential) users

Jun 02, 2015

Panagiotis G. Ipeirotis, Evgeniy Gabrilovich

Figure 1 for Quizz: Targeted crowdsourcing with a billion (potential) users

Figure 2 for Quizz: Targeted crowdsourcing with a billion (potential) users

Figure 3 for Quizz: Targeted crowdsourcing with a billion (potential) users

Figure 4 for Quizz: Targeted crowdsourcing with a billion (potential) users

Abstract:We describe Quizz, a gamified crowdsourcing system that simultaneously assesses the knowledge of users and acquires new knowledge from them. Quizz operates by asking users to complete short quizzes on specific topics; as a user answers the quiz questions, Quizz estimates the user's competence. To acquire new knowledge, Quizz also incorporates questions for which we do not have a known answer; the answers given by competent users provide useful signals for selecting the correct answers for these questions. Quizz actively tries to identify knowledgeable users on the Internet by running advertising campaigns, effectively leveraging the targeting capabilities of existing, publicly available, ad placement services. Quizz quantifies the contributions of the users using information theory and sends feedback to the advertisingsystem about each user. The feedback allows the ad targeting mechanism to further optimize ad placement. Our experiments, which involve over ten thousand users, confirm that we can crowdsource knowledge curation for niche and specialized topics, as the advertising network can automatically identify users with the desired expertise and interest in the given topic. We present controlled experiments that examine the effect of various incentive mechanisms, highlighting the need for having short-term rewards as goals, which incentivize the users to contribute. Finally, our cost-quality analysis indicates that the cost of our approach is below that of hiring workers through paid-crowdsourcing platforms, while offering the additional advantage of giving access to billions of potential users all over the planet, and being able to reach users with specialized expertise that is not typically available through existing labor marketplaces.

* WWW '14 Proceedings of the 23rd international conference on World Wide Web. 11 pages

Via

Access Paper or Ask Questions

Wikipedia-based Semantic Interpretation for Natural Language Processing

Jan 15, 2014

Evgeniy Gabrilovich, Shaul Markovitch

Figure 1 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Figure 2 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Figure 3 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Figure 4 for Wikipedia-based Semantic Interpretation for Natural Language Processing

Abstract:Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, called Explicit Semantic Analysis (ESA), for fine-grained semantic interpretation of unrestricted natural language texts. Our method represents meaning in a high-dimensional space of concepts derived from Wikipedia, the largest encyclopedia in existence. We explicitly represent the meaning of any text in terms of Wikipedia-based concepts. We evaluate the effectiveness of our method on text categorization and on computing the degree of semantic relatedness between fragments of natural language text. Using ESA results in significant improvements over the previous state of the art in both tasks. Importantly, due to the use of natural concepts, the ESA model is easy to explain to human users.

* Journal Of Artificial Intelligence Research, Volume 34, pages 443-498, 2009

Via

Access Paper or Ask Questions

Amalia -- A Unified Platform for Parsing and Generation

Sep 24, 1997

Shuly Wintner, Evgeniy Gabrilovich, Nissim Francez

Figure 1 for Amalia -- A Unified Platform for Parsing and Generation

Figure 2 for Amalia -- A Unified Platform for Parsing and Generation

Figure 3 for Amalia -- A Unified Platform for Parsing and Generation

Figure 4 for Amalia -- A Unified Platform for Parsing and Generation

Abstract:Contemporary linguistic theories (in particular, HPSG) are declarative in nature: they specify constraints on permissible structures, not how such structures are to be computed. Grammars designed under such theories are, therefore, suitable for both parsing and generation. However, practical implementations of such theories don't usually support bidirectional processing of grammars. We present a grammar development system that includes a compiler of grammars (for parsing and generation) to abstract machine instructions, and an interpreter for the abstract machine language. The generation compiler inverts input grammars (designed for parsing) to a form more suitable for generation. The compiled grammars are then executed by the interpreter using one control strategy, regardless of whether the grammar is the original or the inverted version. We thus obtain a unified, efficient platform for developing reversible grammars.

* Proceedings of Recent Advances in Natural Language Programming (RANLP97), Tzigov Chark, Bulgaria, 11-13 September 1997, pp. 135-142
* 8 pages postscript

Via

Access Paper or Ask Questions