Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shobhit Hathi

Hybrid Ranking Network for Text-to-SQL

Aug 11, 2020

Qin Lyu, Kaushik Chakrabarti, Shobhit Hathi, Souvik Kundu, Jianwen Zhang, Zheng Chen

Figure 1 for Hybrid Ranking Network for Text-to-SQL

Figure 2 for Hybrid Ranking Network for Text-to-SQL

Abstract:In this paper, we study how to leverage pre-trained language models in Text-to-SQL. We argue that previous approaches under utilize the base language models by concatenating all columns together with the NL question and feeding them into the base language model in the encoding stage. We propose a neat approach called Hybrid Ranking Network (HydraNet) which breaks down the problem into column-wise ranking and decoding and finally assembles the column-wise outputs into a SQL query by straightforward rules. In this approach, the encoder is given a NL question and one individual column, which perfectly aligns with the original tasks BERT/RoBERTa is trained on, and hence we avoid any ad-hoc pooling or additional encoding layers which are necessary in prior approaches. Experiments on the WikiSQL dataset show that the proposed approach is very effective, achieving the top place on the leaderboard.

Via

Access Paper or Ask Questions

Community Member Retrieval on Social Media using Textual Information

Apr 16, 2018

Aaron Jaech, Shobhit Hathi, Mari Ostendorf

Figure 1 for Community Member Retrieval on Social Media using Textual Information

Figure 2 for Community Member Retrieval on Social Media using Textual Information

Figure 3 for Community Member Retrieval on Social Media using Textual Information

Figure 4 for Community Member Retrieval on Social Media using Textual Information

Abstract:This paper addresses the problem of community membership detection using only text features in a scenario where a small number of positive labeled examples defines the community. The solution introduces an unsupervised proxy task for learning user embeddings: user re-identification. Experiments with 16 different communities show that the resulting embeddings are more effective for community membership identification than common unsupervised representations.

* NAACL 2018

Via

Access Paper or Ask Questions

Hierarchical Character-Word Models for Language Identification

Aug 10, 2016

Aaron Jaech, George Mulcaire, Shobhit Hathi, Mari Ostendorf, Noah A. Smith

Figure 1 for Hierarchical Character-Word Models for Language Identification

Figure 2 for Hierarchical Character-Word Models for Language Identification

Figure 3 for Hierarchical Character-Word Models for Language Identification

Figure 4 for Hierarchical Character-Word Models for Language Identification

Abstract:Social media messages' brevity and unconventional spelling pose a challenge to language identification. We introduce a hierarchical model that learns character and contextualized word-level representations for language identification. Our method performs well against strong base- lines, and can also reveal code-switching.

Via

Access Paper or Ask Questions