Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shib Sankar Dasgupta

Box Embeddings: An open-source library for representation learning using geometric structures

Sep 10, 2021

Tejas Chheda, Purujit Goyal, Trang Tran, Dhruvesh Patel, Michael Boratko, Shib Sankar Dasgupta, Andrew McCallum

Figure 1 for Box Embeddings: An open-source library for representation learning using geometric structures

Figure 2 for Box Embeddings: An open-source library for representation learning using geometric structures

Figure 3 for Box Embeddings: An open-source library for representation learning using geometric structures

Figure 4 for Box Embeddings: An open-source library for representation learning using geometric structures

Abstract:A major factor contributing to the success of modern representation learning is the ease of performing various vector operations. Recently, objects with geometric structures (eg. distributions, complex or hyperbolic vectors, or regions such as cones, disks, or boxes) have been explored for their alternative inductive biases and additional representational capacities. In this work, we introduce Box Embeddings, a Python library that enables researchers to easily apply and extend probabilistic box embeddings.

* The source code and the usage and API documentation for the library is available at https://github.com/iesl/box-embeddings and https://www.iesl.cs.umass.edu/box-embeddings/main/index.html

Via

Access Paper or Ask Questions

Word2Box: Learning Word Representation Using Box Embeddings

Jun 28, 2021

Shib Sankar Dasgupta, Michael Boratko, Shriya Atmakuri, Xiang Lorraine Li, Dhruvesh Patel, Andrew McCallum

Figure 1 for Word2Box: Learning Word Representation Using Box Embeddings

Figure 2 for Word2Box: Learning Word Representation Using Box Embeddings

Figure 3 for Word2Box: Learning Word Representation Using Box Embeddings

Figure 4 for Word2Box: Learning Word Representation Using Box Embeddings

Abstract:Learning vector representations for words is one of the most fundamental topics in NLP, capable of capturing syntactic and semantic relationships useful in a variety of downstream NLP tasks. Vector representations can be limiting, however, in that typical scoring such as dot product similarity intertwines position and magnitude of the vector in space. Exciting innovations in the space of representation learning have proposed alternative fundamental representations, such as distributions, hyperbolic vectors, or regions. Our model, Word2Box, takes a region-based approach to the problem of word representation, representing words as $n$-dimensional rectangles. These representations encode position and breadth independently and provide additional geometric operations such as intersection and containment which allow them to model co-occurrence patterns vectors struggle with. We demonstrate improved performance on various word similarity tasks, particularly on less common words, and perform a qualitative analysis exploring the additional unique expressivity provided by Word2Box.

* Work in progress

Via

Access Paper or Ask Questions

Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning

Apr 09, 2021

Xuelu Chen, Michael Boratko, Muhao Chen, Shib Sankar Dasgupta, Xiang Lorraine Li, Andrew McCallum

Figure 1 for Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning

Figure 2 for Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning

Figure 3 for Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning

Figure 4 for Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning

Abstract:Knowledge bases often consist of facts which are harvested from a variety of sources, many of which are noisy and some of which conflict, resulting in a level of uncertainty for each triple. Knowledge bases are also often incomplete, prompting the use of embedding methods to generalize from known facts, however, existing embedding methods only model triple-level uncertainty, and reasoning results lack global consistency. To address these shortcomings, we propose BEUrRE, a novel uncertain knowledge graph embedding method with calibrated probabilistic semantics. BEUrRE models each entity as a box (i.e. axis-aligned hyperrectangle) and relations between two entities as affine transforms on the head and tail entity boxes. The geometry of the boxes allows for efficient calculation of intersections and volumes, endowing the model with calibrated probabilistic semantics and facilitating the incorporation of relational constraints. Extensive experiments on two benchmark datasets show that BEUrRE consistently outperforms baselines on confidence prediction and fact ranking due to its probabilistic calibration and ability to capture high-order dependencies among facts.

* NAACL-HLT 2021

Via

Access Paper or Ask Questions

Improving Local Identifiability in Probabilistic Box Embeddings

Oct 29, 2020

Shib Sankar Dasgupta, Michael Boratko, Dongxu Zhang, Luke Vilnis, Xiang Lorraine Li, Andrew McCallum

Figure 1 for Improving Local Identifiability in Probabilistic Box Embeddings

Figure 2 for Improving Local Identifiability in Probabilistic Box Embeddings

Figure 3 for Improving Local Identifiability in Probabilistic Box Embeddings

Figure 4 for Improving Local Identifiability in Probabilistic Box Embeddings

Abstract:Geometric embeddings have recently received attention for their natural ability to represent transitive asymmetric relations via containment. Box embeddings, where objects are represented by n-dimensional hyperrectangles, are a particularly promising example of such an embedding as they are closed under intersection and their volume can be calculated easily, allowing them to naturally represent calibrated probability distributions. The benefits of geometric embeddings also introduce a problem of local identifiability, however, where whole neighborhoods of parameters result in equivalent loss which impedes learning. Prior work addressed some of these issues by using an approximation to Gaussian convolution over the box parameters, however, this intersection operation also increases the sparsity of the gradient. In this work, we model the box parameters with min and max Gumbel distributions, which were chosen such that space is still closed under the operation of the intersection. The calculation of the expected intersection volume involves all parameters, and we demonstrate experimentally that this drastically improves the ability of such models to learn.

* Accepted at NeurIPS2020

Via

Access Paper or Ask Questions

Dating Documents using Graph Convolution Networks

Feb 01, 2019

Shikhar Vashishth, Shib Sankar Dasgupta, Swayambhu Nath Ray, Partha Talukdar

Figure 1 for Dating Documents using Graph Convolution Networks

Figure 2 for Dating Documents using Graph Convolution Networks

Figure 3 for Dating Documents using Graph Convolution Networks

Figure 4 for Dating Documents using Graph Convolution Networks

Abstract:Document date is essential for many important tasks, such as document retrieval, summarization, event detection, etc. While existing approaches for these tasks assume accurate knowledge of the document date, this is not always available, especially for arbitrary documents from the Web. Document Dating is a challenging problem which requires inference over the temporal structure of the document. Prior document dating systems have largely relied on handcrafted features while ignoring such document internal structures. In this paper, we propose NeuralDater, a Graph Convolutional Network (GCN) based document dating approach which jointly exploits syntactic and temporal graph structures of document in a principled way. To the best of our knowledge, this is the first application of deep learning for the problem of document dating. Through extensive experiments on real-world datasets, we find that NeuralDater significantly outperforms state-of-the-art baseline by 19% absolute (45% relative) accuracy points.

* Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics 2018
* Accepted at ACL 2018

Via

Access Paper or Ask Questions