Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Komal K. Teru

Semi-supervised Relation Extraction via Data Augmentation and Consistency-training

Jun 16, 2023

Komal K. Teru

Abstract:Due to the semantic complexity of the Relation extraction (RE) task, obtaining high-quality human labelled data is an expensive and noisy process. To improve the sample efficiency of the models, semi-supervised learning (SSL) methods aim to leverage unlabelled data in addition to learning from limited labelled data points. Recently, strong data augmentation combined with consistency-based semi-supervised learning methods have advanced the state of the art in several SSL tasks. However, adapting these methods to the RE task has been challenging due to the difficulty of data augmentation for RE. In this work, we leverage the recent advances in controlled text generation to perform high quality data augmentation for the RE task. We further introduce small but significant changes to model architecture that allows for generation of more training data by interpolating different data points in their latent space. These data augmentations along with consistency training result in very competitive results for semi-supervised relation extraction on four benchmark datasets.

* Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023, 1112--1124
* Previously published at INTERPOLATE @ NeurIPS 2022 workshop

Via

Access Paper or Ask Questions

Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift

Dec 01, 2019

Riashat Islam, Komal K. Teru, Deepak Sharma, Joelle Pineau

Figure 1 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift

Figure 2 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift

Figure 3 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift

Figure 4 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift

Abstract:Off-policy deep reinforcement learning (RL) algorithms are incapable of learning solely from batch offline data without online interactions with the environment, due to the phenomenon known as \textit{extrapolation error}. This is often due to past data available in the replay buffer that may be quite different from the data distribution under the current policy. We argue that most off-policy learning methods fundamentally suffer from a \textit{state distribution shift} due to the mismatch between the state visitation distribution of the data collected by the behavior and target policies. This data distribution shift between current and past samples can significantly impact the performance of most modern off-policy based policy optimization algorithms. In this work, we first do a systematic analysis of state distribution mismatch in off-policy learning, and then develop a novel off-policy policy optimization method to constraint the state distribution shift. To do this, we first estimate the state distribution based on features of the state, using a density estimator and then develop a novel constrained off-policy gradient objective that minimizes the state distribution shift. Our experimental results on continuous control tasks show that minimizing this distribution mismatch can significantly improve performance in most popular practical off-policy policy gradient algorithms.

* Accepted at NeurIPS 2019 workshop on Deep Reinforcement Learning

Via

Access Paper or Ask Questions

Inductive Relation Prediction on Knowledge Graphs

Nov 16, 2019

Komal K. Teru, William L. Hamilton

Figure 1 for Inductive Relation Prediction on Knowledge Graphs

Figure 2 for Inductive Relation Prediction on Knowledge Graphs

Figure 3 for Inductive Relation Prediction on Knowledge Graphs

Figure 4 for Inductive Relation Prediction on Knowledge Graphs

Abstract:Inferring missing edges in multi-relational knowledge graphs is a fundamental task in statistical relational learning. However, previous work has largely focused on the transductive relation prediction problem, where missing edges must be predicted for a single, fixed graph. In contrast, many real-world situations require relation prediction on dynamic or previously unseen knowledge graphs (e.g., for question answering, dialogue, or e-commerce applications). Here, we develop a novel graph neural network (GNN) architecture to perform inductive relation prediction and provide a systematic comparison between this GNN approach and a strong, rule-based baseline. Our results highlight the significant difficulty of inductive relational learning, compared to the transductive case, and offer a new challenging set of inductive benchmarks for knowledge graph completion.

Via

Access Paper or Ask Questions

Towards Reducing Bias in Gender Classification

Nov 16, 2019

Komal K. Teru, Aishik Chakraborty

Figure 1 for Towards Reducing Bias in Gender Classification

Figure 2 for Towards Reducing Bias in Gender Classification

Figure 3 for Towards Reducing Bias in Gender Classification

Figure 4 for Towards Reducing Bias in Gender Classification

Abstract:Societal bias towards certain communities is a big problem that affects a lot of machine learning systems. This work aims at addressing the racial bias present in many modern gender recognition systems. We learn race invariant representations of human faces with an adversarially trained autoencoder model. We show that such representations help us achieve less biased performance in gender classification. We use variance in classification accuracy across different races as a surrogate for the racial bias of the model and achieve a drop of over 40% in variance with race invariant representations.

* arXiv admin note: text overlap with arXiv:1706.00409 by other authors

Via

Access Paper or Ask Questions