Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vid Kocijan

PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

Mar 31, 2024

Weihua Hu, Yiwen Yuan, Zecheng Zhang, Akihiro Nitta, Kaidi Cao, Vid Kocijan, Jure Leskovec, Matthias Fey

Figure 1 for PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

Figure 2 for PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

Figure 3 for PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

Figure 4 for PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

Abstract:We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data. PyTorch Frame makes tabular deep learning easy by providing a PyTorch-based data structure to handle complex tabular data, introducing a model abstraction to enable modular implementation of tabular models, and allowing external foundation models to be incorporated to handle complex columns (e.g., LLMs for text columns). We demonstrate the usefulness of PyTorch Frame by implementing diverse tabular models in a modular way, successfully applying these models to complex multi-modal tabular data, and integrating our framework with PyTorch Geometric, a PyTorch library for Graph Neural Networks (GNNs), to perform end-to-end learning over relational databases.

* https://github.com/pyg-team/pytorch-frame

Via

Access Paper or Ask Questions

Pre-training and Diagnosing Knowledge Base Completion Models

Jan 27, 2024

Vid Kocijan, Myeongjun Erik Jang, Thomas Lukasiewicz

Abstract:In this work, we introduce and analyze an approach to knowledge transfer from one collection of facts to another without the need for entity or relation matching. The method works for both canonicalized knowledge bases and uncanonicalized or open knowledge bases, i.e., knowledge bases where more than one copy of a real-world entity or relation may exist. The main contribution is a method that can make use of large-scale pre-training on facts, which were collected from unstructured text, to improve predictions on structured data from a specific domain. The introduced method is most impactful on small datasets such as ReVerb20k, where a 6% absolute increase of mean reciprocal rank and 65% relative decrease of mean rank over the previously best method was achieved, despite not relying on large pre-trained models like Bert. To understand the obtained pre-trained models better, we then introduce a novel dataset for the analysis of pre-trained models for Open Knowledge Base Completion, called Doge (Diagnostics of Open knowledge Graph Embeddings). It consists of 6 subsets and is designed to measure multiple properties of a pre-trained model: robustness against synonyms, ability to perform deductive reasoning, presence of gender stereotypes, consistency with reverse relations, and coverage of different areas of general knowledge. Using the introduced dataset, we show that the existing OKBC models lack consistency in the presence of synonyms and inverse relations and are unable to perform deductive reasoning. Moreover, their predictions often align with gender stereotypes, which persist even when presented with counterevidence. We additionally investigate the role of pre-trained word embeddings and demonstrate that avoiding biased word embeddings is not a sufficient measure to prevent biased behavior of OKBC models.

* Accepted to AIJ, reference to follow. arXiv admin note: substantial text overlap with arXiv:2108.13073

Via

Access Paper or Ask Questions

Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns

Feb 11, 2023

Zhongbin Xie, Vid Kocijan, Thomas Lukasiewicz, Oana-Maria Camburu

Figure 1 for Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns

Figure 2 for Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns

Figure 3 for Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns

Figure 4 for Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns

Abstract:Bias-measuring datasets play a critical role in detecting biased behavior of language models and in evaluating progress of bias mitigation methods. In this work, we focus on evaluating gender bias through coreference resolution, where previous datasets are either hand-crafted or fail to reliably measure an explicitly defined bias. To overcome these shortcomings, we propose a novel method to collect diverse, natural, and minimally distant text pairs via counterfactual generation, and construct Counter-GAP, an annotated dataset consisting of 4008 instances grouped into 1002 quadruples. We further identify a bias cancellation problem in previous group-level metrics on Counter-GAP, and propose to use the difference between inconsistency across genders and within genders to measure bias at a quadruple level. Our results show that four pre-trained language models are significantly more inconsistent across different gender groups than within each group, and that a name-based counterfactual data augmentation method is more effective to mitigate such bias than an anonymization-based method.

* Long Paper at EACL 2023

Via

Access Paper or Ask Questions

The Defeat of the Winograd Schema Challenge

Jan 16, 2022

Vid Kocijan, Ernest Davis, Thomas Lukasiewicz, Gary Marcus, Leora Morgenstern

Figure 1 for The Defeat of the Winograd Schema Challenge

Figure 2 for The Defeat of the Winograd Schema Challenge

Figure 3 for The Defeat of the Winograd Schema Challenge

Figure 4 for The Defeat of the Winograd Schema Challenge

Abstract:The Winograd Schema Challenge -- a set of twin sentences involving pronoun reference disambiguation that seem to require the use of commonsense knowledge -- was proposed by Hector Levesque in 2011. By 2019, a number of AI systems, based on large pre-trained transformer-based language models and fine-tuned on these kinds of problems, achieved better than 90% accuracy. In this paper, we review the history of the Winograd Schema Challenge and assess its significance.

Via

Access Paper or Ask Questions

Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations

Dec 12, 2021

Yordan Yordanov, Vid Kocijan, Thomas Lukasiewicz, Oana-Maria Camburu

Figure 1 for Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations

Figure 2 for Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations

Figure 3 for Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations

Figure 4 for Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations

Abstract:Recently, there has been an increasing interest in models that generate natural language explanations (NLEs) for their decisions. However, training a model to provide NLEs requires the acquisition of task-specific NLEs, which is time- and resource-consuming. A potential solution is the out-of-domain transfer of NLEs from a domain with a large number of NLEs to a domain with scarce NLEs but potentially a large number of labels, via few-shot transfer learning. In this work, we introduce three vanilla approaches for few-shot transfer learning of NLEs for the case of few NLEs but abundant labels, along with an adaptation of an existing vanilla fine-tuning approach. We transfer explainability from the natural language inference domain, where a large dataset of human-written NLEs exists (e-SNLI), to the domains of (1) hard cases of pronoun resolution, where we introduce a small dataset of NLEs on top of the WinoGrande dataset (small-e-WinoGrande), and (2) commonsense validation (ComVE). Our results demonstrate that the transfer of NLEs outperforms the single-task methods, and establish the best strategies out of the four identified training regimes. We also investigate the scalability of the best methods, both in terms of training data and model size.

* Accepted at the Deep Generative Models and Downstream Applications Workshop at NeurIPS 2021

Via

Access Paper or Ask Questions

Knowledge Base Completion Meets Transfer Learning

Aug 30, 2021

Vid Kocijan, Thomas Lukasiewicz

Figure 1 for Knowledge Base Completion Meets Transfer Learning

Figure 2 for Knowledge Base Completion Meets Transfer Learning

Figure 3 for Knowledge Base Completion Meets Transfer Learning

Figure 4 for Knowledge Base Completion Meets Transfer Learning

Abstract:The aim of knowledge base completion is to predict unseen facts from existing facts in knowledge bases. In this work, we introduce the first approach for transfer of knowledge from one collection of facts to another without the need for entity or relation matching. The method works for both canonicalized knowledge bases and uncanonicalized or open knowledge bases, i.e., knowledge bases where more than one copy of a real-world entity or relation may exist. Such knowledge bases are a natural output of automated information extraction tools that extract structured data from unstructured text. Our main contribution is a method that can make use of a large-scale pre-training on facts, collected from unstructured text, to improve predictions on structured data from a specific domain. The introduced method is the most impactful on small datasets such as ReVerb20K, where we obtained 6% absolute increase of mean reciprocal rank and 65% relative decrease of mean rank over the previously best method, despite not relying on large pre-trained models like BERT.

* EMNLP 2021
* Presented at EMNLP 2021

Via

Access Paper or Ask Questions

The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Nov 12, 2020

Vid Kocijan, Oana-Maria Camburu, Thomas Lukasiewicz

Figure 1 for The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Figure 2 for The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Figure 3 for The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Figure 4 for The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

Abstract:Diagnostic datasets that can detect biased models are an important prerequisite for bias reduction within natural language processing. However, undesired patterns in the collected data can make such tests incorrect. For example, if the feminine subset of a gender-bias-measuring coreference resolution dataset contains sentences with a longer average distance between the pronoun and the correct candidate, an RNN-based model may perform worse on this subset due to long-term dependencies. In this work, we introduce a theoretically grounded method for weighting test samples to cope with such patterns in the test data. We demonstrate the method on the GAP dataset for coreference resolution. We annotate GAP with spans of all personal names and show that examples in the female subset contain more personal names and a longer distance between pronouns and their referents, potentially affecting the bias score in an undesired way. Using our weighting method, we find the set of weights on the test instances that should be used for coping with these correlations, and we re-evaluate 16 recently released coreference models.

* Presented at AFCI workshop at NeurIPS 2020 conference

Via

Access Paper or Ask Questions

Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Oct 06, 2020

Yordan Yordanov, Oana-Maria Camburu, Vid Kocijan, Thomas Lukasiewicz

Figure 1 for Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Figure 2 for Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Figure 3 for Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

Abstract:Hard cases of pronoun resolution have been used as a long-standing benchmark for commonsense reasoning. In the recent literature, pre-trained language models have been used to obtain state-of-the-art results on pronoun resolution. Overall, four categories of training and evaluation objectives have been introduced. The variety of training datasets and pre-trained language models used in these works makes it unclear whether the choice of training objective is critical. In this work, we make a fair comparison of the performance and seed-wise stability of four models that represent the four categories of objectives. Our experiments show that the objective of sequence ranking performs the best in-domain, while the objective of semantic similarity between candidates and pronoun performs the best out-of-domain. We also observe a seed-wise instability of the model using sequence ranking, which is not the case when the other objectives are used.

* Accepted to the EMNLP 2020 conference

Via

Access Paper or Ask Questions

A Review of Winograd Schema Challenge Datasets and Approaches

Apr 23, 2020

Vid Kocijan, Thomas Lukasiewicz, Ernest Davis, Gary Marcus, Leora Morgenstern

Abstract:The Winograd Schema Challenge is both a commonsense reasoning and natural language understanding challenge, introduced as an alternative to the Turing test. A Winograd schema is a pair of sentences differing in one or two words with a highly ambiguous pronoun, resolved differently in the two sentences, that appears to require commonsense knowledge to be resolved correctly. The examples were designed to be easily solvable by humans but difficult for machines, in principle requiring a deep understanding of the content of the text and the situation it describes. This paper reviews existing Winograd Schema Challenge benchmark datasets and approaches that have been published since its introduction.

Via

Access Paper or Ask Questions

WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

Aug 23, 2019

Vid Kocijan, Oana-Maria Camburu, Ana-Maria Cretu, Yordan Yordanov, Phil Blunsom, Thomas Lukasiewicz

Figure 1 for WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

Abstract:Pronoun resolution is a major area of natural language understanding. However, large-scale training sets are still scarce, since manually labelling data is costly. In this work, we introduce WikiCREM (Wikipedia CoREferences Masked) a large-scale, yet accurate dataset of pronoun disambiguation instances. We use a language-model-based approach for pronoun resolution in combination with our WikiCREM dataset. We compare a series of models on a collection of diverse and challenging coreference resolution problems, where we match or outperform previous state-of-the-art approaches on 6 out of 7 datasets, such as GAP, DPR, WNLI, PDP, WinoBias, and WinoGender. We release our model to be used off-the-shelf for solving pronoun disambiguation.

* Accepted to the EMNLP 2019 conference

Via

Access Paper or Ask Questions