Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gaurav Maheshwari

Efficacy of Synthetic Data as a Benchmark

Sep 18, 2024

Gaurav Maheshwari, Dmitry Ivanov, Kevin El Haddad

Abstract:Large language models (LLMs) have enabled a range of applications in zero-shot and few-shot learning settings, including the generation of synthetic datasets for training and testing. However, to reliably use these synthetic datasets, it is essential to understand how representative they are of real-world data. We investigate this by assessing the effectiveness of generating synthetic data through LLM and using it as a benchmark for various NLP tasks. Our experiments across six datasets, and three different tasks, show that while synthetic data can effectively capture performance of various methods for simpler tasks, such as intent classification, it falls short for more complex tasks like named entity recognition. Additionally, we propose a new metric called the bias factor, which evaluates the biases introduced when the same LLM is used to both generate benchmarking data and to perform the tasks. We find that smaller LLMs exhibit biases towards their own generated data, whereas larger models do not. Overall, our findings suggest that the effectiveness of synthetic data as a benchmark varies depending on the task, and that practitioners should rely on data generated from multiple larger models whenever possible.

Via

Access Paper or Ask Questions

ASR Benchmarking: Need for a More Representative Conversational Dataset

Sep 18, 2024

Gaurav Maheshwari, Dmitry Ivanov, Théo Johannet, Kevin El Haddad

Figure 1 for ASR Benchmarking: Need for a More Representative Conversational Dataset

Figure 2 for ASR Benchmarking: Need for a More Representative Conversational Dataset

Figure 3 for ASR Benchmarking: Need for a More Representative Conversational Dataset

Figure 4 for ASR Benchmarking: Need for a More Representative Conversational Dataset

Abstract:Automatic Speech Recognition (ASR) systems have achieved remarkable performance on widely used benchmarks such as LibriSpeech and Fleurs. However, these benchmarks do not adequately reflect the complexities of real-world conversational environments, where speech is often unstructured and contains disfluencies such as pauses, interruptions, and diverse accents. In this study, we introduce a multilingual conversational dataset, derived from TalkBank, consisting of unstructured phone conversation between adults. Our results show a significant performance drop across various state-of-the-art ASR models when tested in conversational settings. Furthermore, we observe a correlation between Word Error Rate and the presence of speech disfluencies, highlighting the critical need for more realistic, conversational ASR benchmarks.

Via

Access Paper or Ask Questions

Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure

May 23, 2024

Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller

Abstract:In this paper, we introduce a data augmentation approach specifically tailored to enhance intersectional fairness in classification tasks. Our method capitalizes on the hierarchical structure inherent to intersectionality, by viewing groups as intersections of their parent categories. This perspective allows us to augment data for smaller groups by learning a transformation function that combines data from these parent groups. Our empirical analysis, conducted on four diverse datasets including both text and images, reveals that classifiers trained with this data augmentation approach achieve superior intersectional fairness and are more robust to ``leveling down'' when compared to methods optimizing traditional group fairness metrics.

Via

Access Paper or Ask Questions

How to Capture Intersectional Fairness

May 21, 2023

Gaurav Maheshwari, Aurélien Bellet, Pascal Denis, Mikaela Keller

Figure 1 for How to Capture Intersectional Fairness

Figure 2 for How to Capture Intersectional Fairness

Figure 3 for How to Capture Intersectional Fairness

Figure 4 for How to Capture Intersectional Fairness

Abstract:In this work, we tackle the problem of intersectional group fairness in the classification setting, where the objective is to learn discrimination-free models in the presence of several intersecting sensitive groups. First, we illustrate various shortcomings of existing fairness measures commonly used to capture intersectional fairness. Then, we propose a new framework called the $\alpha$ Intersectional Fairness framework, which combines the absolute and the relative performances between sensitive groups. Finally, we provide various analyses of our proposed framework, including the min-max and efficiency analysis. Our experiments using the proposed framework show that several in-processing fairness approaches show no improvement over a simple unconstrained approach. Moreover, we show that these approaches minimize existing fairness measures by degrading the performance of the best of the group instead of improving the worst.

Via

Access Paper or Ask Questions

FairGrad: Fairness Aware Gradient Descent

Jun 22, 2022

Gaurav Maheshwari, Michaël Perrot

Figure 1 for FairGrad: Fairness Aware Gradient Descent

Figure 2 for FairGrad: Fairness Aware Gradient Descent

Figure 3 for FairGrad: Fairness Aware Gradient Descent

Figure 4 for FairGrad: Fairness Aware Gradient Descent

Abstract:We tackle the problem of group fairness in classification, where the objective is to learn models that do not unjustly discriminate against subgroups of the population. Most existing approaches are limited to simple binary tasks or involve difficult to implement training mechanisms. This reduces their practical applicability. In this paper, we propose FairGrad, a method to enforce fairness based on a reweighting scheme that iteratively learns group specific weights based on whether they are advantaged or not. FairGrad is easy to implement and can accommodate various standard fairness definitions. Furthermore, we show that it is comparable to standard baselines over various datasets including ones used in natural language processing and computer vision.

Via

Access Paper or Ask Questions

Fair NLP Models with Differentially Private Text Encoders

May 12, 2022

Gaurav Maheshwari, Pascal Denis, Mikaela Keller, Aurélien Bellet

Figure 1 for Fair NLP Models with Differentially Private Text Encoders

Figure 2 for Fair NLP Models with Differentially Private Text Encoders

Figure 3 for Fair NLP Models with Differentially Private Text Encoders

Figure 4 for Fair NLP Models with Differentially Private Text Encoders

Abstract:Encoded text representations often capture sensitive attributes about individuals (e.g., race or gender), which raise privacy concerns and can make downstream models unfair to certain groups. In this work, we propose FEDERATE, an approach that combines ideas from differential privacy and adversarial training to learn private text representations which also induces fairer models. We empirically evaluate the trade-off between the privacy of the representations and the fairness and accuracy of the downstream model on four NLP datasets. Our results show that FEDERATE consistently improves upon previous methods, and thus suggest that privacy and fairness can positively reinforce each other.

* submitted to: ACL-ARR 2022 (February) - https://openreview.net/forum?id=BVgNSki6q1c

Via

Access Paper or Ask Questions

Message Passing for Hyper-Relational Knowledge Graphs

Sep 22, 2020

Mikhail Galkin, Priyansh Trivedi, Gaurav Maheshwari, Ricardo Usbeck, Jens Lehmann

Figure 1 for Message Passing for Hyper-Relational Knowledge Graphs

Figure 2 for Message Passing for Hyper-Relational Knowledge Graphs

Figure 3 for Message Passing for Hyper-Relational Knowledge Graphs

Figure 4 for Message Passing for Hyper-Relational Knowledge Graphs

Abstract:Hyper-relational knowledge graphs (KGs) (e.g., Wikidata) enable associating additional key-value pairs along with the main triple to disambiguate, or restrict the validity of a fact. In this work, we propose a message passing based graph encoder - StarE capable of modeling such hyper-relational KGs. Unlike existing approaches, StarE can encode an arbitrary number of additional information (qualifiers) along with the main triple while keeping the semantic roles of qualifiers and triples intact. We also demonstrate that existing benchmarks for evaluating link prediction (LP) performance on hyper-relational KGs suffer from fundamental flaws and thus develop a new Wikidata-based dataset - WD50K. Our experiments demonstrate that StarE based LP model outperforms existing approaches across multiple benchmarks. We also confirm that leveraging qualifiers is vital for link prediction with gains up to 25 MRR points compared to triple-based representations.

* Accepted to EMNLP 2020

Via

Access Paper or Ask Questions

Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs

Jul 22, 2019

Nilesh Chakraborty, Denis Lukovnikov, Gaurav Maheshwari, Priyansh Trivedi, Jens Lehmann, Asja Fischer

Figure 1 for Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs

Figure 2 for Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs

Abstract:Question answering has emerged as an intuitive way of querying structured data sources, and has attracted significant advancements over the years. In this article, we provide an overview over these recent advancements, focusing on neural network based question answering systems over knowledge graphs. We introduce readers to the challenges in the tasks, current paradigms of approaches, discuss notable advancements, and outline the emerging trends in the field. Through this article, we aim to provide newcomers to the field with a suitable entry point, and ease their process of making informed decisions while creating their own QA system.

* Preprint, under review. The first four authors contributed equally to this paper, and should be regarded as co-first authors

Via

Access Paper or Ask Questions

Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Nov 02, 2018

Gaurav Maheshwari, Priyansh Trivedi, Denis Lukovnikov, Nilesh Chakraborty, Asja Fischer, Jens Lehmann

Figure 1 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Figure 2 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Figure 3 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Figure 4 for Learning to Rank Query Graphs for Complex Question Answering over Knowledge Graphs

Abstract:In this paper, we conduct an empirical investigation of neural query graph ranking approaches for the task of complex question answering over knowledge graphs. We experiment with six different ranking models and propose a novel self-attention based slot matching model which exploits the inherent structure of query graphs, our logical form of choice. Our proposed model generally outperforms the other models on two QA datasets over the DBpedia knowledge graph, evaluated in different settings. In addition, we show that transfer learning from the larger of those QA datasets to the smaller dataset yields substantial improvements, effectively offsetting the general lack of training data.

Via

Access Paper or Ask Questions

Formal Ontology Learning from English IS-A Sentences

Feb 11, 2018

Sourish Dasgupta, Ankur Padia, Gaurav Maheshwari, Priyansh Trivedi, Jens Lehmann

Figure 1 for Formal Ontology Learning from English IS-A Sentences

Figure 2 for Formal Ontology Learning from English IS-A Sentences

Figure 3 for Formal Ontology Learning from English IS-A Sentences

Figure 4 for Formal Ontology Learning from English IS-A Sentences

Abstract:Ontology learning (OL) is the process of automatically generating an ontological knowledge base from a plain text document. In this paper, we propose a new ontology learning approach and tool, called DLOL, which generates a knowledge base in the description logic (DL) SHOQ(D) from a collection of factual non-negative IS-A sentences in English. We provide extensive experimental results on the accuracy of DLOL, giving experimental comparisons to three state-of-the-art existing OL tools, namely Text2Onto, FRED, and LExO. Here, we use the standard OL accuracy measure, called lexical accuracy, and a novel OL accuracy measure, called instance-based inference model. In our experimental results, DLOL turns out to be about 21% and 46%, respectively, better than the best of the other three approaches.

Via

Access Paper or Ask Questions