Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Krithika Ramesh

Evaluating Differentially Private Synthetic Data Generation in High-Stakes Domains

Oct 10, 2024

Krithika Ramesh, Nupoor Gandhi, Pulkit Madaan, Lisa Bauer, Charith Peris, Anjalie Field

Abstract:The difficulty of anonymizing text data hinders the development and deployment of NLP in high-stakes domains that involve private data, such as healthcare and social services. Poorly anonymized sensitive data cannot be easily shared with annotators or external researchers, nor can it be used to train public models. In this work, we explore the feasibility of using synthetic data generated from differentially private language models in place of real data to facilitate the development of NLP in these domains without compromising privacy. In contrast to prior work, we generate synthetic data for real high-stakes domains, and we propose and conduct use-inspired evaluations to assess data quality. Our results show that prior simplistic evaluations have failed to highlight utility, privacy, and fairness issues in the synthetic data. Overall, our work underscores the need for further improvements to synthetic data generation for it to be a viable way to enable privacy-preserving data sharing.

* Accepted to EMNLP 2024 (Findings)

Via

Access Paper or Ask Questions

Fairness in Language Models Beyond English: Gaps and Challenges

Feb 28, 2023

Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury

Abstract:With language models becoming increasingly ubiquitous, it has become essential to address their inequitable treatment of diverse demographic groups and factors. Most research on evaluating and mitigating fairness harms has been concentrated on English, while multilingual models and non-English languages have received comparatively little attention. This paper presents a survey of fairness in multilingual and non-English contexts, highlighting the shortcomings of current research and the difficulties faced by methods designed for English. We contend that the multitude of diverse cultures and languages across the world makes it infeasible to achieve comprehensive coverage in terms of constructing fairness datasets. Thus, the measurement and mitigation of biases must evolve beyond the current dataset-driven practices that are narrowly focused on specific dimensions and types of biases and, therefore, impossible to scale across languages and cultures.

* Accepted to EACL 2023 (Findings)

Via

Access Paper or Ask Questions

'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

Feb 17, 2022

Krithika Ramesh, Ashiqur R. KhudaBukhsh, Sumeet Kumar

Figure 1 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

Figure 2 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

Figure 3 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

Figure 4 for 'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

Abstract:Over the last few years, YouTube Kids has emerged as one of the highly competitive alternatives to television for children's entertainment. Consequently, YouTube Kids' content should receive an additional level of scrutiny to ensure children's safety. While research on detecting offensive or inappropriate content for kids is gaining momentum, little or no current work exists that investigates to what extent AI applications can (accidentally) introduce content that is inappropriate for kids. In this paper, we present a novel (and troubling) finding that well-known automatic speech recognition (ASR) systems may produce text content highly inappropriate for kids while transcribing YouTube Kids' videos. We dub this phenomenon as \emph{inappropriate content hallucination}. Our analyses suggest that such hallucinations are far from occasional, and the ASR systems often produce them with high confidence. We release a first-of-its-kind data set of audios for which the existing state-of-the-art ASR systems hallucinate inappropriate content for kids. In addition, we demonstrate that some of these errors can be fixed using language models.

* This paper got accepted at AAAI 2022, AI for Social Impact track

Via

Access Paper or Ask Questions

Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation

Oct 11, 2021

Mirza Yusuf, Praatibh Surana, Gauri Gupta, Krithika Ramesh

Figure 1 for Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation

Figure 2 for Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation

Figure 3 for Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation

Figure 4 for Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation

Abstract:In recent times, there has been definitive progress in the field of NLP, with its applications growing as the utility of our language models increases with advances in their performance. However, these models require a large amount of computational power and data to train, consequently leading to large carbon footprints. Therefore, it is imperative that we study the carbon efficiency and look for alternatives to reduce the overall environmental impact of training models, in particular large language models. In our work, we assess the performance of models for machine translation, across multiple language pairs to assess the difference in computational power required to train these models for each of these language pairs and examine the various components of these models to analyze aspects of our pipeline that can be optimized to reduce these carbon emissions.

Via

Access Paper or Ask Questions

Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

Jul 14, 2021

Rakshit Naidu, Harshita Diddee, Ajinkya Mulay, Aleti Vardhan, Krithika Ramesh, Ahmed Zamzam

Figure 1 for Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

Figure 2 for Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

Figure 3 for Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

Figure 4 for Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

Abstract:In recent years, machine learning techniques utilizing large-scale datasets have achieved remarkable performance. Differential privacy, by means of adding noise, provides strong privacy guarantees for such learning algorithms. The cost of differential privacy is often a reduced model accuracy and a lowered convergence speed. This paper investigates the impact of differential privacy on learning algorithms in terms of their carbon footprint due to either longer run-times or failed experiments. Through extensive experiments, further guidance is provided on choosing the noise levels which can strike a balance between desired privacy levels and reduced carbon emissions.

* 4+3 pages; 6 figures; 8 tables. Accepted at SRML workshop at ICML'21

Via

Access Paper or Ask Questions

Evaluating Gender Bias in Hindi-English Machine Translation

Jun 16, 2021

Gauri Gupta, Krithika Ramesh, Sanjay Singh

Figure 1 for Evaluating Gender Bias in Hindi-English Machine Translation

Figure 2 for Evaluating Gender Bias in Hindi-English Machine Translation

Abstract:With language models being deployed increasingly in the real world, it is essential to address the issue of the fairness of their outputs. The word embedding representations of these language models often implicitly draw unwanted associations that form a social bias within the model. The nature of gendered languages like Hindi, poses an additional problem to the quantification and mitigation of bias, owing to the change in the form of the words in the sentence, based on the gender of the subject. Additionally, there is sparse work done in the realm of measuring and debiasing systems for Indic languages. In our work, we attempt to evaluate and quantify the gender bias within a Hindi-English machine translation system. We implement a modified version of the existing TGBI metric based on the grammatical considerations for Hindi. We also compare and contrast the resulting bias measurements across multiple metrics for pre-trained embeddings and the ones learned by our machine translation model.

Via

Access Paper or Ask Questions