Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tasnim Mohiuddin

GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge

Dec 24, 2024

Shammur Absar Chowdhury, Hind Almerekhi, Mucahid Kutlu, Kaan Efe Keles, Fatema Ahmad, Tasnim Mohiuddin, George Mikros, Firoj Alam

Figure 1 for GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge

Figure 2 for GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge

Figure 3 for GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge

Figure 4 for GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge

Abstract:This paper presents a comprehensive overview of the first edition of the Academic Essay Authenticity Challenge, organized as part of the GenAI Content Detection shared tasks collocated with COLING 2025. This challenge focuses on detecting machine-generated vs. human-authored essays for academic purposes. The task is defined as follows: "Given an essay, identify whether it is generated by a machine or authored by a human.'' The challenge involves two languages: English and Arabic. During the evaluation phase, 25 teams submitted systems for English and 21 teams for Arabic, reflecting substantial interest in the task. Finally, seven teams submitted system description papers. The majority of submissions utilized fine-tuned transformer-based models, with one team employing Large Language Models (LLMs) such as Llama 2 and Llama 3. This paper outlines the task formulation, details the dataset construction process, and explains the evaluation framework. Additionally, we present a summary of the approaches adopted by participating teams. Nearly all submitted systems outperformed the n-gram-based baseline, with the top-performing systems achieving F1 scores exceeding 0.98 for both languages, indicating significant progress in the detection of machine-generated text.

* AI Generated Content, Academic Essay, LLMs, Arabic, English

Via

Access Paper or Ask Questions

DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Oct 19, 2024

Md Mubtasim Ahasan, Md Fahim, Tasnim Mohiuddin, A K M Mahbubur Rahman, Aman Chadha, Tariq Iqbal, M Ashraful Amin, Md Mofijul Islam, Amin Ahsan Ali

Figure 1 for DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Figure 2 for DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Figure 3 for DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Figure 4 for DM-Codec: Distilling Multimodal Representations for Speech Tokenization

Abstract:Recent advancements in speech-language models have yielded significant improvements in speech tokenization and synthesis. However, effectively mapping the complex, multidimensional attributes of speech into discrete tokens remains challenging. This process demands acoustic, semantic, and contextual information for precise speech representations. Existing speech representations generally fall into two categories: acoustic tokens from audio codecs and semantic tokens from speech self-supervised learning models. Although recent efforts have unified acoustic and semantic tokens for improved performance, they overlook the crucial role of contextual representation in comprehensive speech modeling. Our empirical investigations reveal that the absence of contextual representations results in elevated Word Error Rate (WER) and Word Information Lost (WIL) scores in speech transcriptions. To address these limitations, we propose two novel distillation approaches: (1) a language model (LM)-guided distillation method that incorporates contextual information, and (2) a combined LM and self-supervised speech model (SM)-guided distillation technique that effectively distills multimodal representations (acoustic, semantic, and contextual) into a comprehensive speech tokenizer, termed DM-Codec. The DM-Codec architecture adopts a streamlined encoder-decoder framework with a Residual Vector Quantizer (RVQ) and incorporates the LM and SM during the training process. Experiments show DM-Codec significantly outperforms state-of-the-art speech tokenization models, reducing WER by up to 13.46%, WIL by 9.82%, and improving speech quality by 5.84% and intelligibility by 1.85% on the LibriSpeech benchmark dataset. The code, samples, and model checkpoints are available at https://github.com/mubtasimahasan/DM-Codec.

Via

Access Paper or Ask Questions

Data Selection Curriculum for Neural Machine Translation

Mar 25, 2022

Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq Joty

Figure 1 for Data Selection Curriculum for Neural Machine Translation

Figure 2 for Data Selection Curriculum for Neural Machine Translation

Figure 3 for Data Selection Curriculum for Neural Machine Translation

Figure 4 for Data Selection Curriculum for Neural Machine Translation

Abstract:Neural Machine Translation (NMT) models are typically trained on heterogeneous data that are concatenated and randomly shuffled. However, not all of the training data are equally useful to the model. Curriculum training aims to present the data to the NMT models in a meaningful order. In this work, we introduce a two-stage curriculum training framework for NMT where we fine-tune a base NMT model on subsets of data, selected by both deterministic scoring using pre-trained methods and online scoring that considers prediction scores of the emerging NMT model. Through comprehensive experiments on six language pairs comprising low- and high-resource languages from WMT'21, we have shown that our curriculum strategies consistently demonstrate better quality (up to +2.2 BLEU improvement) and faster convergence (approximately 50% fewer updates).

Via

Access Paper or Ask Questions

AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

Jun 09, 2021

Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

Figure 1 for AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

Figure 2 for AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

Figure 3 for AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

Figure 4 for AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT

Abstract:The success of Neural Machine Translation (NMT) largely depends on the availability of large bitext training corpora. Due to the lack of such large corpora in low-resource language pairs, NMT systems often exhibit poor performance. Extra relevant monolingual data often helps, but acquiring it could be quite expensive, especially for low-resource languages. Moreover, domain mismatch between bitext (train/test) and monolingual data might degrade the performance. To alleviate such issues, we propose AUGVIC, a novel data augmentation framework for low-resource NMT which exploits the vicinal samples of the given bitext without using any extra monolingual data explicitly. It can diversify the in-domain bitext data with finer level control. Through extensive experiments on four low-resource language pairs comprising data from different domains, we have shown that our method is comparable to the traditional back-translation that uses extra in-domain monolingual data. When we combine the synthetic parallel data generated from AUGVIC with the ones from the extra monolingual data, we achieve further improvements. We show that AUGVIC helps to attenuate the discrepancies between relevant and distant-domain monolingual data in traditional back-translation. To understand the contributions of different components of AUGVIC, we perform an in-depth framework analysis.

* ACL-2021 accepted paper

Via

Access Paper or Ask Questions

CohEval: Benchmarking Coherence Models

Apr 30, 2020

Tasnim Mohiuddin, Prathyusha Jwalapuram, Xiang Lin, Shafiq Joty

Figure 1 for CohEval: Benchmarking Coherence Models

Figure 2 for CohEval: Benchmarking Coherence Models

Figure 3 for CohEval: Benchmarking Coherence Models

Figure 4 for CohEval: Benchmarking Coherence Models

Abstract:Although coherence modeling has come a long way in developing novel models, their evaluation on downstream applications has largely been neglected. With the advancements made by neural approaches in applications such as machine translation, text summarization and dialogue systems, the need for standard coherence evaluation is now more crucial than ever. In this paper, we propose to benchmark coherence models on a number of synthetic and downstream tasks. In particular, we evaluate well-known traditional and neural coherence models on sentence ordering tasks, and also on three downstream applications including coherence evaluation for machine translation, summarization and next utterance prediction. We also show model produced rankings for pre-trained language model outputs as another use-case. Our results demonstrate a weak correlation between the model performances in the synthetic tasks and the downstream applications, motivating alternate evaluation methods for coherence models. This work has led us to create a leaderboard to foster further research in coherence modeling.

Via

Access Paper or Ask Questions

LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Apr 28, 2020

Tasnim Mohiuddin, M Saiful Bari, Shafiq Joty

Figure 1 for LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Figure 2 for LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Figure 3 for LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Figure 4 for LNMap: Departures from Isomorphic Assumption in Bilingual Lexicon Induction Through Non-Linear Mapping in Latent Space

Abstract:Most of the successful and predominant methods for bilingual lexicon induction (BLI) are mapping-based, where a linear mapping function is learned with the assumption that the word embedding spaces of different languages exhibit similar geometric structures (i.e., approximately isomorphic). However, several recent studies have criticized this simplified assumption showing that it does not hold in general even for closely related languages. In this work, we propose a novel semi-supervised method to learn cross-lingual word embeddings for BLI. Our model is independent of the isomorphic assumption and uses nonlinear mapping in the latent space of two independently trained auto-encoders. Through extensive experiments on fifteen (15) different language pairs (in both directions) comprising resource-rich and low-resource languages from two different datasets, we demonstrate that our method outperforms existing models by a good margin. Ablation studies show the importance of different model components and the necessity of non-linear mapping.

* 10 pages, 1 figure

Via

Access Paper or Ask Questions

A Unified Neural Coherence Model

Sep 01, 2019

Han Cheol Moon, Tasnim Mohiuddin, Shafiq Joty, Xu Chi

Figure 1 for A Unified Neural Coherence Model

Figure 2 for A Unified Neural Coherence Model

Figure 3 for A Unified Neural Coherence Model

Figure 4 for A Unified Neural Coherence Model

Abstract:Recently, neural approaches to coherence modeling have achieved state-of-the-art results in several evaluation tasks. However, we show that most of these models often fail on harder tasks with more realistic application scenarios. In particular, the existing models underperform on tasks that require the model to be sensitive to local contexts such as candidate ranking in conversational dialogue and in machine translation. In this paper, we propose a unified coherence model that incorporates sentence grammar, inter-sentence coherence relations, and global coherence patterns into a common neural framework. With extensive experiments on local and global discrimination tasks, we demonstrate that our proposed model outperforms existing models by a good margin, and establish a new state-of-the-art.

* To appear at EMNLP-IJCNLP 2019

Via

Access Paper or Ask Questions

Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

Apr 04, 2019

Tasnim Mohiuddin, Shafiq Joty

Figure 1 for Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

Figure 2 for Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

Figure 3 for Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

Figure 4 for Revisiting Adversarial Autoencoder for Unsupervised Word Translation with Cycle Consistency and Improved Training

Abstract:Adversarial training has shown impressive success in learning bilingual dictionary without any parallel data by mapping monolingual embeddings to a shared space. However, recent work has shown superior performance for non-adversarial methods in more challenging language pairs. In this work, we revisit adversarial autoencoder for unsupervised word translation and propose two novel extensions to it that yield more stable training and improved results. Our method includes regularization terms to enforce cycle consistency and input reconstruction, and puts the target encoders as an adversary against the corresponding discriminator. Extensive experimentations with European, non-European and low-resource languages show that our method is more robust and achieves better performance than recently proposed adversarial and non-adversarial approaches.

* Published in NAACL-HLT 2019

Via

Access Paper or Ask Questions

Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

Apr 01, 2019

Tasnim Mohiuddin, Thanh-Tung Nguyen, Shafiq Joty

Figure 1 for Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

Figure 2 for Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

Figure 3 for Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

Figure 4 for Adaptation of Hierarchical Structured Models for Speech Act Recognition in Asynchronous Conversation

Abstract:We address the problem of speech act recognition (SAR) in asynchronous conversations (forums, emails). Unlike synchronous conversations (e.g., meetings, phone), asynchronous domains lack large labeled datasets to train an effective SAR model. In this paper, we propose methods to effectively leverage abundant unlabeled conversational data and the available labeled data from synchronous domains. We carry out our research in three main steps. First, we introduce a neural architecture based on hierarchical LSTMs and conditional random fields (CRF) for SAR, and show that our method outperforms existing methods when trained on in-domain data only. Second, we improve our initial SAR models by semi-supervised learning in the form of pretrained word embeddings learned from a large unlabeled conversational corpus. Finally, we employ adversarial training to improve the results further by leveraging the labeled data from synchronous domains and by explicitly modeling the distributional shift in two domains.

* To appear in NAACL 2019

Via

Access Paper or Ask Questions

Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

May 06, 2018

Tasnim Mohiuddin, Shafiq Joty, Dat Tien Nguyen

Figure 1 for Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

Figure 2 for Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

Figure 3 for Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

Figure 4 for Coherence Modeling of Asynchronous Conversations: A Neural Entity Grid Approach

Abstract:We propose a novel coherence model for written asynchronous conversations (e.g., forums, emails), and show its applications in coherence assessment and thread reconstruction tasks. We conduct our research in two steps. First, we propose improvements to the recently proposed neural entity grid model by lexicalizing its entity transitions. Then, we extend the model to asynchronous conversations by incorporating the underlying conversational structure in the entity grid representation and feature computation. Our model achieves state of the art results on standard coherence assessment tasks in monologue and conversations outperforming existing models. We also demonstrate its effectiveness in reconstructing thread structures.

Via

Access Paper or Ask Questions