Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Raj Nath Patel

AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification

Nov 01, 2023

Yongxin Huang, Kexin Wang, Sourav Dutta, Raj Nath Patel, Goran Glavaš, Iryna Gurevych

Abstract:Recent work has found that few-shot sentence classification based on pre-trained Sentence Encoders (SEs) is efficient, robust, and effective. In this work, we investigate strategies for domain-specialization in the context of few-shot sentence classification with SEs. We first establish that unsupervised Domain-Adaptive Pre-Training (DAPT) of a base Pre-trained Language Model (PLM) (i.e., not an SE) substantially improves the accuracy of few-shot sentence classification by up to 8.4 points. However, applying DAPT on SEs, on the one hand, disrupts the effects of their (general-domain) Sentence Embedding Pre-Training (SEPT). On the other hand, applying general-domain SEPT on top of a domain-adapted base PLM (i.e., after DAPT) is effective but inefficient, since the computationally expensive SEPT needs to be executed on top of a DAPT-ed PLM of each domain. As a solution, we propose AdaSent, which decouples SEPT from DAPT by training a SEPT adapter on the base PLM. The adapter can be inserted into DAPT-ed PLMs from any domain. We demonstrate AdaSent's effectiveness in extensive experiments on 17 different few-shot sentence classification datasets. AdaSent matches or surpasses the performance of full SEPT on DAPT-ed PLM, while substantially reducing the training costs. The code for AdaSent is available.

* Accepted at EMNLP 2023 Main

Via

Access Paper or Ask Questions

Improving Robustness in Real-World Neural Machine Translation Engines

Jul 02, 2019

Rohit Gupta, Patrik Lambert, Raj Nath Patel, John Tinsley

Figure 1 for Improving Robustness in Real-World Neural Machine Translation Engines

Figure 2 for Improving Robustness in Real-World Neural Machine Translation Engines

Figure 3 for Improving Robustness in Real-World Neural Machine Translation Engines

Figure 4 for Improving Robustness in Real-World Neural Machine Translation Engines

Abstract:As a commercial provider of machine translation, we are constantly training engines for a variety of uses, languages, and content types. In each case, there can be many variables, such as the amount of training data available, and the quality requirements of the end user. These variables can have an impact on the robustness of Neural MT engines. On the whole, Neural MT cures many ills of other MT paradigms, but at the same time, it has introduced a new set of challenges to address. In this paper, we describe some of the specific issues with practical NMT and the approaches we take to improve model robustness in real-world scenarios.

* 6 Pages, Accepted in Machine Translation Summit 2019

Via

Access Paper or Ask Questions

Machine Translation in Indian Languages: Challenges and Resolution

Aug 01, 2018

Raj Nath Patel, Prakash B. Pimpale, M Sasikumar

Figure 1 for Machine Translation in Indian Languages: Challenges and Resolution

Figure 2 for Machine Translation in Indian Languages: Challenges and Resolution

Figure 3 for Machine Translation in Indian Languages: Challenges and Resolution

Figure 4 for Machine Translation in Indian Languages: Challenges and Resolution

Abstract:English to Indian language machine translation poses the challenge of structural and morphological divergence. This paper describes English to Indian language statistical machine translation using pre-ordering and suffix separation. The pre-ordering uses rules to transfer the structure of the source sentences prior to training and translation. This syntactic restructuring helps statistical machine translation to tackle the structural divergence and hence better translation quality. The suffix separation is used to tackle the morphological divergence between English and highly agglutinative Indian languages. We demonstrate that the use of pre-ordering and suffix separation helps in improving the quality of English to Indian Language machine translation.

* 11 pages journal paper

Via

Access Paper or Ask Questions

Personalized Machine Translation: Preserving Original Author Traits

Jan 12, 2017

Ella Rabinovich, Shachar Mirkin, Raj Nath Patel, Lucia Specia, Shuly Wintner

Figure 1 for Personalized Machine Translation: Preserving Original Author Traits

Figure 2 for Personalized Machine Translation: Preserving Original Author Traits

Figure 3 for Personalized Machine Translation: Preserving Original Author Traits

Figure 4 for Personalized Machine Translation: Preserving Original Author Traits

Abstract:The language that we produce reflects our personality, and various personal and demographic characteristics can be detected in natural language texts. We focus on one particular personal trait of the author, gender, and study how it is manifested in original texts and in translations. We show that author's gender has a powerful, clear signal in originals texts, but this signal is obfuscated in human and machine translation. We then propose simple domain-adaptation techniques that help retain the original gender traits in the translation, without harming the quality of the translation, thereby creating more personalized machine translation systems.

* EACL 2017, 11 pages

Via

Access Paper or Ask Questions

Recurrent Neural Network based Part-of-Speech Tagger for Code-Mixed Social Media Text

Nov 16, 2016

Raj Nath Patel, Prakash B. Pimpale, M Sasikumar

Figure 1 for Recurrent Neural Network based Part-of-Speech Tagger for Code-Mixed Social Media Text

Figure 2 for Recurrent Neural Network based Part-of-Speech Tagger for Code-Mixed Social Media Text

Abstract:This paper describes Centre for Development of Advanced Computing's (CDACM) submission to the shared task-'Tool Contest on POS tagging for Code-Mixed Indian Social Media (Facebook, Twitter, and Whatsapp) Text', collocated with ICON-2016. The shared task was to predict Part of Speech (POS) tag at word level for a given text. The code-mixed text is generated mostly on social media by multilingual users. The presence of the multilingual words, transliterations, and spelling variations make such content linguistically complex. In this paper, we propose an approach to POS tag code-mixed social media text using Recurrent Neural Network Language Model (RNN-LM) architecture. We submitted the results for Hindi-English (hi-en), Bengali-English (bn-en), and Telugu-English (te-en) code-mixed data.

* In Proceedings of the Tool Contest on POS tagging for Indian Social Media Text, ICON 2016
* 7 pages, Published at the Tool Contest on POS tagging for Indian Social Media Text, ICON 2016

Via

Access Paper or Ask Questions

Experiments with POS Tagging Code-mixed Indian Social Media Text

Oct 31, 2016

Prakash B. Pimpale, Raj Nath Patel

Figure 1 for Experiments with POS Tagging Code-mixed Indian Social Media Text

Figure 2 for Experiments with POS Tagging Code-mixed Indian Social Media Text

Figure 3 for Experiments with POS Tagging Code-mixed Indian Social Media Text

Abstract:This paper presents Centre for Development of Advanced Computing Mumbai's (CDACM) submission to the NLP Tools Contest on Part-Of-Speech (POS) Tagging For Code-mixed Indian Social Media Text (POSCMISMT) 2015 (collocated with ICON 2015). We submitted results for Hindi (hi), Bengali (bn), and Telugu (te) languages mixed with English (en). In this paper, we have described our approaches to the POS tagging techniques, we exploited for this task. Machine learning has been used to POS tag the mixed language text. For POS tagging, distributed representations of words in vector space (word2vec) for feature extraction and Log-linear models have been tried. We report our work on all three languages hi, bn, and te mixed with en.

* In the Proceedings of the 12th International Conference on Natural Language Processing (ICON 2015)
* 3 Pages, Published in the Proceedings of the Tool Contest on POS Tagging for Code-mixed Indian Social Media (Facebook, Twitter, and Whatsapp) Text

Via

Access Paper or Ask Questions

Statistical Machine Translation for Indian Languages: Mission Hindi 2

Oct 25, 2016

Raj Nath Patel, Prakash B. Pimpale

Figure 1 for Statistical Machine Translation for Indian Languages: Mission Hindi 2

Figure 2 for Statistical Machine Translation for Indian Languages: Mission Hindi 2

Figure 3 for Statistical Machine Translation for Indian Languages: Mission Hindi 2

Abstract:This paper presents Centre for Development of Advanced Computing Mumbai's (CDACM) submission to NLP Tools Contest on Statistical Machine Translation in Indian Languages (ILSMT) 2015 (collocated with ICON 2015). The aim of the contest was to collectively explore the effectiveness of Statistical Machine Translation (SMT) while translating within Indian languages and between English and Indian languages. In this paper, we report our work on all five language pairs, namely Bengali-Hindi (\bnhi), Marathi-Hindi (\mrhi), Tamil-Hindi (\tahi), Telugu-Hindi (\tehi), and English-Hindi (\enhi) for Health, Tourism, and General domains. We have used suffix separation, compound splitting and preordering prior to SMT training and testing.

* In the Proceedings of the 12th International Conference on Natural Language Processing (ICON 2015)
* 4 pages, Published in the Proceedings of NLP Tools Contest: Statistical Machine Translation in Indian Languages

Via

Access Paper or Ask Questions

Reordering rules for English-Hindi SMT

Oct 24, 2016

Raj Nath Patel, Rohit Gupta, Prakash B. Pimpale, Sasikumar M

Figure 1 for Reordering rules for English-Hindi SMT

Figure 2 for Reordering rules for English-Hindi SMT

Figure 3 for Reordering rules for English-Hindi SMT

Figure 4 for Reordering rules for English-Hindi SMT

Abstract:Reordering is a preprocessing stage for Statistical Machine Translation (SMT) system where the words of the source sentence are reordered as per the syntax of the target language. We are proposing a rich set of rules for better reordering. The idea is to facilitate the training process by better alignments and parallel phrase extraction for a phrase-based SMT system. Reordering also helps the decoding process and hence improving the machine translation quality. We have observed significant improvements in the translation quality by using our approach over the baseline SMT. We have used BLEU, NIST, multi-reference word error rate, multi-reference position independent error rate for judging the improvements. We have exploited open source SMT toolkit MOSES to develop the system.

* 8 pages, Published at the Second Workshop on Hybrid Approaches to Translation, ACL 2013

Via

Access Paper or Ask Questions

Statistical Machine Translation for Indian Languages: Mission Hindi

Oct 24, 2016

Raj Nath Patel, Prakash B. Pimpale, Sasikumar M

Figure 1 for Statistical Machine Translation for Indian Languages: Mission Hindi

Figure 2 for Statistical Machine Translation for Indian Languages: Mission Hindi

Figure 3 for Statistical Machine Translation for Indian Languages: Mission Hindi

Figure 4 for Statistical Machine Translation for Indian Languages: Mission Hindi

Abstract:This paper discusses Centre for Development of Advanced Computing Mumbai's (CDACM) submission to the NLP Tools Contest on Statistical Machine Translation in Indian Languages (ILSMT) 2014 (collocated with ICON 2014). The objective of the contest was to explore the effectiveness of Statistical Machine Translation (SMT) for Indian language to Indian language and English-Hindi machine translation. In this paper, we have proposed that suffix separation and word splitting for SMT from agglutinative languages to Hindi significantly improves over the baseline (BL). We have also shown that the factored model with reordering outperforms the phrase-based SMT for English-Hindi (\enhi). We report our work on all five pairs of languages, namely Bengali-Hindi (\bnhi), Marathi-Hindi (\mrhi), Tamil-Hindi (\tahi), Telugu-Hindi (\tehi), and \enhi for Health, Tourism, and General domains.

* 5 pages, Published at NLP Tools Contest: Statistical Machine Translation in Indian Languages, ICON-2015

Via

Access Paper or Ask Questions

Translation Quality Estimation using Recurrent Neural Network

Oct 21, 2016

Raj Nath Patel, Sasikumar M

Figure 1 for Translation Quality Estimation using Recurrent Neural Network

Figure 2 for Translation Quality Estimation using Recurrent Neural Network

Figure 3 for Translation Quality Estimation using Recurrent Neural Network

Figure 4 for Translation Quality Estimation using Recurrent Neural Network

Abstract:This paper describes our submission to the shared task on word/phrase level Quality Estimation (QE) in the First Conference on Statistical Machine Translation (WMT16). The objective of the shared task was to predict if the given word/phrase is a correct/incorrect (OK/BAD) translation in the given sentence. In this paper, we propose a novel approach for word level Quality Estimation using Recurrent Neural Network Language Model (RNN-LM) architecture. RNN-LMs have been found very effective in different Natural Language Processing (NLP) applications. RNN-LM is mainly used for vector space language modeling for different NLP problems. For this task, we modify the architecture of RNN-LM. The modified system predicts a label (OK/BAD) in the slot rather than predicting the word. The input to the system is a word sequence, similar to the standard RNN-LM. The approach is language independent and requires only the translated text for QE. To estimate the phrase level quality, we use the output of the word level QE system.

* 7 pages, published at First Conference on Machine Translation

Via

Access Paper or Ask Questions