Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Phuong Minh Nguyen

An Effective Method using Phrase Mechanism in Neural Machine Translation

Aug 22, 2023

Phuong Minh Nguyen, Le Minh Nguyen

Abstract:Machine Translation is one of the essential tasks in Natural Language Processing (NLP), which has massive applications in real life as well as contributing to other tasks in the NLP research community. Recently, Transformer -based methods have attracted numerous researchers in this domain and achieved state-of-the-art results in most of the pair languages. In this paper, we report an effective method using a phrase mechanism, PhraseTransformer, to improve the strong baseline model Transformer in constructing a Neural Machine Translation (NMT) system for parallel corpora Vietnamese-Chinese. Our experiments on the MT dataset of the VLSP 2022 competition achieved the BLEU score of 35.3 on Vietnamese to Chinese and 33.2 BLEU scores on Chinese to Vietnamese data. Our code is available at https://github.com/phuongnm94/PhraseTransformer.

Via

Access Paper or Ask Questions

Miko Team: Deep Learning Approach for Legal Question Answering in ALQAC 2022

Nov 04, 2022

Hieu Nguyen Van, Dat Nguyen, Phuong Minh Nguyen, Minh Le Nguyen

Abstract:We introduce efficient deep learning-based methods for legal document processing including Legal Document Retrieval and Legal Question Answering tasks in the Automated Legal Question Answering Competition (ALQAC 2022). In this competition, we achieve 1\textsuperscript{st} place in the first task and 3\textsuperscript{rd} place in the second task. Our method is based on the XLM-RoBERTa model that is pre-trained from a large amount of unlabeled corpus before fine-tuning to the specific tasks. The experimental results showed that our method works well in legal retrieval information tasks with limited labeled data. Besides, this method can be applied to other information retrieval tasks in low-resource languages.

Via

Access Paper or Ask Questions

JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021

Jun 25, 2021

Ha-Thanh Nguyen, Phuong Minh Nguyen, Thi-Hai-Yen Vuong, Quan Minh Bui, Chau Minh Nguyen, Binh Tran Dang, Vu Tran, Minh Le Nguyen, Ken Satoh

Figure 1 for JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021

Figure 2 for JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021

Figure 3 for JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021

Figure 4 for JNLP Team: Deep Learning Approaches for Legal Processing Tasks in COLIEE 2021

Abstract:COLIEE is an annual competition in automatic computerized legal text processing. Automatic legal document processing is an ambitious goal, and the structure and semantics of the law are often far more complex than everyday language. In this article, we survey and report our methods and experimental results in using deep learning in legal document processing. The results show the difficulties as well as potentials in this family of approaches.

* Also published in COLIEE 2021's proceeding

Via

Access Paper or Ask Questions

ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing

Jun 25, 2021

Ha-Thanh Nguyen, Vu Tran, Phuong Minh Nguyen, Thi-Hai-Yen Vuong, Quan Minh Bui, Chau Minh Nguyen, Binh Tran Dang, Minh Le Nguyen, Ken Satoh

Figure 1 for ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing

Figure 2 for ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing

Figure 3 for ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing

Figure 4 for ParaLaw Nets -- Cross-lingual Sentence-level Pretraining for Legal Text Processing

Abstract:Ambiguity is a characteristic of natural language, which makes expression ideas flexible. However, in a domain that requires accurate statements, it becomes a barrier. Specifically, a single word can have many meanings and multiple words can have the same meaning. When translating a text into a foreign language, the translator needs to determine the exact meaning of each element in the original sentence to produce the correct translation sentence. From that observation, in this paper, we propose ParaLaw Nets, a pretrained model family using sentence-level cross-lingual information to reduce ambiguity and increase the performance in legal text processing. This approach achieved the best result in the Question Answering task of COLIEE-2021.

* Also published in COLIEE 2021's Proceeding

Via

Access Paper or Ask Questions

JNLP Team: Deep Learning for Legal Processing in COLIEE 2020

Nov 04, 2020

Ha-Thanh Nguyen, Hai-Yen Thi Vuong, Phuong Minh Nguyen, Binh Tran Dang, Quan Minh Bui, Sinh Trong Vu, Chau Minh Nguyen, Vu Tran, Ken Satoh, Minh Le Nguyen

Figure 1 for JNLP Team: Deep Learning for Legal Processing in COLIEE 2020

Figure 2 for JNLP Team: Deep Learning for Legal Processing in COLIEE 2020

Figure 3 for JNLP Team: Deep Learning for Legal Processing in COLIEE 2020

Figure 4 for JNLP Team: Deep Learning for Legal Processing in COLIEE 2020

Abstract:We propose deep learning based methods for automatic systems of legal retrieval and legal question-answering in COLIEE 2020. These systems are all characterized by being pre-trained on large amounts of data before being finetuned for the specified tasks. This approach helps to overcome the data scarcity and achieve good performance, thus can be useful for tackling related problems in information retrieval, and decision support in the legal domain. Besides, the approach can be explored to deal with other domain specific problems.

* Also be published in JURISIN2020

Via

Access Paper or Ask Questions