Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dat Quoc Nguyen

Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

Oct 21, 2024

Quang Hieu Pham, Hoang Ngo, Anh Tuan Luu, Dat Quoc Nguyen

Figure 1 for Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

Figure 2 for Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

Figure 3 for Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

Figure 4 for Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

Abstract:Retrieval-augmented generation (RAG) methods are viable solutions for addressing the static memory limits of pre-trained language models. Nevertheless, encountering conflicting sources of information within the retrieval context is an inevitable practical challenge. In such situations, the language models are recommended to transparently inform users about the conflicts rather than autonomously deciding what to present based on their inherent biases. To analyze how current large language models (LLMs) align with our recommendation, we introduce WhoQA, a public benchmark dataset to examine model's behavior in knowledge conflict situations. We induce conflicts by asking about a common property among entities having the same name, resulting in questions with up to 8 distinctive answers. WhoQA evaluation set includes 5K questions across 13 Wikidata property types and 150K Wikipedia entities. Our experiments show that despite the simplicity of WhoQA questions, knowledge conflicts significantly degrades LLMs' performance in RAG settings.

* Accepted to EMNLP 2024 Findings

Via

Access Paper or Ask Questions

RecGPT: Generative Pre-training for Text-based Recommendation

May 21, 2024

Hoang Ngo, Dat Quoc Nguyen

Figure 1 for RecGPT: Generative Pre-training for Text-based Recommendation

Figure 2 for RecGPT: Generative Pre-training for Text-based Recommendation

Figure 3 for RecGPT: Generative Pre-training for Text-based Recommendation

Figure 4 for RecGPT: Generative Pre-training for Text-based Recommendation

Abstract:We present the first domain-adapted and fully-trained large language model, RecGPT-7B, and its instruction-following variant, RecGPT-7B-Instruct, for text-based recommendation. Experimental results on rating prediction and sequential recommendation tasks show that our model, RecGPT-7B-Instruct, outperforms previous strong baselines. We are releasing our RecGPT models as well as their pre-training and fine-tuning datasets to facilitate future research and downstream applications in text-based recommendation. Public "huggingface" links to our RecGPT models and datasets are available at: https://github.com/VinAIResearch/RecGPT

* Accepted to the ACL 2024 main conference

Via

Access Paper or Ask Questions

Improving Vietnamese-English Medical Machine Translation

Mar 28, 2024

Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi, Wray Buntine

Figure 1 for Improving Vietnamese-English Medical Machine Translation

Figure 2 for Improving Vietnamese-English Medical Machine Translation

Figure 3 for Improving Vietnamese-English Medical Machine Translation

Figure 4 for Improving Vietnamese-English Medical Machine Translation

Abstract:Machine translation for Vietnamese-English in the medical domain is still an under-explored research area. In this paper, we introduce MedEV -- a high-quality Vietnamese-English parallel dataset constructed specifically for the medical domain, comprising approximately 360K sentence pairs. We conduct extensive experiments comparing Google Translate, ChatGPT (gpt-3.5-turbo), state-of-the-art Vietnamese-English neural machine translation models and pre-trained bilingual/multilingual sequence-to-sequence models on our new MedEV dataset. Experimental results show that the best performance is achieved by fine-tuning "vinai-translate" for each translation direction. We publicly release our dataset to promote further research.

* To appear in Proceedings of LREC-COLING 2024

Via

Access Paper or Ask Questions

JPIS: A Joint Model for Profile-based Intent Detection and Slot Filling with Slot-to-Intent Attention

Dec 16, 2023

Thinh Pham, Dat Quoc Nguyen

Abstract:Profile-based intent detection and slot filling are important tasks aimed at reducing the ambiguity in user utterances by leveraging user-specific supporting profile information. However, research in these two tasks has not been extensively explored. To fill this gap, we propose a joint model, namely JPIS, designed to enhance profile-based intent detection and slot filling. JPIS incorporates the supporting profile information into its encoder and introduces a slot-to-intent attention mechanism to transfer slot information representations to intent detection. Experimental results show that our JPIS substantially outperforms previous profile-based models, establishing a new state-of-the-art performance in overall accuracy on the Chinese benchmark dataset ProSLU.

* To appear in Proceedings of ICASSP 2024 (Camera-ready version)

Via

Access Paper or Ask Questions

MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention

Dec 10, 2023

Thinh Pham, Chi Tran, Dat Quoc Nguyen

Figure 1 for MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention

Figure 2 for MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention

Figure 3 for MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention

Figure 4 for MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention

Abstract:The research study of detecting multiple intents and filling slots is becoming more popular because of its relevance to complicated real-world situations. Recent advanced approaches, which are joint models based on graphs, might still face two potential issues: (i) the uncertainty introduced by constructing graphs based on preliminary intents and slots, which may transfer intent-slot correlation information to incorrect label node destinations, and (ii) direct incorporation of multiple intent labels for each token w.r.t. token-level intent voting might potentially lead to incorrect slot predictions, thereby hurting the overall performance. To address these two issues, we propose a joint model named MISCA. Our MISCA introduces an intent-slot co-attention mechanism and an underlying layer of label attention mechanism. These mechanisms enable MISCA to effectively capture correlations between intents and slot labels, eliminating the need for graph construction. They also facilitate the transfer of correlation information in both directions: from intents to slots and from slots to intents, through multiple levels of label-specific representations, without relying on token-level intent information. Experimental results show that MISCA outperforms previous models, achieving new state-of-the-art overall accuracy performances on two benchmark datasets MixATIS and MixSNIPS. This highlights the effectiveness of our attention mechanisms.

* Findings of EMNLP 2023 (https://aclanthology.org/2023.findings-emnlp.841.pdf); Long paper - 10 pages; 3 figures and 3 tables

Via

Access Paper or Ask Questions

PhoGPT: Generative Pre-training for Vietnamese

Nov 06, 2023

Dat Quoc Nguyen, Linh The Nguyen, Chi Tran, Dung Ngoc Nguyen, Nhung Nguyen, Thien Huu Nguyen, Dinh Phung, Hung Bui

Figure 1 for PhoGPT: Generative Pre-training for Vietnamese

Abstract:We open-source a state-of-the-art 7.5B-parameter generative model series named PhoGPT for Vietnamese, which includes the base pre-trained monolingual model PhoGPT-7B5 and its instruction-following variant, PhoGPT-7B5-Instruct. In addition, we also demonstrate its superior performance compared to previous open-source models through a human evaluation experiment. GitHub: https://github.com/VinAIResearch/PhoGPT

* PhoGPT Technical Report - 4 pages

Via

Access Paper or Ask Questions

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech

May 31, 2023

Linh The Nguyen, Thinh Pham, Dat Quoc Nguyen

Abstract:We present XPhoneBERT, the first multilingual model pre-trained to learn phoneme representations for the downstream text-to-speech (TTS) task. Our XPhoneBERT has the same model architecture as BERT-base, trained using the RoBERTa pre-training approach on 330M phoneme-level sentences from nearly 100 languages and locales. Experimental results show that employing XPhoneBERT as an input phoneme encoder significantly boosts the performance of a strong neural TTS model in terms of naturalness and prosody and also helps produce fairly high-quality speech with limited training data. We publicly release our pre-trained XPhoneBERT with the hope that it would facilitate future research and downstream TTS applications for multiple languages. Our XPhoneBERT model is available at https://github.com/VinAIResearch/XPhoneBERT

* In Proceedings of INTERSPEECH 2023 (to appear)

Via

Access Paper or Ask Questions

Joint Multilingual Knowledge Graph Completion and Alignment

Oct 18, 2022

Vinh Tong, Dat Quoc Nguyen, Trung Thanh Huynh, Tam Thanh Nguyen, Quoc Viet Hung Nguyen, Mathias Niepert

Figure 1 for Joint Multilingual Knowledge Graph Completion and Alignment

Figure 2 for Joint Multilingual Knowledge Graph Completion and Alignment

Figure 3 for Joint Multilingual Knowledge Graph Completion and Alignment

Figure 4 for Joint Multilingual Knowledge Graph Completion and Alignment

Abstract:Knowledge graph (KG) alignment and completion are usually treated as two independent tasks. While recent work has leveraged entity and relation alignments from multiple KGs, such as alignments between multilingual KGs with common entities and relations, a deeper understanding of the ways in which multilingual KG completion (MKGC) can aid the creation of multilingual KG alignments (MKGA) is still limited. Motivated by the observation that structural inconsistencies -- the main challenge for MKGA models -- can be mitigated through KG completion methods, we propose a novel model for jointly completing and aligning knowledge graphs. The proposed model combines two components that jointly accomplish KG completion and alignment. These two components employ relation-aware graph neural networks that we propose to encode multi-hop neighborhood structures into entity and relation representations. Moreover, we also propose (i) a structural inconsistency reduction mechanism to incorporate information from the completion into the alignment component, and (ii) an alignment seed enlargement and triple transferring mechanism to enlarge alignment seeds and transfer triples during KGs alignment. Extensive experiments on a public multilingual benchmark show that our proposed model outperforms existing competitive baselines, obtaining new state-of-the-art results on both MKGC and MKGA tasks. We publicly release the implementation of our model at https://github.com/vinhsuhi/JMAC

* EMNLP 2022 (Findings), to appear

Via

Access Paper or Ask Questions

From Disfluency Detection to Intent Detection and Slot Filling

Sep 17, 2022

Mai Hoang Dao, Thinh Hung Truong, Dat Quoc Nguyen

Figure 1 for From Disfluency Detection to Intent Detection and Slot Filling

Figure 2 for From Disfluency Detection to Intent Detection and Slot Filling

Figure 3 for From Disfluency Detection to Intent Detection and Slot Filling

Figure 4 for From Disfluency Detection to Intent Detection and Slot Filling

Abstract:We present the first empirical study investigating the influence of disfluency detection on downstream tasks of intent detection and slot filling. We perform this study for Vietnamese -- a low-resource language that has no previous study as well as no public dataset available for disfluency detection. First, we extend the fluent Vietnamese intent detection and slot filling dataset PhoATIS by manually adding contextual disfluencies and annotating them. Then, we conduct experiments using strong baselines for disfluency detection and joint intent detection and slot filling, which are based on pre-trained language models. We find that: (i) disfluencies produce negative effects on the performances of the downstream intent detection and slot filling tasks, and (ii) in the disfluency context, the pre-trained multilingual language model XLM-R helps produce better intent detection and slot filling performances than the pre-trained monolingual language model PhoBERT, and this is opposite to what generally found in the fluency context.

* In Proceedings of INTERSPEECH 2022

Via

Access Paper or Ask Questions

A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

Aug 08, 2022

Linh The Nguyen, Nguyen Luong Tran, Long Doan, Manh Luong, Dat Quoc Nguyen

Figure 1 for A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

Figure 2 for A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

Figure 3 for A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

Abstract:In this paper, we introduce a high-quality and large-scale benchmark dataset for English-Vietnamese speech translation with 508 audio hours, consisting of 331K triplets of (sentence-lengthed audio, English source transcript sentence, Vietnamese target subtitle sentence). We also conduct empirical experiments using strong baselines and find that the traditional "Cascaded" approach still outperforms the modern "End-to-End" approach. To the best of our knowledge, this is the first large-scale English-Vietnamese speech translation study. We hope both our publicly available dataset and study can serve as a starting point for future research and applications on English-Vietnamese speech translation. Our dataset is available at https://github.com/VinAIResearch/PhoST

* In Proceedings of INTERSPEECH 2022, to appear. The first three authors contributed equally to this work

Via

Access Paper or Ask Questions