Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Liran Wang

MdEval: Massively Multilingual Code Debugging

Nov 04, 2024

Shukai Liu, Linzheng Chai, Jian Yang, Jiajun Shi, He Zhu, Liran Wang, Ke Jin, Wei Zhang, Hualei Zhu, Shuyue Guo(+8 more)

Figure 1 for MdEval: Massively Multilingual Code Debugging

Figure 2 for MdEval: Massively Multilingual Code Debugging

Figure 3 for MdEval: Massively Multilingual Code Debugging

Figure 4 for MdEval: Massively Multilingual Code Debugging

Abstract:Code large language models (LLMs) have made significant progress in code debugging by directly generating the correct code based on the buggy code snippet. Programming benchmarks, typically consisting of buggy code snippet and their associated test cases, are used to assess the debugging capabilities of LLMs. However, many existing benchmarks primarily focus on Python and are often limited in terms of language diversity (e.g., DebugBench and DebugEval). To advance the field of multilingual debugging with LLMs, we propose the first massively multilingual debugging benchmark, which includes 3.6K test samples of 18 programming languages and covers the automated program repair (APR) task, the code review (CR) task, and the bug identification (BI) task. Further, we introduce the debugging instruction corpora MDEVAL-INSTRUCT by injecting bugs into the correct multilingual queries and solutions (xDebugGen). Further, a multilingual debugger xDebugCoder trained on MDEVAL-INSTRUCT as a strong baseline specifically to handle the bugs of a wide range of programming languages (e.g. "Missing Mut" in language Rust and "Misused Macro Definition" in language C). Our extensive experiments on MDEVAL reveal a notable performance gap between open-source models and closed-source LLMs (e.g., GPT and Claude series), highlighting huge room for improvement in multilingual code debugging scenarios.

* 15 pages

Via

Access Paper or Ask Questions

TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

Apr 09, 2022

Ze Yang, Liran Wang, Zhoujin Tian, Wei Wu, Zhoujun Li

Figure 1 for TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

Figure 2 for TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

Figure 3 for TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

Figure 4 for TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization

Abstract:Although pre-trained language models (PLMs) have achieved great success and become a milestone in NLP, abstractive conversational summarization remains a challenging but less studied task. The difficulty lies in two aspects. One is the lack of large-scale conversational summary data. Another is that applying the existing pre-trained models to this task is tricky because of the structural dependence within the conversation and its informal expression, etc. In this work, we first build a large-scale (11M) pretraining dataset called RCS, based on the multi-person discussions in the Reddit community. We then present TANet, a thread-aware Transformer-based network. Unlike the existing pre-trained models that treat a conversation as a sequence of sentences, we argue that the inherent contextual dependency among the utterances plays an essential role in understanding the entire conversation and thus propose two new techniques to incorporate the structural information into our model. The first is thread-aware attention which is computed by taking into account the contextual dependency within utterances. Second, we apply thread prediction loss to predict the relations between utterances. We evaluate our model on four datasets of real conversations, covering types of meeting transcripts, customer-service records, and forum threads. Experimental results demonstrate that TANET achieves a new state-of-the-art in terms of both automatic evaluation and human judgment.

* NAACL2022-findings

Via

Access Paper or Ask Questions

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Oct 06, 2020

Ze Yang, Wei Wu, Can Xu, Xinnian Liang, Jiaqi Bai, Liran Wang, Wei Wang, Zhoujun Li

Figure 1 for StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Figure 2 for StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Figure 3 for StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Figure 4 for StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Abstract:Generating responses following a desired style has great potentials to extend applications of open-domain dialogue systems, yet is refrained by lacking of parallel data for training. In this work, we explore the challenging task with pre-trained language models that have brought breakthrough to various natural language tasks. To this end, we introduce a KL loss and a style classifier to the fine-tuning step in order to steer response generation towards the target style in both a word-level and a sentence-level. Comprehensive empirical studies with two public datasets indicate that our model can significantly outperform state-of-the-art methods in terms of both style consistency and contextual coherence.

* Findings of EMNLP2020

Via

Access Paper or Ask Questions