Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yunshi Lan

SEAGraph: Unveiling the Whole Story of Paper Review Comments

Dec 16, 2024

Jianxiang Yu, Jiaqi Tan, Zichen Ding, Jiapeng Zhu, Jiahao Li, Yao Cheng, Qier Cui, Yunshi Lan, Xiang Li

Figure 1 for SEAGraph: Unveiling the Whole Story of Paper Review Comments

Figure 2 for SEAGraph: Unveiling the Whole Story of Paper Review Comments

Figure 3 for SEAGraph: Unveiling the Whole Story of Paper Review Comments

Figure 4 for SEAGraph: Unveiling the Whole Story of Paper Review Comments

Abstract:Peer review, as a cornerstone of scientific research, ensures the integrity and quality of scholarly work by providing authors with objective feedback for refinement. However, in the traditional peer review process, authors often receive vague or insufficiently detailed feedback, which provides limited assistance and leads to a more time-consuming review cycle. If authors can identify some specific weaknesses in their paper, they can not only address the reviewer's concerns but also improve their work. This raises the critical question of how to enhance authors' comprehension of review comments. In this paper, we present SEAGraph, a novel framework developed to clarify review comments by uncovering the underlying intentions behind them. We construct two types of graphs for each paper: the semantic mind graph, which captures the author's thought process, and the hierarchical background graph, which delineates the research domains related to the paper. A retrieval method is then designed to extract relevant content from both graphs, facilitating coherent explanations for the review comments. Extensive experiments show that SEAGraph excels in review comment understanding tasks, offering significant benefits to authors.

Via

Access Paper or Ask Questions

NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language

Dec 11, 2024

Yuanyuan Liang, Tingyu Xie, Gan Peng, Zihao Huang, Yunshi Lan, Weining Qian

Figure 1 for NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language

Figure 2 for NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language

Figure 3 for NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language

Figure 4 for NAT-NL2GQL: A Novel Multi-Agent Framework for Translating Natural Language to Graph Query Language

Abstract:The emergence of Large Language Models (LLMs) has revolutionized many fields, not only traditional natural language processing (NLP) tasks. Recently, research on applying LLMs to the database field has been booming, and as a typical non-relational database, the use of LLMs in graph database research has naturally gained significant attention. Recent efforts have increasingly focused on leveraging LLMs to translate natural language into graph query language (NL2GQL). Although some progress has been made, these methods have clear limitations, such as their reliance on streamlined processes that often overlook the potential of LLMs to autonomously plan and collaborate with other LLMs in tackling complex NL2GQL challenges. To address this gap, we propose NAT-NL2GQL, a novel multi-agent framework for translating natural language to graph query language. Specifically, our framework consists of three synergistic agents: the Preprocessor agent, the Generator agent, and the Refiner agent. The Preprocessor agent manages data processing as context, including tasks such as name entity recognition, query rewriting, path linking, and the extraction of query-related schemas. The Generator agent is a fine-tuned LLM trained on NL-GQL data, responsible for generating corresponding GQL statements based on queries and their related schemas. The Refiner agent is tasked with refining the GQL or context using error information obtained from the GQL execution results. Given the scarcity of high-quality open-source NL2GQL datasets based on nGQL syntax, we developed StockGQL, a dataset constructed from a financial market graph database. It is available at: https://github.com/leonyuancode/StockGQL. Experimental results on the StockGQL and SpCQL datasets reveal that our method significantly outperforms baseline approaches, highlighting its potential for advancing NL2GQL research.

* 12 pages,6 figures

Via

Access Paper or Ask Questions

Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting

Oct 02, 2024

Siyi Liu, Yang Li, Jiang Li, Shan Yang, Yunshi Lan

Figure 1 for Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting

Figure 2 for Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting

Figure 3 for Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting

Figure 4 for Unleashing the Power of Large Language Models in Zero-shot Relation Extraction via Self-Prompting

Abstract:Recent research in zero-shot Relation Extraction (RE) has focused on using Large Language Models (LLMs) due to their impressive zero-shot capabilities. However, current methods often perform suboptimally, mainly due to a lack of detailed, context-specific prompts needed for understanding various sentences and relations. To address this, we introduce the Self-Prompting framework, a novel method designed to fully harness the embedded RE knowledge within LLMs. Specifically, our framework employs a three-stage diversity approach to prompt LLMs, generating multiple synthetic samples that encapsulate specific relations from scratch. These generated samples act as in-context learning samples, offering explicit and context-specific guidance to efficiently prompt LLMs for RE. Experimental evaluations on benchmark datasets show our approach outperforms existing LLM-based zero-shot RE methods. Additionally, our experiments confirm the effectiveness of our generation pipeline in producing high-quality synthetic data that enhances performance.

* EMNLP 2024 Short

Via

Access Paper or Ask Questions

DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning

Sep 12, 2024

Kangyang Luo, Shuai Wang, Yexuan Fu, Renrong Shao, Xiang Li, Yunshi Lan, Ming Gao, Jinlong Shu

Figure 1 for DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning

Figure 2 for DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning

Figure 3 for DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning

Figure 4 for DFDG: Data-Free Dual-Generator Adversarial Distillation for One-Shot Federated Learning

Abstract:Federated Learning (FL) is a distributed machine learning scheme in which clients jointly participate in the collaborative training of a global model by sharing model information rather than their private datasets. In light of concerns associated with communication and privacy, one-shot FL with a single communication round has emerged as a de facto promising solution. However, existing one-shot FL methods either require public datasets, focus on model homogeneous settings, or distill limited knowledge from local models, making it difficult or even impractical to train a robust global model. To address these limitations, we propose a new data-free dual-generator adversarial distillation method (namely DFDG) for one-shot FL, which can explore a broader local models' training space via training dual generators. DFDG is executed in an adversarial manner and comprises two parts: dual-generator training and dual-model distillation. In dual-generator training, we delve into each generator concerning fidelity, transferability and diversity to ensure its utility, and additionally tailor the cross-divergence loss to lessen the overlap of dual generators' output spaces. In dual-model distillation, the trained dual generators work together to provide the training data for updates of the global model. At last, our extensive experiments on various image classification tasks show that DFDG achieves significant performance gains in accuracy compared to SOTA baselines.

* Accepted by ICDM2024 main conference (long paper)

Via

Access Paper or Ask Questions

Aligning Large Language Models to a Domain-specific Graph Database

Feb 28, 2024

Yuanyuan Liang, Keren Tan, Tingyu Xie, Wenbiao Tao, Siyuan Wang, Yunshi Lan, Weining Qian

Abstract:Graph Databases (Graph DB) are widely applied in various fields, including finance, social networks, and medicine. However, translating Natural Language (NL) into the Graph Query Language (GQL), commonly known as NL2GQL, proves to be challenging due to its inherent complexity and specialized nature. Some approaches have sought to utilize Large Language Models (LLMs) to address analogous tasks like text2SQL. Nevertheless, when it comes to NL2GQL taskson a particular domain, the absence of domain-specific NL-GQL data pairs makes it difficult to establish alignment between LLMs and the graph DB. To address this challenge, we propose a well-defined pipeline. Specifically, we utilize ChatGPT to create NL-GQL data pairs based on the given graph DB with self-instruct. Then, we use the created data to fine-tune LLMs, thereby achieving alignment between LLMs and the graph DB. Additionally, during inference, we propose a method that extracts relevant schema to the queried NL as the input context to guide LLMs for generating accurate GQLs.We evaluate our method on two constructed datasets deriving from graph DBs in finance domain and medicine domain, namely FinGQL and MediGQL. Experimental results demonstrate that our method significantly outperforms a set of baseline methods, with improvements of 5.90 and 6.36 absolute points on EM, and 6.00 and 7.09 absolute points on EX, respectively.

* 13 pages,2 figures

Via

Access Paper or Ask Questions

An LLM-Enhanced Adversarial Editing System for Lexical Simplification

Feb 23, 2024

Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan, Jinlong Shu

Abstract:Lexical Simplification (LS) aims to simplify text at the lexical level. Existing methods rely heavily on annotated data, making it challenging to apply in low-resource scenarios. In this paper, we propose a novel LS method without parallel corpora. This method employs an Adversarial Editing System with guidance from a confusion loss and an invariance loss to predict lexical edits in the original sentences. Meanwhile, we introduce an innovative LLM-enhanced loss to enable the distillation of knowledge from Large Language Models (LLMs) into a small-size LS system. From that, complex words within sentences are masked and a Difficulty-aware Filling module is crafted to replace masked positions with simpler words. At last, extensive experimental results and analyses on three benchmark LS datasets demonstrate the effectiveness of our proposed method.

* Accepted by COLING 2024 main conference

Via

Access Paper or Ask Questions

Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions

Feb 21, 2024

Lei Pan, Yunshi Lan, Yang Li, Weining Qian

Abstract:Unsupervised Text Style Transfer (UTST) has emerged as a critical task within the domain of Natural Language Processing (NLP), aiming to transfer one stylistic aspect of a sentence into another style without changing its semantics, syntax, or other attributes. This task is especially challenging given the intrinsic lack of parallel text pairings. Among existing methods for UTST tasks, attention masking approach and Large Language Models (LLMs) are deemed as two pioneering methods. However, they have shortcomings in generating unsmooth sentences and changing the original contents, respectively. In this paper, we investigate if we can combine these two methods effectively. We propose four ways of interactions, that are pipeline framework with tuned orders; knowledge distillation from LLMs to attention masking model; in-context learning with constructed parallel examples. We empirically show these multi-way interactions can improve the baselines in certain perspective of style strength, content preservation and text fluency. Experiments also demonstrate that simply conducting prompting followed by attention masking-based revision can consistently surpass the other systems, including supervised text style transfer systems. On Yelp-clean and Amazon-clean datasets, it improves the previously best mean metric by 0.5 and 3.0 absolute percentages respectively, and achieves new SOTA results.

Via

Access Paper or Ask Questions

TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Feb 20, 2024

Xiang Li, Yunshi Lan, Chao Yang

Figure 1 for TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Figure 2 for TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Figure 3 for TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Figure 4 for TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Abstract:Recently, numerous new benchmarks have been established to evaluate the performance of large language models (LLMs) via either computing a holistic score or employing another LLM as a judge. However, these approaches suffer from data leakage due to the open access of the benchmark and inflexible evaluation process. To address this issue, we introduce $\textbf{TreeEval}$, a benchmark-free evaluation method for LLMs that let a high-performance LLM host an irreproducible evaluation session and essentially avoids the data leakage. Moreover, this LLM performs as an examiner to raise up a series of questions under a topic with a tree planing strategy, which considers the current evaluation status to decide the next question generation and ensures the completeness and efficiency of the evaluation process. We evaluate $6$ models of different parameter sizes, including $7$B, $13$B, and $33$B, and ultimately achieved the highest correlation coefficient with AlpacaEval2.0 using only around $45$ questions. We also conduct more analysis to show the robustness and reliability of TreeEval. Our code can be accessed via the provided https://github.com/Ashura5/TreeEval.

Via

Access Paper or Ask Questions

Safety of Multimodal Large Language Models on Images and Text

Feb 01, 2024

Xin Liu, Yichen Zhu, Yunshi Lan, Chao Yang, Yu Qiao

Abstract:Attracted by the impressive power of Multimodal Large Language Models (MLLMs), the public is increasingly utilizing them to improve the efficiency of daily work. Nonetheless, the vulnerabilities of MLLMs to unsafe instructions bring huge safety risks when these models are deployed in real-world scenarios. In this paper, we systematically survey current efforts on the evaluation, attack, and defense of MLLMs' safety on images and text. We begin with introducing the overview of MLLMs on images and text and understanding of safety, which helps researchers know the detailed scope of our survey. Then, we review the evaluation datasets and metrics for measuring the safety of MLLMs. Next, we comprehensively present attack and defense techniques related to MLLMs' safety. Finally, we analyze several unsolved issues and discuss promising research directions.

Via

Access Paper or Ask Questions

Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

Jan 31, 2024

Yunshi Lan, Xinyuan Li, Hanyue Du, Xuesong Lu, Ming Gao, Weining Qian, Aoying Zhou

Figure 1 for Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

Figure 2 for Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

Figure 3 for Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

Figure 4 for Survey of Natural Language Processing for Education: Taxonomy, Systematic Review, and Future Trends

Abstract:Natural Language Processing (NLP) aims to analyze the text via techniques in the computer science field. It serves the applications in healthcare, commerce, and education domains. Particularly, NLP has been applied to the education domain to help teaching and learning. In this survey, we review recent advances in NLP with a focus on solving problems related to the education domain. In detail, we begin with introducing the relevant background. Then, we present the taxonomy of NLP in the education domain. Next, we illustrate the task definition, challenges, and corresponding techniques based on the above taxonomy. After that, we showcase some off-the-shelf demonstrations in this domain and conclude with future directions.

Via

Access Paper or Ask Questions