Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hongbin Ye

IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

Feb 22, 2024

Honghao Gui, Hongbin Ye, Lin Yuan, Ningyu Zhang, Mengshu Sun, Lei Liang, Huajun Chen

Figure 1 for IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

Figure 2 for IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

Figure 3 for IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

Figure 4 for IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus

Abstract:Large Language Models (LLMs) demonstrate remarkable potential across various domains; however, they exhibit a significant performance gap in Information Extraction (IE). Note that high-quality instruction data is the vital key for enhancing the specific capabilities of LLMs, while current IE datasets tend to be small in scale, fragmented, and lack standardized schema. To this end, we introduce IEPile, a comprehensive bilingual (English and Chinese) IE instruction corpus, which contains approximately 0.32B tokens. We construct IEPile by collecting and cleaning 33 existing IE datasets, and introduce schema-based instruction generation to unearth a large-scale corpus. Experimental results on LLaMA and Baichuan demonstrate that using IEPile can enhance the performance of LLMs for IE, especially the zero-shot generalization. We open-source the resource and pre-trained models, hoping to provide valuable support to the NLP community.

* Ongoing work; 18 pages; Github: https://github.com/zjunlp/IEPile

Via

Access Paper or Ask Questions

Beyond Isolation: Multi-Agent Synergy for Improving Knowledge Graph Construction

Dec 05, 2023

Hongbin Ye, Honghao Gui, Aijia Zhang, Tong Liu, Wei Hua, Weiqiang Jia

Abstract:Knowledge graph construction (KGC) is a multifaceted undertaking involving the extraction of entities, relations, and events. Traditionally, large language models (LLMs) have been viewed as solitary task-solving agents in this complex landscape. However, this paper challenges this paradigm by introducing a novel framework, CooperKGC. Departing from the conventional approach, CooperKGC establishes a collaborative processing network, assembling a KGC collaboration team capable of concurrently addressing entity, relation, and event extraction tasks. Our experiments unequivocally demonstrate that fostering collaboration and information interaction among diverse agents within CooperKGC yields superior results compared to individual cognitive processes operating in isolation. Importantly, our findings reveal that the collaboration facilitated by CooperKGC enhances knowledge selection, correction, and aggregation capabilities across multiple rounds of interactions.

* work in progress; 12 pages

Via

Access Paper or Ask Questions

Cognitive Mirage: A Review of Hallucinations in Large Language Models

Sep 13, 2023

Hongbin Ye, Tong Liu, Aijia Zhang, Wei Hua, Weiqiang Jia

Abstract:As large language models continue to develop in the field of AI, text generation systems are susceptible to a worrisome phenomenon known as hallucination. In this study, we summarize recent compelling insights into hallucinations in LLMs. We present a novel taxonomy of hallucinations from various text generation tasks, thus provide theoretical insights, detection methods and improvement approaches. Based on this, future research directions are proposed. Our contribution are threefold: (1) We provide a detailed and complete taxonomy for hallucinations appearing in text generation tasks; (2) We provide theoretical analyses of hallucinations in LLMs and provide existing detection and improvement methods; (3) We propose several research directions that can be developed in the future. As hallucinations garner significant attention from the community, we will maintain updates on relevant research progress.

* work in progress; 21 pages

Via

Access Paper or Ask Questions

InstructIE: A Chinese Instruction-based Information Extraction Dataset

May 19, 2023

Honghao Gui, Jintian Zhang, Hongbin Ye, Ningyu Zhang

Figure 1 for InstructIE: A Chinese Instruction-based Information Extraction Dataset

Figure 2 for InstructIE: A Chinese Instruction-based Information Extraction Dataset

Figure 3 for InstructIE: A Chinese Instruction-based Information Extraction Dataset

Figure 4 for InstructIE: A Chinese Instruction-based Information Extraction Dataset

Abstract:We introduce a new Information Extraction (IE) task dubbed Instruction-based IE, which aims to ask the system to follow specific instructions or guidelines to extract information. To facilitate research in this area, we construct a dataset called InstructIE, consisting of 270,000 weakly supervised data from Chinese Wikipedia and 1,000 high-quality crowdsourced annotated instances. We further evaluate the performance of various baseline models on the InstructIE dataset. The results reveal that although current models exhibit promising performance, there is still room for improvement. Furthermore, we conduct a comprehensive case study analysis, underlining the challenges inherent in the Instruction-based IE task. Code and dataset are available at https://github.com/zjunlp/DeepKE/tree/main/example/llm.

* Work in progress

Via

Access Paper or Ask Questions

Schema-adaptable Knowledge Graph Construction

May 19, 2023

Hongbin Ye, Honghao Gui, Xin Xu, Huajun Chen, Ningyu Zhang

Figure 1 for Schema-adaptable Knowledge Graph Construction

Figure 2 for Schema-adaptable Knowledge Graph Construction

Figure 3 for Schema-adaptable Knowledge Graph Construction

Figure 4 for Schema-adaptable Knowledge Graph Construction

Abstract:Conventional Knowledge Graph Construction (KGC) approaches typically follow the static information extraction paradigm with a closed set of pre-defined schema. As a result, such approaches fall short when applied to dynamic scenarios or domains, whereas a new type of knowledge emerges. This necessitates a system that can handle evolving schema automatically to extract information for KGC. To address this need, we propose a new task called schema-adaptable KGC, which aims to continually extract entity, relation, and event based on a dynamically changing schema graph without re-training. We first split and convert existing datasets based on three principles to build a benchmark, i.e., horizontal schema expansion, vertical schema expansion, and hybrid schema expansion; then investigate the schema-adaptable performance of several well-known approaches such as Text2Event, TANL, UIE and GPT-3.5. We further propose a simple yet effective baseline dubbed AdaKGC, which contains schema-enriched prefix instructor and schema-conditioned dynamic decoding to better handle evolving schema. Comprehensive experimental results illustrate that AdaKGC can outperform baselines but still have room for improvement. We hope the proposed work can deliver benefits to the community. Code and datasets will be available in https://github.com/zjunlp/AdaKGC.

* Work in progress

Via

Access Paper or Ask Questions

Generative Knowledge Graph Construction: A Review

Oct 23, 2022

Hongbin Ye, Ningyu Zhang, Hui Chen, Huajun Chen

Figure 1 for Generative Knowledge Graph Construction: A Review

Figure 2 for Generative Knowledge Graph Construction: A Review

Figure 3 for Generative Knowledge Graph Construction: A Review

Figure 4 for Generative Knowledge Graph Construction: A Review

Abstract:Generative Knowledge Graph Construction (KGC) refers to those methods that leverage the sequence-to-sequence framework for building knowledge graphs, which is flexible and can be adapted to widespread tasks. In this study, we summarize the recent compelling progress in generative knowledge graph construction. We present the advantages and weaknesses of each paradigm in terms of different generation targets and provide theoretical insight and empirical analysis. Based on the review, we suggest promising research directions for the future. Our contributions are threefold: (1) We present a detailed, complete taxonomy for the generative KGC methods; (2) We provide a theoretical and empirical analysis of the generative KGC methods; (3) We propose several research directions that can be developed in the future.

* Accepted to EMNLP 2022 and a public repository is available in https://github.com/zjunlp/Generative_KG_Construction_Papers

Via

Access Paper or Ask Questions

Ontology-enhanced Prompt-tuning for Few-shot Learning

Jan 27, 2022

Hongbin Ye, Ningyu Zhang, Shumin Deng, Xiang Chen, Hui Chen, Feiyu Xiong, Xi Chen, Huajun Chen

Figure 1 for Ontology-enhanced Prompt-tuning for Few-shot Learning

Figure 2 for Ontology-enhanced Prompt-tuning for Few-shot Learning

Figure 3 for Ontology-enhanced Prompt-tuning for Few-shot Learning

Figure 4 for Ontology-enhanced Prompt-tuning for Few-shot Learning

Abstract:Few-shot Learning (FSL) is aimed to make predictions based on a limited number of samples. Structured data such as knowledge graphs and ontology libraries has been leveraged to benefit the few-shot setting in various tasks. However, the priors adopted by the existing methods suffer from challenging knowledge missing, knowledge noise, and knowledge heterogeneity, which hinder the performance for few-shot learning. In this study, we explore knowledge injection for FSL with pre-trained language models and propose ontology-enhanced prompt-tuning (OntoPrompt). Specifically, we develop the ontology transformation based on the external knowledge graph to address the knowledge missing issue, which fulfills and converts structure knowledge to text. We further introduce span-sensitive knowledge injection via a visible matrix to select informative knowledge to handle the knowledge noise issue. To bridge the gap between knowledge and text, we propose a collective training algorithm to optimize representations jointly. We evaluate our proposed OntoPrompt in three tasks, including relation extraction, event extraction, and knowledge graph completion, with eight datasets. Experimental results demonstrate that our approach can obtain better few-shot performance than baselines.

* Accepted by WWW2022

Via

Access Paper or Ask Questions

DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Jan 24, 2022

Ningyu Zhang, Xin Xu, Liankuan Tao, Haiyang Yu, Hongbin Ye, Xin Xie, Xiang Chen, Zhoubo Li, Lei Li, Xiaozhuan Liang(+8 more)

Figure 1 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Figure 2 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Figure 3 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Figure 4 for DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

Abstract:We present a new open-source and extensible knowledge extraction toolkit, called DeepKE (Deep learning based Knowledge Extraction), supporting standard fully supervised, low-resource few-shot and document-level scenarios. DeepKE implements various information extraction tasks, including named entity recognition, relation extraction and attribute extraction. With a unified framework, DeepKE allows developers and researchers to customize datasets and models to extract information from unstructured texts according to their requirements. Specifically, DeepKE not only provides various functional modules and model implementation for different tasks and scenarios but also organizes all components by consistent frameworks to maintain sufficient modularity and extensibility. Besides, we present an online platform in http://deepke.zjukg.cn/ for real-time extraction of various tasks. DeepKE has been equipped with Google Colab tutorials and comprehensive documents for beginners. We release the source code at https://github.com/zjunlp/DeepKE, with a demo video.

* work in progress

Via

Access Paper or Ask Questions

LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Dec 02, 2021

Ningyu Zhang, Hongbin Ye, Jiacheng Yang, Shumin Deng, Chuanqi Tan, Mosha Chen, Songfang Huang, Fei Huang, Huajun Chen

Figure 1 for LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Figure 2 for LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Figure 3 for LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Figure 4 for LOGEN: Few-shot Logical Knowledge-Conditioned Text Generation with Self-training

Abstract:Natural language generation from structured data mainly focuses on surface-level descriptions, suffering from uncontrollable content selection and low fidelity. Previous works leverage logical forms to facilitate logical knowledge-conditioned text generation. Though achieving remarkable progress, they are data-hungry, which makes the adoption for real-world applications challenging with limited data. To this end, this paper proposes a unified framework for logical knowledge-conditioned text generation in the few-shot setting. With only a few seeds logical forms (e.g., 20/100 shot), our approach leverages self-training and samples pseudo logical forms based on content and structure consistency. Experimental results demonstrate that our approach can obtain better few-shot performance than baselines.

Via

Access Paper or Ask Questions

Learning to Ask for Data-Efficient Event Argument Extraction

Oct 01, 2021

Hongbin Ye, Ningyu Zhang, Zhen Bi, Shumin Deng, Chuanqi Tan, Hui Chen, Fei Huang, Huajun Chen

Figure 1 for Learning to Ask for Data-Efficient Event Argument Extraction

Figure 2 for Learning to Ask for Data-Efficient Event Argument Extraction

Figure 3 for Learning to Ask for Data-Efficient Event Argument Extraction

Abstract:Event argument extraction (EAE) is an important task for information extraction to discover specific argument roles. In this study, we cast EAE as a question-based cloze task and empirically analyze fixed discrete token template performance. As generating human-annotated question templates is often time-consuming and labor-intensive, we further propose a novel approach called "Learning to Ask," which can learn optimized question templates for EAE without human annotations. Experiments using the ACE-2005 dataset demonstrate that our method based on optimized questions achieves state-of-the-art performance in both the few-shot and supervised settings.

* work in progress

Via

Access Paper or Ask Questions