Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaoyan Yu

Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs

May 12, 2025

Yifan Wei, Xiaoyan Yu, Tengfei Pan, Angsheng Li, Li Du

Abstract:Large language models (LLMs) have achieved unprecedented performance by leveraging vast pretraining corpora, yet their performance remains suboptimal in knowledge-intensive domains such as medicine and scientific research, where high factual precision is required. While synthetic data provides a promising avenue for augmenting domain knowledge, existing methods frequently generate redundant samples that do not align with the model's true knowledge gaps. To overcome this limitation, we propose a novel Structural Entropy-guided Knowledge Navigator (SENATOR) framework that addresses the intrinsic knowledge deficiencies of LLMs. Our approach employs the Structure Entropy (SE) metric to quantify uncertainty along knowledge graph paths and leverages Monte Carlo Tree Search (MCTS) to selectively explore regions where the model lacks domain-specific knowledge. Guided by these insights, the framework generates targeted synthetic data for supervised fine-tuning, enabling continuous self-improvement. Experimental results on LLaMA-3 and Qwen2 across multiple domain-specific benchmarks show that SENATOR effectively detects and repairs knowledge deficiencies, achieving notable performance improvements. The code and data for our methods and experiments are available at https://github.com/weiyifan1023/senator.

Via

Access Paper or Ask Questions

SetKE: Knowledge Editing for Knowledge Elements Overlap

Apr 29, 2025

Yifan Wei, Xiaoyan Yu, Ran Song, Hao Peng, Angsheng Li

Abstract:Large Language Models (LLMs) excel in tasks such as retrieval and question answering but require updates to incorporate new knowledge and reduce inaccuracies and hallucinations. Traditional updating methods, like fine-tuning and incremental learning, face challenges such as overfitting and high computational costs. Knowledge Editing (KE) provides a promising alternative but often overlooks the Knowledge Element Overlap (KEO) phenomenon, where multiple triplets share common elements, leading to editing conflicts. We identify the prevalence of KEO in existing KE datasets and show its significant impact on current KE methods, causing performance degradation in handling such triplets. To address this, we propose a new formulation, Knowledge Set Editing (KSE), and introduce SetKE, a method that edits sets of triplets simultaneously. Experimental results demonstrate that SetKE outperforms existing methods in KEO scenarios on mainstream LLMs. Additionally, we introduce EditSet, a dataset containing KEO triplets, providing a comprehensive benchmark.

* IJCAI 2025
* The CR version will be updated subsequently

Via

Access Paper or Ask Questions

SocialED: A Python Library for Social Event Detection

Dec 18, 2024

Kun Zhang, Xiaoyan Yu, Pu Li, Hao Peng, Philip S. Yu

Figure 1 for SocialED: A Python Library for Social Event Detection

Figure 2 for SocialED: A Python Library for Social Event Detection

Abstract:SocialED is a comprehensive, open-source Python library designed to support social event detection (SED) tasks, integrating 19 detection algorithms and 14 diverse datasets. It provides a unified API with detailed documentation, offering researchers and practitioners a complete solution for event detection in social media. The library is designed with modularity in mind, allowing users to easily adapt and extend components for various use cases. SocialED supports a wide range of preprocessing techniques, such as graph construction and tokenization, and includes standardized interfaces for training models and making predictions. By integrating popular deep learning frameworks, SocialED ensures high efficiency and scalability across both CPU and GPU environments. The library is built adhering to high code quality standards, including unit testing, continuous integration, and code coverage, ensuring that SocialED delivers robust, maintainable software. SocialED is publicly available at \url{https://github.com/RingBDStack/SocialED} and can be installed via PyPI.

* 8 pages, 1 figure, Python library

Via

Access Paper or Ask Questions

Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Dec 14, 2024

Xiaoyan Yu, Yifan Wei, Shuaishuai Zhou, Zhiwei Yang, Li Sun, Hao Peng, Liehuang Zhu, Philip S. Yu

Figure 1 for Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Figure 2 for Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Figure 3 for Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Figure 4 for Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic Space

Abstract:The vast, complex, and dynamic nature of social message data has posed challenges to social event detection (SED). Despite considerable effort, these challenges persist, often resulting in inadequately expressive message representations (ineffective) and prolonged learning durations (inefficient). In response to the challenges, this work introduces an unsupervised framework, HyperSED (Hyperbolic SED). Specifically, the proposed framework first models social messages into semantic-based message anchors, and then leverages the structure of the anchor graph and the expressiveness of the hyperbolic space to acquire structure- and geometry-aware anchor representations. Finally, HyperSED builds the partitioning tree of the anchor message graph by incorporating differentiable structural information as the reflection of the detected events. Extensive experiments on public datasets demonstrate HyperSED's competitive performance, along with a substantial improvement in efficiency compared to the current state-of-the-art unsupervised paradigm. Statistically, HyperSED boosts incremental SED by an average of 2%, 2%, and 25% in NMI, AMI, and ARI, respectively; enhancing efficiency by up to 37.41 times and at least 12.10 times, illustrating the advancement of the proposed framework. Our code is publicly available at https://github.com/XiaoyanWork/HyperSED.

* Accepted to AAAI 2025

Via

Access Paper or Ask Questions

Multi-View Incongruity Learning for Multimodal Sarcasm Detection

Dec 01, 2024

Diandian Guo, Cong Cao, Fangfang Yuan, Yanbing Liu, Guangjie Zeng, Xiaoyan Yu, Hao Peng, Philip S. Yu

Abstract:Multimodal sarcasm detection (MSD) is essential for various downstream tasks. Existing MSD methods tend to rely on spurious correlations. These methods often mistakenly prioritize non-essential features yet still make correct predictions, demonstrating poor generalizability beyond training environments. Regarding this phenomenon, this paper undertakes several initiatives. Firstly, we identify two primary causes that lead to the reliance of spurious correlations. Secondly, we address these challenges by proposing a novel method that integrate Multimodal Incongruities via Contrastive Learning (MICL) for multimodal sarcasm detection. Specifically, we first leverage incongruity to drive multi-view learning from three views: token-patch, entity-object, and sentiment. Then, we introduce extensive data augmentation to mitigate the biased learning of the textual modality. Additionally, we construct a test set, SPMSD, which consists potential spurious correlations to evaluate the the model's generalizability. Experimental results demonstrate the superiority of MICL on benchmark datasets, along with the analyses showcasing MICL's advancement in mitigating the effect of spurious correlation.

* Accepted to COLING 2025

Via

Access Paper or Ask Questions

Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification

Nov 11, 2024

Jannik Franzen, Claudia Winklmayr, Vanessa E. Guarino, Christoph Karg, Xiaoyan Yu, Nora Koreuber, Jan P. Albrecht, Philip Bischoff, Dagmar Kainmueller

Figure 1 for Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification

Figure 2 for Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification

Figure 3 for Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification

Figure 4 for Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification

Abstract:Uncertainty Quantification (UQ) is crucial for reliable image segmentation. Yet, while the field sees continual development of novel methods, a lack of agreed-upon benchmarks limits their systematic comparison and evaluation: Current UQ methods are typically tested either on overly simplistic toy datasets or on complex real-world datasets that do not allow to discern true uncertainty. To unify both controllability and complexity, we introduce Arctique, a procedurally generated dataset modeled after histopathological colon images. We chose histopathological images for two reasons: 1) their complexity in terms of intricate object structures and highly variable appearance, which yields challenging segmentation problems, and 2) their broad prevalence for medical diagnosis and respective relevance of high-quality UQ. To generate Arctique, we established a Blender-based framework for 3D scene creation with intrinsic noise manipulation. Arctique contains 50,000 rendered images with precise masks as well as noisy label simulations. We show that by independently controlling the uncertainty in both images and labels, we can effectively study the performance of several commonly used UQ methods. Hence, Arctique serves as a critical resource for benchmarking and advancing UQ techniques and other methodologies in complex, multi-object environments, bridging the gap between realism and controllability. All code is publicly available, allowing re-creation and controlled manipulations of our shipped images as well as creation and rendering of new scenes.

* 13 pages, 4 figures

Via

Access Paper or Ask Questions

DAMe: Personalized Federated Social Event Detection with Dual Aggregation Mechanism

Sep 01, 2024

Xiaoyan Yu, Yifan Wei, Pu Li, Shuaishuai Zhou, Hao Peng, Li Sun, Liehuang Zhu, Philip S. Yu

Figure 1 for DAMe: Personalized Federated Social Event Detection with Dual Aggregation Mechanism

Figure 2 for DAMe: Personalized Federated Social Event Detection with Dual Aggregation Mechanism

Figure 3 for DAMe: Personalized Federated Social Event Detection with Dual Aggregation Mechanism

Figure 4 for DAMe: Personalized Federated Social Event Detection with Dual Aggregation Mechanism

Abstract:Training social event detection models through federated learning (FedSED) aims to improve participants' performance on the task. However, existing federated learning paradigms are inadequate for achieving FedSED's objective and exhibit limitations in handling the inherent heterogeneity in social data. This paper proposes a personalized federated learning framework with a dual aggregation mechanism for social event detection, namely DAMe. We present a novel local aggregation strategy utilizing Bayesian optimization to incorporate global knowledge while retaining local characteristics. Moreover, we introduce a global aggregation strategy to provide clients with maximum external knowledge of their preferences. In addition, we incorporate a global-local event-centric constraint to prevent local overfitting and ``client-drift''. Experiments within a realistic simulation of a natural federated setting, utilizing six social event datasets spanning six languages and two social media platforms, along with an ablation study, have demonstrated the effectiveness of the proposed framework. Further robustness analyses have shown that DAMe is resistant to injection attacks.

* CIKM 2024

Via

Access Paper or Ask Questions

Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Sep 01, 2024

Yifan Wei, Xiaoyan Yu, Yixuan Weng, Huanhuan Ma, Yuanzhe Zhang, Jun Zhao, Kang Liu

Figure 1 for Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Figure 2 for Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Figure 3 for Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Figure 4 for Does Knowledge Localization Hold True? Surprising Differences Between Entity and Relation Perspectives in Language Models

Abstract:Large language models encapsulate knowledge and have demonstrated superior performance on various natural language processing tasks. Recent studies have localized this knowledge to specific model parameters, such as the MLP weights in intermediate layers. This study investigates the differences between entity and relational knowledge through knowledge editing. Our findings reveal that entity and relational knowledge cannot be directly transferred or mapped to each other. This result is unexpected, as logically, modifying the entity or the relation within the same knowledge triplet should yield equivalent outcomes. To further elucidate the differences between entity and relational knowledge, we employ causal analysis to investigate how relational knowledge is stored in pre-trained models. Contrary to prior research suggesting that knowledge is stored in MLP weights, our experiments demonstrate that relational knowledge is also significantly encoded in attention modules. This insight highlights the multifaceted nature of knowledge storage in language models, underscoring the complexity of manipulating specific types of knowledge within these models.

* CIKM 2024

Via

Access Paper or Ask Questions

Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration

Jul 27, 2024

Xiaoyan Yu, Shen Zhou, Huafeng Li, Liehuang Zhu

Figure 1 for Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration

Figure 2 for Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration

Figure 3 for Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration

Figure 4 for Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration

Abstract:The use of a single image restoration framework to achieve multi-task image restoration has garnered significant attention from researchers. However, several practical challenges remain, including meeting the specific and simultaneous demands of different tasks, balancing relationships between tasks, and effectively utilizing task correlations in model design. To address these challenges, this paper explores a multi-expert adaptive selection mechanism. We begin by designing a feature representation method that accounts for both the pixel channel level and the global level, encompassing low-frequency and high-frequency components of the image. Based on this method, we construct a multi-expert selection and ensemble scheme. This scheme adaptively selects the most suitable expert from the expert library according to the content of the input image and the prompts of the current task. It not only meets the individualized needs of different tasks but also achieves balance and optimization across tasks. By sharing experts, our design promotes interconnections between different tasks, thereby enhancing overall performance and resource utilization. Additionally, the multi-expert mechanism effectively eliminates irrelevant experts, reducing interference from them and further improving the effectiveness and accuracy of image restoration. Experimental results demonstrate that our proposed method is both effective and superior to existing approaches, highlighting its potential for practical applications in multi-task image restoration.

Via

Access Paper or Ask Questions

Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction

Jul 23, 2024

Kun Peng, Lei Jiang, Qian Li, Haoran Li, Xiaoyan Yu, Li Sun, Shuo Sun, Yanxian Bi, Hao Peng

Figure 1 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction

Figure 2 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction

Figure 3 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction

Figure 4 for Table-Filling via Mean Teacher for Cross-domain Aspect Sentiment Triplet Extraction

Abstract:Cross-domain Aspect Sentiment Triplet Extraction (ASTE) aims to extract fine-grained sentiment elements from target domain sentences by leveraging the knowledge acquired from the source domain. Due to the absence of labeled data in the target domain, recent studies tend to rely on pre-trained language models to generate large amounts of synthetic data for training purposes. However, these approaches entail additional computational costs associated with the generation process. Different from them, we discover a striking resemblance between table-filling methods in ASTE and two-stage Object Detection (OD) in computer vision, which inspires us to revisit the cross-domain ASTE task and approach it from an OD standpoint. This allows the model to benefit from the OD extraction paradigm and region-level alignment. Building upon this premise, we propose a novel method named \textbf{T}able-\textbf{F}illing via \textbf{M}ean \textbf{T}eacher (TFMT). Specifically, the table-filling methods encode the sentence into a 2D table to detect word relations, while TFMT treats the table as a feature map and utilizes a region consistency to enhance the quality of those generated pseudo labels. Additionally, considering the existence of the domain gap, a cross-domain consistency based on Maximum Mean Discrepancy is designed to alleviate domain shift problems. Our method achieves state-of-the-art performance with minimal parameters and computational costs, making it a strong baseline for cross-domain ASTE.

* Accepted by CIKM2024

Via

Access Paper or Ask Questions