Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aijia Zhang

Distributionally Robust Deep Q-Learning

May 25, 2025

Chung I Lu, Julian Sester, Aijia Zhang

Abstract:We propose a novel distributionally robust $Q$-learning algorithm for the non-tabular case accounting for continuous state spaces where the state transition of the underlying Markov decision process is subject to model uncertainty. The uncertainty is taken into account by considering the worst-case transition from a ball around a reference probability measure. To determine the optimal policy under the worst-case state transition, we solve the associated non-linear Bellman equation by dualising and regularising the Bellman operator with the Sinkhorn distance, which is then parameterized with deep neural networks. This approach allows us to modify the Deep Q-Network algorithm to optimise for the worst case state transition. We illustrate the tractability and effectiveness of our approach through several applications, including a portfolio optimisation task based on S\&{P}~500 data.

Via

Access Paper or Ask Questions

An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Nov 11, 2024

Dong Li, Aijia Zhang, Junqi Gao, Biqing Qi

Figure 1 for An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Figure 2 for An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Figure 3 for An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Figure 4 for An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Abstract:Incremental graph learning has gained significant attention for its ability to address the catastrophic forgetting problem in graph representation learning. However, traditional methods often rely on a large number of labels for node classification, which is impractical in real-world applications. This makes few-shot incremental learning on graphs a pressing need. Current methods typically require extensive training samples from meta-learning to build memory and perform intensive fine-tuning of GNN parameters, leading to high memory consumption and potential loss of previously learned knowledge. To tackle these challenges, we introduce Mecoin, an efficient method for building and maintaining memory. Mecoin employs Structured Memory Units to cache prototypes of learned categories, as well as Memory Construction Modules to update these prototypes for new categories through interactions between the nodes and the cached prototypes. Additionally, we have designed a Memory Representation Adaptation Module to store probabilities associated with each class prototype, reducing the need for parameter fine-tuning and lowering the forgetting rate. When a sample matches its corresponding class prototype, the relevant probabilities are retrieved from the MRaM. Knowledge is then distilled back into the GNN through a Graph Knowledge Distillation Module, preserving the model's memory. We analyze the effectiveness of Mecoin in terms of generalization error and explore the impact of different distillation strategies on model performance through experiments and VC-dimension analysis. Compared to other related works, Mecoin shows superior performance in accuracy and forgetting rate. Our code is publicly available on the https://github.com/Arvin0313/Mecoin-GFSCIL.git .

* 16 pages, 6 figures, 38th Conference on Neural Information Processing Systems, 2024

Via

Access Paper or Ask Questions

Beyond Isolation: Multi-Agent Synergy for Improving Knowledge Graph Construction

Dec 05, 2023

Hongbin Ye, Honghao Gui, Aijia Zhang, Tong Liu, Wei Hua, Weiqiang Jia

Figure 1 for Beyond Isolation: Multi-Agent Synergy for Improving Knowledge Graph Construction

Figure 2 for Beyond Isolation: Multi-Agent Synergy for Improving Knowledge Graph Construction

Figure 3 for Beyond Isolation: Multi-Agent Synergy for Improving Knowledge Graph Construction

Figure 4 for Beyond Isolation: Multi-Agent Synergy for Improving Knowledge Graph Construction

Abstract:Knowledge graph construction (KGC) is a multifaceted undertaking involving the extraction of entities, relations, and events. Traditionally, large language models (LLMs) have been viewed as solitary task-solving agents in this complex landscape. However, this paper challenges this paradigm by introducing a novel framework, CooperKGC. Departing from the conventional approach, CooperKGC establishes a collaborative processing network, assembling a KGC collaboration team capable of concurrently addressing entity, relation, and event extraction tasks. Our experiments unequivocally demonstrate that fostering collaboration and information interaction among diverse agents within CooperKGC yields superior results compared to individual cognitive processes operating in isolation. Importantly, our findings reveal that the collaboration facilitated by CooperKGC enhances knowledge selection, correction, and aggregation capabilities across multiple rounds of interactions.

* work in progress; 12 pages

Via

Access Paper or Ask Questions

Cognitive Mirage: A Review of Hallucinations in Large Language Models

Sep 13, 2023

Hongbin Ye, Tong Liu, Aijia Zhang, Wei Hua, Weiqiang Jia

Figure 1 for Cognitive Mirage: A Review of Hallucinations in Large Language Models

Figure 2 for Cognitive Mirage: A Review of Hallucinations in Large Language Models

Figure 3 for Cognitive Mirage: A Review of Hallucinations in Large Language Models

Figure 4 for Cognitive Mirage: A Review of Hallucinations in Large Language Models

Abstract:As large language models continue to develop in the field of AI, text generation systems are susceptible to a worrisome phenomenon known as hallucination. In this study, we summarize recent compelling insights into hallucinations in LLMs. We present a novel taxonomy of hallucinations from various text generation tasks, thus provide theoretical insights, detection methods and improvement approaches. Based on this, future research directions are proposed. Our contribution are threefold: (1) We provide a detailed and complete taxonomy for hallucinations appearing in text generation tasks; (2) We provide theoretical analyses of hallucinations in LLMs and provide existing detection and improvement methods; (3) We propose several research directions that can be developed in the future. As hallucinations garner significant attention from the community, we will maintain updates on relevant research progress.

* work in progress; 21 pages

Via

Access Paper or Ask Questions