Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bokui Chen

Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

May 22, 2024

Zhiyuan Wang, Bokui Chen, Xiaoyang Qu, Zhenhou Hong, Jing Xiao, Jianzong Wang

Figure 1 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Figure 2 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Figure 3 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Figure 4 for Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training

Abstract:With the rapid advancements in artificial intelligence, the development of knowledgeable and personalized agents has become increasingly prevalent. However, the inherent variability in state variables and action spaces among personalized agents poses significant aggregation challenges for traditional federated learning algorithms. To tackle these challenges, we introduce the Federated Split Decision Transformer (FSDT), an innovative framework designed explicitly for AI agent decision tasks. The FSDT framework excels at navigating the intricacies of personalized agents by harnessing distributed data for training while preserving data privacy. It employs a two-stage training process, with local embedding and prediction models on client agents and a global transformer decoder model on the server. Our comprehensive evaluation using the benchmark D4RL dataset highlights the superior performance of our algorithm in federated split learning for personalized agents, coupled with significant reductions in communication and computational overhead compared to traditional centralized training approaches. The FSDT framework demonstrates strong potential for enabling efficient and privacy-preserving collaborative learning in applications such as autonomous driving decision systems. Our findings underscore the efficacy of the FSDT framework in effectively leveraging distributed offline reinforcement learning data to enable powerful multi-type agent decision systems.

* Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

Via

Access Paper or Ask Questions

Large Language Models Powered Context-aware Motion Prediction

Mar 17, 2024

Xiaoji Zheng, Lixiu Wu, Zhijie Yan, Yuanrong Tang, Hao Zhao, Chen Zhong, Bokui Chen, Jiangtao Gong

Figure 1 for Large Language Models Powered Context-aware Motion Prediction

Figure 2 for Large Language Models Powered Context-aware Motion Prediction

Figure 3 for Large Language Models Powered Context-aware Motion Prediction

Figure 4 for Large Language Models Powered Context-aware Motion Prediction

Abstract:Motion prediction is among the most fundamental tasks in autonomous driving. Traditional methods of motion forecasting primarily encode vector information of maps and historical trajectory data of traffic participants, lacking a comprehensive understanding of overall traffic semantics, which in turn affects the performance of prediction tasks. In this paper, we utilized Large Language Models (LLMs) to enhance the global traffic context understanding for motion prediction tasks. We first conducted systematic prompt engineering, visualizing complex traffic environments and historical trajectory information of traffic participants into image prompts -- Transportation Context Map (TC-Map), accompanied by corresponding text prompts. Through this approach, we obtained rich traffic context information from the LLM. By integrating this information into the motion prediction model, we demonstrate that such context can enhance the accuracy of motion predictions. Furthermore, considering the cost associated with LLMs, we propose a cost-effective deployment strategy: enhancing the accuracy of motion prediction tasks at scale with 0.7\% LLM-augmented datasets. Our research offers valuable insights into enhancing the understanding of traffic scenes of LLMs and the motion prediction performance of autonomous driving.

* 6 pages,4 figures

Via

Access Paper or Ask Questions

INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Jan 22, 2024

Zhiyuan Wang, Xiaoyang Qu, Jing Xiao, Bokui Chen, Jianzong Wang

Figure 1 for INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Figure 2 for INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Figure 3 for INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Figure 4 for INCPrompt: Task-Aware incremental Prompting for Rehearsal-Free Class-incremental Learning

Abstract:This paper introduces INCPrompt, an innovative continual learning solution that effectively addresses catastrophic forgetting. INCPrompt's key innovation lies in its use of adaptive key-learner and task-aware prompts that capture task-relevant information. This unique combination encapsulates general knowledge across tasks and encodes task-specific knowledge. Our comprehensive evaluation across multiple continual learning benchmarks demonstrates INCPrompt's superiority over existing algorithms, showing its effectiveness in mitigating catastrophic forgetting while maintaining high performance. These results highlight the significant impact of task-aware incremental prompting on continual learning performance.

* Accepted by the 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

Via

Access Paper or Ask Questions

P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Jan 22, 2024

Zhiyuan Wang, Xiaoyang Qu, Jing Xiao, Bokui Chen, Jianzong Wang

Figure 1 for P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Figure 2 for P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Figure 3 for P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Figure 4 for P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer

Abstract:Catastrophic forgetting poses a substantial challenge for managing intelligent agents controlled by a large model, causing performance degradation when these agents face new tasks. In our work, we propose a novel solution - the Progressive Prompt Decision Transformer (P2DT). This method enhances a transformer-based model by dynamically appending decision tokens during new task training, thus fostering task-specific policies. Our approach mitigates forgetting in continual and offline reinforcement learning scenarios. Moreover, P2DT leverages trajectories collected via traditional reinforcement learning from all tasks and generates new task-specific tokens during training, thereby retaining knowledge from previous studies. Preliminary results demonstrate that our model effectively alleviates catastrophic forgetting and scales well with increasing task environments.

* Accepted by the 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

Via

Access Paper or Ask Questions