Picture for Pengjie Ren

Pengjie Ren

MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization

Add code
Oct 10, 2024
Figure 1 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Figure 2 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Figure 3 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Figure 4 for MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
Viaarxiv icon

Uncovering Overfitting in Large Language Model Editing

Add code
Oct 10, 2024
Viaarxiv icon

Enhancing Multi-hop Reasoning through Knowledge Erasure in Large Language Model Editing

Add code
Aug 22, 2024
Viaarxiv icon

Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning

Add code
Aug 18, 2024
Figure 1 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Chain-of-Strategy Planning with LLMs: Aligning the Generation of Psychotherapy Dialogue with Strategy in Motivational Interviewing

Add code
Aug 12, 2024
Viaarxiv icon

Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering

Add code
Jun 21, 2024
Viaarxiv icon

MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter

Add code
Jun 07, 2024
Viaarxiv icon

Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality

Add code
May 16, 2024
Figure 1 for Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
Figure 2 for Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
Figure 3 for Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
Figure 4 for Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
Viaarxiv icon

ExcluIR: Exclusionary Neural Information Retrieval

Add code
Apr 26, 2024
Viaarxiv icon

Offline Trajectory Generalization for Offline Reinforcement Learning

Add code
Apr 16, 2024
Viaarxiv icon