Picture for Chengdong Ma

Chengdong Ma

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment

Add code
Oct 22, 2024
Viaarxiv icon

A Survey on Self-play Methods in Reinforcement Learning

Add code
Aug 02, 2024
Viaarxiv icon

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

Add code
Jun 03, 2024
Viaarxiv icon

Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects

Add code
Mar 01, 2024
Viaarxiv icon

Panacea: Pareto Alignment via Preference Adaptation for LLMs

Add code
Feb 03, 2024
Figure 1 for Panacea: Pareto Alignment via Preference Adaptation for LLMs
Figure 2 for Panacea: Pareto Alignment via Preference Adaptation for LLMs
Figure 3 for Panacea: Pareto Alignment via Preference Adaptation for LLMs
Figure 4 for Panacea: Pareto Alignment via Preference Adaptation for LLMs
Viaarxiv icon

Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models

Add code
Oct 10, 2023
Viaarxiv icon

Fully Decentralized Model-based Policy Optimization for Networked Systems

Add code
Jul 13, 2022
Figure 1 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 2 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 3 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Figure 4 for Fully Decentralized Model-based Policy Optimization for Networked Systems
Viaarxiv icon