Picture for Xiaolin Sun

Xiaolin Sun

Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations

Add code
Mar 06, 2024
Viaarxiv icon

Enhancing LLM Safety via Constrained Direct Preference Optimization

Add code
Mar 04, 2024
Figure 1 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 2 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 3 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Figure 4 for Enhancing LLM Safety via Constrained Direct Preference Optimization
Viaarxiv icon

Pandering in a Flexible Representative Democracy

Add code
Nov 18, 2022
Figure 1 for Pandering in a Flexible Representative Democracy
Figure 2 for Pandering in a Flexible Representative Democracy
Figure 3 for Pandering in a Flexible Representative Democracy
Viaarxiv icon

An exact solution in Markov decision process with multiplicative rewards as a general framework

Add code
Dec 15, 2020
Viaarxiv icon

Leveraging Legacy Data to Accelerate Materials Design via Preference Learning

Add code
Oct 25, 2019
Figure 1 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 2 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 3 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Figure 4 for Leveraging Legacy Data to Accelerate Materials Design via Preference Learning
Viaarxiv icon