Picture for Xiang Ji

Xiang Ji

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Add code
Feb 10, 2025
Viaarxiv icon

Improving Vision-Language-Action Model with Online Reinforcement Learning

Add code
Jan 28, 2025
Viaarxiv icon

RS-NeRF: Neural Radiance Fields from Rolling Shutter Images

Add code
Jul 14, 2024
Viaarxiv icon

Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models

Add code
Jun 06, 2024
Figure 1 for Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models
Viaarxiv icon

Motion Blur Decomposition with Cross-shutter Guidance

Add code
Apr 01, 2024
Viaarxiv icon

Sample Complexity of Preference-Based Nonparametric Off-Policy Evaluation with Deep Networks

Add code
Oct 16, 2023
Viaarxiv icon

Towards Deep Learning Models Resistant to Transfer-based Adversarial Attacks via Data-centric Robust Learning

Add code
Oct 15, 2023
Viaarxiv icon

Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds

Add code
Sep 25, 2023
Viaarxiv icon

Hard Adversarial Example Mining for Improving Robust Fairness

Add code
Aug 03, 2023
Figure 1 for Hard Adversarial Example Mining for Improving Robust Fairness
Figure 2 for Hard Adversarial Example Mining for Improving Robust Fairness
Figure 3 for Hard Adversarial Example Mining for Improving Robust Fairness
Figure 4 for Hard Adversarial Example Mining for Improving Robust Fairness
Viaarxiv icon

Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems

Add code
Jul 24, 2023
Viaarxiv icon