Picture for Simon Shaolei Du

Simon Shaolei Du

Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation

Add code
Dec 17, 2024
Viaarxiv icon

Hybrid Preference Optimization for Alignment: Provably Faster Convergence Rates by Combining Offline Preferences with Online Exploration

Add code
Dec 13, 2024
Viaarxiv icon

On Erroneous Agreements of CLIP Image Embeddings

Add code
Nov 07, 2024
Viaarxiv icon

Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning

Add code
Jul 02, 2024
Viaarxiv icon

CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning

Add code
May 29, 2024
Viaarxiv icon

Offline Multi-task Transfer RL with Representational Penalization

Add code
Feb 19, 2024
Viaarxiv icon

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning

Add code
Feb 03, 2024
Viaarxiv icon

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Add code
Oct 30, 2023
Viaarxiv icon

Robust Offline Reinforcement Learning -- Certify the Confidence Interval

Add code
Oct 03, 2023
Viaarxiv icon

LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning

Add code
Jun 16, 2023
Figure 1 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 2 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 3 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 4 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Viaarxiv icon