Picture for Ching-An Cheng

Ching-An Cheng

Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC

Add code
Nov 11, 2024
Viaarxiv icon

How to Solve Contextual Goal-Oriented Problems with Offline Datasets?

Add code
Aug 14, 2024
Viaarxiv icon

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Add code
Jun 23, 2024
Viaarxiv icon

The Importance of Directional Feedback for LLM-based Optimizers

Add code
May 26, 2024
Viaarxiv icon

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Add code
Apr 04, 2024
Figure 1 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 2 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 3 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 4 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Viaarxiv icon

PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem

Add code
Feb 16, 2024
Viaarxiv icon

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

Add code
Dec 13, 2023
Viaarxiv icon

Interactive Robot Learning from Verbal Correction

Add code
Oct 26, 2023
Viaarxiv icon

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

Add code
Jun 30, 2023
Viaarxiv icon

Survival Instinct in Offline Reinforcement Learning

Add code
Jun 05, 2023
Viaarxiv icon