Picture for Ching-An Cheng

Ching-An Cheng

POLCA: Stochastic Generative Optimization with LLM

Add code
Mar 16, 2026
Viaarxiv icon

Provably Learning from Language Feedback

Add code
Jun 12, 2025
Viaarxiv icon

Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC

Add code
Nov 11, 2024
Figure 1 for Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC
Figure 2 for Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC
Figure 3 for Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC
Figure 4 for Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC
Viaarxiv icon

How to Solve Contextual Goal-Oriented Problems with Offline Datasets?

Add code
Aug 14, 2024
Figure 1 for How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Figure 2 for How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Figure 3 for How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Figure 4 for How to Solve Contextual Goal-Oriented Problems with Offline Datasets?
Viaarxiv icon

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Add code
Jun 23, 2024
Figure 1 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Figure 2 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Figure 3 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Figure 4 for Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows
Viaarxiv icon

The Importance of Directional Feedback for LLM-based Optimizers

Add code
May 26, 2024
Figure 1 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 2 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 3 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 4 for The Importance of Directional Feedback for LLM-based Optimizers
Viaarxiv icon

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Add code
Apr 04, 2024
Figure 1 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 2 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 3 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 4 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Viaarxiv icon

PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem

Add code
Feb 16, 2024
Figure 1 for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem
Figure 2 for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem
Figure 3 for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem
Figure 4 for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem
Viaarxiv icon

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

Add code
Dec 13, 2023
Viaarxiv icon

Interactive Robot Learning from Verbal Correction

Add code
Oct 26, 2023
Viaarxiv icon