Picture for Jaekyeom Kim

Jaekyeom Kim

Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents

Add code
Oct 29, 2024
Viaarxiv icon

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Add code
Apr 26, 2024
Viaarxiv icon

AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents

Add code
Mar 13, 2024
Viaarxiv icon

Lipschitz-constrained Unsupervised Skill Discovery

Add code
Feb 08, 2022
Figure 1 for Lipschitz-constrained Unsupervised Skill Discovery
Figure 2 for Lipschitz-constrained Unsupervised Skill Discovery
Figure 3 for Lipschitz-constrained Unsupervised Skill Discovery
Figure 4 for Lipschitz-constrained Unsupervised Skill Discovery
Viaarxiv icon

Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods

Add code
Dec 01, 2021
Figure 1 for Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Figure 2 for Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Figure 3 for Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Figure 4 for Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Viaarxiv icon

Unsupervised Skill Discovery with Bottleneck Option Learning

Add code
Jun 27, 2021
Figure 1 for Unsupervised Skill Discovery with Bottleneck Option Learning
Figure 2 for Unsupervised Skill Discovery with Bottleneck Option Learning
Figure 3 for Unsupervised Skill Discovery with Bottleneck Option Learning
Figure 4 for Unsupervised Skill Discovery with Bottleneck Option Learning
Viaarxiv icon

Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration

Add code
Mar 23, 2021
Figure 1 for Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration
Figure 2 for Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration
Figure 3 for Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration
Figure 4 for Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration
Viaarxiv icon

EMI: Exploration with Mutual Information Maximizing State and Action Embeddings

Add code
Oct 04, 2018
Figure 1 for EMI: Exploration with Mutual Information Maximizing State and Action Embeddings
Figure 2 for EMI: Exploration with Mutual Information Maximizing State and Action Embeddings
Figure 3 for EMI: Exploration with Mutual Information Maximizing State and Action Embeddings
Figure 4 for EMI: Exploration with Mutual Information Maximizing State and Action Embeddings
Viaarxiv icon