Picture for Kyungjae Lee

Kyungjae Lee

Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation

Add code
Jul 31, 2024
Viaarxiv icon

Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation

Add code
Jul 01, 2024
Figure 1 for Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
Figure 2 for Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
Figure 3 for Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
Figure 4 for Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation
Viaarxiv icon

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Add code
Jun 09, 2024
Figure 1 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 2 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 3 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 4 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Viaarxiv icon

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees

Add code
May 29, 2024
Viaarxiv icon

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Add code
May 02, 2024
Viaarxiv icon

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

Add code
Mar 21, 2024
Viaarxiv icon

Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion

Add code
Oct 27, 2023
Viaarxiv icon

PreWoMe: Exploiting Presuppositions as Working Memory for Long Form Question Answering

Add code
Oct 24, 2023
Viaarxiv icon

SPOTS: Stable Placement of Objects with Reasoning in Semi-Autonomous Teleoperation Systems

Add code
Sep 25, 2023
Viaarxiv icon

On Monotonic Aggregation for Open-domain QA

Add code
Aug 08, 2023
Viaarxiv icon