Picture for Keerthiram Murugesan

Keerthiram Murugesan

Combinatorial Multi-armed Bandits: Arm Selection via Group Testing

Add code
Oct 14, 2024
Viaarxiv icon

Towards Aligning Language Models with Textual Feedback

Add code
Jul 24, 2024
Viaarxiv icon

CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design

Add code
Jun 25, 2024
Viaarxiv icon

STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models

Add code
Jun 09, 2024
Viaarxiv icon

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

Add code
May 30, 2024
Viaarxiv icon

SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Add code
May 24, 2024
Viaarxiv icon

On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

Add code
Apr 15, 2024
Viaarxiv icon

EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

Add code
Mar 15, 2024
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Viaarxiv icon

Language Guided Exploration for RL Agents in Text Environments

Add code
Mar 05, 2024
Viaarxiv icon