Picture for Blazej Manczak

Blazej Manczak

PrimeGuard: Safe and Helpful LLMs through Tuning-Free Routing

Add code
Jul 23, 2024
Viaarxiv icon

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Add code
Feb 07, 2024
Viaarxiv icon

Hierarchical Reinforcement Learning for Power Network Topology Control

Add code
Nov 03, 2023
Viaarxiv icon