Picture for Markel Sanz Ausin

Markel Sanz Ausin

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Add code
May 02, 2024
Viaarxiv icon

Polaris: A Safety-focused LLM Constellation Architecture for Healthcare

Add code
Mar 20, 2024
Viaarxiv icon

HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare

Add code
Feb 18, 2023
Viaarxiv icon

InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem

Add code
May 02, 2021
Figure 1 for InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem
Figure 2 for InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem
Figure 3 for InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem
Figure 4 for InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem
Viaarxiv icon