Picture for Haitham Bou-Ammar

Haitham Bou-Ammar

Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information

Add code
Jan 02, 2025
Viaarxiv icon

A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics

Add code
Nov 08, 2024
Viaarxiv icon

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Add code
Nov 05, 2024
Figure 1 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 2 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 3 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 4 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Viaarxiv icon

SparsePO: Controlling Preference Alignment of LLMs via Sparse Token Masks

Add code
Oct 07, 2024
Viaarxiv icon

ShortCircuit: AlphaZero-Driven Circuit Design

Add code
Aug 19, 2024
Viaarxiv icon

Human-like Episodic Memory for Infinite Context LLMs

Add code
Jul 12, 2024
Figure 1 for Human-like Episodic Memory for Infinite Context LLMs
Figure 2 for Human-like Episodic Memory for Infinite Context LLMs
Figure 3 for Human-like Episodic Memory for Infinite Context LLMs
Figure 4 for Human-like Episodic Memory for Infinite Context LLMs
Viaarxiv icon

ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning

Add code
Jun 28, 2024
Figure 1 for ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning
Figure 2 for ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning
Figure 3 for ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning
Figure 4 for ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning
Viaarxiv icon

Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications

Add code
Apr 13, 2024
Figure 1 for Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Figure 2 for Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Figure 3 for Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Figure 4 for Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Viaarxiv icon

ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization

Add code
Mar 04, 2024
Figure 1 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 2 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 3 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Figure 4 for ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization
Viaarxiv icon

Bayesian Reward Models for LLM Alignment

Add code
Feb 20, 2024
Viaarxiv icon