Picture for Matthieu Geist

Matthieu Geist

INRIA Lorraine - LORIA

Audio-to-Image Bird Species Retrieval without Audio-Image Pairs via Text Distillation

Add code
Jan 31, 2026
Viaarxiv icon

Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval

Add code
Jan 30, 2026
Viaarxiv icon

RL-AVIST: Reinforcement Learning for Autonomous Visual Inspection of Space Targets

Add code
Oct 26, 2025
Viaarxiv icon

Learning Equilibria from Data: Provably Efficient Multi-Agent Imitation Learning

Add code
May 23, 2025
Viaarxiv icon

NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation

Add code
May 20, 2025
Viaarxiv icon

ShiQ: Bringing back Bellman to LLMs

Add code
May 16, 2025
Figure 1 for ShiQ: Bringing back Bellman to LLMs
Figure 2 for ShiQ: Bringing back Bellman to LLMs
Figure 3 for ShiQ: Bringing back Bellman to LLMs
Figure 4 for ShiQ: Bringing back Bellman to LLMs
Viaarxiv icon

Command A: An Enterprise-Ready Large Language Model

Add code
Apr 01, 2025
Figure 1 for Command A: An Enterprise-Ready Large Language Model
Figure 2 for Command A: An Enterprise-Ready Large Language Model
Figure 3 for Command A: An Enterprise-Ready Large Language Model
Figure 4 for Command A: An Enterprise-Ready Large Language Model
Viaarxiv icon

Understanding Likelihood Over-optimisation in Direct Alignment Algorithms

Add code
Oct 15, 2024
Figure 1 for Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Figure 2 for Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Figure 3 for Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Figure 4 for Understanding Likelihood Over-optimisation in Direct Alignment Algorithms
Viaarxiv icon

Solving robust MDPs as a sequence of static RL problems

Add code
Oct 08, 2024
Figure 1 for Solving robust MDPs as a sequence of static RL problems
Figure 2 for Solving robust MDPs as a sequence of static RL problems
Figure 3 for Solving robust MDPs as a sequence of static RL problems
Figure 4 for Solving robust MDPs as a sequence of static RL problems
Viaarxiv icon

Imitating Language via Scalable Inverse Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 2 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 3 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 4 for Imitating Language via Scalable Inverse Reinforcement Learning
Viaarxiv icon