Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Youssef Attia El Hili

LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection

Oct 30, 2025

Youssef Attia El Hili, Albert Thomas, Malik Tiomoko, Abdelhakim Benechehab, Corentin Léger, Corinne Ancourt, Balázs Kégl

Figure 1 for LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection

Figure 2 for LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection

Figure 3 for LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection

Figure 4 for LLMs as In-Context Meta-Learners for Model and Hyperparameter Selection

Abstract:Model and hyperparameter selection are critical but challenging in machine learning, typically requiring expert intuition or expensive automated search. We investigate whether large language models (LLMs) can act as in-context meta-learners for this task. By converting each dataset into interpretable metadata, we prompt an LLM to recommend both model families and hyperparameters. We study two prompting strategies: (1) a zero-shot mode relying solely on pretrained knowledge, and (2) a meta-informed mode augmented with examples of models and their performance on past tasks. Across synthetic and real-world benchmarks, we show that LLMs can exploit dataset metadata to recommend competitive models and hyperparameters without search, and that improvements from meta-informed prompting demonstrate their capacity for in-context meta-learning. These results highlight a promising new role for LLMs as lightweight, general-purpose assistants for model selection and hyperparameter optimization.

* 27 pages, 6 figures

Via

Access Paper or Ask Questions

Zero-shot Model-based Reinforcement Learning using Large Language Models

Oct 15, 2024

Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat, Oussama Zekri, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Ievgen Redko, Balázs Kégl

Figure 1 for Zero-shot Model-based Reinforcement Learning using Large Language Models

Figure 2 for Zero-shot Model-based Reinforcement Learning using Large Language Models

Figure 3 for Zero-shot Model-based Reinforcement Learning using Large Language Models

Figure 4 for Zero-shot Model-based Reinforcement Learning using Large Language Models

Abstract:The emerging zero-shot capabilities of Large Language Models (LLMs) have led to their applications in areas extending well beyond natural language processing tasks. In reinforcement learning, while LLMs have been extensively used in text-based environments, their integration with continuous state spaces remains understudied. In this paper, we investigate how pre-trained LLMs can be leveraged to predict in context the dynamics of continuous Markov decision processes. We identify handling multivariate data and incorporating the control signal as key challenges that limit the potential of LLMs' deployment in this setup and propose Disentangled In-Context Learning (DICL) to address them. We present proof-of-concept applications in two reinforcement learning settings: model-based policy evaluation and data-augmented off-policy reinforcement learning, supported by theoretical analysis of the proposed methods. Our experiments further demonstrate that our approach produces well-calibrated uncertainty estimates. We release the code at https://github.com/abenechehab/dicl.

Via

Access Paper or Ask Questions