Picture for Jan Ebert

Jan Ebert

Time Transfer: On Optimal Learning Rate and Batch Size In The Infinite Data Limit

Add code
Oct 08, 2024
Viaarxiv icon

Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?

Add code
Feb 21, 2024
Viaarxiv icon

Tokenizer Choice For LLM Training: Negligible or Crucial?

Add code
Oct 18, 2023
Figure 1 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 2 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 3 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Figure 4 for Tokenizer Choice For LLM Training: Negligible or Crucial?
Viaarxiv icon

Physics informed Neural Networks applied to the description of wave-particle resonance in kinetic simulations of fusion plasmas

Add code
Aug 23, 2023
Viaarxiv icon

StarCoder: may the source be with you!

Add code
May 09, 2023
Viaarxiv icon

Hearts Gym: Learning Reinforcement Learning as a Team Event

Add code
Sep 07, 2022
Viaarxiv icon