Picture for Rio Yokota

Rio Yokota

Takeuchi's Information Criteria as Generalization Measures for DNNs Close to NTK Regime

Add code
Feb 26, 2026
Viaarxiv icon

Evolutionary Context Search for Automated Skill Acquisition

Add code
Feb 18, 2026
Viaarxiv icon

On the Optimal Reasoning Length for RL-Trained Language Models

Add code
Feb 11, 2026
Viaarxiv icon

FedPM: Federated Learning Using Second-order Optimization with Preconditioned Mixing of Local Parameters

Add code
Nov 12, 2025
Viaarxiv icon

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Add code
Aug 26, 2025
Viaarxiv icon

Improving LoRA with Variational Learning

Add code
Jun 17, 2025
Viaarxiv icon

Variational Learning Finds Flatter Solutions at the Edge of Stability

Add code
Jun 15, 2025
Viaarxiv icon

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Add code
May 05, 2025
Viaarxiv icon

Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models

Add code
Mar 31, 2025
Viaarxiv icon

On the Relationship Between Double Descent of CNNs and Shape/Texture Bias Under Learning Process

Add code
Mar 04, 2025
Viaarxiv icon