Picture for Takuya Akiba

Takuya Akiba

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Add code
Feb 26, 2025
Viaarxiv icon

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Add code
Jan 29, 2025
Viaarxiv icon

Agent Skill Acquisition for Large Language Models via CycleQD

Add code
Oct 16, 2024
Figure 1 for Agent Skill Acquisition for Large Language Models via CycleQD
Figure 2 for Agent Skill Acquisition for Large Language Models via CycleQD
Figure 3 for Agent Skill Acquisition for Large Language Models via CycleQD
Figure 4 for Agent Skill Acquisition for Large Language Models via CycleQD
Viaarxiv icon

Evolutionary Optimization of Model Merging Recipes

Add code
Mar 19, 2024
Figure 1 for Evolutionary Optimization of Model Merging Recipes
Figure 2 for Evolutionary Optimization of Model Merging Recipes
Figure 3 for Evolutionary Optimization of Model Merging Recipes
Figure 4 for Evolutionary Optimization of Model Merging Recipes
Viaarxiv icon

Team PFDet's Methods for Open Images Challenge 2019

Add code
Oct 25, 2019
Figure 1 for Team PFDet's Methods for Open Images Challenge 2019
Figure 2 for Team PFDet's Methods for Open Images Challenge 2019
Figure 3 for Team PFDet's Methods for Open Images Challenge 2019
Figure 4 for Team PFDet's Methods for Open Images Challenge 2019
Viaarxiv icon

Chainer: A Deep Learning Framework for Accelerating the Research Cycle

Add code
Aug 01, 2019
Figure 1 for Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Figure 2 for Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Figure 3 for Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Figure 4 for Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Viaarxiv icon

Optuna: A Next-generation Hyperparameter Optimization Framework

Add code
Jul 25, 2019
Figure 1 for Optuna: A Next-generation Hyperparameter Optimization Framework
Figure 2 for Optuna: A Next-generation Hyperparameter Optimization Framework
Figure 3 for Optuna: A Next-generation Hyperparameter Optimization Framework
Figure 4 for Optuna: A Next-generation Hyperparameter Optimization Framework
Viaarxiv icon

A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation

Add code
May 28, 2019
Figure 1 for A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation
Figure 2 for A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation
Figure 3 for A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation
Figure 4 for A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation
Viaarxiv icon

Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects

Add code
Nov 27, 2018
Figure 1 for Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects
Figure 2 for Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects
Figure 3 for Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects
Viaarxiv icon

PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track

Add code
Sep 04, 2018
Figure 1 for PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track
Figure 2 for PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track
Figure 3 for PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track
Figure 4 for PFDet: 2nd Place Solution to Open Images Challenge 2018 Object Detection Track
Viaarxiv icon