Picture for Takuya Akiba

Takuya Akiba

UnMaskFork: Test-Time Scaling for Masked Diffusion via Deterministic Action Branching

Add code
Feb 04, 2026
Viaarxiv icon

Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings

Add code
Dec 13, 2025
Figure 1 for Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings
Figure 2 for Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings
Figure 3 for Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings
Figure 4 for Extending the Context of Pretrained LLMs by Dropping Their Positional Embeddings
Viaarxiv icon

DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion

Add code
Jun 17, 2025
Viaarxiv icon

ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering

Add code
Jun 10, 2025
Figure 1 for ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering
Figure 2 for ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering
Figure 3 for ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering
Figure 4 for ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering
Viaarxiv icon

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Add code
Mar 06, 2025
Figure 1 for Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
Figure 2 for Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
Figure 3 for Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
Figure 4 for Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
Viaarxiv icon

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Add code
Feb 26, 2025
Viaarxiv icon

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Add code
Jan 29, 2025
Figure 1 for TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models
Figure 2 for TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models
Figure 3 for TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models
Figure 4 for TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models
Viaarxiv icon

Agent Skill Acquisition for Large Language Models via CycleQD

Add code
Oct 16, 2024
Figure 1 for Agent Skill Acquisition for Large Language Models via CycleQD
Figure 2 for Agent Skill Acquisition for Large Language Models via CycleQD
Figure 3 for Agent Skill Acquisition for Large Language Models via CycleQD
Figure 4 for Agent Skill Acquisition for Large Language Models via CycleQD
Viaarxiv icon

Evolutionary Optimization of Model Merging Recipes

Add code
Mar 19, 2024
Figure 1 for Evolutionary Optimization of Model Merging Recipes
Figure 2 for Evolutionary Optimization of Model Merging Recipes
Figure 3 for Evolutionary Optimization of Model Merging Recipes
Figure 4 for Evolutionary Optimization of Model Merging Recipes
Viaarxiv icon

Team PFDet's Methods for Open Images Challenge 2019

Add code
Oct 25, 2019
Figure 1 for Team PFDet's Methods for Open Images Challenge 2019
Figure 2 for Team PFDet's Methods for Open Images Challenge 2019
Figure 3 for Team PFDet's Methods for Open Images Challenge 2019
Figure 4 for Team PFDet's Methods for Open Images Challenge 2019
Viaarxiv icon