Picture for Naoaki Okazaki

Naoaki Okazaki

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Diffusion-State Policy Optimization for Masked Diffusion Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

Add code
Feb 06, 2026
Viaarxiv icon

From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models

Add code
Jan 16, 2026
Viaarxiv icon

Bit-level BPE: Below the byte boundary

Add code
Jun 09, 2025
Viaarxiv icon

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Add code
May 05, 2025
Viaarxiv icon

Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models

Add code
Mar 31, 2025
Viaarxiv icon

Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon

LCTG Bench: LLM Controlled Text Generation Benchmark

Add code
Jan 27, 2025
Figure 1 for LCTG Bench: LLM Controlled Text Generation Benchmark
Figure 2 for LCTG Bench: LLM Controlled Text Generation Benchmark
Figure 3 for LCTG Bench: LLM Controlled Text Generation Benchmark
Figure 4 for LCTG Bench: LLM Controlled Text Generation Benchmark
Viaarxiv icon

HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model

Add code
Dec 19, 2024
Figure 1 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Figure 2 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Figure 3 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Figure 4 for HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model
Viaarxiv icon