Picture for Yushun Zhang

Yushun Zhang

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

ErrEval: Error-Aware Evaluation for Question Generation through Explicit Diagnostics

Add code
Jan 15, 2026
Viaarxiv icon

GeoLaux: A Benchmark for Evaluating MLLMs' Geometry Performance on Long-Step Problems Requiring Auxiliary Lines

Add code
Aug 08, 2025
Viaarxiv icon

$XX^{t}$ Can Be Faster

Add code
May 14, 2025
Figure 1 for $XX^{t}$ Can Be Faster
Figure 2 for $XX^{t}$ Can Be Faster
Figure 3 for $XX^{t}$ Can Be Faster
Figure 4 for $XX^{t}$ Can Be Faster
Viaarxiv icon

Towards Quantifying the Hessian Structure of Neural Networks

Add code
May 05, 2025
Viaarxiv icon

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

Add code
Jul 31, 2024
Viaarxiv icon

Adam-mini: Use Fewer Learning Rates To Gain More

Add code
Jun 26, 2024
Viaarxiv icon

Why Transformers Need Adam: A Hessian Perspective

Add code
Feb 26, 2024
Figure 1 for Why Transformers Need Adam: A Hessian Perspective
Figure 2 for Why Transformers Need Adam: A Hessian Perspective
Figure 3 for Why Transformers Need Adam: A Hessian Perspective
Figure 4 for Why Transformers Need Adam: A Hessian Perspective
Viaarxiv icon

Communication Efficiency Optimization of Federated Learning for Computing and Network Convergence of 6G Networks

Add code
Nov 28, 2023
Viaarxiv icon

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Add code
Oct 17, 2023
Figure 1 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 2 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 3 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Figure 4 for ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
Viaarxiv icon