Picture for Yi-Kai Zhang

Yi-Kai Zhang

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

$V_0$: A Generalist Value Model for Any Policy at State Zero

Add code
Feb 03, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

Model Assembly Learning with Heterogeneous Layer Weight Merging

Add code
Mar 27, 2025
Viaarxiv icon

Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing

Add code
Feb 24, 2025
Viaarxiv icon

OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions

Add code
Dec 09, 2024
Viaarxiv icon

Wings: Learning Multimodal LLMs without Text-only Forgetting

Add code
Jun 05, 2024
Figure 1 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 2 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 3 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Figure 4 for Wings: Learning Multimodal LLMs without Text-only Forgetting
Viaarxiv icon

Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration

Add code
Dec 08, 2023
Figure 1 for Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration
Figure 2 for Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration
Figure 3 for Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration
Figure 4 for Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration
Viaarxiv icon

ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse

Add code
Aug 17, 2023
Figure 1 for ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Figure 2 for ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Viaarxiv icon

Model Spider: Learning to Rank Pre-Trained Models Efficiently

Add code
Jun 06, 2023
Figure 1 for Model Spider: Learning to Rank Pre-Trained Models Efficiently
Figure 2 for Model Spider: Learning to Rank Pre-Trained Models Efficiently
Figure 3 for Model Spider: Learning to Rank Pre-Trained Models Efficiently
Figure 4 for Model Spider: Learning to Rank Pre-Trained Models Efficiently
Viaarxiv icon