Picture for Kai Yang

Kai Yang

Sherman

Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts

Add code
Jan 15, 2026
Viaarxiv icon

KDCM: Reducing Hallucination in LLMs through Explicit Reasoning Structures

Add code
Jan 07, 2026
Viaarxiv icon

Mitigating Prompt-Induced Hallucinations in Large Language Models via Structured Reasoning

Add code
Jan 06, 2026
Viaarxiv icon

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

Add code
Nov 19, 2025
Viaarxiv icon

RoS-Guard: Robust and Scalable Online Change Detection with Delay-Optimal Guarantees

Add code
Nov 17, 2025
Viaarxiv icon

Contributions to Robust and Efficient Methods for Analysis of High Dimensional Data

Add code
Sep 09, 2025
Viaarxiv icon

Graph Federated Learning for Personalized Privacy Recommendation

Add code
Aug 08, 2025
Viaarxiv icon

Stable Preference Optimization for LLMs: A Bilevel Approach Beyond Direct Preference Optimization

Add code
Jul 10, 2025
Viaarxiv icon