Picture for Ziyue Li

Ziyue Li

Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test

Add code
Jun 26, 2025
Viaarxiv icon

GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors

Add code
Jun 17, 2025
Viaarxiv icon

Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges

Add code
Jun 12, 2025
Viaarxiv icon

TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review

Add code
Jun 09, 2025
Viaarxiv icon

Unlocking the Power of SAM 2 for Few-Shot Segmentation

Add code
May 21, 2025
Viaarxiv icon

Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era

Add code
May 05, 2025
Viaarxiv icon

Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation

Add code
May 04, 2025
Viaarxiv icon

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Add code
Apr 14, 2025
Viaarxiv icon

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Add code
Apr 10, 2025
Viaarxiv icon

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Add code
Apr 10, 2025
Viaarxiv icon