Picture for Zhikai Lei

Zhikai Lei

Zhejiang Provincial Hospital of Chinese Medicine, 54 Youdian Road, Hangzhou, 310003, China

Steering LLMs via Scalable Interactive Oversight

Add code
Feb 04, 2026
Viaarxiv icon

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Add code
Jan 16, 2026
Viaarxiv icon

Building Self-Evolving Agents via Experience-Driven Lifelong Learning: A Framework and Benchmark

Add code
Aug 26, 2025
Viaarxiv icon

Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences

Add code
Mar 17, 2025
Figure 1 for Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences
Figure 2 for Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences
Figure 3 for Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences
Figure 4 for Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences
Viaarxiv icon

GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?

Add code
Dec 13, 2024
Viaarxiv icon

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues

Add code
Nov 11, 2024
Figure 1 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 2 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 3 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Figure 4 for LongSafetyBench: Long-Context LLMs Struggle with Safety Issues
Viaarxiv icon

Boosting Conversational Question Answering with Fine-Grained Retrieval-Augmentation and Self-Check

Add code
Mar 27, 2024
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid Nodules: A ChatGPT-Style Assistant

Add code
Feb 04, 2024
Viaarxiv icon