Picture for Kai Yan

Kai Yan

School of Land Science and Techniques, China University of Geosciences

Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance

Add code
May 14, 2026
Viaarxiv icon

Autonomous Laparoscope Control through Unified Mechanics-Based Representation of Multimodal Intraoperative Information

Add code
May 06, 2026
Viaarxiv icon

SAP: Segment Any 4K Panorama

Add code
Mar 13, 2026
Viaarxiv icon

Latent Wasserstein Adversarial Imitation Learning

Add code
Mar 05, 2026
Viaarxiv icon

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

Add code
Jan 08, 2026
Viaarxiv icon

Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?

Add code
Apr 01, 2025
Figure 1 for Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
Figure 2 for Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
Figure 3 for Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
Figure 4 for Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
Viaarxiv icon

CodeARC: Benchmarking Reasoning Capabilities of LLM Agents for Inductive Program Synthesis

Add code
Mar 29, 2025
Viaarxiv icon

LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion

Add code
Jan 25, 2025
Figure 1 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Figure 2 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Figure 3 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Figure 4 for LongReason: A Synthetic Long-Context Reasoning Benchmark via Context Expansion
Viaarxiv icon

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers

Add code
Oct 31, 2024
Figure 1 for Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Figure 2 for Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Figure 3 for Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Figure 4 for Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Viaarxiv icon

Natural Adversarial Patch Generation Method Based on Latent Diffusion Model

Add code
Dec 27, 2023
Viaarxiv icon