Picture for Shu Yang

Shu Yang

Is Long-to-Short a Free Lunch? Investigating Inconsistency and Reasoning Efficiency in LRMs

Add code
Jun 24, 2025
Viaarxiv icon

The Compositional Architecture of Regret in Large Language Models

Add code
Jun 18, 2025
Viaarxiv icon

Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images

Add code
Jun 08, 2025
Viaarxiv icon

Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs

Add code
Jun 08, 2025
Viaarxiv icon

Stable Vision Concept Transformers for Medical Diagnosis

Add code
Jun 05, 2025
Viaarxiv icon

Understanding How Value Neurons Shape the Generation of Specified Values in LLMs

Add code
May 23, 2025
Viaarxiv icon

A Survey on the Safety and Security Threats of Computer-Using Agents: JARVIS or Ultron?

Add code
May 16, 2025
Viaarxiv icon

Doubly Robust Fusion of Many Treatments for Policy Learning

Add code
May 12, 2025
Viaarxiv icon

Investigating LLMs in Clinical Triage: Promising Capabilities, Persistent Intersectional Biases

Add code
Apr 22, 2025
Viaarxiv icon

Understanding the Repeat Curse in Large Language Models from a Feature Perspective

Add code
Apr 19, 2025
Viaarxiv icon