Picture for Jingkuan Song

Jingkuan Song

Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation

Add code
Oct 02, 2025
Viaarxiv icon

More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration

Add code
Oct 02, 2025
Viaarxiv icon

Learning Generalizable and Efficient Image Watermarking via Hierarchical Two-Stage Optimization

Add code
Aug 12, 2025
Viaarxiv icon

Dynamic Pattern Alignment Learning for Pretraining Lightweight Human-Centric Vision Models

Add code
Aug 10, 2025
Viaarxiv icon

Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

Add code
Aug 08, 2025
Viaarxiv icon

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism

Add code
Jul 02, 2025
Viaarxiv icon

OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction

Add code
May 26, 2025
Viaarxiv icon

Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach

Add code
May 22, 2025
Viaarxiv icon

InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning

Add code
May 20, 2025
Viaarxiv icon

Policy Contrastive Decoding for Robotic Foundation Models

Add code
May 19, 2025
Viaarxiv icon