Picture for Zheming Yang

Zheming Yang

Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

From Atoms to Chains: Divergence-Guided Reasoning Curriculum for Unlabeled LLM Domain Adaptation

Add code
Jan 27, 2026
Viaarxiv icon

Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding

Add code
Jan 12, 2026
Viaarxiv icon

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving

Add code
Jan 08, 2026
Viaarxiv icon

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Add code
Jan 08, 2026
Viaarxiv icon

SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks

Add code
Jan 07, 2026
Viaarxiv icon

Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models

Add code
Apr 30, 2025
Viaarxiv icon

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

Add code
Mar 15, 2024
Figure 1 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 2 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 3 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 4 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Viaarxiv icon