Picture for Jingang Wang

Jingang Wang

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Viaarxiv icon

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Add code
Sep 03, 2025
Viaarxiv icon

Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit

Add code
Aug 25, 2025
Viaarxiv icon

Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs

Add code
May 23, 2025
Viaarxiv icon

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Add code
May 23, 2025
Viaarxiv icon

Two Minds Better Than One: Collaborative Reward Modeling for LLM Alignment

Add code
May 19, 2025
Viaarxiv icon

Dynamic Fisher-weighted Model Merging via Bayesian Optimization

Add code
Apr 26, 2025
Viaarxiv icon

NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables

Add code
Apr 09, 2025
Viaarxiv icon

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Add code
Apr 02, 2025
Viaarxiv icon

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Add code
Mar 03, 2025
Viaarxiv icon