Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

DICE: Disentangling Artist Style from Content via Contrastive Subspace Decomposition in Diffusion Models

Add code
Feb 08, 2026
Viaarxiv icon

Humanoid Manipulation Interface: Humanoid Whole-Body Manipulation from Robot-Free Demonstrations

Add code
Feb 06, 2026
Viaarxiv icon

GT-SVJ: Generative-Transformer-Based Self-Supervised Video Judge For Efficient Video Reward Modeling

Add code
Feb 05, 2026
Viaarxiv icon

Mitigating Hallucinations in Video Large Language Models via Spatiotemporal-Semantic Contrastive Decoding

Add code
Jan 30, 2026
Viaarxiv icon

Farewell to Item IDs: Unlocking the Scaling Potential of Large Ranking Models via Semantic Tokens

Add code
Jan 30, 2026
Viaarxiv icon

PhysProver: Advancing Automatic Theorem Proving for Physics

Add code
Jan 22, 2026
Viaarxiv icon

A Training-Free Guess What Vision Language Model from Snippets to Open-Vocabulary Object Detection

Add code
Jan 21, 2026
Viaarxiv icon

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Add code
Jan 15, 2026
Viaarxiv icon

Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification

Add code
Jan 12, 2026
Viaarxiv icon

WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Add code
Jan 07, 2026
Viaarxiv icon