Picture for Jie Tang

Jie Tang

Tony

Data-Efficient RLVR via Off-Policy Influence Guidance

Add code
Oct 30, 2025
Viaarxiv icon

Tibetan Language and AI: A Comprehensive Survey of Resources, Methods and Challenges

Add code
Oct 22, 2025
Viaarxiv icon

Pure-Pass: Fine-Grained, Adaptive Masking for Dynamic Token-Mixing Routing in Lightweight Image Super-Resolution

Add code
Oct 02, 2025
Viaarxiv icon

TDRM: Smooth Reward Models with Temporal Difference for LLM RL and Inference

Add code
Sep 18, 2025
Viaarxiv icon

ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding

Add code
Aug 27, 2025
Viaarxiv icon

ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models

Add code
Aug 25, 2025
Viaarxiv icon

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

Add code
Aug 19, 2025
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Figure 1 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 2 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 3 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 4 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Viaarxiv icon

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search

Add code
Jun 13, 2025
Viaarxiv icon