Picture for Ruixiang Tang

Ruixiang Tang

When Reward Hacking Rebounds: Understanding and Mitigating It with Representation-Level Signals

Add code
Apr 01, 2026
Viaarxiv icon

Reinforcing Consistency in Video MLLMs with Structured Rewards

Add code
Apr 01, 2026
Viaarxiv icon

Q-Bridge: Code Translation for Quantum Machine Learning via LLMs

Add code
Mar 29, 2026
Viaarxiv icon

Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Shifting Uncertainty to Critical Moments: Towards Reliable Uncertainty Quantification for VLA Model

Add code
Mar 18, 2026
Viaarxiv icon

Improving Visual Reasoning with Iterative Evidence Refinement

Add code
Mar 14, 2026
Viaarxiv icon

Data Augmentation for High-Fidelity Generation of CAR-T/NK Immunological Synapse Images

Add code
Feb 03, 2026
Viaarxiv icon

TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching

Add code
Jan 27, 2026
Viaarxiv icon

Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety

Add code
Jan 12, 2026
Viaarxiv icon

Read the Scene, Not the Script: Outcome-Aware Safety for LLMs

Add code
Oct 05, 2025
Viaarxiv icon