Picture for Jiaqi Wang

Jiaqi Wang

Michael Pokorny

CapRL++: Unified Reinforcement Learning with Verifiable Rewards for Dense Image and Video Captioning

Add code
Jun 08, 2026
Viaarxiv icon

Harnessing Streaming Video in the Wild

Add code
Jun 07, 2026
Viaarxiv icon

OmniCap-IF: Benchmarking and Improving Instruction Following Abilities for Omni-Video Captioning

Add code
Jun 07, 2026
Viaarxiv icon

Light-WAM: Efficient World Action Models with State-Fusion Action Decoding

Add code
Jun 06, 2026
Viaarxiv icon

AdaGRPO: A Capability-Aware Adaptive Enhancement for Flow-based GRPO

Add code
Jun 05, 2026
Viaarxiv icon

Right Makes Might: Aligning Verified Hidden States Empowers RL Reasoning

Add code
Jun 02, 2026
Viaarxiv icon

AdaCodec: A Predictive Visual Code for Video MLLMs

Add code
Jun 01, 2026
Viaarxiv icon

Brain-Atlas-Guided Generative Counterfactual Attention for Explainable Cognitive Decline Diagnosis Using Multimodal Connectomes

Add code
May 31, 2026
Viaarxiv icon

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

Add code
May 28, 2026
Viaarxiv icon

Channel-wise Vector Quantization

Add code
May 25, 2026
Viaarxiv icon