Picture for Xiao Hu

Xiao Hu

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL

Add code
Feb 26, 2026
Viaarxiv icon

H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model

Add code
Feb 11, 2026
Viaarxiv icon

CausalGDP: Causality-Guided Diffusion Policies for Reinforcement Learning

Add code
Feb 09, 2026
Viaarxiv icon

Semantically Aware UAV Landing Site Assessment from Remote Sensing Imagery via Multimodal Large Language Models

Add code
Feb 01, 2026
Viaarxiv icon

Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective

Add code
Jan 30, 2026
Viaarxiv icon

SIGMA-PPG: Statistical-prior Informed Generative Masking Architecture for PPG Foundation Model

Add code
Jan 28, 2026
Viaarxiv icon

Noninvasive Intracranial Pressure Estimation Using Subspace System Identification and Bespoke Machine Learning Algorithms: A Learning-to-Rank Approach

Add code
Jan 28, 2026
Viaarxiv icon

Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective

Add code
Jan 21, 2026
Viaarxiv icon

Kling-Omni Technical Report

Add code
Dec 18, 2025
Figure 1 for Kling-Omni Technical Report
Figure 2 for Kling-Omni Technical Report
Figure 3 for Kling-Omni Technical Report
Figure 4 for Kling-Omni Technical Report
Viaarxiv icon

BioMedJImpact: A Comprehensive Dataset and LLM Pipeline for AI Engagement and Scientific Impact Analysis of Biomedical Journals

Add code
Nov 16, 2025
Viaarxiv icon