Picture for Hao Li

Hao Li

Jack

Omni-Video: Democratizing Unified Video Understanding and Generation

Add code
Jul 09, 2025
Viaarxiv icon

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Add code
Jul 03, 2025
Viaarxiv icon

TypeTele: Releasing Dexterity in Teleoperation by Dexterous Manipulation Types

Add code
Jul 02, 2025
Viaarxiv icon

LogitSpec: Accelerating Retrieval-based Speculative Decoding via Next Next Token Speculation

Add code
Jul 02, 2025
Viaarxiv icon

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Add code
Jun 24, 2025
Viaarxiv icon

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Add code
Jun 13, 2025
Viaarxiv icon

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Add code
Jun 12, 2025
Viaarxiv icon

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

FuXi-Air: Urban Air Quality Forecasting Based on Emission-Meteorology-Pollutant multimodal Machine Learning

Add code
Jun 09, 2025
Viaarxiv icon

MIRA: Medical Time Series Foundation Model for Real-World Health Data

Add code
Jun 09, 2025
Viaarxiv icon