Picture for Zhiqi Li

Zhiqi Li

Cosmos 3: Omnimodal World Models for Physical AI

Add code
Jun 01, 2026
Viaarxiv icon

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Add code
May 27, 2026
Viaarxiv icon

T-GINEE: A Tensor-Based Multilayer Graph Representation Learning

Add code
May 27, 2026
Viaarxiv icon

Hermite-NGP: Gradient-Augmented Hash Encoding for Learning PDEs

Add code
May 23, 2026
Viaarxiv icon

A Few-Step Generative Model on Cumulative Flow Maps

Add code
May 05, 2026
Viaarxiv icon

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Add code
Apr 27, 2026
Viaarxiv icon

Any 3D Scene is Worth 1K Tokens: 3D-Grounded Representation for Scene Generation at Scale

Add code
Apr 13, 2026
Viaarxiv icon

Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline

Add code
Mar 05, 2026
Viaarxiv icon

A Long-Short Flow-Map Perspective for Drifting Models

Add code
Feb 24, 2026
Viaarxiv icon

Trajectory Consistency for One-Step Generation on Euler Mean Flows

Add code
Jan 31, 2026
Viaarxiv icon