Picture for Jan Kautz

Jan Kautz

NVIDIA

Scaling Vision Pre-Training to 4K Resolution

Add code
Mar 25, 2025
Viaarxiv icon

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Add code
Mar 18, 2025
Viaarxiv icon

Token-Efficient Long Video Understanding for Multimodal LLMs

Add code
Mar 06, 2025
Viaarxiv icon

FeatSharp: Your Vision Model Features, Sharper

Add code
Feb 22, 2025
Viaarxiv icon

QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Add code
Feb 07, 2025
Viaarxiv icon

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

Add code
Feb 06, 2025
Viaarxiv icon

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

Add code
Feb 04, 2025
Viaarxiv icon

Parallel Sequence Modeling via Generalized Spatial Propagation Network

Add code
Jan 21, 2025
Figure 1 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 2 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 3 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 4 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Viaarxiv icon

SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing

Add code
Dec 12, 2024
Viaarxiv icon

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Viaarxiv icon