Picture for Jan Kautz

Jan Kautz

NVIDIA

FeatSharp: Your Vision Model Features, Sharper

Add code
Feb 22, 2025
Viaarxiv icon

QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Add code
Feb 07, 2025
Viaarxiv icon

Factorized Implicit Global Convolution for Automotive Computational Fluid Dynamics Prediction

Add code
Feb 06, 2025
Viaarxiv icon

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

Add code
Feb 04, 2025
Viaarxiv icon

Parallel Sequence Modeling via Generalized Spatial Propagation Network

Add code
Jan 21, 2025
Figure 1 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 2 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 3 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Figure 4 for Parallel Sequence Modeling via Generalized Spatial Propagation Network
Viaarxiv icon

SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing

Add code
Dec 12, 2024
Viaarxiv icon

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Viaarxiv icon

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

Add code
Dec 10, 2024
Viaarxiv icon

Gated Delta Networks: Improving Mamba2 with Delta Rule

Add code
Dec 09, 2024
Viaarxiv icon

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon