Picture for Pavlo Molchanov

Pavlo Molchanov

Scaling Vision Pre-Training to 4K Resolution

Add code
Mar 25, 2025
Viaarxiv icon

TwinTURBO: Semi-Supervised Fine-Tuning of Foundation Models via Mutual Information Decompositions for Downstream Task and Latent Spaces

Add code
Mar 10, 2025
Viaarxiv icon

FeatSharp: Your Vision Model Features, Sharper

Add code
Feb 22, 2025
Viaarxiv icon

Advancing Weight and Channel Sparsification with Enhanced Saliency

Add code
Feb 05, 2025
Figure 1 for Advancing Weight and Channel Sparsification with Enhanced Saliency
Figure 2 for Advancing Weight and Channel Sparsification with Enhanced Saliency
Figure 3 for Advancing Weight and Channel Sparsification with Enhanced Saliency
Figure 4 for Advancing Weight and Channel Sparsification with Enhanced Saliency
Viaarxiv icon

Entropy-Regularized Process Reward Model

Add code
Dec 15, 2024
Viaarxiv icon

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

Add code
Dec 10, 2024
Viaarxiv icon

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Add code
Dec 03, 2024
Figure 1 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Figure 2 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Figure 3 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Figure 4 for Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Viaarxiv icon

Hymba: A Hybrid-head Architecture for Small Language Models

Add code
Nov 20, 2024
Figure 1 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 2 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 3 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 4 for Hymba: A Hybrid-head Architecture for Small Language Models
Viaarxiv icon

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Add code
Nov 19, 2024
Figure 1 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 2 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 3 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Figure 4 for VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
Viaarxiv icon