Picture for Anil Kag

Anil Kag

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Add code
Dec 12, 2024
Viaarxiv icon

AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation

Add code
Nov 07, 2024
Figure 1 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Figure 2 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Figure 3 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Figure 4 for AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Viaarxiv icon

Scalable Ranked Preference Optimization for Text-to-Image Generation

Add code
Oct 23, 2024
Figure 1 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 2 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 3 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Figure 4 for Scalable Ranked Preference Optimization for Text-to-Image Generation
Viaarxiv icon

Lightweight Predictive 3D Gaussian Splats

Add code
Jun 27, 2024
Figure 1 for Lightweight Predictive 3D Gaussian Splats
Figure 2 for Lightweight Predictive 3D Gaussian Splats
Figure 3 for Lightweight Predictive 3D Gaussian Splats
Figure 4 for Lightweight Predictive 3D Gaussian Splats
Viaarxiv icon

SF-V: Single Forward Video Generation Model

Add code
Jun 06, 2024
Viaarxiv icon

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

Add code
Jun 06, 2024
Figure 1 for BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Figure 2 for BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Figure 3 for BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Figure 4 for BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Viaarxiv icon

TextCraftor: Your Text Encoder Can be Image Quality Controller

Add code
Mar 27, 2024
Figure 1 for TextCraftor: Your Text Encoder Can be Image Quality Controller
Figure 2 for TextCraftor: Your Text Encoder Can be Image Quality Controller
Figure 3 for TextCraftor: Your Text Encoder Can be Image Quality Controller
Figure 4 for TextCraftor: Your Text Encoder Can be Image Quality Controller
Viaarxiv icon

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Add code
Feb 22, 2024
Figure 1 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 2 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 3 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 4 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Viaarxiv icon

Online Selective Classification with Limited Feedback

Add code
Oct 27, 2021
Figure 1 for Online Selective Classification with Limited Feedback
Figure 2 for Online Selective Classification with Limited Feedback
Figure 3 for Online Selective Classification with Limited Feedback
Figure 4 for Online Selective Classification with Limited Feedback
Viaarxiv icon

Selective Classification via One-Sided Prediction

Add code
Nov 07, 2020
Figure 1 for Selective Classification via One-Sided Prediction
Figure 2 for Selective Classification via One-Sided Prediction
Figure 3 for Selective Classification via One-Sided Prediction
Figure 4 for Selective Classification via One-Sided Prediction
Viaarxiv icon