Picture for Stefano Soatto

Stefano Soatto

UCLA-CS

Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models

Add code
Dec 17, 2024
Figure 1 for Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models
Figure 2 for Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models
Figure 3 for Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models
Figure 4 for Expansion Span: Combining Fading Memory and Retrieval in Hybrid State Space Models
Viaarxiv icon

Efficient Scaling of Diffusion Transformers for Text-to-Image Generation

Add code
Dec 16, 2024
Figure 1 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 2 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 3 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 4 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Viaarxiv icon

The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation

Add code
Nov 06, 2024
Figure 1 for The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation
Figure 2 for The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation
Figure 3 for The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation
Figure 4 for The N-Grammys: Accelerating Autoregressive Inference with Learning-Free Batched Speculation
Viaarxiv icon

Conjuring Semantic Similarity

Add code
Oct 21, 2024
Viaarxiv icon

DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models

Add code
Oct 04, 2024
Figure 1 for DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Figure 2 for DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Figure 3 for DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Figure 4 for DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models
Viaarxiv icon

RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions

Add code
Oct 03, 2024
Figure 1 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Figure 2 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Figure 3 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Figure 4 for RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Viaarxiv icon

NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality

Add code
Aug 18, 2024
Figure 1 for NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality
Figure 2 for NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality
Figure 3 for NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality
Figure 4 for NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality
Viaarxiv icon

Compositional Structures in Neural Embedding and Interaction Decompositions

Add code
Jul 12, 2024
Viaarxiv icon

B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memory

Add code
Jul 08, 2024
Viaarxiv icon

Diffusion Soup: Model Merging for Text-to-Image Diffusion Models

Add code
Jun 12, 2024
Figure 1 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Figure 2 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Figure 3 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Figure 4 for Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
Viaarxiv icon