Picture for Matthew Gwilliam

Matthew Gwilliam

Towards Understanding Best Practices for Quantization of Vision-Language Models

Add code
Jan 21, 2026
Viaarxiv icon

Implicit Neural Representation Facilitates Unified Universal Vision Encoding

Add code
Jan 20, 2026
Viaarxiv icon

AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models

Add code
Dec 09, 2025
Figure 1 for AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
Figure 2 for AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
Figure 3 for AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
Figure 4 for AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models
Viaarxiv icon

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Add code
Jul 09, 2025
Viaarxiv icon

Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model

Add code
Jun 18, 2025
Figure 1 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 2 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 3 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Figure 4 for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Viaarxiv icon

Utilization of Neighbor Information for Image Classification with Different Levels of Supervision

Add code
Mar 18, 2025
Viaarxiv icon

NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields

Add code
Nov 04, 2024
Figure 1 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Figure 2 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Figure 3 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Figure 4 for NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Viaarxiv icon

Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

Add code
Aug 05, 2024
Figure 1 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Figure 2 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Figure 3 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Figure 4 for Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics
Viaarxiv icon

Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions

Add code
Jan 18, 2024
Viaarxiv icon

Do text-free diffusion models learn discriminative visual representations?

Add code
Nov 30, 2023
Figure 1 for Do text-free diffusion models learn discriminative visual representations?
Figure 2 for Do text-free diffusion models learn discriminative visual representations?
Figure 3 for Do text-free diffusion models learn discriminative visual representations?
Figure 4 for Do text-free diffusion models learn discriminative visual representations?
Viaarxiv icon