Picture for Derek Hoiem

Derek Hoiem

Can We Generate Visual Programs Without Prompting LLMs?

Add code
Dec 11, 2024
Viaarxiv icon

RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations

Add code
Dec 02, 2024
Viaarxiv icon

Anytime Continual Learning for Open Vocabulary Classification

Add code
Sep 13, 2024
Viaarxiv icon

MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance

Add code
Apr 12, 2024
Viaarxiv icon

Region-Based Representations Revisited

Add code
Feb 04, 2024
Viaarxiv icon

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Add code
Dec 28, 2023
Viaarxiv icon

ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation

Add code
Nov 22, 2023
Viaarxiv icon

WebWISE: Web Interface Control and Sequential Exploration with Large Language Models

Add code
Oct 25, 2023
Viaarxiv icon

Continual Learning in Open-vocabulary Classification with Complementary Memory Systems

Add code
Jul 04, 2023
Viaarxiv icon

Consistent Multimodal Generation via A Unified GAN Framework

Add code
Jul 04, 2023
Viaarxiv icon