Picture for Ethan Shen

Ethan Shen

Perception Tokens Enhance Visual Reasoning in Multimodal Language Models

Add code
Dec 04, 2024
Viaarxiv icon

Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass

Add code
May 29, 2024
Viaarxiv icon

Are "Hierarchical" Visual Representations Hierarchical?

Add code
Nov 23, 2023
Viaarxiv icon

Generative Visual Question Answering

Add code
Jul 18, 2023
Viaarxiv icon

Model-Agnostic Graph Regularization for Few-Shot Learning

Add code
Feb 14, 2021
Figure 1 for Model-Agnostic Graph Regularization for Few-Shot Learning
Figure 2 for Model-Agnostic Graph Regularization for Few-Shot Learning
Figure 3 for Model-Agnostic Graph Regularization for Few-Shot Learning
Figure 4 for Model-Agnostic Graph Regularization for Few-Shot Learning
Viaarxiv icon