Picture for Chuhan Zhang

Chuhan Zhang

From Image to Video: An Empirical Study of Diffusion Representations

Add code
Feb 10, 2025
Viaarxiv icon

Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation

Add code
Feb 06, 2025
Viaarxiv icon

ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs

Add code
Feb 04, 2025
Viaarxiv icon

SpikingSoft: A Spiking Neuron Controller for Bio-inspired Locomotion with Soft Snake Robots

Add code
Jan 31, 2025
Viaarxiv icon

Scaling 4D Representations

Add code
Dec 19, 2024
Figure 1 for Scaling 4D Representations
Figure 2 for Scaling 4D Representations
Figure 3 for Scaling 4D Representations
Figure 4 for Scaling 4D Representations
Viaarxiv icon

TRecViT: A Recurrent Video Transformer

Add code
Dec 18, 2024
Viaarxiv icon

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

Add code
Apr 25, 2024
Viaarxiv icon

NiSNN-A: Non-iterative Spiking Neural Networks with Attention with Application to Motor Imagery EEG Classification

Add code
Dec 09, 2023
Viaarxiv icon

Helping Hands: An Object-Aware Ego-Centric Video Recognition Model

Add code
Aug 15, 2023
Viaarxiv icon

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime

Add code
May 03, 2023
Figure 1 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Figure 2 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Figure 3 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Figure 4 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Viaarxiv icon