Picture for Haoran Zhang

Haoran Zhang

Massachusetts Institute of Technology

Towards Vision Mixture of Experts for Wildlife Monitoring on the Edge

Add code
Nov 12, 2024
Viaarxiv icon

From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $α$-NeuS

Add code
Nov 08, 2024
Viaarxiv icon

BendVLM: Test-Time Debiasing of Vision-Language Embeddings

Add code
Nov 07, 2024
Viaarxiv icon

Identifying Implicit Social Biases in Vision-Language Models

Add code
Nov 01, 2024
Viaarxiv icon

ING-VP: MLLMs cannot Play Easy Vision-based Games Yet

Add code
Oct 09, 2024
Figure 1 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Figure 2 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Figure 3 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Figure 4 for ING-VP: MLLMs cannot Play Easy Vision-based Games Yet
Viaarxiv icon

CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation

Add code
Sep 24, 2024
Viaarxiv icon

PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization

Add code
Sep 21, 2024
Viaarxiv icon

Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks

Add code
Sep 11, 2024
Figure 1 for Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks
Figure 2 for Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks
Figure 3 for Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks
Figure 4 for Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks
Viaarxiv icon

Curriculum Prompting Foundation Models for Medical Image Segmentation

Add code
Sep 01, 2024
Viaarxiv icon

AutoPV: Automatically Design Your Photovoltaic Power Forecasting Model

Add code
Aug 01, 2024
Viaarxiv icon