Picture for Weihao Yu

Weihao Yu

Attention Prompting on Image for Large Vision-Language Models

Add code
Sep 25, 2024
Viaarxiv icon

LinFusion: 1 GPU, 1 Minute, 16K Image

Add code
Sep 03, 2024
Viaarxiv icon

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Add code
Aug 01, 2024
Figure 1 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 2 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 3 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 4 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Viaarxiv icon

KAN or MLP: A Fairer Comparison

Add code
Jul 23, 2024
Figure 1 for KAN or MLP: A Fairer Comparison
Figure 2 for KAN or MLP: A Fairer Comparison
Figure 3 for KAN or MLP: A Fairer Comparison
Figure 4 for KAN or MLP: A Fairer Comparison
Viaarxiv icon

GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation

Add code
Jul 08, 2024
Viaarxiv icon

EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting

Add code
Jul 01, 2024
Viaarxiv icon

MambaOut: Do We Really Need Mamba for Vision?

Add code
May 14, 2024
Viaarxiv icon

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities

Add code
Aug 04, 2023
Viaarxiv icon

Two-stage Denoising Diffusion Model for Source Localization in Graph Inverse Problems

Add code
Apr 18, 2023
Viaarxiv icon

InceptionNeXt: When Inception Meets ConvNeXt

Add code
Mar 29, 2023
Viaarxiv icon