Picture for Wenhao Zheng

Wenhao Zheng

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Add code
Oct 14, 2024
Figure 1 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 2 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 3 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 4 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Viaarxiv icon

VHELM: A Holistic Evaluation of Vision Language Models

Add code
Oct 09, 2024
Figure 1 for VHELM: A Holistic Evaluation of Vision Language Models
Figure 2 for VHELM: A Holistic Evaluation of Vision Language Models
Figure 3 for VHELM: A Holistic Evaluation of Vision Language Models
Figure 4 for VHELM: A Holistic Evaluation of Vision Language Models
Viaarxiv icon

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Add code
Jun 10, 2024
Figure 1 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 2 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 3 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 4 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Viaarxiv icon

Multimodal Clinical Trial Outcome Prediction with Large Language Models

Add code
Feb 18, 2024
Viaarxiv icon

STAN: Stage-Adaptive Network for Multi-Task Recommendation by Learning User Lifecycle-Based Representation

Add code
Jun 21, 2023
Viaarxiv icon

Knowledge Soft Integration for Multimodal Recommendation

Add code
May 12, 2023
Figure 1 for Knowledge Soft Integration for Multimodal Recommendation
Figure 2 for Knowledge Soft Integration for Multimodal Recommendation
Figure 3 for Knowledge Soft Integration for Multimodal Recommendation
Figure 4 for Knowledge Soft Integration for Multimodal Recommendation
Viaarxiv icon

Robust Image Ordinal Regression with Controllable Image Generation

Add code
May 10, 2023
Viaarxiv icon

Click-aware Structure Transfer with Sample Weight Assignment for Post-Click Conversion Rate Estimation

Add code
Apr 03, 2023
Viaarxiv icon

CTT-Net: A Multi-view Cross-token Transformer for Cataract Postoperative Visual Acuity Prediction

Add code
Dec 12, 2022
Viaarxiv icon

PNM: Pixel Null Model for General Image Segmentation

Add code
Mar 13, 2022
Figure 1 for PNM: Pixel Null Model for General Image Segmentation
Figure 2 for PNM: Pixel Null Model for General Image Segmentation
Figure 3 for PNM: Pixel Null Model for General Image Segmentation
Figure 4 for PNM: Pixel Null Model for General Image Segmentation
Viaarxiv icon