Picture for Zhanyu Ma

Zhanyu Ma

Detailed Object Description with Controllable Dimensions

Add code
Nov 28, 2024
Viaarxiv icon

Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing

Add code
Oct 22, 2024
Figure 1 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Figure 2 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Figure 3 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Figure 4 for Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing
Viaarxiv icon

I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow

Add code
Oct 10, 2024
Figure 1 for I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow
Figure 2 for I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow
Figure 3 for I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow
Figure 4 for I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow
Viaarxiv icon

Evaluating Attribute Comprehension in Large Vision-Language Models

Add code
Aug 25, 2024
Figure 1 for Evaluating Attribute Comprehension in Large Vision-Language Models
Figure 2 for Evaluating Attribute Comprehension in Large Vision-Language Models
Figure 3 for Evaluating Attribute Comprehension in Large Vision-Language Models
Figure 4 for Evaluating Attribute Comprehension in Large Vision-Language Models
Viaarxiv icon

Efficient Face Super-Resolution via Wavelet-based Feature Enhancement Network

Add code
Jul 29, 2024
Viaarxiv icon

M4Fog: A Global Multi-Regional, Multi-Modal, and Multi-Stage Dataset for Marine Fog Detection and Forecasting to Bridge Ocean and Atmosphere

Add code
Jun 19, 2024
Viaarxiv icon

NeRSP: Neural 3D Reconstruction for Reflective Objects with Sparse Polarized Images

Add code
Jun 11, 2024
Viaarxiv icon

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Add code
Jun 10, 2024
Viaarxiv icon

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

Add code
Mar 10, 2024
Viaarxiv icon

Vision-language Assisted Attribute Learning

Add code
Dec 15, 2023
Viaarxiv icon