Picture for Sheng Zhang

Sheng Zhang

University of Southern California

From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond

Add code
Nov 06, 2024
Viaarxiv icon

MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction

Add code
Nov 02, 2024
Viaarxiv icon

MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging

Add code
Oct 09, 2024
Figure 1 for MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging
Figure 2 for MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging
Figure 3 for MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging
Figure 4 for MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging
Viaarxiv icon

Machine learning approach for vibronically renormalized electronic band structures

Add code
Sep 03, 2024
Viaarxiv icon

Probing Perfection: The Relentless Art of Meddling for Pulmonary Airway Segmentation from HRCT via a Human-AI Collaboration Based Active Learning Method

Add code
Jul 03, 2024
Viaarxiv icon

From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning

Add code
Jul 01, 2024
Viaarxiv icon

Fuzzy Attention-based Border Rendering Network for Lung Organ Segmentation

Add code
Jun 23, 2024
Viaarxiv icon

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Add code
Jun 17, 2024
Figure 1 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 2 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 3 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 4 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

GLINT-RU: Gated Lightweight Intelligent Recurrent Units for Sequential Recommender Systems

Add code
Jun 06, 2024
Viaarxiv icon