Picture for Wenbo Zhang

Wenbo Zhang

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Viaarxiv icon

PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography

Add code
Aug 06, 2025
Viaarxiv icon

Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation

Add code
Jun 11, 2025
Viaarxiv icon

SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models

Add code
Feb 28, 2025
Figure 1 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Figure 2 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Figure 3 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Figure 4 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Viaarxiv icon

Inference Computation Scaling for Feature Augmentation in Recommendation Systems

Add code
Feb 22, 2025
Viaarxiv icon

SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images

Add code
Feb 20, 2025
Figure 1 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Figure 2 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Figure 3 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Figure 4 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Viaarxiv icon

Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis

Add code
Feb 13, 2025
Figure 1 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Figure 2 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Figure 3 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Figure 4 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Viaarxiv icon

Adaptive Pruning for Large Language Models with Structural Importance Awareness

Add code
Dec 19, 2024
Figure 1 for Adaptive Pruning for Large Language Models with Structural Importance Awareness
Figure 2 for Adaptive Pruning for Large Language Models with Structural Importance Awareness
Figure 3 for Adaptive Pruning for Large Language Models with Structural Importance Awareness
Figure 4 for Adaptive Pruning for Large Language Models with Structural Importance Awareness
Viaarxiv icon

INSIGHT: Explainable Weakly-Supervised Medical Image Analysis

Add code
Dec 02, 2024
Figure 1 for INSIGHT: Explainable Weakly-Supervised Medical Image Analysis
Figure 2 for INSIGHT: Explainable Weakly-Supervised Medical Image Analysis
Figure 3 for INSIGHT: Explainable Weakly-Supervised Medical Image Analysis
Figure 4 for INSIGHT: Explainable Weakly-Supervised Medical Image Analysis
Viaarxiv icon

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

Add code
Nov 29, 2024
Viaarxiv icon