Picture for Wenbo Zhang

Wenbo Zhang

Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling

Add code
Jan 13, 2026
Viaarxiv icon

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Add code
Jan 08, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Figure 1 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 2 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 3 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 4 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Viaarxiv icon

PET2Rep: Towards Vision-Language Model-Drived Automated Radiology Report Generation for Positron Emission Tomography

Add code
Aug 06, 2025
Viaarxiv icon

Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation

Add code
Jun 11, 2025
Viaarxiv icon

SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models

Add code
Feb 28, 2025
Figure 1 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Figure 2 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Figure 3 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Figure 4 for SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models
Viaarxiv icon

Inference Computation Scaling for Feature Augmentation in Recommendation Systems

Add code
Feb 22, 2025
Viaarxiv icon

SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images

Add code
Feb 20, 2025
Figure 1 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Figure 2 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Figure 3 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Figure 4 for SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
Viaarxiv icon

Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis

Add code
Feb 13, 2025
Figure 1 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Figure 2 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Figure 3 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Figure 4 for Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis
Viaarxiv icon