Picture for Weiming Zhang

Weiming Zhang

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Add code
Jun 24, 2025
Viaarxiv icon

Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment

Add code
Jun 17, 2025
Viaarxiv icon

NL-Debugging: Exploiting Natural Language as an Intermediate Representation for Code Debugging

Add code
May 21, 2025
Viaarxiv icon

Gaussian Shading++: Rethinking the Realistic Deployment Challenge of Performance-Lossless Image Watermark for Diffusion Models

Add code
Apr 21, 2025
Viaarxiv icon

Clean Image May be Dangerous: Data Poisoning Attacks Against Deep Hashing

Add code
Mar 27, 2025
Viaarxiv icon

MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG

Add code
Mar 17, 2025
Viaarxiv icon

E-SAM: Training-Free Segment Every Entity Model

Add code
Mar 15, 2025
Viaarxiv icon

Exploiting Vulnerabilities in Speech Translation Systems through Targeted Adversarial Attacks

Add code
Mar 05, 2025
Viaarxiv icon

M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment

Add code
Feb 21, 2025
Viaarxiv icon

Towards Efficient Pre-training: Exploring FP4 Precision in Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon