Picture for Weiming Zhang

Weiming Zhang

FreeText: Training-Free Text Rendering in Diffusion Transformers via Attention Localization and Spectral Glyph Injection

Add code
Jan 02, 2026
Viaarxiv icon

Cross-modal Retrieval Models for Stripped Binary Analysis

Add code
Dec 11, 2025
Figure 1 for Cross-modal Retrieval Models for Stripped Binary Analysis
Figure 2 for Cross-modal Retrieval Models for Stripped Binary Analysis
Figure 3 for Cross-modal Retrieval Models for Stripped Binary Analysis
Figure 4 for Cross-modal Retrieval Models for Stripped Binary Analysis
Viaarxiv icon

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

Add code
Dec 10, 2025
Viaarxiv icon

MF-Speech: Achieving Fine-Grained and Compositional Control in Speech Generation via Factor Disentanglement

Add code
Nov 19, 2025
Viaarxiv icon

AuthSig: Safeguarding Scanned Signatures Against Unauthorized Reuse in Paperless Workflows

Add code
Nov 12, 2025
Viaarxiv icon

LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors

Add code
Nov 10, 2025
Figure 1 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 2 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 3 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 4 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Viaarxiv icon

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

Add code
Nov 09, 2025
Figure 1 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 2 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 3 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Figure 4 for LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
Viaarxiv icon

A high-capacity linguistic steganography based on entropy-driven rank-token mapping

Add code
Oct 27, 2025
Figure 1 for A high-capacity linguistic steganography based on entropy-driven rank-token mapping
Figure 2 for A high-capacity linguistic steganography based on entropy-driven rank-token mapping
Figure 3 for A high-capacity linguistic steganography based on entropy-driven rank-token mapping
Figure 4 for A high-capacity linguistic steganography based on entropy-driven rank-token mapping
Viaarxiv icon

T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models

Add code
Oct 25, 2025
Figure 1 for T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
Figure 2 for T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
Figure 3 for T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
Figure 4 for T2SMark: Balancing Robustness and Diversity in Noise-as-Watermark for Diffusion Models
Viaarxiv icon

De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks

Add code
Jul 03, 2025
Viaarxiv icon