Picture for Jielei Zhang

Jielei Zhang

Efficient Causal Structure Learning via Modular Subgraph Integration

Add code
Jan 28, 2026
Viaarxiv icon

Improving VQA Reliability: A Dual-Assessment Approach with Self-Reflection and Cross-Model Verification

Add code
Dec 16, 2025
Viaarxiv icon

MeshRipple: Structured Autoregressive Generation of Artist-Meshes

Add code
Dec 09, 2025
Viaarxiv icon

Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models

Add code
Sep 17, 2025
Figure 1 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Figure 2 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Figure 3 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Figure 4 for Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
Viaarxiv icon

TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis

Add code
May 23, 2025
Figure 1 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Figure 2 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Figure 3 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Figure 4 for TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
Viaarxiv icon

MX-Font++: Mixture of Heterogeneous Aggregation Experts for Few-shot Font Generation

Add code
Mar 04, 2025
Viaarxiv icon

DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training

Add code
Aug 01, 2024
Figure 1 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 2 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 3 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 4 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Viaarxiv icon

Facial Attribute Transformers for Precise and Robust Makeup Transfer

Add code
Apr 07, 2021
Figure 1 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Figure 2 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Figure 3 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Figure 4 for Facial Attribute Transformers for Precise and Robust Makeup Transfer
Viaarxiv icon

On Vocabulary Reliance in Scene Text Recognition

Add code
May 08, 2020
Figure 1 for On Vocabulary Reliance in Scene Text Recognition
Figure 2 for On Vocabulary Reliance in Scene Text Recognition
Figure 3 for On Vocabulary Reliance in Scene Text Recognition
Figure 4 for On Vocabulary Reliance in Scene Text Recognition
Viaarxiv icon