Picture for Jiacheng Zhang

Jiacheng Zhang

Let's Roll a BiFTA: Bi-refinement for Fine-grained Text-visual Alignment in Vision-Language Models

Add code
Jan 28, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

Sample-Specific Noise Injection For Diffusion-Based Adversarial Purification

Add code
Jun 06, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Figure 1 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 2 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 3 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Figure 4 for NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results
Viaarxiv icon

DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy

Add code
Mar 04, 2025
Viaarxiv icon

Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding

Add code
Jan 03, 2025
Figure 1 for Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding
Figure 2 for Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding
Figure 3 for Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding
Figure 4 for Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Figure 1 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 2 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 3 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 4 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Viaarxiv icon

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Add code
Dec 19, 2024
Figure 1 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 2 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 3 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 4 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Viaarxiv icon

An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training

Add code
Dec 18, 2024
Figure 1 for An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training
Figure 2 for An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training
Figure 3 for An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training
Figure 4 for An Efficient Occupancy World Model via Decoupled Dynamic Flow and Image-assisted Training
Viaarxiv icon

Minimax-optimal trust-aware multi-armed bandits

Add code
Oct 04, 2024
Viaarxiv icon