Picture for Jiawei Zhang

Jiawei Zhang

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Add code
Feb 11, 2025
Viaarxiv icon

DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models

Add code
Feb 06, 2025
Figure 1 for DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models
Figure 2 for DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models
Figure 3 for DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models
Figure 4 for DeblurDiff: Real-World Image Deblurring with Generative Diffusion Models
Viaarxiv icon

FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles

Add code
Jan 02, 2025
Figure 1 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Figure 2 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Figure 3 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Figure 4 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Viaarxiv icon

Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution

Add code
Dec 04, 2024
Figure 1 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Figure 2 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Figure 3 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Figure 4 for Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution
Viaarxiv icon

FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video

Add code
Nov 23, 2024
Figure 1 for FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Figure 2 for FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Figure 3 for FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Figure 4 for FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Viaarxiv icon

I2TTS: Image-indicated Immersive Text-to-speech Synthesis with Spatial Perception

Add code
Nov 20, 2024
Viaarxiv icon

RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer

Add code
Nov 17, 2024
Figure 1 for RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer
Figure 2 for RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer
Figure 3 for RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer
Figure 4 for RPN 2: On Interdependence Function Learning Towards Unifying and Advancing CNN, RNN, GNN, and Transformer
Viaarxiv icon

MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI

Add code
Nov 14, 2024
Figure 1 for MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI
Figure 2 for MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI
Figure 3 for MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI
Figure 4 for MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI
Viaarxiv icon

See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers

Add code
Nov 04, 2024
Figure 1 for See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers
Figure 2 for See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers
Figure 3 for See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers
Figure 4 for See it, Think it, Sorted: Large Multimodal Models are Few-shot Time Series Anomaly Analyzers
Viaarxiv icon

Deep Learning-Based Fatigue Cracks Detection in Bridge Girders using Feature Pyramid Networks

Add code
Oct 28, 2024
Figure 1 for Deep Learning-Based Fatigue Cracks Detection in Bridge Girders using Feature Pyramid Networks
Figure 2 for Deep Learning-Based Fatigue Cracks Detection in Bridge Girders using Feature Pyramid Networks
Figure 3 for Deep Learning-Based Fatigue Cracks Detection in Bridge Girders using Feature Pyramid Networks
Figure 4 for Deep Learning-Based Fatigue Cracks Detection in Bridge Girders using Feature Pyramid Networks
Viaarxiv icon