Picture for Siyi Chen

Siyi Chen

International Institute for Urban Systems Engineering, Southeast University, Nanjing, China

Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning

Add code
Dec 10, 2024
Viaarxiv icon

Learning Diffusion Model from Noisy Measurement using Principled Expectation-Maximization Method

Add code
Oct 15, 2024
Viaarxiv icon

Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering

Add code
Sep 04, 2024
Figure 1 for Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
Figure 2 for Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
Figure 3 for Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
Figure 4 for Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
Viaarxiv icon

Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing

Add code
Sep 04, 2024
Figure 1 for Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing
Figure 2 for Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing
Figure 3 for Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing
Figure 4 for Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing
Viaarxiv icon

Unfolding Videos Dynamics via Taylor Expansion

Add code
Sep 04, 2024
Figure 1 for Unfolding Videos Dynamics via Taylor Expansion
Figure 2 for Unfolding Videos Dynamics via Taylor Expansion
Figure 3 for Unfolding Videos Dynamics via Taylor Expansion
Figure 4 for Unfolding Videos Dynamics via Taylor Expansion
Viaarxiv icon

Blind Inversion using Latent Diffusion Priors

Add code
Jul 01, 2024
Figure 1 for Blind Inversion using Latent Diffusion Priors
Figure 2 for Blind Inversion using Latent Diffusion Priors
Figure 3 for Blind Inversion using Latent Diffusion Priors
Figure 4 for Blind Inversion using Latent Diffusion Priors
Viaarxiv icon

Robust semi-automatic vessel tracing in the human retinal image by an instance segmentation neural network

Add code
Feb 15, 2024
Viaarxiv icon

Understanding 3D Object Articulation in Internet Videos

Add code
Mar 30, 2022
Figure 1 for Understanding 3D Object Articulation in Internet Videos
Figure 2 for Understanding 3D Object Articulation in Internet Videos
Figure 3 for Understanding 3D Object Articulation in Internet Videos
Figure 4 for Understanding 3D Object Articulation in Internet Videos
Viaarxiv icon

Transform-Based Feature Map Compression for CNN Inference

Add code
Jun 24, 2021
Figure 1 for Transform-Based Feature Map Compression for CNN Inference
Figure 2 for Transform-Based Feature Map Compression for CNN Inference
Figure 3 for Transform-Based Feature Map Compression for CNN Inference
Figure 4 for Transform-Based Feature Map Compression for CNN Inference
Viaarxiv icon

Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios

Add code
Nov 16, 2020
Figure 1 for Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios
Figure 2 for Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios
Figure 3 for Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios
Figure 4 for Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios
Viaarxiv icon