Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhizhen Zhao

Hierarchical Rectified Flow Matching with Mini-Batch Couplings

Jul 17, 2025

Yichi Zhang, Yici Yan, Alex Schwing, Zhizhen Zhao

Abstract:Flow matching has emerged as a compelling generative modeling approach that is widely used across domains. To generate data via a flow matching model, an ordinary differential equation (ODE) is numerically solved via forward integration of the modeled velocity field. To better capture the multi-modality that is inherent in typical velocity fields, hierarchical flow matching was recently introduced. It uses a hierarchy of ODEs that are numerically integrated when generating data. This hierarchy of ODEs captures the multi-modal velocity distribution just like vanilla flow matching is capable of modeling a multi-modal data distribution. While this hierarchy enables to model multi-modal velocity distributions, the complexity of the modeled distribution remains identical across levels of the hierarchy. In this paper, we study how to gradually adjust the complexity of the distributions across different levels of the hierarchy via mini-batch couplings. We show the benefits of mini-batch couplings in hierarchical rectified flow matching via compelling results on synthetic and imaging data. Code is available at https://riccizz.github.io/HRF_coupling.

* Project Page: https://riccizz.github.io/HRF_coupling

Via

Access Paper or Ask Questions

Towards Hierarchical Rectified Flow

Feb 24, 2025

Yichi Zhang, Yici Yan, Alex Schwing, Zhizhen Zhao

Abstract:We formulate a hierarchical rectified flow to model data distributions. It hierarchically couples multiple ordinary differential equations (ODEs) and defines a time-differentiable stochastic process that generates a data distribution from a known source distribution. Each ODE resembles the ODE that is solved in a classic rectified flow, but differs in its domain, i.e., location, velocity, acceleration, etc. Unlike the classic rectified flow formulation, which formulates a single ODE in the location domain and only captures the expected velocity field (sufficient to capture a multi-modal data distribution), the hierarchical rectified flow formulation models the multi-modal random velocity field, acceleration field, etc., in their entirety. This more faithful modeling of the random velocity field enables integration paths to intersect when the underlying ODE is solved during data generation. Intersecting paths in turn lead to integration trajectories that are more straight than those obtained in the classic rectified flow formulation, where integration paths cannot intersect. This leads to modeling of data distributions with fewer neural function evaluations. We empirically verify this on synthetic 1D and 2D data as well as MNIST, CIFAR-10, and ImageNet-32 data. Code is available at: https://riccizz.github.io/HRF/.

* ICLR 2025; Project Page: https://riccizz.github.io/HRF/

Via

Access Paper or Ask Questions

Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

Dec 17, 2024

Hongyu Shen, Zhizhen Zhao

Figure 1 for Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

Figure 2 for Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

Figure 3 for Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

Abstract:Despite empirical risk minimization (ERM) is widely applied in the machine learning community, its performance is limited on data with spurious correlation or subpopulation that is introduced by hidden attributes. Existing literature proposed techniques to maximize group-balanced or worst-group accuracy when such correlation presents, yet, at the cost of lower average accuracy. In addition, many existing works conduct surveys on different subpopulation methods without revealing the inherent connection between these methods, which could hinder the technology advancement in this area. In this paper, we identify important sampling as a simple yet powerful tool for solving the subpopulation problem. On the theory side, we provide a new systematic formulation of the subpopulation problem and explicitly identify the assumptions that are not clearly stated in the existing works. This helps to uncover the cause of the dropped average accuracy. We provide the first theoretical discussion on the connections of existing methods, revealing the core components that make them different. On the application side, we demonstrate a single estimator is enough to solve the subpopulation problem. In particular, we introduce the estimator in both attribute-known and -unknown scenarios in the subpopulation setup, offering flexibility in practical use cases. And empirically, we achieve state-of-the-art performance on commonly used benchmark datasets.

* Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2025)
* 16 pages, 1 figure, 2 tables

Via

Access Paper or Ask Questions

DeepDRK: Deep Dependency Regularized Knockoff for Feature Selection

Feb 27, 2024

Hongyu Shen, Yici Yan, Zhizhen Zhao

Figure 1 for DeepDRK: Deep Dependency Regularized Knockoff for Feature Selection

Figure 2 for DeepDRK: Deep Dependency Regularized Knockoff for Feature Selection

Figure 3 for DeepDRK: Deep Dependency Regularized Knockoff for Feature Selection

Figure 4 for DeepDRK: Deep Dependency Regularized Knockoff for Feature Selection

Abstract:Model-X knockoff, among various feature selection methods, received much attention recently due to its guarantee on false discovery rate (FDR) control. Subsequent to its introduction in parametric design, knockoff is advanced to handle arbitrary data distributions using deep learning-based generative modeling. However, we observed that current implementations of the deep Model-X knockoff framework exhibit limitations. Notably, the "swap property" that knockoffs necessitate frequently encounter challenges on sample level, leading to a diminished selection power. To overcome, we develop "Deep Dependency Regularized Knockoff (DeepDRK)", a distribution-free deep learning method that strikes a balance between FDR and power. In DeepDRK, a generative model grounded in a transformer architecture is introduced to better achieve the "swap property". Novel efficient regularization techniques are also proposed to reach higher power. Our model outperforms other benchmarks in synthetic, semi-synthetic, and real-world data, especially when sample size is small and data distribution is complex.

* 23 pages, 14 figures, 7 tables

Via

Access Paper or Ask Questions

Convolutional GRU Network for Seasonal Prediction of the El Niño-Southern Oscillation

Jun 18, 2023

Lingda Wang, Savana Ammons, Vera Mikyoung Hur, Ryan L. Sriver, Zhizhen Zhao

Abstract:Predicting sea surface temperature (SST) within the El Ni\~no-Southern Oscillation (ENSO) region has been extensively studied due to its significant influence on global temperature and precipitation patterns. Statistical models such as linear inverse model (LIM), analog forecasting (AF), and recurrent neural network (RNN) have been widely used for ENSO prediction, offering flexibility and relatively low computational expense compared to large dynamic models. However, these models have limitations in capturing spatial patterns in SST variability or relying on linear dynamics. Here we present a modified Convolutional Gated Recurrent Unit (ConvGRU) network for the ENSO region spatio-temporal sequence prediction problem, along with the Ni\~no 3.4 index prediction as a down stream task. The proposed ConvGRU network, with an encoder-decoder sequence-to-sequence structure, takes historical SST maps of the Pacific region as input and generates future SST maps for subsequent months within the ENSO region. To evaluate the performance of the ConvGRU network, we trained and tested it using data from multiple large climate models. The results demonstrate that the ConvGRU network significantly improves the predictability of the Ni\~no 3.4 index compared to LIM, AF, and RNN. This improvement is evidenced by extended useful prediction range, higher Pearson correlation, and lower root-mean-square error. The proposed model holds promise for improving our understanding and predicting capabilities of the ENSO phenomenon and can be broadly applicable to other weather and climate prediction scenarios with spatial patterns and teleconnections.

* 13 pages, 7 figures

Via

Access Paper or Ask Questions

FAIR AI Models in High Energy Physics

Dec 21, 2022

Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori(+7 more)

Abstract:The findable, accessible, interoperable, and reusable (FAIR) data principles have provided a framework for examining, evaluating, and improving how we share data with the aim of facilitating scientific discovery. Efforts have been made to generalize these principles to research software and other digital products. Artificial intelligence (AI) models -- algorithms that have been trained on data rather than explicitly programmed -- are an important target for this because of the ever-increasing pace with which AI is transforming scientific and engineering domains. In this paper, we propose a practical definition of FAIR principles for AI models and create a FAIR AI project template that promotes adherence to these principles. We demonstrate how to implement these principles using a concrete example from experimental high energy physics: a graph neural network for identifying Higgs bosons decaying to bottom quarks. We study the robustness of these FAIR AI models and their portability across hardware architectures and software frameworks, and report new insights on the interpretability of AI predictions by studying the interplay between FAIR datasets and AI models. Enabled by publishing FAIR AI models, these studies pave the way toward reliable and automated AI-driven scientific discovery.

* 32 pages, 8 figures, 9 tables

Via

Access Paper or Ask Questions

Initialization and Alignment for Adversarial Texture Optimization

Jul 28, 2022

Xiaoming Zhao, Zhizhen Zhao, Alexander G. Schwing

Figure 1 for Initialization and Alignment for Adversarial Texture Optimization

Figure 2 for Initialization and Alignment for Adversarial Texture Optimization

Figure 3 for Initialization and Alignment for Adversarial Texture Optimization

Figure 4 for Initialization and Alignment for Adversarial Texture Optimization

Abstract:While recovery of geometry from image and video data has received a lot of attention in computer vision, methods to capture the texture for a given geometry are less mature. Specifically, classical methods for texture generation often assume clean geometry and reasonably well-aligned image data. While very recent methods, e.g., adversarial texture optimization, better handle lower-quality data obtained from hand-held devices, we find them to still struggle frequently. To improve robustness, particularly of recent adversarial texture optimization, we develop an explicit initialization and an alignment procedure. It deals with complex geometry due to a robust mapping of the geometry to the texture map and a hard-assignment-based initialization. It deals with misalignment of geometry and images by integrating fast image-alignment into the texture refinement optimization. We demonstrate efficacy of our texture generation on a dataset of 11 scenes with a total of 2807 frames, observing 7.8% and 11.1% relative improvements regarding perceptual and sharpness measurements.

* ECCV 2022; Project Page: https://xiaoming-zhao.github.io/projects/advtex_init_align/

Via

Access Paper or Ask Questions

Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

Jul 06, 2022

Shuai Huang, Mona Zehni, Ivan Dokmanić, Zhizhen Zhao

Figure 1 for Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

Figure 2 for Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

Figure 3 for Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

Figure 4 for Orthogonal Matrix Retrieval with Spatial Consensus for 3D Unknown-View Tomography

Abstract:Unknown-view tomography (UVT) reconstructs a 3D density map from its 2D projections at unknown, random orientations. A line of work starting with Kam (1980) employs the method of moments (MoM) with rotation-invariant Fourier features to solve UVT in the frequency domain, assuming that the orientations are uniformly distributed. This line of work includes the recent orthogonal matrix retrieval (OMR) approaches based on matrix factorization, which, while elegant, either require side information about the density that is not available, or fail to be sufficiently robust. In order for OMR to break free from those restrictions, we propose to jointly recover the density map and the orthogonal matrices by requiring that they be mutually consistent. We regularize the resulting non-convex optimization problem by a denoised reference projection and a nonnegativity constraint. This is enabled by the new closed-form expressions for spatial autocorrelation features. Further, we design an easy-to-compute initial density map which effectively mitigates the non-convexity of the reconstruction problem. Experimental results show that the proposed OMR with spatial consensus is more robust and performs significantly better than the previous state-of-the-art OMR approach in the typical low-SNR scenario of 3D UVT.

* Keywords: unknown view tomography, single-particle cryo-electron microscopy, method of moments, autocorrelation, spherical harmonics

Via

Access Paper or Ask Questions

Multi-Frequency Joint Community Detection and Phase Synchronization

Jun 16, 2022

Lingda Wang, Zhizhen Zhao

Figure 1 for Multi-Frequency Joint Community Detection and Phase Synchronization

Figure 2 for Multi-Frequency Joint Community Detection and Phase Synchronization

Figure 3 for Multi-Frequency Joint Community Detection and Phase Synchronization

Figure 4 for Multi-Frequency Joint Community Detection and Phase Synchronization

Abstract:This paper studies the joint community detection and phase synchronization problem on the \textit{stochastic block model with relative phase}, where each node is associated with a phase. This problem, with a variety of real-world applications, aims to recover community memberships and associated phases simultaneously. By studying the maximum likelihood estimation formulation, we show that this problem exhibits a \textit{``multi-frequency''} structure. To this end, two simple yet efficient algorithms that leverage information across multiple frequencies are proposed. The former is a spectral method based on the novel multi-frequency column-pivoted QR factorization, and the latter is an iterative multi-frequency generalized power method. Numerical experiments indicate our proposed algorithms outperform state-of-the-art algorithms, in recovering community memberships and associated phases.

* 20 pages, 5 figures

Via

Access Paper or Ask Questions

A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization

Dec 25, 2021

Yifeng Fan, Yuehaw Khoo, Zhizhen Zhao

Figure 1 for A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization

Figure 2 for A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization

Figure 3 for A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization

Figure 4 for A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization

Abstract:Community detection and orthogonal group synchronization are both fundamental problems with a variety of important applications in science and engineering. In this work, we consider the joint problem of community detection and orthogonal group synchronization which aims to recover the communities and perform synchronization simultaneously. To this end, we propose a simple algorithm that consists of a spectral decomposition step followed by a blockwise column pivoted QR factorization (CPQR). The proposed algorithm is efficient and scales linearly with the number of data points. We also leverage the recently developed `leave-one-out' technique to establish a near-optimal guarantee for exact recovery of the cluster memberships and stable recovery of the orthogonal transforms. Numerical experiments demonstrate the efficiency and efficacy of our algorithm and confirm our theoretical characterization of it.

Via

Access Paper or Ask Questions