Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stark C. Draper

Controlled privacy leakage propagation throughout overlapping grouped learning

Mar 06, 2025

Shahrzad Kiani, Franziska Boenisch, Stark C. Draper

Abstract:Federated Learning (FL) is the standard protocol for collaborative learning. In FL, multiple workers jointly train a shared model. They exchange model updates calculated on their data, while keeping the raw data itself local. Since workers naturally form groups based on common interests and privacy policies, we are motivated to extend standard FL to reflect a setting with multiple, potentially overlapping groups. In this setup where workers can belong and contribute to more than one group at a time, complexities arise in understanding privacy leakage and in adhering to privacy policies. To address the challenges, we propose differential private overlapping grouped learning (DPOGL), a novel method to implement privacy guarantees within overlapping groups. Under the honest-but-curious threat model, we derive novel privacy guarantees between arbitrary pairs of workers. These privacy guarantees describe and quantify two key effects of privacy leakage in DP-OGL: propagation delay, i.e., the fact that information from one group will leak to other groups only with temporal offset through the common workers and information degradation, i.e., the fact that noise addition over model updates limits information leakage between workers. Our experiments show that applying DP-OGL enhances utility while maintaining strong privacy compared to standard FL setups.

* IEEE Journal on Selected Areas in Information Theory (JSAIT), vol. 5, pp. 442-463, 2024
* This paper was presented in part at the 2024 IEEE International Symposium on Information Theory (ISIT), Athens, Greece, 2024, pp. 386-391, doi: 10.1109/ISIT57864.2024.10619521. The paper was published in the IEEE Journal on Selected Areas in Information Theory (JSAIT)

Via

Access Paper or Ask Questions

Quasicyclic Principal Component Analysis

Feb 07, 2025

Susanna E. Rumsey, Stark C. Draper, Frank R. Kschischang

Abstract:We present quasicyclic principal component analysis (QPCA), a generalization of principal component analysis (PCA), that determines an optimized basis for a dataset in terms of families of shift-orthogonal principal vectors. This is of particular interest when analyzing cyclostationary data, whose cyclic structure is not exploited by the standard PCA algorithm. We first formulate QPCA as an optimization problem, which we show may be decomposed into a series of PCA problems in the frequency domain. We then formalize our solution as an explicit algorithm and analyze its computational complexity. Finally, we provide some examples of applications of QPCA to cyclostationary signal processing data, including an investigation of carrier pulse recovery, a presentation of methods for estimating an unknown oversampling rate, and a discussion of an appropriate approach for pre-processing data with a non-integer oversampling rate in order to better apply the QPCA algorithm.

Via

Access Paper or Ask Questions

Controllability of Coarsely Measured Networked Linear Dynamical Systems (Extended Version)

Jun 21, 2022

Nafiseh Ghoroghchian, Rajasekhar Anguluri, Gautam Dasarathy, Stark C. Draper

Figure 1 for Controllability of Coarsely Measured Networked Linear Dynamical Systems (Extended Version)

Figure 2 for Controllability of Coarsely Measured Networked Linear Dynamical Systems (Extended Version)

Figure 3 for Controllability of Coarsely Measured Networked Linear Dynamical Systems (Extended Version)

Abstract:We consider the controllability of large-scale linear networked dynamical systems when complete knowledge of network structure is unavailable and knowledge is limited to coarse summaries. We provide conditions under which average controllability of the fine-scale system can be well approximated by average controllability of the (synthesized, reduced-order) coarse-scale system. To this end, we require knowledge of some inherent parametric structure of the fine-scale network that makes this type of approximation possible. Therefore, we assume that the underlying fine-scale network is generated by the stochastic block model (SBM) -- often studied in community detection. We then provide an algorithm that directly estimates the average controllability of the fine-scale system using a coarse summary of SBM. Our analysis indicates the necessity of underlying structure (e.g., in-built communities) to be able to quantify accurately the controllability from coarsely characterized networked dynamics. We also compare our method to that of the reduced-order method and highlight the regimes where both can outperform each other. Finally, we provide simulations to confirm our theoretical results for different scalings of network size and density, and the parameter that captures how much community-structure is retained in the coarse summary.

Via

Access Paper or Ask Questions

Compressing gradients by exploiting temporal correlation in momentum-SGD

Aug 17, 2021

Tharindu B. Adikari, Stark C. Draper

Figure 1 for Compressing gradients by exploiting temporal correlation in momentum-SGD

Figure 2 for Compressing gradients by exploiting temporal correlation in momentum-SGD

Figure 3 for Compressing gradients by exploiting temporal correlation in momentum-SGD

Figure 4 for Compressing gradients by exploiting temporal correlation in momentum-SGD

Abstract:An increasing bottleneck in decentralized optimization is communication. Bigger models and growing datasets mean that decentralization of computation is important and that the amount of information exchanged is quickly growing. While compression techniques have been introduced to cope with the latter, none has considered leveraging the temporal correlations that exist in consecutive vector updates. An important example is distributed momentum-SGD where temporal correlation is enhanced by the low-pass-filtering effect of applying momentum. In this paper we design and analyze compression methods that exploit temporal correlation in systems both with and without error-feedback. Experiments with the ImageNet dataset demonstrate that our proposed methods offer significant reduction in the rate of communication at only a negligible increase in computation complexity. We further analyze the convergence of SGD when compression is applied with error-feedback. In the literature, convergence guarantees are developed only for compressors that provide error-bounds point-wise, i.e., for each input to the compressor. In contrast, many important codes (e.g. rate-distortion codes) provide error-bounds only in expectation and thus provide a more general guarantee. In this paper we prove the convergence of SGD under an expected error assumption by establishing a bound for the minimum gradient norm.

* IEEE Journal on Selected Areas in Information Theory (JSAIT) Volume 2, Issue 3 (2021)
* This paper was presented in part at the 11th International Symposium on Topics in Coding (ISTC), Montreal, QC, Canada, August 2021, and the paper has been accepted for publication in the IEEE Journal on Selected Areas in Information Theory (JSAIT) Volume 2, Issue 3 (2021), https://ieeexplore.ieee.org/document/9511618

Via

Access Paper or Ask Questions

Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Jun 30, 2021

Sankalp Gilda, Stark C. Draper, Sebastien Fabbro, William Mahoney, Simon Prunet, Kanoa Withington, Matthew Wilson, Yuan-Sen Ting, Andrew Sheinis

Figure 1 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Figure 2 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Figure 3 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Figure 4 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Abstract:We leverage state-of-the-art machine learning methods and a decade's worth of archival data from the Canada-France-Hawaii Telescope (CFHT) to predict observatory image quality (IQ) from environmental conditions and observatory operating parameters. Specifically, we develop accurate and interpretable models of the complex dependence between data features and observed IQ for CFHT's wide field camera, MegaCam. Our contributions are several-fold. First, we collect, collate and reprocess several disparate data sets gathered by CFHT scientists. Second, we predict probability distribution functions (PDFs) of IQ, and achieve a mean absolute error of $\sim0.07''$ for the predicted medians. Third, we explore data-driven actuation of the 12 dome ``vents'', installed in 2013-14 to accelerate the flushing of hot air from the dome. We leverage epistemic and aleatoric uncertainties in conjunction with probabilistic generative modeling to identify candidate vent adjustments that are in-distribution (ID) and, for the optimal configuration for each ID sample, we predict the reduction in required observing time to achieve a fixed SNR. On average, the reduction is $\sim15\%$. Finally, we rank sensor data features by Shapley values to identify the most predictive variables for each observation. Our long-term goal is to construct reliable and real-time models that can forecast optimal observatory operating parameters for optimization of IQ. Such forecasts can then be fed into scheduling protocols and predictive maintenance routines. We anticipate that such approaches will become standard in automating observatory operations and maintenance by the time CFHT's successor, the Maunakea Spectroscopic Explorer (MSE), is installed in the next decade.

* 25 pages, 1 appendix, 12 figures. To be submitted to MNRAS. Comments and feedback welcome

Via

Access Paper or Ask Questions

Graph Community Detection from Coarse Measurements: Recovery Conditions for the Coarsened Weighted Stochastic Block Model

Feb 25, 2021

Nafiseh Ghoroghchian, Gautam Dasarathy, Stark C. Draper

Figure 1 for Graph Community Detection from Coarse Measurements: Recovery Conditions for the Coarsened Weighted Stochastic Block Model

Figure 2 for Graph Community Detection from Coarse Measurements: Recovery Conditions for the Coarsened Weighted Stochastic Block Model

Abstract:We study the problem of community recovery from coarse measurements of a graph. In contrast to the problem of community recovery of a fully observed graph, one often encounters situations when measurements of a graph are made at low-resolution, each measurement integrating across multiple graph nodes. Such low-resolution measurements effectively induce a coarse graph with its own communities. Our objective is to develop conditions on the graph structure, the quantity, and properties of measurements, under which we can recover the community organization in this coarse graph. In this paper, we build on the stochastic block model by mathematically formalizing the coarsening process, and characterizing its impact on the community members and connections. Through this novel setup and modeling, we characterize an error bound for community recovery. The error bound yields simple and closed-form asymptotic conditions to achieve the perfect recovery of the coarse graph communities.

Via

Access Paper or Ask Questions

Node-Centric Graph Learning from Data for Brain State Identification

Nov 04, 2020

Nafiseh Ghoroghchian, David M. Groppe, Roman Genov, Taufik A. Valiante, Stark C. Draper

Figure 1 for Node-Centric Graph Learning from Data for Brain State Identification

Figure 2 for Node-Centric Graph Learning from Data for Brain State Identification

Figure 3 for Node-Centric Graph Learning from Data for Brain State Identification

Figure 4 for Node-Centric Graph Learning from Data for Brain State Identification

Abstract:Data-driven graph learning models a network by determining the strength of connections between its nodes. The data refers to a graph signal which associates a value with each graph node. Existing graph learning methods either use simplified models for the graph signal, or they are prohibitively expensive in terms of computational and memory requirements. This is particularly true when the number of nodes is high or there are temporal changes in the network. In order to consider richer models with a reasonable computational tractability, we introduce a graph learning method based on representation learning on graphs. Representation learning generates an embedding for each graph node, taking the information from neighbouring nodes into account. Our graph learning method further modifies the embeddings to compute the graph similarity matrix. In this work, graph learning is used to examine brain networks for brain state identification. We infer time-varying brain graphs from an extensive dataset of intracranial electroencephalographic (iEEG) signals from ten patients. We then apply the graphs as input to a classifier to distinguish seizure vs. non-seizure brain states. Using the binary classification metric of area under the receiver operating characteristic curve (AUC), this approach yields an average of 9.13 percent improvement when compared to two widely used brain network modeling methods.

* IEEE Transactions on Signal and Information Processing over Networks, 6, 120-132 (2020)

Via

Access Paper or Ask Questions

A Hierarchical Graph Signal Processing Approach to Inference from Spatiotemporal Signals

Oct 25, 2020

Nafiseh Ghoroghchian, Stark C. Draper, Roman Genov

Figure 1 for A Hierarchical Graph Signal Processing Approach to Inference from Spatiotemporal Signals

Figure 2 for A Hierarchical Graph Signal Processing Approach to Inference from Spatiotemporal Signals

Figure 3 for A Hierarchical Graph Signal Processing Approach to Inference from Spatiotemporal Signals

Figure 4 for A Hierarchical Graph Signal Processing Approach to Inference from Spatiotemporal Signals

Abstract:Motivated by the emerging area of graph signal processing (GSP), we introduce a novel method to draw inference from spatiotemporal signals. Data acquisition in different locations over time is common in sensor networks, for diverse applications ranging from object tracking in wireless networks to medical uses such as electroencephalography (EEG) signal processing. In this paper we leverage novel techniques of GSP to develop a hierarchical feature extraction approach by mapping the data onto a series of spatiotemporal graphs. Such a model maps signals onto vertices of a graph and the time-space dependencies among signals are modeled by the edge weights. Signal components acquired from different locations and time often have complicated functional dependencies. Accordingly, their corresponding graph weights are learned from data and used in two ways. First, they are used as a part of the embedding related to the topology of graph, such as density. Second, they provide the connectivities of the base graph for extracting higher level GSP-based features. The latter include the energies of the signal's graph Fourier transform in different frequency bands. We test our approach on the intracranial EEG (iEEG) data set of the Kaggle epileptic seizure detection contest. In comparison to the winning code, the results show a slight net improvement and up to 6 percent improvement in per subject analysis, while the number of features are decreased by 75 percent on average.

* In 2018 29th Biennial Symposium on Communications (BSC) (pp. 1-5). IEEE (2018, June)

Via

Access Paper or Ask Questions

Anytime MiniBatch: Exploiting Stragglers in Online Distributed Optimization

Jun 10, 2020

Nuwan Ferdinand, Haider Al-Lawati, Stark C. Draper, Matthew Nokleby

Figure 1 for Anytime MiniBatch: Exploiting Stragglers in Online Distributed Optimization

Figure 2 for Anytime MiniBatch: Exploiting Stragglers in Online Distributed Optimization

Figure 3 for Anytime MiniBatch: Exploiting Stragglers in Online Distributed Optimization

Figure 4 for Anytime MiniBatch: Exploiting Stragglers in Online Distributed Optimization

Abstract:Distributed optimization is vital in solving large-scale machine learning problems. A widely-shared feature of distributed optimization techniques is the requirement that all nodes complete their assigned tasks in each computational epoch before the system can proceed to the next epoch. In such settings, slow nodes, called stragglers, can greatly slow progress. To mitigate the impact of stragglers, we propose an online distributed optimization method called Anytime Minibatch. In this approach, all nodes are given a fixed time to compute the gradients of as many data samples as possible. The result is a variable per-node minibatch size. Workers then get a fixed communication time to average their minibatch gradients via several rounds of consensus, which are then used to update primal variables via dual averaging. Anytime Minibatch prevents stragglers from holding up the system without wasting the work that stragglers can complete. We present a convergence analysis and analyze the wall time performance. Our numerical results show that our approach is up to 1.5 times faster in Amazon EC2 and it is up to five times faster when there is greater variability in compute node performance.

* Proc. of the 7th Int. Conf. on Learning Representations (ICLR), May 2019, New Orleans, LA, USA
* International Conference on Learning Representations (ICLR), May 2019, New Orleans, LA, USA

Via

Access Paper or Ask Questions

Efficient learning of neighbor representations for boundary trees and forests

Oct 26, 2018

Tharindu Adikari, Stark C. Draper

Figure 1 for Efficient learning of neighbor representations for boundary trees and forests

Figure 2 for Efficient learning of neighbor representations for boundary trees and forests

Figure 3 for Efficient learning of neighbor representations for boundary trees and forests

Abstract:We introduce a semiparametric approach to neighbor-based classification. We build off the recently proposed Boundary Trees algorithm by Mathy et al.(2015) which enables fast neighbor-based classification, regression and retrieval in large datasets. While boundary trees use an Euclidean measure of similarity, the Differentiable Boundary Tree algorithm by Zoran et al.(2017) was introduced to learn low-dimensional representations of complex input data, on which semantic similarity can be calculated to train boundary trees. As is pointed out by its authors, the differentiable boundary tree approach contains a few limitations that prevents it from scaling to large datasets. In this paper, we introduce Differentiable Boundary Sets, an algorithm that overcomes the computational issues of the differentiable boundary tree scheme and also improves its classification accuracy and data representability. Our algorithm is efficiently implementable with existing tools and offers a significant reduction in training time. We test and compare the algorithms on the well known MNIST handwritten digits dataset and the newer Fashion-MNIST dataset by Xiao et al.(2017).

* 9 pages, 2 figures

Via

Access Paper or Ask Questions