Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Guillaume Bourque

A log-linear time algorithm for constrained changepoint detection

Mar 09, 2017

Toby Dylan Hocking, Guillem Rigaill, Paul Fearnhead, Guillaume Bourque

Figure 1 for A log-linear time algorithm for constrained changepoint detection

Figure 2 for A log-linear time algorithm for constrained changepoint detection

Figure 3 for A log-linear time algorithm for constrained changepoint detection

Figure 4 for A log-linear time algorithm for constrained changepoint detection

Abstract:Changepoint detection is a central problem in time series and genomic data. For some applications, it is natural to impose constraints on the directions of changes. One example is ChIP-seq data, for which adding an up-down constraint improves peak detection accuracy, but makes the optimization problem more complicated. We show how a recently proposed functional pruning technique can be adapted to solve such constrained changepoint detection problems. This leads to a new algorithm which can solve problems with arbitrary affine constraints on adjacent segment means, and which has empirical time complexity that is log-linear in the amount of data. This algorithm achieves state-of-the-art accuracy in a benchmark of several genomic data sets, and is orders of magnitude faster than existing algorithms that have similar accuracy. Our implementation is available as the PeakSegPDPA function in the coseg R package, https://github.com/tdhock/coseg

Via

Access Paper or Ask Questions

PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

Jun 03, 2015

Toby Dylan Hocking, Guillaume Bourque

Figure 1 for PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

Figure 2 for PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

Figure 3 for PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

Figure 4 for PeakSegJoint: fast supervised peak detection via joint segmentation of multiple count data samples

Abstract:Joint peak detection is a central problem when comparing samples in genomic data analysis, but current algorithms for this task are unsupervised and limited to at most 2 sample types. We propose PeakSegJoint, a new constrained maximum likelihood segmentation model for any number of sample types. To select the number of peaks in the segmentation, we propose a supervised penalty learning model. To infer the parameters of these two models, we propose to use a discrete optimization heuristic for the segmentation, and convex optimization for the penalty learning. In comparisons with state-of-the-art peak detection algorithms, PeakSegJoint achieves similar accuracy, faster speeds, and a more interpretable model with overlapping peaks that occur in exactly the same positions across all samples.

* 11 pages, 5 figures

Via

Access Paper or Ask Questions