Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Simon Urbanek

Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

Jul 12, 2015

Michael J. Kane, Bryan Lewis, Sekhar Tatikonda, Simon Urbanek

Figure 1 for Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

Figure 2 for Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

Figure 3 for Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

Figure 4 for Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

Abstract:Linear regression models depend directly on the design matrix and its properties. Techniques that efficiently estimate model coefficients by partitioning rows of the design matrix are increasingly popular for large-scale problems because they fit well with modern parallel computing architectures. We propose a simple measure of {\em concordance} between a design matrix and a subset of its rows that estimates how well a subset captures the variance-covariance structure of a larger data set. We illustrate the use of this measure in a heuristic method for selecting row partition sizes that balance statistical and computational efficiency goals in real-world problems.

Via

Access Paper or Ask Questions