Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gentry Johnson

Matching on What Matters: A Pseudo-Metric Learning Approach to Matching Estimation in High Dimensions

May 28, 2019

Gentry Johnson, Brian Quistorff, Matt Goldman

Figure 1 for Matching on What Matters: A Pseudo-Metric Learning Approach to Matching Estimation in High Dimensions

Figure 2 for Matching on What Matters: A Pseudo-Metric Learning Approach to Matching Estimation in High Dimensions

Figure 3 for Matching on What Matters: A Pseudo-Metric Learning Approach to Matching Estimation in High Dimensions

Figure 4 for Matching on What Matters: A Pseudo-Metric Learning Approach to Matching Estimation in High Dimensions

Abstract:When pre-processing observational data via matching, we seek to approximate each unit with maximally similar peers that had an alternative treatment status--essentially replicating a randomized block design. However, as one considers a growing number of continuous features, a curse of dimensionality applies making asymptotically valid inference impossible (Abadie and Imbens, 2006). The alternative of ignoring plausibly relevant features is certainly no better, and the resulting trade-off substantially limits the application of matching methods to "wide" datasets. Instead, Li and Fu (2017) recasts the problem of matching in a metric learning framework that maps features to a low-dimensional space that facilitates "closer matches" while still capturing important aspects of unit-level heterogeneity. However, that method lacks key theoretical guarantees and can produce inconsistent estimates in cases of heterogeneous treatment effects. Motivated by straightforward extension of existing results in the matching literature, we present alternative techniques that learn latent matching features through either MLPs or through siamese neural networks trained on a carefully selected loss function. We benchmark the resulting alternative methods in simulations as well as against two experimental data sets--including the canonical NSW worker training program data set--and find superior performance of the neural-net-based methods.

Via

Access Paper or Ask Questions