Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Matthew Eichhorn

Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps

Jan 05, 2021

Tri Dao, Nimit S. Sohoni, Albert Gu, Matthew Eichhorn, Amit Blonder, Megan Leszczynski, Atri Rudra, Christopher Ré

Figure 1 for Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps

Figure 2 for Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps

Figure 3 for Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps

Figure 4 for Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps

Abstract:Modern neural network architectures use structured linear transformations, such as low-rank matrices, sparse matrices, permutations, and the Fourier transform, to improve inference speed and reduce memory usage compared to general linear maps. However, choosing which of the myriad structured transformations to use (and its associated parameterization) is a laborious task that requires trading off speed, space, and accuracy. We consider a different approach: we introduce a family of matrices called kaleidoscope matrices (K-matrices) that provably capture any structured matrix with near-optimal space (parameter) and time (arithmetic operation) complexity. We empirically validate that K-matrices can be automatically learned within end-to-end pipelines to replace hand-crafted procedures, in order to improve model quality. For example, replacing channel shuffles in ShuffleNet improves classification accuracy on ImageNet by up to 5%. K-matrices can also simplify hand-engineered pipelines -- we replace filter bank feature computation in speech data preprocessing with a learnable kaleidoscope layer, resulting in only 0.4% loss in accuracy on the TIMIT speech recognition task. In addition, K-matrices can capture latent structure in models: for a challenging permuted image classification task, a K-matrix based representation of permutations is able to learn the right latent structure and improves accuracy of a downstream convolutional model by over 9%. We provide a practically efficient implementation of our approach, and use K-matrices in a Transformer network to attain 36% faster end-to-end inference speed on a language translation task.

* International Conference on Learning Representations (ICLR) 2020 spotlight

Via

Access Paper or Ask Questions

Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations

Mar 14, 2019

Tri Dao, Albert Gu, Matthew Eichhorn, Atri Rudra, Christopher Ré

Figure 1 for Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations

Figure 2 for Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations

Figure 3 for Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations

Figure 4 for Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations

Abstract:Fast linear transforms are ubiquitous in machine learning, including the discrete Fourier transform, discrete cosine transform, and other structured transformations such as convolutions. All of these transforms can be represented by dense matrix-vector multiplication, yet each has a specialized and highly efficient (subquadratic) algorithm. We ask to what extent hand-crafting these algorithms and implementations is necessary, what structural priors they encode, and how much knowledge is required to automatically learn a fast algorithm for a provided structured transform. Motivated by a characterization of fast matrix-vector multiplication as products of sparse matrices, we introduce a parameterization of divide-and-conquer methods that is capable of representing a large class of transforms. This generic formulation can automatically learn an efficient algorithm for many important transforms; for example, it recovers the $O(N \log N)$ Cooley-Tukey FFT algorithm to machine precision, for dimensions $N$ up to $1024$. Furthermore, our method can be incorporated as a lightweight replacement of generic matrices in machine learning pipelines to learn efficient and compressible transformations. On a standard task of compressing a single hidden-layer network, our method exceeds the classification accuracy of unconstrained matrices on CIFAR-10 by 3.9 points---the first time a structured approach has done so---with 4X faster inference speed and 40X fewer parameters.

Via

Access Paper or Ask Questions

Fast Counting in Machine Learning Applications

Jun 26, 2018

Subhadeep Karan, Matthew Eichhorn, Blake Hurlburt, Grant Iraci, Jaroslaw Zola

Figure 1 for Fast Counting in Machine Learning Applications

Figure 2 for Fast Counting in Machine Learning Applications

Figure 3 for Fast Counting in Machine Learning Applications

Figure 4 for Fast Counting in Machine Learning Applications

Abstract:We propose scalable methods to execute counting queries in machine learning applications. To achieve memory and computational efficiency, we abstract counting queries and their context such that the counts can be aggregated as a stream. We demonstrate performance and scalability of the resulting approach on random queries, and through extensive experimentation using Bayesian networks learning and association rule mining. Our methods significantly outperform commonly used ADtrees and hash tables, and are practical alternatives for processing large-scale data.

Via

Access Paper or Ask Questions