Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nima Asadi

Runtime Optimizations for Prediction with Tree-Based Models

Apr 26, 2013

Nima Asadi, Jimmy Lin, Arjen P. de Vries

Figure 1 for Runtime Optimizations for Prediction with Tree-Based Models

Figure 2 for Runtime Optimizations for Prediction with Tree-Based Models

Figure 3 for Runtime Optimizations for Prediction with Tree-Based Models

Figure 4 for Runtime Optimizations for Prediction with Tree-Based Models

Abstract:Tree-based models have proven to be an effective solution for web ranking as well as other problems in diverse domains. This paper focuses on optimizing the runtime performance of applying such models to make predictions, given an already-trained model. Although exceedingly simple conceptually, most implementations of tree-based models do not efficiently utilize modern superscalar processor architectures. By laying out data structures in memory in a more cache-conscious fashion, removing branches from the execution flow using a technique called predication, and micro-batching predictions using a technique called vectorization, we are able to better exploit modern processor architectures and significantly improve the speed of tree-based models over hard-coded if-else blocks. Our work contributes to the exploration of architecture-conscious runtime implementations of machine learning algorithms.

Via

Access Paper or Ask Questions

Using Variational Inference and MapReduce to Scale Topic Modeling

Jul 19, 2011

Ke Zhai, Jordan Boyd-Graber, Nima Asadi

Figure 1 for Using Variational Inference and MapReduce to Scale Topic Modeling

Figure 2 for Using Variational Inference and MapReduce to Scale Topic Modeling

Figure 3 for Using Variational Inference and MapReduce to Scale Topic Modeling

Figure 4 for Using Variational Inference and MapReduce to Scale Topic Modeling

Abstract:Latent Dirichlet Allocation (LDA) is a popular topic modeling technique for exploring document collections. Because of the increasing prevalence of large datasets, there is a need to improve the scalability of inference of LDA. In this paper, we propose a technique called ~\emph{MapReduce LDA} (Mr. LDA) to accommodate very large corpus collections in the MapReduce framework. In contrast to other techniques to scale inference for LDA, which use Gibbs sampling, we use variational inference. Our solution efficiently distributes computation and is relatively simple to implement. More importantly, this variational implementation, unlike highly tuned and specialized implementations, is easily extensible. We demonstrate two extensions of the model possible with this scalable framework: informed priors to guide topic discovery and modeling topics from a multilingual corpus.

Via

Access Paper or Ask Questions