Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference

Nov 12, 2021

Moontae Lee, Sungjun Cho, Kun Dong, David Mimno, David Bindel

Figure 1 for On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference

Figure 2 for On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference

Figure 3 for On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference

Figure 4 for On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference

Share this with someone who'll enjoy it:

Abstract:Across many data domains, co-occurrence statistics about the joint appearance of objects are powerfully informative. By transforming unsupervised learning problems into decompositions of co-occurrence statistics, spectral algorithms provide transparent and efficient algorithms for posterior inference such as latent topic analysis and community detection. As object vocabularies grow, however, it becomes rapidly more expensive to store and run inference algorithms on co-occurrence statistics. Rectifying co-occurrence, the key process to uphold model assumptions, becomes increasingly more vital in the presence of rare terms, but current techniques cannot scale to large vocabularies. We propose novel methods that simultaneously compress and rectify co-occurrence statistics, scaling gracefully with the size of vocabulary and the dimension of latent space. We also present new algorithms learning latent variables from the compressed statistics, and verify that our methods perform comparably to previous approaches on both textual and non-textual data.

View paper on

Share this with someone who'll enjoy it:

Title:On-the-Fly Rectification for Robust Large-Vocabulary Topic Inference

Paper and Code