Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

and Hai Lin

voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Jul 11, 2022

Xiangyang He, Yubo Tao, Shuoliu Yang, Haoran Dai, and Hai Lin

Figure 1 for voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Figure 2 for voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Figure 3 for voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Figure 4 for voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Abstract:Relationships in scientific data, such as the numerical and spatial distribution relations of features in univariate data, the scalar-value combinations' relations in multivariate data, and the association of volumes in time-varying and ensemble data, are intricate and complex. This paper presents voxel2vec, a novel unsupervised representation learning model, which is used to learn distributed representations of scalar values/scalar-value combinations in a low-dimensional vector space. Its basic assumption is that if two scalar values/scalar-value combinations have similar contexts, they usually have high similarity in terms of features. By representing scalar values/scalar-value combinations as symbols, voxel2vec learns the similarity between them in the context of spatial distribution and then allows us to explore the overall association between volumes by transfer prediction. We demonstrate the usefulness and effectiveness of voxel2vec by comparing it with the isosurface similarity map of univariate data and applying the learned distributed representations to feature classification for multivariate data and to association analysis for time-varying and ensemble data.

* Accepted by IEEE Transaction on Visualization and Computer Graphics (TVCG)

Via

Access Paper or Ask Questions