Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Oct 16, 2019

Tobias Weber, Dieter Kranzlmüller, Michael Fromm, Nelson Tavares de Sousa

Figure 1 for Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Figure 2 for Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Figure 3 for Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Figure 4 for Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Share this with someone who'll enjoy it:

Abstract:Automated classification of metadata of research data by their discipline(s) of research can be used in scientometric research, by repository service providers, and in the context of research data aggregation services. Openly available metadata of the DataCite index for research data were used to compile a large training and evaluation set comprised of 609,524 records, which is published alongside this paper. These data allow to reproducibly assess classification approaches, such as tree-based models and neural networks. According to our experiments with 20 base classes (multi-label classification), multi-layer perceptron models perform best with a f1-macro score of 0.760 closely followed by Long Short-Term Memory models (f1-macro score of 0.755). A possible application of the trained classification models is the quantitative analysis of trends towards interdisciplinarity of digital scholarly output or the characterization of growth patterns of research data, stratified by discipline of research. Both applications perform at scale with the proposed models which are available for re-use.

View paper on

Share this with someone who'll enjoy it:

Title:Using Supervised Learning to Classify Metadata of Research Data by Discipline of Research

Paper and Code