Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kausik Hira

Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

Oct 12, 2023

Kausik Hira, Mohd Zaki, Dhruvil Sheth, Mausam, N M Anoop Krishnan

Figure 1 for Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

Figure 2 for Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

Figure 3 for Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

Figure 4 for Reconstructing Materials Tetrahedron: Challenges in Materials Information Extraction

Abstract:Discovery of new materials has a documented history of propelling human progress for centuries and more. The behaviour of a material is a function of its composition, structure, and properties, which further depend on its processing and testing conditions. Recent developments in deep learning and natural language processing have enabled information extraction at scale from published literature such as peer-reviewed publications, books, and patents. However, this information is spread in multiple formats, such as tables, text, and images, and with little or no uniformity in reporting style giving rise to several machine learning challenges. Here, we discuss, quantify, and document these outstanding challenges in automated information extraction (IE) from materials science literature towards the creation of a large materials science knowledge base. Specifically, we focus on IE from text and tables and outline several challenges with examples. We hope the present work inspires researchers to address the challenges in a coherent fashion, providing to fillip to IE for the materials knowledge base.

Via

Access Paper or Ask Questions