Back to Search Start Over

Opportunities and challenges of text mining in materials research

Authors :
Gerbrand Ceder
Haoyan Huo
Olga Kononova
Tanjin He
Elsa Olivetti
Amalie Trewartha
Source :
iScience, Vol 24, Iss 3, Pp 102155-(2021), iScience, vol 24, iss 3, iScience
Publication Year :
2021
Publisher :
Elsevier, 2021.

Abstract

Research publications are the major repository of scientific knowledge. However, their unstructured and highly heterogenous format creates a significant obstacle to large-scale analysis of the information contained within. Recent progress in natural language processing (NLP) has provided a variety of tools for high-quality information extraction from unstructured text. These tools are primarily trained on non-technical text and struggle to produce accurate results when applied to scientific text, involving specific technical terminology. During the last years, significant efforts in information retrieval have been made for biomedical and biochemical publications. For materials science, text mining (TM) methodology is still at the dawn of its development. In this review, we survey the recent progress in creating and applying TM and NLP approaches to materials science field. This review is directed at the broad class of researchers aiming to learn the fundamentals of TM as applied to the materials science publications.<br />Graphical Abstract<br />Data Analysis; Computing Methodology; Computational Materials Science; Materials Design

Details

Language :
English
ISSN :
25890042
Volume :
24
Issue :
3
Database :
OpenAIRE
Journal :
iScience
Accession number :
edsair.doi.dedup.....cc91e86506edc60397cd8585eb638b7f