Back to Search
Start Over
Opportunities and challenges of text mining in materials research
- Source :
- iScience, Vol 24, Iss 3, Pp 102155-(2021), iScience, vol 24, iss 3, iScience
- Publication Year :
- 2021
- Publisher :
- Elsevier, 2021.
-
Abstract
- Research publications are the major repository of scientific knowledge. However, their unstructured and highly heterogenous format creates a significant obstacle to large-scale analysis of the information contained within. Recent progress in natural language processing (NLP) has provided a variety of tools for high-quality information extraction from unstructured text. These tools are primarily trained on non-technical text and struggle to produce accurate results when applied to scientific text, involving specific technical terminology. During the last years, significant efforts in information retrieval have been made for biomedical and biochemical publications. For materials science, text mining (TM) methodology is still at the dawn of its development. In this review, we survey the recent progress in creating and applying TM and NLP approaches to materials science field. This review is directed at the broad class of researchers aiming to learn the fundamentals of TM as applied to the materials science publications.<br />Graphical Abstract<br />Data Analysis; Computing Methodology; Computational Materials Science; Materials Design
- Subjects :
- 0301 basic medicine
Data Analysis
Sociology of scientific knowledge
Computer science
Review
02 engineering and technology
Materials design
computer.software_genre
Field (computer science)
Terminology
03 medical and health sciences
Text mining
Materials Design
lcsh:Science
Class (computer programming)
Multidisciplinary
business.industry
021001 nanoscience & nanotechnology
Data science
Variety (cybernetics)
Information extraction
030104 developmental biology
Computational Materials Science
lcsh:Q
0210 nano-technology
business
Computing Methodology
computer
Subjects
Details
- Language :
- English
- ISSN :
- 25890042
- Volume :
- 24
- Issue :
- 3
- Database :
- OpenAIRE
- Journal :
- iScience
- Accession number :
- edsair.doi.dedup.....cc91e86506edc60397cd8585eb638b7f