Back to Search Start Over

An original template solution for FAIR scientific text mining

Authors :
Niels A. Zondervan
Frazen Tolentino-Zondervan
Source :
MethodsX, Vol 10, Iss , Pp 102145- (2023)
Publication Year :
2023
Publisher :
Elsevier, 2023.

Abstract

This method paper presents a template solution for text mining of scientific literature using the R tm package. Literature to be analyzed can be collected manually or automatically using the code provided with this paper. Once the literature is collected, the three steps for conducting text mining can be performed as outlined below: • loading and cleaning of text from articles, • processing, statistical analysis, and clustering, and • presentation of results using generalized and tailor-made visualizations.The text mining steps can be applied to a single, multiple, or time series groups of documents.References are provided to three published peer reviewed articles that use the presented text mining methodology. The main advantages of our method are: (1) Its suitability for both research and educational purposes, (2) Compliance with the Findable Accessible Interoperable and Reproducible (FAIR) principles, and (3) Code and example data are made available on GitHub under the open-source Apache V2 license.

Details

Language :
English
ISSN :
22150161
Volume :
10
Issue :
102145-
Database :
Directory of Open Access Journals
Journal :
MethodsX
Publication Type :
Academic Journal
Accession number :
edsdoj.603e2080bf045f5b8c7f7d07bea6145
Document Type :
article
Full Text :
https://doi.org/10.1016/j.mex.2023.102145