Back to Search Start Over

A Corpus Investigation on the Journal of Social Sciences of the Turkic World

Authors :
Yilmaz, Isa
Source :
Universal Journal of Educational Research. 2018 6(6):1199-1206.
Publication Year :
2018

Abstract

In recent years, a rapid development in computer technologies has been witnessed and feasibility of data access has been increased. In today's world, restoring documents, or data in general, and transferring them to interested parties are ordinary tasks. The amount of restored documents has also increased expeditiously and this development has required new technologies to emerge for building knowledge from large data sets. Basic applications of text mining include gathering and processing text to extract information that embodies raw data. Thus, basic text mining applications can help researchers to reach valuable knowledge from a mass of documents. This study investigated academic articles published in "bilig" ("Journal of Social Sciences of the Turkic World") between 1996 and 2017 to find the frequencies of words and letters used in academic Turkish. Basic text mining of 4,850,817 words in 19437 pages from 81 "bilig" issues was completed using a natural language processing library, Zemberek and a programming language, R.

Details

Language :
English
ISSN :
2332-3205
Volume :
6
Issue :
6
Database :
ERIC
Journal :
Universal Journal of Educational Research
Publication Type :
Academic Journal
Accession number :
EJ1181202
Document Type :
Journal Articles<br />Reports - Research<br />Information Analyses