Start Over

Construction of English and American Literature Corpus Based on Machine Learning Algorithm.

Authors :: Dai, Qian
Source :: Computational Intelligence & Neuroscience; 6/2/2022, p1-9, 9p
Publication Year :: 2022
Abstract: In China, the application of corpus in language teaching, especially in English and American literature teaching, is still in the preliminary research stage, and there are various shortcomings, which have not been paid due attention by front-line educators. Constructing English and American literature corpus according to certain principles can effectively promote English and American literature teaching. The research of this paper is devoted to how to automatically build a corpus of English and American literature. In the process of keyword extraction, key phrases and keywords are effectively combined. The similarity between atomic events is calculated by the TextRank algorithm, and then the first N sentences with high similarity are selected and sorted. Based on ML (machine learning) text classification method, a combined classifier based on SVM (support vector machine) and NB (Naive Bayes) is proposed. The experimental results show that, from the point of view of accuracy and recall, the classification effect of the combined algorithm proposed in this paper is the best among the three methods. The best classification results of accuracy, recall, and F value are 0.87, 0.9, and 0.89, respectively. Experimental results show that this method can quickly, accurately, and persistently obtain high-quality bilingual mixed web pages. [ABSTRACT FROM AUTHOR]

Subjects :: AMERICAN literature
AMERICAN English language
ENGLISH literature
BRITISH Americans
SUPPORT vector machines
MACHINE translating
MACHINE learning

Details

Language :: English
ISSN :: 16875265
Database :: Complementary Index
Journal :: Computational Intelligence & Neuroscience
Publication Type :: Academic Journal
Accession number :: 157216561
Full Text :: https://doi.org/10.1155/2022/9773452

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Construction of English and American Literature Corpus Based on Machine Learning Algorithm.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Construction of English and American Literature Corpus Based on Machine Learning Algorithm.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources