Back to Search Start Over

Automated News Classification using N-gram Model and Key Features of Nepali Language

Authors :
Rupesh Dahi Shrestha
Dinesh Dangol
Arun K. Timalsina
Source :
SCITECH Nepal. 13:64-69
Publication Year :
2018
Publisher :
Nepal Journals Online (JOL), 2018.

Abstract

With an increasing trend of publishing news online on website, automatic text processing becomes more and more important. Automatic text classification has been a focus of many researchers in different languages for decades. There is a huge amount of research repository on features of English language and their uses on automated text processing. This research implements Nepali language key features for automatic text classification of Nepali news. In particular, the study on impact of Nepali language based features, which are extremely different than English language is more challenging because of the higher level of complexity to be resolved. The research experiment using vector space model, n-gram model and key feature based processing specific to Nepali language shows promising result compared to bag-of-words model for the task of automated Nepali news classification.

Details

ISSN :
20911742
Volume :
13
Database :
OpenAIRE
Journal :
SCITECH Nepal
Accession number :
edsair.doi...........6cf0f36f465a1e4722be4d1de82b73b6