ISBN-13: 9786200785268 / Angielski / Miękka / 2020 / 128 str.
Documents classification is one of the most important fields in Natural language processing and text mining. There are many algorithms can be used to perform this task. Most of the used algorithms are from machine learning like: Decision Tree, Support Vector Machine, K-Nearest Neighbors and Naïve Bayes. These are the most essential four classification algorithms. Many researches try to modify and improve these algorithms for text classification. In this book, our work is divided into two levels: (i) a comparative study for these four algorithms, (ii) studying the improvement of document classification with feature selection where four feature selection methods are used and a new feature selection method is suggested.