ISBN-13: 9783659780394 / Angielski
About this Book: This book covers basic concepts of document pre-processing which are necessary for multistage document clustering. The practical knowledge of these concepts is required for the implementation of document clustering. This book describes the various document clustering methods and examines the similarity measure for document clustering. It also examines the few major factors for designing multistage document clustering using Rough set theory. It introduces the various document preprocessing & feature selection techniques relevant the proposed research work and also discusses the various stages required for clustering and its applicability to feature reduction. Therefore the elaboration of basic document preprocessing concept using rough set is provided in this book. It is written for the under graduate, post graduate students of Computer Science and Engineering and Information Technology. Document Preprocessing for any input text document. Feature Extraction techniques used for multistage document clustering. Dimensionality Reduction as Rough set approach is applied for document clustering. Rough Set ensemble is used for multistage document clustering