" an excellent primer on the subject of data mining with an accessible introduction to the fundamental and advanced data mining technologies." (
Journal of Electronic Imaging, January–March 2006)
"Applied statisticians and probabilists will like this book very much." (Journal of Statistical Computation and Simulation, November 2005)
" the book is an impressive and broad overview...a general roadmap of what methods are available and where to look." (Journal of Intelligent & Fuzzy Systems, Vol. 16, No. 2, 2005)
"This readable survey describes multimedia, soft computing, and bioinformatics strategies for a number of data types " (Business Horizons, September– October 2004)
" an accessible introduction to fundamental and advanced data mining technologies. It will be an excellent book for both beginners and professionals." (Computing Reviews.com, April 20, 2004)
"Overall, this is a nice, easy–to–read book for those already working in the area of data mining." (Technometrics, August 2004, Vol. 46, No. 3)
Preface.
1. Introduction to Data Mining.
2. Soft Computing.
3. Multimedia Data Compression.
4. String Matching.
5. Classification in Data Mining.
6. Clustering in Data Mining.
7. Association Rules.
8. Rule Mining with Soft Computing.
9. Multimedia Data Mining.
10. Bioinformatics: An Application.
Index.
About the Authors.
SUSHMITA MITRA, PHD, is a Professor at Machine Intelligence Unit, Indian Statistical Institute, in Calcutta. She is a coauthor of
Neuro–Fuzzy Pattern Recognition: Methods in Soft Computing, also published by Wiley.
TINKU ACHARYA, PHD, Senior Executive vice president and Chief Science Officer of Avisere Inc., Tucson, Arizona, is involved in multimedia data mining applications. He is also an adjunct professor in the Department of Electrical Engineering at Arizona State University. He was recognized as the Most Prolific Inventor of Intel Corporation Worldwide in 1999.
A primer on traditional hard and emerging soft computing approaches for mining multimedia data
While the digital revolution has made huge volumes of high dimensional multimedia data available, it has also challenged users to extract the information they seek from heretofore unthinkably huge datasets. Traditional hard computing data mining techniques have concentrated on flat–file applications. Soft computing tools such as fuzzy sets, artificial neural networks, genetic algorithms, and rough sets however, offer the opportunity to apply a wide range of data types to a variety of vital functions by handling real–life uncertainty with low–cost solutions. Data Mining: Multimedia, Soft Computing, and Bioinformatics provides an accessible introduction to fundamental and advanced data mining technologies.
This readable survey describes data mining strategies for a slew of data types, including numeric and alpha–numeric formats, text, images, video, graphics, and the mixed representations therein. Along with traditional concepts and functions of data mining like classification, clustering, and rule mining the authors highlight topical issues in multimedia applications and bioinformatics. Principal topics discussed throughout the text include:
The role of soft computing and its principles in data mining
Principles and classical algorithms on string matching and their role in data (mainly text) mining
Data compression principles for both lossless and lossy techniques, including their scope in data mining
Access of data using matching pursuits both in raw and compressed data domains