ISBN-13: 9786209589720 / Angielski / Miękka / 2026 / 60 str.
This book is a hands-on guide that focuses exclusively on practical implementations and executable solutions using Hadoop and its ecosystem. It is designed as a laboratory-oriented resource where readers learn Big Data technologies by directly performing configurations, commands, and analytics tasks.The book begins with setting up and installing Hadoop in both pseudo-distributed and fully distributed operating modes. Readers then implement essential HDFS file management tasks, including adding files and directories, retrieving data from HDFS, and deleting files. These exercises help users become comfortable working with distributed storage systems.The book provides multiple MapReduce implementations, starting with a Word Count solution to calculate the number of occurrences of each word in large input files. It further includes programs for keyword-based word search and stop word elimination from large text datasets. A dedicated section focuses on weather data mining using MapReduce, where programs are implemented to compute average, maximum, and minimum temperatures for each year and to filter records based on temperature thresholds.