ISBN-13: 9786202524384 / Angielski / Miękka / 2020 / 88 str.
The aim of this book is to understand: Types of Digital Data, Introduction to Big Data, Big Data Analytics, History of Hadoop, Apache Hadoop, Analysing Data with Unix tools, Analysing Data with Hadoop, Hadoop Streaming, Hadoop Echo System, IBM Big Data Strategy, Introduction to Info-sphere, Big Insights and Big Sheets. HDFS (Hadoop Distributed File System). The Design of HDFS, HDFS Concepts, Command Line Interface, Hadoop file system interfaces, Data flow, Data Ingest with Flume and Scoop and Hadoop archives, Hadoop I/ O: Compression, Serialization, Avro and File-Based Data structures.