Introduction to HPC with Mpi for Data Science » książka

zaloguj się | załóż konto

topmenu

Szukaj

Książki na zamówienie

Wyszukiwanie zaawansowane

Pusty koszyk

Bezpłatna dostawa dla zamówień powyżej 40 zł

Kategorie główne

• Nauka

[2950464]

• Literatura piękna

[1818042]

więcej...

Kategorie szczegółowe BISAC

Introduction to HPC with Mpi for Data Science

ISBN-13: 9783319219028 / Angielski / Miękka / 2016 / 282 str.

Frank Nielsen

Introduction to HPC with Mpi for Data Science

ISBN-13: 9783319219028 / Angielski / Miękka / 2016 / 282 str.

Frank Nielsen

cena 160,99
(netto: 153,32 VAT: 5%)

Najniższa cena z 30 dni: 154,18

Termin realizacji zamówienia:
ok. 16-18 dni roboczych.

Darmowa dostawa!

This gentle introduction to High Performance Computing (HPC) for Data Science using the Message Passing Interface (MPI) standard has been designed as a first course for undergraduates on parallel programming on distributed memory models, and requires only basic programming notions.Divided into two parts the first part covers high performance computing using C++ with the Message Passing Interface (MPI) standard followed by a second part providing high-performance data analytics on computer clusters.In the first part, the fundamental notions of blocking versus non-blocking point-to-point communications, global communications (like broadcast or scatter) and collaborative computations (reduce), with Amdalh and Gustafson speed-up laws are described before addressing parallel sorting and parallel linear algebra on computer clusters. The common ring, torus and hypercube topologies of clusters are then explained and global communication procedures on these topologies are studied. This first part closes with the MapReduce (MR) model of computation well-suited to processing big data using the MPI framework.In the second part, the book focuses on high-performance data analytics. Flat and hierarchical clustering algorithms are introduced for data exploration along with how to program these algorithms on computer clusters, followed by machine learning classification, and an introduction to graph analytics. This part closes with a concise introduction to data core-sets that let big data problems be amenable to tiny data problems.Exercises are included at the end of each chapter in order for students to practice the concepts learned, and a final section contains an overall exam which allows them to evaluate how well they have assimilated the material covered in the book.

Kategorie:

Informatyka, Bazy danych

Kategorie BISAC:

Computers > Programming - Object Oriented
Computers > Data Science - Data Analytics

Wydawca:

Springer

Seria wydawnicza:

Undergraduate Topics in Computer Science

Język:

Angielski

ISBN-13:

9783319219028

Rok wydania:

2016

Wydanie:

2016

Numer serii:

000343004

Ilość stron:

282

Waga:

0.51 kg

Wymiary:

15.9 x 24.0 x 1.9

Oprawa:

Miękka

Wolumenów:

Dodatkowe informacje:

Bibliografia
Wydanie ilustrowane

Preface.- Part 1: High Performance Computing (HPC) with the Message Passing Interface (MPI).- A Glance at High Performance Computing (HPC).- Introduction to MPI: The Message Passing Interface.- Topology of Interconnection Networks.- Parallel Sorting.- Parallel Linear Algebra.-The MapReduce Paradigm.- Part 11: High Performance Computing for Data Science.- Partition-based Clustering with k means.- Hierarchical Clustering.- Supervised Learning: Practice and Theory of Classification with k NN rule.- Fast Approximate Optimization to High Dimensions with Core-sets and Fast Dimension Reduction.- Parallel Algorithms for Graphs.- Appendix A: Written Exam.- Appendix B: SLURM: A resource manager and job scheduler on clusters of machines.- Appendix C: List of Figures.- Appendix D: List of Tables.- Appendix E: Index.

Frank Nielsen is a Professor at École Polytechnique in France where he teaches graduate (vision/graphics) and undergraduate (Java/algorithms),and a senior researcher at Sony Computer Science Laboratories Inc. His research includes Computational information geometry for imaging and learning and he is the author of 3 textbooks and 3 edited books. He is also on the Editorial Board for the Springer Journal of Mathematical Imaging and Vision.

Divided into two parts the first part covers high performance computing using C++ with the Message Passing Interface (MPI) standard followed by a second part providing high-performance data analytics on computer clusters.

In the first part, the fundamental notions of blocking versus non-blocking point-to-point communications, global communications (like broadcast or scatter) and collaborative computations (reduce), with Amdalh and Gustafson speed-up laws are described before addressing parallel sorting and parallel linear algebra on computer clusters. The common ring, torus and hypercube topologies of clusters are then explained and global communication procedures on these topologies are studied. This first part closes with the MapReduce (MR) model of computation well-suited to processing big data using the MPI framework.

In the second part, the book focuses on high-performance data analytics. Flat and hierarchical clustering algorithms are introduced for data exploration along with how to program these algorithms on computer clusters, followed by machine learning classification, and an introduction to graph analytics. This part closes with a concise introduction to data core-sets that let big data problems be amenable to tiny data problems.

Exercises are included at the end of each chapter in order for students to practice the concepts learned, and a final section contains an overall exam which allows them to evaluate how well they have assimilated the material covered in the book.

Nielsen, Frank Frank Nielsen (Japan) is a technical director and ... więcej >

Krainaksiazek.pl w programie rzetelna firma

Krainaksiaze.pl - płatności przez paypal

Czytaj nas na:

Zobacz:

1997-2026 DolnySlask.com Agencja Internetowa