This book addresses the significant challenge in designing applications to effectively use high-performance computing architectures and offers methods to more effectively restructure applications to take advantage of these powerful nodes. The authors discuss the benefit of vectorization, providing examples run on current accelerators. They also explore issues with data locality when threading on the nodes and examine the accelerator in detail. The book describes programming environments for hybrid multicore systems, from the low level OpenCL and Cuda languages to new directive-based...
This book addresses the significant challenge in designing applications to effectively use high-performance computing architectures and offers meth...