Optimizing HPC Applications with Intel Cluster Tools

Hunting Petaflops

by Alexander Supalov, Andrey Semin, Michael Klemm, Christopher Dahnken

DescriptionTable of ContentsDetailsHashtagsReport an issue

Book Description

Optimizing HPC Applications with Intel Cluster Tools takes the reader on a tour of the fast-growing area of high performance computing and the optimization of hybrid programs. These programs typically combine distributed memory and shared memory programming models and use the Message Passing Interface (MPI) and OpenMP for multi-threading to achieve the ultimate goal of high performance at low power consumption on enterprise-class workstations and compute clusters.

The book focuses on optimization for clusters consisting of the Intel Xeon processor, but the optimization methodologies also apply to the Intel Xeon Phi coprocessor and heterogeneous clusters mixing both architectures. Besides the tutorial and reference content, the authors address and refute many myths and misconceptions surrounding the topic. The text is augmented and enriched by descriptions of real-life situations.

This open book is licensed under a Creative Commons License (CC BY-NC-ND). You can download Optimizing HPC Applications with Intel Cluster Tools ebook for free in PDF format (6.6 MB).

Table of Contents

Chapter 1
No Time to Read This Book?
Chapter 2
Overview of Platform Architectures
Chapter 3
Top-Down Software Optimization
Chapter 4
Addressing System Bottlenecks
Chapter 5
Addressing Application Bottlenecks: Distributed Memory
Chapter 6
Addressing Application Bottlenecks: Shared Memory
Chapter 7
Addressing Application Bottlenecks: Microarchitecture
Chapter 8
Application Design Considerations

Book Details

Computer Science
ISBN13 Digital
ISBN10 Digital
PDF Size
6.6 MB

Related Books

Docker Succinctly
Containers have revolutionized software development, allowing developers to bundle their applications with everything they need, from the operating system up, into a single package. Docker is one of the most popular platforms for containers, allowing them to be hosted on-premises or on the cloud, and to run on Linux, Windows, and Mac machines. With...
Android on x86
Android on x86: an Introduction to Optimizing for Intel® Architecture serves two main purposes. First, it makes the case for adapting your applications onto Intel's x86 architecture, including discussions of the business potential, the changing landscape of the Android marketplace, and the unique challenges and opportunities that arise from x86 de...
Containerized Docker Application Lifecycle with Microsoft Platform and Tools
This book provides end-to-end guidance on the Docker application development lifecycle with Microsoft tools and services while providing an introduction to Docker development concepts for readers who might be new to the Docker ecosystem. This way, anyone can understand the global picture and start planning development projects based on Docker and M...
Platform Embedded Security Technology Revealed
Platform Embedded Security Technology Revealed is an in-depth introduction to Intel's platform embedded solution: the security and management engine. The engine is shipped inside most Intel platforms for servers, personal computers, tablets, and smartphones. The engine realizes advanced security and management functionalities and protects applicati...
Deep Learning with JavaScript
Deep learning has transformed the fields of computer vision, image processing, and natural language applications. Thanks to TensorFlow.js, now JavaScript developers can build deep learning apps without relying on Python or R. Deep Learning with JavaScript shows developers how they can bring DL technology to the web. Written by the main authors of t...
CouchDB: The Definitive Guide
Three of CouchDB's creators show you how to use this document-oriented database as a standalone application framework or with high-volume, distributed applications. With its simple model for storing, processing, and accessing data, CouchDB is ideal for web applications that handle huge amounts of loosely structured data. That alone would stretch th...