Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Computation Checkpointing & Migration - Vipin Chaudhary, Hai Jiang, John Paul N Walters

Computation Checkpointing & Migration

Buch | Hardcover
141 Seiten
2010
Nova Science Publishers Inc (Verlag)
9781607418405 (ISBN)
CHF 286,10 inkl. MwSt
  • Versand in 10-20 Tagen
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken
Computational clusters have long provided a mechanism for the acceleration of high performance computing (HPC) applications. This book addresses the issue of fault-tolerance through checkpointing. It presents a general overview of checkpointing and how it's implemented.
Computational clusters have long provided a mechanism for the acceleration of high performance computing (HPC) applications. With today's supercomputers now exceeding the petaflop scale, however, they are also exhibiting an increase in heterogeneity. Thisheterogeneity spans a range of technologies, from multiple operating systems to hardware accelerators and novel architectures. Because of the exceptional acceleration some of these heterogeneous architectures provide, they are being embraced as viable tools for HPC applications. Given the scale of today's supercomputers, it is clear that scientists must consider the use of fault-tolerance in their applications. This is particularly true as computational clusters with hundreds and thousands of processors become ubiquitous in large-scale scientific computing, leading to lower mean-times-to-failure. This forces the systems to effectively deal with the possibility of arbitrary and unexpected node failure. In this book the address the issue of fault-tolerance via checkpointing. They discuss the existing strategies to provide rollback recovery to applications -- both via MPI at the user level and through application-level techniques. Checkpointing itself has been studied extensively in the literature, including the authors' own works. Here they give a general overview of checkpointing and how it's implemented. More importantly, they describe strategies to improve the performance of checkpointing, particularly in the case of distributed systems.

Introduction; Background on Checkpointing; Thread States; Compile-time Support; Run-time Support; Performance Analysis; Microbenchmarks; Experimental Results; Summary; Migration Safety; Pointer Representations in C; Microbenchmarks; Experimental Results; Related Research .; Summary; Heterogeneity Support; Data Representations in Heterogeneous Environments; Data Conversion Issues; Coarse-grained Tagged RMR in MigThread; The Compile Time Support Module; The Run-time Support Module; Complexity Analysis; Microbenchmarks; Experimental Results; Related Research; Summary; User-Lever Checkpoint/Fault Tolerance; User-level LAM Checkpointing with Arbitrary Restart Structure; Checkpoint Storage, Resilience, and Performance; HPC and Virtualization; Virtualization within HPC; Virtualization Background; Performance Results; Fault Tolerant OpenVZ; Checkpointing/Restart System Design; Checkpoint/Replication Analysis; Performance Results; Index.

Erscheint lt. Verlag 1.7.2010
Zusatzinfo Illustrations
Verlagsort New York
Sprache englisch
Maße 260 x 180 mm
Gewicht 408 g
Themenwelt Mathematik / Informatik Informatik Theorie / Studium
ISBN-13 9781607418405 / 9781607418405
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
was jeder über Informatik wissen sollte

von Timm Eichstädt; Stefan Spieker

Buch | Softcover (2024)
Springer Vieweg (Verlag)
CHF 53,15
Grundlagen – Anwendungen – Perspektiven

von Matthias Homeister

Buch | Softcover (2022)
Springer Vieweg (Verlag)
CHF 53,15
Teil 2 der gestreckten Abschlussprüfung Fachinformatiker/-in …

von Dirk Hardy; Annette Schellenberg; Achim Stiefel

Buch | Softcover (2025)
Europa-Lehrmittel (Verlag)
CHF 37,90