Share

Analyzing and Evaluating the Resilience of Scheduling Scientific Applications on High Performance Computing Systems Using a Simulation-based Methodology

Download Analyzing and Evaluating the Resilience of Scheduling Scientific Applications on High Performance Computing Systems Using a Simulation-based Methodology PDF Online Free

Author :
Release : 2015
Genre :
Kind : eBook
Book Rating : /5 ( reviews)

GET EBOOK


Book Synopsis Analyzing and Evaluating the Resilience of Scheduling Scientific Applications on High Performance Computing Systems Using a Simulation-based Methodology by : Nitin Sukhija

Download or read book Analyzing and Evaluating the Resilience of Scheduling Scientific Applications on High Performance Computing Systems Using a Simulation-based Methodology written by Nitin Sukhija. This book was released on 2015. Available in PDF, EPUB and Kindle. Book excerpt: Large scale systems provide a powerful computing platform for solving large and complex scientific applications. However, the inherent complexity, heterogeneity, wide distribution, and dynamism of the computing environments can lead to performance degradation of the scientific applications executing on these computing systems. Load imbalance arising from a variety of sources such as application, algorithmic, and systemic variations is one of the major contributors to their performance degradation. In general, load balancing is achieved via scheduling. Moreover, frequently occurring resource failures drastically affect the execution of applications running on high performance computing systems. Therefore, the study of deploying support for integrated scheduling and fault-tolerance mechanisms for guaranteeing that applications deployed on computing systems are resilient to failures becomes of paramount importance. Recently, several research initiatives have started to address the issue of resilience. However, the major focus of these efforts was geared more toward achieving system level resilience with less emphasis on achieving resilience at the application level. Therefore, it is increasingly important to extend the concept of resilience to the scheduling techniques at the application level for establishing a holistic approach that addresses the performability of these applications on high performance computing systems. This can be achieved by developing a comprehensive modeling framework that can be used to evaluate the resiliency of such techniques on heterogeneous computing systems for assessing the impact of failures as well as workloads in an integrated way. This dissertation presents an experimental methodology based on discrete event simulation for the analysis and the evaluation of the resilience of scheduling scientific applications on high performance computing systems. With the aid of the methodology a wide class of dependencies existing between application and computing system are captured within a deterministic model for quantifying the performance impact expected from changes in application and system characteristics. Ideally, the results obtained by employing the proposed simulation-based performance prediction framework enabled an introspective design and investigation of scheduling heuristics to reason about how to best fully optimize various often antagonistic objectives, such as minimizing application makespan and maximizing reliability.

Scheduling in Parallel Computing Systems

Download Scheduling in Parallel Computing Systems PDF Online Free

Author :
Release : 2012-12-06
Genre : Computers
Kind : eBook
Book Rating : 653/5 ( reviews)

GET EBOOK


Book Synopsis Scheduling in Parallel Computing Systems by : Shaharuddin Salleh

Download or read book Scheduling in Parallel Computing Systems written by Shaharuddin Salleh. This book was released on 2012-12-06. Available in PDF, EPUB and Kindle. Book excerpt: Scheduling in Parallel Computing Systems: Fuzzy and Annealing Techniques advocates the viability of using fuzzy and annealing methods in solving scheduling problems for parallel computing systems. The book proposes new techniques for both static and dynamic scheduling, using emerging paradigms that are inspired by natural phenomena such as fuzzy logic, mean-field annealing, and simulated annealing. Systems that are designed using such techniques are often referred to in the literature as `intelligent' because of their capability to adapt to sudden changes in their environments. Moreover, most of these changes cannot be anticipated in advance or included in the original design of the system. Scheduling in Parallel Computing Systems: Fuzzy and Annealing Techniques provides results that prove such approaches can become viable alternatives to orthodox solutions to the scheduling problem, which are mostly based on heuristics. Although heuristics are robust and reliable when solving certain instances of the scheduling problem, they do not perform well when one needs to obtain solutions to general forms of the scheduling problem. On the other hand, techniques inspired by natural phenomena have been successfully applied for solving a wide range of combinatorial optimization problems (e.g. traveling salesman, graph partitioning). The success of these methods motivated their use in this book to solve scheduling problems that are known to be formidable combinatorial problems. Scheduling in Parallel Computing Systems: Fuzzy and Annealing Techniques is an excellent reference and may be used for advanced courses on the topic.

Scheduling Problems

Download Scheduling Problems PDF Online Free

Author :
Release : 2020-07-08
Genre : Computers
Kind : eBook
Book Rating : 533/5 ( reviews)

GET EBOOK


Book Synopsis Scheduling Problems by : Rodrigo Righi

Download or read book Scheduling Problems written by Rodrigo Righi. This book was released on 2020-07-08. Available in PDF, EPUB and Kindle. Book excerpt: Scheduling is defined as the process of assigning operations to resources over time to optimize a criterion. Problems with scheduling comprise both a set of resources and a set of a consumers. As such, managing scheduling problems involves managing the use of resources by several consumers. This book presents some new applications and trends related to task and data scheduling. In particular, chapters focus on data science, big data, high-performance computing, and Cloud computing environments. In addition, this book presents novel algorithms and literature reviews that will guide current and new researchers who work with load balancing, scheduling, and allocation problems.

Foundations of Real-Time Computing: Scheduling and Resource Management

Download Foundations of Real-Time Computing: Scheduling and Resource Management PDF Online Free

Author :
Release : 2012-12-22
Genre : Computers
Kind : eBook
Book Rating : 666/5 ( reviews)

GET EBOOK


Book Synopsis Foundations of Real-Time Computing: Scheduling and Resource Management by : André M. van Tilborg

Download or read book Foundations of Real-Time Computing: Scheduling and Resource Management written by André M. van Tilborg. This book was released on 2012-12-22. Available in PDF, EPUB and Kindle. Book excerpt: This volume contains a selection of papers that focus on the state-of the-art in real-time scheduling and resource management. Preliminary versions of these papers were presented at a workshop on the foundations of real-time computing sponsored by the Office of Naval Research in October, 1990 in Washington, D.C. A companion volume by the title Foundations of Real-Time Computing: Fonnal Specifications and Methods complements this book by addressing many of the most advanced approaches currently being investigated in the arena of formal specification and verification of real-time systems. Together, these two texts provide a comprehensive snapshot of current insights into the process of designing and building real-time computing systems on a scientific basis. Many of the papers in this book take care to define the notion of real-time system precisely, because it is often easy to misunderstand what is meant by that term. Different communities of researchers variously use the term real-time to refer to either very fast computing, or immediate on-line data acquisition, or deadline-driven computing. This text is concerned with the very difficult problems of scheduling tasks and resource management in computer systems whose performance is inextricably fused with the achievement of deadlines. Such systems have been enabled for a rapidly increasing set of diverse end-uses by the unremitting advances in computing power per constant-dollar cost and per constant-unit-volume of space. End-use applications of deadline-driven real-time computers span a spectrum that includes transportation systems, robotics and manufacturing, aerospace and defense, industrial process control, and telecommunications.

Resilience Assessment and Evaluation of Computing Systems

Download Resilience Assessment and Evaluation of Computing Systems PDF Online Free

Author :
Release : 2012-11-02
Genre : Computers
Kind : eBook
Book Rating : 329/5 ( reviews)

GET EBOOK


Book Synopsis Resilience Assessment and Evaluation of Computing Systems by : Katinka Wolter

Download or read book Resilience Assessment and Evaluation of Computing Systems written by Katinka Wolter. This book was released on 2012-11-02. Available in PDF, EPUB and Kindle. Book excerpt: The resilience of computing systems includes their dependability as well as their fault tolerance and security. It defines the ability of a computing system to perform properly in the presence of various kinds of disturbances and to recover from any service degradation. These properties are immensely important in a world where many aspects of our daily life depend on the correct, reliable and secure operation of often large-scale distributed computing systems. Wolter and her co-editors grouped the 20 chapters from leading researchers into seven parts: an introduction and motivating examples, modeling techniques, model-driven prediction, measurement and metrics, testing techniques, case studies, and conclusions. The core is formed by 12 technical papers, which are framed by motivating real-world examples and case studies, thus illustrating the necessity and the application of the presented methods. While the technical chapters are independent of each other and can be read in any order, the reader will benefit more from the case studies if he or she reads them together with the related techniques. The papers combine topics like modeling, benchmarking, testing, performance evaluation, and dependability, and aim at academic and industrial researchers in these areas as well as graduate students and lecturers in related fields. In this volume, they will find a comprehensive overview of the state of the art in a field of continuously growing practical importance.

You may also like...