Fault Tolerance in Computational Grids

dc.contributor.authorChopra, Inderpreet
dc.contributor.supervisorChana, Inderveer
dc.date.accessioned2007-03-08T07:16:27Z
dc.date.available2007-03-08T07:16:27Z
dc.date.issued2007-03-08T07:16:27Z
dc.description.abstractThe Grid is rapidly emerging as the means for coordinated resource sharing and problem solving in multi-institutional virtual organizations while providing dependable, consistent, pervasive access to global resources. The emergence of computational Grids and the potential for seamless aggregation and interactions between distributed services and resources, has led to the start of new era of computing. Tremendously large number and the heterogeneous nature of grid computing resource make the resource management a significantly challenging job. Resource management scenarios often include resource discovery, resource monitoring, resource inventories, resource provisioning, fault isolation, variety of autonomic capabilities and service level management activities. Out of these scenarios, fault tolerance is one of the main research areas. The probability of fault occurrence increases, as the number of resources involved in grid increases. Till today there is no system that can be fully fault tolerant. In this research our main focus is on the development of fault tolerance system for computational grids. For this we had setup a computational grid based on the Alchemi middleware. Alchemi is a .NET-based grid computing framework that provides the runtime machinery and programming environment required to construct computational grid. After setting up grid environment, we have studied existing fault tolerance in Alchemi in detail, and have ascertained the frequent causes of failures in it. To deal with some of the identified deficiencies we have proposed backup manager concept. Backup manager uses the heartbeating and replication based fault tolerant technique to monitor the central manager. In case of failure of the central manager, backup manager will take its control and avoids the grid to fail.en
dc.description.sponsorshipComputer Science & Engineering Department Thapar Institute of Engineering & Technology, Patiala.en
dc.format.extent869985 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/123456789/154
dc.language.isoenen
dc.subjectFault Toleranceen
dc.subjectOpen Gridservices Architectureen
dc.subjectGrid Computingen
dc.subjectKnids of Failuresen
dc.titleFault Tolerance in Computational Gridsen
dc.typeThesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
154.pdf
Size:
842.02 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: