Repository logo
Article

A novel adaptive checkpointing method based on information obtained from workflow structure

creativeworkseries.issn1508-2806
dc.contributor.authorKail, Eszter
dc.contributor.authorKacsuk, Péter
dc.contributor.authorKozlovszky, Miklós
dc.date.available2017-09-21T08:02:08Z
dc.date.issued2016
dc.descriptionBibliogr. s. 404-405.
dc.description.abstractScientific workflows are data- and compute-intensive, thus, they may run for days or even weeks on parallel and distributed infrastructures such as grids, supercomputers, and clouds. In these high-performance computing infrastructures, the number of failures that can arise during scientific-workflow enact- ment can be high, so the use of fault-tolerance techniques is unavoidable. The most-frequently used fault-tolerance technique is taking checkpoints from time to time, when failure is detected, the last consistent state is restored. One of the most-critical factors that has great impact on the effectiveness of the checkpointing method is the checkpointing interval. In this work, we propose a Static (Wsb) and an Adaptive (AWsb) Workflow Structure Based checkpointing algorithm. Our results showed that, compared to the optimal checkpointing strategy, the static algorithm may decrease the checkpointing overhead by as much as 33% without affecting the total processing time of workflow execution. The adaptive algorithm may further decrease this overhead while keeping the overall processing time at its necessary minimum.en
dc.description.placeOfPublicationKraków
dc.description.versionwersja wydawniczapl
dc.identifier.doihttps://doi.org/10.7494/csci.2016.17.3.387
dc.identifier.eissn2300-7036
dc.identifier.issn1508-2806
dc.identifier.nukatdd2016315127pl
dc.identifier.urihttps://repo.agh.edu.pl/handle/AGH/49505
dc.language.isoeng
dc.publisherWydawnictwa AGH
dc.relation.ispartofComputer Science
dc.rightsAttribution 4.0 International
dc.rights.accessotwarty dostęp
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/legalcode
dc.subjectscientific workflowen
dc.subjectcheckpointen
dc.subjectdynamic executionen
dc.titleA novel adaptive checkpointing method based on information obtained from workflow structureen
dc.title.relatedComputer Science
dc.typeartykuł
dspace.entity.typePublication
publicationissue.issueNumberNo. 3
publicationissue.paginationpp. 387-406
publicationvolume.volumeNumberVol. 17
relation.isJournalIssueOfPublicationae079d21-80ca-47d5-91b9-3bc482304137
relation.isJournalIssueOfPublication.latestForDiscoveryae079d21-80ca-47d5-91b9-3bc482304137
relation.isJournalOfPublication020291ee-249b-4dcf-98a3-276a2f7981aa

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
csci.2016.17.3.387.pdf
Size:
1.03 MB
Format:
Adobe Portable Document Format