Full Text:   <3129>

CLC number: O242

On-line Access: 2024-08-27

Received: 2023-10-17

Revision Accepted: 2024-05-08

Crosschecked: 2012-11-12

Cited: 0

Clicked: 7170

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Journal of Zhejiang University SCIENCE C 2012 Vol.13 No.12 P.891-900

http://doi.org/10.1631/jzus.C1200135


Optimizing checkpoint for scientific simulations


Author(s):  Xi-sheng Xiao, Ying-ping Huang, Xi-hui Zhang

Affiliation(s):  Economics & Management College, Southwest Jiaotong University, Chengdu 610031, China; more

Corresponding email(s):   davidshiau@qq.com, yhuang@una.edu, xzhang6@una.edu

Key Words:  Checkpoint, Long-running, Optimizing, Simulation



Abstract: 
It is extremely time-consuming to restart a long-running simulation from the beginning when a failure occurs. checkpointing is a viable solution that enables simulations to be resumed from the point of failure. We study three models to determine the optimal checkpoint interval between contiguous checkpoints so that the total execution time is minimized and we demonstrate that optimal checkpointing can facilitate self-optimizing. This study greatly advances our knowledge of and practice in optimizing long-running scientific simulations.

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2025 Journal of Zhejiang University-SCIENCE