Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request
Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request
Academic Journal

Checkpointing strategies to tolerate non-memoryless failures on HPC platforms

Subjects: fault-tolerance; failure; non-memoryless

  • Source: ISSN: 2329-4949 ; ACM Transactions on Parallel Computing ; https://inria.hal.science/hal-04215283 ; ACM Transactions on Parallel Computing, In press, ⟨10.1145/3624560⟩.

Record details

×
Conference

Robustness of the Young/Daly formula for stochastic iterative applications

Subjects: Fault-tolerance; Checkpoint; Young/Daly formulaEdmonton / VirtualEdmonton / Virtual, Canada

  • Source: ICPP 2020 - 49th International Conference on Parallel Processing ; https://inria.hal.science/hal-03024618 ; ICPP 2020 - 49th International Conference on Parallel Processing, Aug 2020, Edmonton /

Record details

×
Report

Robustness of the Young/Daly formula for stochastic iterative applications

Subjects: Fault-tolerance; Iterative algorithm; Checkpoint

  • Source: https://inria.hal.science/hal-02514107 ; [Research Report] RR-9332, Inria Grenoble Rhône-Alpes. 2020.

Record details

×
Conference

Combining backward and forward recovery to cope with silent errors in iterative solvers

Subjects: Performance model; Sparse matrix-vector multiplication; CheckpointingHyderabad; IndiaHyderabad, India

  • Source: PDSEC2015 ; https://inria.hal.science/hal-01159679 ; PDSEC2015, May 2015, Hyderabad, India. pp.980--989

Record details

×
  • 1-10 of  31 results for ""tolerance""