References
- E. N. Elnozahy, L. Alvisi, Y. -M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message passing systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, oct 1996
- K. M. Chandy and L. Lamport. Distributed snapshots: Determining global states of distributed systems. ACM Transactions on Computing Systems, 3(1):63-75, AUG 1985 https://doi.org/10.1145/214451.214456
- R. Koo and S. Toueg. Checkpointing and rollback recovery for distributed systems. IEEE Transaction on Software Engineering, SE-13(1):23-31, 1987 https://doi.org/10.1109/TSE.1987.232562
- T. Park and H. Y. Yeom. Application controlled checkpointing coordination for fault tolerant distributed computing systems. Parallel Computing, 26(4):467-482, MAR 2000 https://doi.org/10.1016/S0167-8191(99)00112-X
- L. Alvisi and K. Marzullo. Message logging: Pessimistic, optimistic and causal. In Proceedings of the 15th International Conference on Distributed Computing Systems, pages 229-236, 1995 https://doi.org/10.1109/ICDCS.1995.500024
- N. Neves and W. K. Fuchs. RENEW: A tool for fast and efficient implementation of checkpoint protocols. In Symposium on Fault-Tolerant Computing, pages 58-67, 1998 https://doi.org/10.1109/FTCS.1998.689455
- Y. -M. Wang and W. K. Fuchs. Optimistic message logging for independent checkpointing in message-passing systems. In Symposium on Reliable Distributed Systems, pages 147-154, 1992 https://doi.org/10.1109/RELDIS.1992.235132
- L. Alvisi, E. N. Elnozahy, S. Rao, S. A. Husain, and A. D. Mel. An analysis of communication induced checkpointing. In Symposium on Fault-Tolerant Computing, pages 242-249, 1999 https://doi.org/10.1109/FTCS.1999.781058
- D. Briatico, A. Ciuffoletti, and L. Simoncini. A distributed domino-effect free recovery algorithm. In Proceedings of the IEEE International Symposium on Reliability Distributed Software wand Database, pages 207-215, DEC 1984
- J. Helary, A. Mostefaoui, R. Netzer, and M. Raynal. Preventing useless checkpoints in distributed computations. In Proceedings of IEEE International Symposium on Reliable Distributed Systems, pages 183-190, 1997 https://doi.org/10.1109/RELDIS.1997.632814
- R. Baldoni, F. Quaglia, and B. Ciciani. A VP-accordant checkpointing protocol preventing useless checkpoints. In Symposium on Reliable Distributed Systems, pages 61-67, 1998 https://doi.org/10.1109/RELDIS.1998.740475
- R. Baldoni, J. H'elary, and M. Raynal. Rollback-dependency trackability. Technical Report Report 1107, IRISA Research, MAY 1997
- L. Lamport, 'Time, Clocks, and the Ordering of Events in a Distributed System,' Comm. of the ACM, Vol.21, No.7, pp.558-564, Jul., 1978 https://doi.org/10.1145/359545.359563
- R. Netzer and J. Xu. Necessary and sufficient conditions for consistent global snapshots. IEEE Transactions on Parallel and Distributed Systems, 6(2):165-169, 1995 https://doi.org/10.1109/71.342127
- F. Quaglia, R. Baldoni, and B. Ciciani. On the no-z-cycle property in distributed executions. Journal of Computer and System Sciences, 61(3): 400-427, 2000 https://doi.org/10.1006/jcss.2000.1720
- Y. Nah. The Specification of Task Communication Patterns. PhD thesis, Seoul National University, Korea, 1997
- G. Andrews. Paradigms for process interaction in distributed programs. ACM Computing Surveys, 23(1):49-90, 1991 https://doi.org/10.1145/103162.103164