dadca7da88
This merge adds Checkpoint/Restart support to Open MPI. The initial frameworks and components support a LAM/MPI-like implementation. This commit follows the risk assessment presented to the Open MPI core development group on Feb. 22, 2007. This commit closes trac:158 More details to follow. This commit was SVN r14051. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r13912 The following Trac tickets were found above: Ticket 158 --> https://svn.open-mpi.org/trac/ompi/ticket/158
35 строки
619 B
Plaintext
35 строки
619 B
Plaintext
#
|
|
# An Aggregate MCA Parameter Set to enable checkpoint/restart capabilities
|
|
# for a job.
|
|
#
|
|
# Usage:
|
|
# shell$ mpirun -am ft-enable-cr ./app
|
|
#
|
|
|
|
#
|
|
# OPAL Parameters
|
|
# - Select only checkpoint ready components
|
|
# - Enable Additional FT infrastructure
|
|
# - Auto-select OPAL CRS component
|
|
#
|
|
mca_base_component_distill_checkpoint_ready=1
|
|
ft_cr_enabled=1
|
|
crs=
|
|
|
|
#
|
|
# ORTE Parameters
|
|
# - Wrap the RML
|
|
# - Use the 'full' Snapshot Coordinator
|
|
#
|
|
rml_wrapper=ftrm
|
|
snapc=full
|
|
#filem=rsh
|
|
|
|
#
|
|
# OMPI Parameters
|
|
# - Wrap the PML
|
|
# - Use the LAM/MPI-like Coordinated Checkpoint/Restart Coordination Protocol
|
|
#
|
|
pml_wrapper=crcpw
|
|
crcp=coord
|