596062d34b
assumptions in the FT restart code for the ORTE layer. This fixes those problems by having the RML completely shutdown and restart the OOB framework (instead of just the module as before). This makes it much easier to manage, and maintainable as the OOB changes in the future. The SDS now does communication as part of its startup procedure, so we need to make sure we restart the RML before the SDS so that it can communicate properly. OOB base [close|open] used a static bool to determine if they have been called previously or not. I needed to expose this boolean so that I can close() then open() the oob base in the restart procedure. The functionality has not changed, we just now have the ability to open/close the framework as many times as we need to as long as we always call them in that order. (So calling open twice in a row is not allowed as before, it is only allowed if you open(), close(), then open() again). Things seem to be working now. This commit was SVN r14515. |
||
---|---|---|
.. | ||
help-orte-runtime.txt | ||
Makefile.am | ||
orte_abort.c | ||
orte_cr.c | ||
orte_cr.h | ||
orte_finalize.c | ||
orte_init_stage1.c | ||
orte_init_stage2.c | ||
orte_init.c | ||
orte_monitor.c | ||
orte_params.c | ||
orte_restart.c | ||
orte_setup_hnp.c | ||
orte_setup_hnp.h | ||
orte_system_finalize.c | ||
orte_system_init.c | ||
orte_universe_exists.c | ||
orte_wait.c | ||
orte_wait.h | ||
orte_wakeup.c | ||
orte_wakeup.h | ||
params.h | ||
runtime_internal.h | ||
runtime_types.h | ||
runtime.h |