1
1
openmpi/orte/mca/errmgr/base
Ralph Castain 319758e3e0 Restore process recovery for procs local to mpirun (first step towards restoring full capability). Define three new MCA params:
1. orte_enable_recovery - default recovery policy, can be overridden on a per-job basis

2. orte_max_local_restarts - default max number of local restarts, can be overridden

3. orte_max_global_restarts - default max number of relocates, can be overridden

Implement the restart_proc API for the ODLS framework, reorganize the default fns a little to avoid copying code.

This commit was SVN r23057.
2010-04-28 04:06:57 +00:00
..
base.h Add a sensor framework to ORTE that monitors applications and notifies the errmgr when they exceed specified boundaries. Two modules are included here: 2010-04-26 22:15:57 +00:00
errmgr_base_close.c Add a sensor framework to ORTE that monitors applications and notifies the errmgr when they exceed specified boundaries. Two modules are included here: 2010-04-26 22:15:57 +00:00
errmgr_base_fns.c Restore process recovery for procs local to mpirun (first step towards restoring full capability). Define three new MCA params: 2010-04-28 04:06:57 +00:00
errmgr_base_open.c Restore process recovery for procs local to mpirun (first step towards restoring full capability). Define three new MCA params: 2010-04-28 04:06:57 +00:00
errmgr_base_select.c Add a sensor framework to ORTE that monitors applications and notifies the errmgr when they exceed specified boundaries. Two modules are included here: 2010-04-26 22:15:57 +00:00
errmgr_private.h Restore process recovery for procs local to mpirun (first step towards restoring full capability). Define three new MCA params: 2010-04-28 04:06:57 +00:00
Makefile.am Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00