1
1
openmpi/orte/mca/odls/base
Ralph Castain 319758e3e0 Restore process recovery for procs local to mpirun (first step towards restoring full capability). Define three new MCA params:
1. orte_enable_recovery - default recovery policy, can be overridden on a per-job basis

2. orte_max_local_restarts - default max number of local restarts, can be overridden

3. orte_max_global_restarts - default max number of relocates, can be overridden

Implement the restart_proc API for the ODLS framework, reorganize the default fns a little to avoid copying code.

This commit was SVN r23057.
2010-04-28 04:06:57 +00:00
..
base.h Revamp the errmgr framework to provide a greater range of optional behaviors, including different behaviors for daemons, and remove several looping messages across the code base: 2010-04-23 04:44:41 +00:00
help-orte-odls-base.txt Issue a warning if a requested binding operation results in processes being bound to all available processes, which is the equivalent of not being bound at all. 2010-04-17 01:02:41 +00:00
Makefile.am Complete implementation of the --without-rte-support configure option. Working with Brian, this has been tested on RedStorm. 2008-06-18 03:15:56 +00:00
odls_base_close.c Revamp the errmgr framework to provide a greater range of optional behaviors, including different behaviors for daemons, and remove several looping messages across the code base: 2010-04-23 04:44:41 +00:00
odls_base_default_fns.c Restore process recovery for procs local to mpirun (first step towards restoring full capability). Define three new MCA params: 2010-04-28 04:06:57 +00:00
odls_base_open.c Add a sensor framework to ORTE that monitors applications and notifies the errmgr when they exceed specified boundaries. Two modules are included here: 2010-04-26 22:15:57 +00:00
odls_base_select.c Fix some Coverity 'Event set_but_not_used' highlights. 2008-06-06 14:38:41 +00:00
odls_base_state.c The system headers are supposed to be protected by #ifdef and not by #if. 2009-07-16 18:27:33 +00:00
odls_private.h After hours spent chasing the stupid "abort" file, it became clear that we were always going to be plagued by that idiot contraption when trying to be good citizens and properly cleanup. So get rid of it by instead doing a messaging handshake with the local daemon. 2010-04-27 03:39:32 +00:00