1
1
openmpi/orte/mca/plm
Ralph Castain f11931306a Modify the accounting system to recycle jobids. Properly recover resources from nodes and jobs upon completion. Adjustments in several places were required to deal with sparsely populated job, node, and proc arrays as a result of this change.
Correct an error wrt how jobids were being computed. Needed to ensure that the job family field was not overrun as we increment jobids for comm_spawn.

Update the slurm plm module so it uses the new slurm termination procedure (brings trunk back into alignment with 1.3 branch).

Update the slurmd ess component so it doesn't get selected if we are running a singleton inside of a slurm allocation.

Cleanup HNP init by moving some code that had been in orte_globals.c for historical reasons into the ess hnp module, and removing the call to that code from the ess_base_std_prolog


NOTE: this change allows orte to support an infinite aggregate number of comm_spawn's, with up to 64k being alive at any one instant. HOWEVER, the MPI layer currently does -not- support re-use of jobids. I did some prototype coding to revise the ompi_proc_t structures, but the BTLs are caching their own data, and there was no readily apparent way to update it. Thus, attempts to spawn more than the 64k limit will abort to avoid causing the MPI layer to hang.

This commit was SVN r20700.
2009-03-03 16:39:13 +00:00
..
alps - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
base Modify the accounting system to recycle jobids. Properly recover resources from nodes and jobs upon completion. Adjustments in several places were required to deal with sparsely populated job, node, and proc arrays as a result of this change. 2009-03-03 16:39:13 +00:00
bproc - Header orte/mca/oob/base/base.h is probably the wrong one to include 2009-02-26 04:20:03 +00:00
ccp - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
lsf - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
process - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
rsh - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
slurm Modify the accounting system to recycle jobids. Properly recover resources from nodes and jobs upon completion. Adjustments in several places were required to deal with sparsely populated job, node, and proc arrays as a result of this change. 2009-03-03 16:39:13 +00:00
slurmd - On the way to get the BTLs split out and lessen dependency on orte: 2009-02-14 02:26:12 +00:00
submit - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
tm - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
tmd - Header orte/mca/errmgr/errmgr.h is not needed. 2009-02-26 04:05:30 +00:00
xgrid Allocate the slots for use in the xgrid plm 2009-02-06 00:55:14 +00:00
Makefile.am Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
plm_types.h - Get rid of include orte/util/proc_info.h, if not needed 2009-02-25 03:38:00 +00:00
plm.h Fixes trac:1392, #1400 2008-07-28 22:40:57 +00:00