1
1
openmpi/orte/mca/plm
Ralph Castain b110a247be Fix comm_spawn (maybe).
Comm_spawn was sticking during spawn_multiple because of a problem in the dpm - the modex there is asking processes to talk to each other in an allgather_list operation, but the procs don't have the required contact info to do so. The solution here was to ensure that all parent procs have full contact info for procs in the child job.

Admittedly, this isn't the long-term answer. We would like to have the contact info given to only the parent procs that were involved in the comm_spawn. There is a way to do that, but this will suffice to keep things working until that can be implemented and tested.

This commit was SVN r17772.
2008-03-06 21:56:00 +00:00
..
alps Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
base Fix comm_spawn (maybe). 2008-03-06 21:56:00 +00:00
ccp Set the windows components invisible. 2008-03-04 17:37:17 +00:00
gridengine Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
lsf Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
md Silence some minor compiler warnings 2008-02-29 02:39:39 +00:00
poe Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
process Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
rsh Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
slurm Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
slurmd Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
submit Add default hostfile parameter plus --default-hostfile command line option. 2008-03-05 04:54:57 +00:00
tm Cleanup recursions in ORTE caused by processing recv'd messages that can cause the system to take action resulting in receipt of another message. 2008-02-28 19:58:32 +00:00
tmd First cut at direct launch for TM. Able to launch non-ORTE procs and detect their completion for a clean shutdown. 2008-03-05 13:51:32 +00:00
xgrid Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
Makefile.am Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
plm_types.h Replace all occurences of orte_pointer_array by opal_pointer_array. Remove the 2008-02-28 05:32:23 +00:00
plm.h Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00