1
1
openmpi/orte
Ralph Castain 3fdcaeab97 Fix a problem where we need to abort due to a mapping failure, but we are in a managed environment and thus the orteds have not wired up. Thus, if we send the exit message across the routed network, the remote daemons won't have a way to relay the message along - and we won't exit.
If we are aborting, then set the flags so the HNP directly sends an exit command to each daemon. Make it the halt_vm command so the remote daemon doesn't try to relay it, but instead just exits without waiting for its routed children to exit first.

cmr=v1.8.1:reviewer=jsquyres:subject=fix hangs due to abort prior to daemon wireup

This commit was SVN r31304.
2014-04-02 04:17:55 +00:00
..
etc Backing out the Open SHMEM project 2013-02-12 17:45:27 +00:00
include Fix longstanding issue with our multi-project support. Rather than using 2014-01-07 22:11:15 +00:00
mca Fix a problem where we need to abort due to a mapping failure, but we are in a managed environment and thus the orteds have not wired up. Thus, if we send the exit message across the routed network, the remote daemons won't have a way to relay the message along - and we won't exit. 2014-04-02 04:17:55 +00:00
orted Fix a problem where we need to abort due to a mapping failure, but we are in a managed environment and thus the orteds have not wired up. Thus, if we send the exit message across the routed network, the remote daemons won't have a way to relay the message along - and we won't exit. 2014-04-02 04:17:55 +00:00
runtime Repair builds without hwloc 2014-03-05 02:48:15 +00:00
test Shifting to an event-driven RTE raises some interesting issues during shutdown. We want the last messages to get thru, but also need to correctly shutdown the virtual machine. This requires a delicate balancing act across event priorities, and the need to check for termination conditions in places where related events get processed. 2014-03-12 16:49:58 +00:00
tools opal: add mca param to control ranks env variables 2014-04-01 21:14:31 +00:00
util Fix a problem where we need to abort due to a mapping failure, but we are in a managed environment and thus the orteds have not wired up. Thus, if we send the exit message across the routed network, the remote daemons won't have a way to relay the message along - and we won't exit. 2014-04-02 04:17:55 +00:00
Doxyfile Fix the broken Doxyfile so people can generate what little code base documentation we have :-) 2006-04-13 12:52:17 +00:00
Makefile.am Something funny is going on when --without-orte, so revert the orte/Makefile.am for now while we try to figure it out 2014-03-11 23:07:21 +00:00