1
1
openmpi/orte/mca
Jeff Squyres ea4c916096 plm_slurm_module.c: don't leave the extra fd to /dev/null open
Prior to r29058, this same logic was in place (i.e., ensure that the
extra fd to /dev/null is closed).  It looks like it was accidentally
removed in the ORTE conversion to the state machine in r29058.

This ''might'' have something to do with many hangs that we're seeing
in Cisco MTT with jobs that exhibit failure (e.g., call MPI_ABORT)...?

cmr=v1.8.2:reviewer=rhc

This commit was SVN r31469.

The following SVN revision numbers were found above:
  r29058 --> open-mpi/ompi@a200e4f865
2014-04-21 20:09:15 +00:00
..
dfs Fix longstanding issue with our multi-project support. Rather than using 2014-01-07 22:11:15 +00:00
errmgr Per the RFC, remove the sensor framework from the ORTE code area, relocating it offsite to the ORCM code area. Also update some ignores to ensure we don't pickup crosstalk in components 2014-04-15 21:48:24 +00:00
ess Correctly set a non-zero exit status when mpirun is terminated by signal 2014-04-18 16:39:08 +00:00
filem build: add Automake-like silent/verbose macros for "ln -s ..." operations 2014-03-28 18:24:32 +00:00
grpcomm Silence warning 2014-03-29 19:10:26 +00:00
iof Cleanup some potential memory overruns 2014-01-19 16:31:26 +00:00
odls Per the RFC, remove the sensor framework from the ORTE code area, relocating it offsite to the ORCM code area. Also update some ignores to ensure we don't pickup crosstalk in components 2014-04-15 21:48:24 +00:00
oob Per suggestion from George, use a pipe for terminating the thread. 2014-04-14 01:02:46 +00:00
plm plm_slurm_module.c: don't leave the extra fd to /dev/null open 2014-04-21 20:09:15 +00:00
ras Prevent a segfault if we encounter an error while parsing a hostfile. Don't issue and error_log output as the hostfile code already prints an error message 2014-04-12 21:32:10 +00:00
rmaps Fix the sequential mapper - it was out-of-sync with the hostfile changes, and we missed the "seq" policy when parsing the --map-by option. Thanks to Bill Chen for reporting it 2014-04-08 03:38:25 +00:00
rml use the newly created JOB_STATE_FT_* events 2014-03-12 12:37:14 +00:00
routed Fix a problem where we need to abort due to a mapping failure, but we are in a managed environment and thus the orteds have not wired up. Thus, if we send the exit message across the routed network, the remote daemons won't have a way to relay the message along - and we won't exit. 2014-04-02 04:17:55 +00:00
snapc build: add Automake-like silent/verbose macros for "ln -s ..." operations 2014-03-28 18:24:32 +00:00
sstore build: add Automake-like silent/verbose macros for "ln -s ..." operations 2014-03-28 18:24:32 +00:00
state Per the RFC, remove the sensor framework from the ORTE code area, relocating it offsite to the ORCM code area. Also update some ignores to ensure we don't pickup crosstalk in components 2014-04-15 21:48:24 +00:00