Для этого сайта требуется поддержка JavaScript.
Обзор
Помощь
Вход
ports
/
openmpi
Следить
1
В избранное
1
Форкнуть
0
Вы уже форкнули openmpi
Код
Релизы
Активность
openmpi
/
orte
/
mca
История
Ralph Castain
abedb97be4
Resolve race condition when procs call MPI_Abort. Since we go thru the errmgr instead of the normal proc termination routines, we need to ensure we mark that the proc has fired its waitpid and is no longer alive. Otherwise, the local daemon won't terminate because it thinks there is still a local proc alive and we hang.
...
Thanks to Gilles for tracking it down. cmr=v1.8.2:reviewer=rhc This commit was SVN r32460.
2014-08-08 15:58:49 +00:00
..
dfs
The final step of the RFC: convert the <foo>libdir and friends to fit their respective code areas, and equate them all at the top. Note that we can't entirely separate things as the opal_install_dirs framework can't handle separated locations for the various trees.
2014-05-08 02:01:35 +00:00
errmgr
Remove a potential race condition - we'll cleanup the local children when we are all done
2014-07-03 14:13:43 +00:00
ess
Okay, now handle the non-MPI apps
2014-08-01 14:49:25 +00:00
filem
This isn't as big a change as it appears - a change in one place caused a whole bunch of files to require updated #include's due to some arcane linkage. Rework the orte_wait code to reflect the introduction of the state machine. If we are in cleanup mode and just want to kill all our local children, then there is no reason to be polite about it as that introduces *very* long delays at scale. Just kill the procs and move on.
2014-06-17 17:57:51 +00:00
grpcomm
Revert r32222, r32210, and r32203 as they created a problem when daemon collectives did not involve app procs on every node. Instead, modify the ompi/mca/rte/orte/rte_orte.h to add a new function that allows apps to request new daemon collective ids for use in barrier and modex operations. This will only appear in ORTE-based installations, but it is only being used by a couple of researchers at the moment.
2014-07-15 03:48:00 +00:00
iof
Silence used-before-set var warning
2014-07-09 22:37:47 +00:00
odls
Resolve race condition when procs call MPI_Abort. Since we go thru the errmgr instead of the normal proc termination routines, we need to ensure we mark that the proc has fired its waitpid and is no longer alive. Otherwise, the local daemon won't terminate because it thinks there is still a local proc alive and we hang.
2014-08-08 15:58:49 +00:00
oob
Some small leak cleanups
2014-07-30 15:46:02 +00:00
plm
Single-word cmd line values for orted are dealt with in orte_plm_base_orted_append_basic_args, so protect against special characters there. Have the rsh module only deal with multi-word arguments as those were skipped by orte_plm_base_orted_append_basic_args.
2014-07-23 17:06:51 +00:00
ras
Silence warning
2014-07-03 17:29:04 +00:00
rmaps
orte_rmaps_numa_node_t -> opal_rmaps_numa_node_t
2014-07-31 19:58:47 +00:00
rml
I have no idea how they did it, but someone managed to write a test that circled around and around and eventually reached this point with a NULL pointer. So protect against that possibility.
2014-08-05 16:20:46 +00:00
routed
Now that we are using the radix routed module, teach it how to behave nicely with singletons
2014-07-24 22:46:17 +00:00
rtc
Fix static builds by renaming conflicting type
2014-06-14 17:39:28 +00:00
snapc
use the orte attributes in the FT code to fix compile errors
2014-06-26 03:19:17 +00:00
sstore
use the orte attributes in the FT code to fix compile errors
2014-06-26 03:19:17 +00:00
state
Resolve the MTT timeout problem. This appears to have largely been caused by missing sigchld notifications, thus causing the daemons to believe that not all procs had exited. Let comm failure also serve as notification of process termination, and add appropriate flags/attributes to avoid multiple reporting of proc termination.
2014-06-21 17:09:02 +00:00