rhc54
dc857a9b24
Merge pull request #1267 from rhc54/topic/oob
...
Standardize the handling of shutdown in the OOB TCP component
2015-12-29 09:10:42 -08:00
Ralph Castain
12dccaa911
Standardize the handling of shutdown in the OOB TCP component
2015-12-29 07:57:22 -08:00
Ralph Castain
ddf0f272e1
Fix typo
2015-12-29 07:04:28 -08:00
rhc54
5dfb7ac396
Merge pull request #1266 from ggouaillardet/topic/misc_pmix_fixes
...
Topic/misc pmix fixes
2015-12-29 07:02:44 -08:00
Gilles Gouaillardet
b20a219ad0
hwloc/external: abort if hwloc v2 is detected since it is not yet supported
2015-12-29 09:23:27 +09:00
rhc54
b7b4231fbb
Merge pull request #1261 from ggouaillardet/topic/rmaps_rr_oversubsription
...
rmaps/round_robin: warn if oversubscribing when manually setting the …
2015-12-28 07:03:08 -08:00
Gilles Gouaillardet
0ca1ee5156
configury: misc pmix120 fixes
2015-12-28 23:17:41 +09:00
Gilles Gouaillardet
3300d7cc00
pmix: rename pmix_munge_module
2015-12-28 23:16:27 +09:00
Ralph Castain
a5b95a0939
Continue work on error notification system
2015-12-28 23:15:59 +09:00
Ralph Castain
810f2446b7
Add pmix120 component, update the error handling functions in the PMIx API.
...
Update the configure logic for the new pmix120 component
ckpt
Get the pmix120 component to work - still not really registering or handling notifications, but infrastructure now operates
Cleanup some of the symbol scopes, and provide a more comprehensive rename.h file. Will pretty it up later - let's see how this works
Cleanup the rename files to use the pretty macros
2015-12-28 23:15:44 +09:00
Gilles Gouaillardet
c757c5c612
pmix/external: Fix error handler usage
2015-12-28 23:15:17 +09:00
Gilles Gouaillardet
1157329732
configury: misc pmix112 fixes
2015-12-28 23:15:16 +09:00
Gilles Gouaillardet
55ae6768d3
configury: use --with-pmix option instead of --with-external-pmix
2015-12-28 23:14:59 +09:00
Gilles Gouaillardet
d416c7fd8a
pmix/external: no more circular dependencies if not building shared DSO
2015-12-28 23:14:03 +09:00
Gilles Gouaillardet
4f8c0c5f0d
event/external: add missing include file
2015-12-28 17:22:59 +09:00
Gilles Gouaillardet
fec973efda
configury: test portability
...
replace test ... -o ... with test ... || test ...
and test ... -a ... with test ... && test ...
2015-12-28 13:58:45 +09:00
Gilles Gouaillardet
352b05a552
rmaps: warn if oversubscribing when manually setting the number of hosts
...
This is a port of the v1.10 series one-off open-mpi/ompi-release@8c5ce45ab6
2015-12-28 10:38:57 +09:00
Gilles Gouaillardet
47ab2fcb89
man: fix MPI_Neighbor_alltoall{v,w} prototypes
...
Thanks Willem Vermin for bringing this to our attention
2015-12-28 09:39:33 +09:00
Nathan Hjelm
700a21022a
Merge pull request #1260 from artpol84/openib_proc_account_fix
...
Openib proc accounting fix
2015-12-27 15:19:52 -07:00
Nathan Hjelm
6fd5041e48
Merge pull request #1264 from artpol84/vader_resource_leak_fix
...
Fix vader resource leak.
2015-12-27 15:19:27 -07:00
Artem Polyakov
a20826e6b4
Fix vader resource leak.
...
This nasty bug was nicely masked. It was causing `mca_btl_vader_component.vader_frags_user`
overflow and as the result rear hangs of ompi-test-suite.
2015-12-28 00:41:45 +06:00
Gilles Gouaillardet
2d9aa38e6a
btl/openib: fix heterogeneous support
2015-12-25 16:31:35 +09:00
Ralph Castain
8ab28cdc82
Fix a typo that causes segfaults on multi-node executions
2015-12-24 08:43:47 -08:00
Artem Polyakov
3031affdb7
Fix openib process accounting if procs was dynamically added.
2015-12-24 17:56:35 +06:00
Artem Polyakov
400af6c52d
openib addproc improvements:
...
1. finer grained locks;
2. separate srq creation from cq adjustments.
2015-12-24 17:56:35 +06:00
Artem Polyakov
41c325f15a
Shift common code for calculating a port count and btl_rank in openib
...
into the static function
2015-12-24 17:56:35 +06:00
Gilles Gouaillardet
0b3e3c6817
opal/runtime: add missing #include <unistd.h>
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:56 +09:00
Gilles Gouaillardet
f0e3e16f49
pmix/base: add missing #include <unistd.h>
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:52 +09:00
Gilles Gouaillardet
66d9c2daea
rcache/vma: add missing #include "opal/util/output.h"
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:49 +09:00
Gilles Gouaillardet
5fa63f086a
btl/tcp: add missing #include <unistd.h>
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:46 +09:00
Gilles Gouaillardet
15ed7ad9f5
btl/sm: add missing #include <unistd.h>
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:41 +09:00
Gilles Gouaillardet
65a081ae6a
mca/base: add missing #include "opal/util/output.h" and <unistd.h>
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:33 +09:00
Gilles Gouaillardet
ccc96ad204
fbtl/base: add missing #include "opal/util/output.h"
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:26 +09:00
Gilles Gouaillardet
cebde2a753
coll/tuned: add missing #include "opal/util/output.h"
...
Thanks Marco Atzeri for contributing the original patch
2015-12-24 14:41:17 +09:00
Gilles Gouaillardet
99d046d060
scoll/fca: add missing #include <alloca.h>
2015-12-24 14:33:58 +09:00
Gilles Gouaillardet
ad9693c604
pml/yalla: add missing #include <alloca.h>
2015-12-24 14:33:58 +09:00
Gilles Gouaillardet
b38c17dbcb
pml/cm: add missing #include <alloca.h>
...
Thanks Paul Hargrove for reporting this issue
2015-12-24 14:33:58 +09:00
Gilles Gouaillardet
071ae39a44
osc/rdma: add missing #include <alloca.h>
2015-12-24 14:33:58 +09:00
Gilles Gouaillardet
77f199d1d7
coll/fca: add missing #include <alloca.h>
2015-12-24 14:33:58 +09:00
Gilles Gouaillardet
42313acd58
btl/usnic: add missing #include <alloca.h>
2015-12-24 14:33:58 +09:00
Gilles Gouaillardet
38a8826136
opal/datatype: #include <alloca.h> when needed and nowhere else
2015-12-24 14:33:58 +09:00
rhc54
d7199dc75b
Merge pull request #1255 from annu13/fixup
...
Fixup
2015-12-22 20:54:48 -08:00
Nathan Hjelm
84d890b7e7
Merge pull request #1248 from artpol84/openib_proc_init_race
...
Openib dynamic add proc race conditions
2015-12-22 21:48:05 -07:00
Howard Pritchard
2362bf0c0c
Merge pull request #1257 from hppritcha/topic/disable_mpirun_for_native_slurm_crayxc
...
plm/alps: only use srun for Native SLURM
2015-12-22 21:07:41 -07:00
annu13
43f44f31c1
moved code to job setup first before enabling comm
2015-12-22 14:30:59 -08:00
Howard Pritchard
39367ca0bf
plm/alps: only use srun for Native SLURM
...
Turns out that the way the SLURM plm works
is not compatible with the way MPI processes
on Cray XC obtain RDMA credentials to use
the high speed network. Unlike with ALPS,
the mpirun process is on the first compute
node in the job. With the current PLM launch
system, mpirun (HNP daemon) launches the MPI
ranks on that node rather than relying on
srun.
This will probably require a significant amount
of effort to rework to support Native SLURM
on Cray XC's. As a short term alternative,
have the alps plm (which gets selected by default
again on Cray systems regardless of the launch system)
check whether or not srun or alps is being used on the
system. If alps is not being used, print a helpful
message for the user and abort the job launch.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-12-22 11:03:42 -08:00
Ryan Grant
5ec5bd08c1
Merge pull request #1256 from tkordenbrock/topic/mtl.init.nid.pid.in.logical
...
mtl-portals4: initialize endpoint nid/pid when using logical mapping
2015-12-22 11:43:03 -07:00
Todd Kordenbrock
8a3660138e
mtl-portals4: initialize endpoint nid/pid when using logical mapping
...
When mtl-portals4 is configured for logical mapping, coll-portals4
must disqualify because it does not yet support logical mapping.
coll-portals4 looks for the endpoint pid to be zero which tells it
that mtl-portals4 is configured for logical mapping. This commit
initializes the endpoint nid/pid to zero for logical mapping.
2015-12-22 11:20:18 -06:00
Artem Polyakov
08ad8357a8
Fix local process accounting in openib when dynamic add_proc is on.
2015-12-22 22:44:46 +06:00
Artem Polyakov
3c2f6d5560
Protect openib_btl->device data with explicit opal_mitex locks.
2015-12-22 18:33:26 +06:00