Gilles Gouaillardet
40424c9d0f
orte/util/hostfile: add the port=<port> option
...
add the option to pass an alternate port to plm
for example
node0 port=2222
directs the plm (via the ORTE_NODE_PORT) attribute to use
the non default port 2222 (e.g. ssh -p 2222 node0 ...)
2016-10-19 15:04:01 +09:00
Gilles Gouaillardet
73ea87800b
orte/util: add the ORTE_NODE_PORT attribute
...
this can be used to direct the plm component to use an alternate port
(e.g. ssh -p 2222 ...)
2016-10-19 15:04:01 +09:00
Gilles Gouaillardet
e78fcc4db9
coll/base: fix ompi_coll_base_{gather,scatter}_intra_binomial
...
receive type is only relevant for root with gather,
send type is only relevant for root with scatter,
so do not access these types on a non root task
2016-10-19 14:05:22 +09:00
Gilles Gouaillardet
cb76d93b4e
ompi_wrapper_script: fix $extra_ldflags
...
use @OMPI_PKG_CONFIG_LDFLAGS@ instead of @OMPI_WRAPPER_EXTRA_LDFLAGS@
so @{libdir} is substitued with ${libdir}
Thanks Manesh Nanavalla for the report
2016-10-19 09:57:55 +09:00
rhc54
2826da727a
Merge pull request #2244 from rhc54/topic/pmixext
...
Create PMIx v3 external component
2016-10-18 16:46:18 -05:00
Ralph Castain
8113a8d1b0
Now that we are hiding symbols in the internal PMIx component, we cannot reuse that component for integration to the external PMIx master as the symbols don't match. So create a new "ext3x" component and copy the PMIx v3 integration over there.
...
Also, remove a couple of build-product files from the pmix3x component.
2016-10-18 13:15:32 -07:00
rhc54
1884aa68e5
Merge pull request #2240 from rhc54/topic/badapp
...
Properly report failure to launch when someone mis-types the name of the application
2016-10-18 13:08:06 -05:00
Ralph Castain
16540c7422
Properly report failure to launch when someone mis-types the name of the application
...
Fixes #2233
2016-10-18 10:09:30 -07:00
Ralph Castain
7be607582e
ORTE applications need to commit any modex send's prior to calling fence
2016-10-18 09:22:56 -07:00
Ralph Castain
7910aa23eb
Set lazy_wait_in_init "on" by default for test in master
2016-10-18 08:47:04 -07:00
rhc54
0e5d46ae7a
Merge pull request #2237 from rhc54/topic/thread
...
Ensure the PMIx progress thread is stopped prior to tearing anything down.
2016-10-18 10:38:03 -05:00
Ralph Castain
50c9f3de55
Ensure the PMIx progress thread is stopped prior to tearing anything down. Thanks to Gilles for spotting this error!
2016-10-18 00:27:52 -07:00
rhc54
a659cb2fda
Merge pull request #2229 from rhc54/topic/dvm
...
Pickup the npernode and npersocket options and include them in the job object
2016-10-17 15:27:21 -05:00
Ralph Castain
57114a09ae
Pickup the npernode and npersocket options and include them in the job object
2016-10-17 12:26:21 -07:00
Gilles Gouaillardet
1e3191115b
Merge pull request #2172 from ggouaillardet/topic/ialltoall_in_place
...
support MPI_IN_PLACE in MPI_Ialltoall*
2016-10-17 17:00:47 +09:00
Gilles Gouaillardet
bd1b6fe661
rml/oob: add a missing include file
2016-10-16 10:25:00 +09:00
Gilles Gouaillardet
c530b0a07c
mpi/cxx: remove duplicate and now useless typedef
2016-10-15 14:30:00 +09:00
Ralph Castain
50bb0ded70
Update the PMIx nightly scripts to generalize locations
2016-10-14 08:40:05 -07:00
Joshua Ladd
64a15188bd
Merge pull request #2199 from vspetrov/coll_hcoll_ialltoallv
...
coll/hcoll: ialltoallv interface
2016-10-14 07:59:23 -06:00
Gilles Gouaillardet
9389de4199
topo/treematch: fix displacements in mca_topo_treematch_dist_graph_create()
2016-10-14 17:16:49 +09:00
Gilles Gouaillardet
4e19cd51b1
hwloc/external: add a missing include file
2016-10-14 09:27:33 +09:00
rhc54
ef0610dd56
Merge pull request #2223 from rhc54/topic/pmixfix
...
Repair event notification support and resync to PMIx master
2016-10-13 19:26:44 -05:00
Ralph Castain
6f65d0a173
Repair event notification support. Cleanup the long-suffering "epoll: warning" coming out of libevent whenever a process abnormally terminated.
...
Add changes to test program
Sync to PMIx master
2016-10-13 16:27:39 -07:00
rhc54
952ea06f15
Merge pull request #2219 from ggouaillardet/topic/oob_vs_pmix_finalize
...
ess: tear down pmix (if any) before oob
2016-10-13 11:32:47 -05:00
Ralph Castain
6417f217e1
Turn PMIx dstore off by default as MTT was effectively broken
2016-10-13 08:14:51 -07:00
Joshua Ladd
b661307e6f
Merge pull request #2218 from yosefe/topic/ucx-pml-spml-update
...
ucx: adapt pml_ucx and spml_ucx to new UCX APIs
2016-10-13 09:23:37 -04:00
Gilles Gouaillardet
958e29f929
osc/rdma: silence a warning
...
declare a local variable volatile and silence CID 1372692
2016-10-13 16:10:07 +09:00
Gilles Gouaillardet
451b9dc467
ess: tear down pmix (if any) before oob
2016-10-13 14:08:02 +09:00
Yossi Itigin
05ca466c6b
ucx: adapt pml_ucx and spml_ucx to new UCX APIs
...
- pass field_mask to ucp_init().
- use non-blocking disconnect.
- recv() with pre-allocated request.
- call opal_progress() from iprobe() and improbe().
- use shift pattern in connect/disconnect.
2016-10-12 23:45:45 +03:00
Nathan Hjelm
1b01b6db4f
Merge pull request #2213 from hjelmn/osc_rdma
...
osc/rdma: fix warnings
2016-10-12 13:13:27 -06:00
Nathan Hjelm
5c99d097ee
Merge pull request #2214 from hjelmn/cxx_isolation
...
mpi/cxx: fix warnings
2016-10-12 13:04:30 -06:00
rhc54
4439d880fd
Merge pull request #2211 from rhc54/topic/rml
...
Some compilers apparently complain about this, so modify the typedef …
2016-10-12 13:11:47 -05:00
Nathan Hjelm
ab0a005c95
mpi/cxx: fix warnings
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 11:30:36 -06:00
Nathan Hjelm
5b40fd267f
Merge pull request #2204 from hjelmn/arm64
...
asm/arm64: ensure instruction ordering on timer
2016-10-12 11:22:28 -06:00
Nathan Hjelm
e8ef503bee
osc/rdma: fix warnings
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 10:17:25 -06:00
Ralph Castain
fca1556787
Some compilers apparently complain about this, so modify the typedef statements
2016-10-12 08:44:03 -07:00
Nathan Hjelm
9a50ce6364
asm/arm64: ensure instruction ordering on timer
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 09:25:21 -06:00
Ralph Castain
b11c9574d4
Remove debug and update copyright
2016-10-11 23:28:16 -07:00
Ralph Castain
a2326e3ba0
Update the scaling test to properly use orterun for orte-dvm tests, and extend by adding params for async mpi init/finalize
2016-10-11 23:24:52 -07:00
Gilles Gouaillardet
974dd64293
ompi/errhandler: remove useless include file
...
and avoid #includ'ing stuff that breaks C++ compilers
2016-10-12 14:49:31 +09:00
Ralph Castain
8f05beb1ec
Sync pmix/master@cb53105
2016-10-11 20:54:59 -07:00
rhc54
ad156e3e91
Merge pull request #2207 from rhc54/topic/pmixupdate
...
Update PMIx support to latest PMIx master
2016-10-11 18:57:11 -05:00
rhc54
ee9f33f08c
Merge pull request #2146 from rhc54/topic/rml2
...
Bring the RML modifications across
2016-10-11 18:54:59 -05:00
Jeff Squyres
bcbf0bc4f9
usnic: s/OMPI/OPAL/
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-10-11 16:43:35 -07:00
Ralph Castain
a2919174d0
Bring the RML modifications across. This is the first step in a revamp of the ORTE messaging subsystem to support fabric-based communications during launch and wireup phases. When completed, the grpcomm and plm frameworks will each have their own "conduit" for communication - each conduit corresponds to a particular RML messaging transport. This can be the active OOB-based component, or a provider from within the RML/OFI component. Messages sent down the conduit will flow across the associated transport.
...
Multiple conduits can exist at the same time, and can even point to the same base transport. Each conduit can have its own characteristics (e.g., flow control) based on the info keys provided to the "open_conduit" call. For ease during the transition period, the "legacy" RML interfaces remain as wrappers over the new conduit-based APIs using a default conduit opened during orte_init - this default conduit is tied to the OOB framework so that current behaviors are preserved. Once the transition has been completed, a one-time cleanup will be done to update all RML calls to the new APIs and the "legacy" interfaces will be deleted.
While we are at it: Remove oob/usock component to eliminate the TMPDIR length problem - get all working, including oob_stress
2016-10-11 16:01:02 -07:00
Ralph Castain
6ce4b6d098
Eliminate -Wall from being hardcoded
2016-10-11 12:50:31 -07:00
Ralph Castain
1859b03416
Enable PMIx shared memory support by default
2016-10-11 12:18:01 -07:00
Ralph Castain
1d7d7c201b
Update PMIx support to latest PMIx master
2016-10-11 10:17:23 -07:00
Nathan Hjelm
432d79046b
Merge pull request #2197 from tkordenbrock/topic/master/osc-rdma.put.use.true_extent
...
osc-rdma: fix datatype lower bound errors in ompi_osc_rdma_master()
2016-10-11 10:42:02 -06:00
Ryan Grant
fd55204791
Merge pull request #2196 from tkordenbrock/topic/master/osc-portals4.put.use.true_extent
...
osc-portals4: fix datatype errors in put()
2016-10-10 08:57:12 -06:00