1
1
Граф коммитов

25952 Коммитов

Автор SHA1 Сообщение Дата
Gilles Gouaillardet
40424c9d0f orte/util/hostfile: add the port=<port> option
add the option to pass an alternate port to plm
for example
node0 port=2222
directs the plm (via the ORTE_NODE_PORT) attribute to use
the non default port 2222 (e.g. ssh -p 2222 node0 ...)
2016-10-19 15:04:01 +09:00
Gilles Gouaillardet
73ea87800b orte/util: add the ORTE_NODE_PORT attribute
this can be used to direct the plm component to use an alternate port
(e.g. ssh -p 2222 ...)
2016-10-19 15:04:01 +09:00
Gilles Gouaillardet
e78fcc4db9 coll/base: fix ompi_coll_base_{gather,scatter}_intra_binomial
receive type is only relevant for root with gather,
send type is only relevant for root with scatter,
so do not access these types on a non root task
2016-10-19 14:05:22 +09:00
Gilles Gouaillardet
cb76d93b4e ompi_wrapper_script: fix $extra_ldflags
use @OMPI_PKG_CONFIG_LDFLAGS@ instead of @OMPI_WRAPPER_EXTRA_LDFLAGS@
so @{libdir} is substitued with ${libdir}

Thanks Manesh Nanavalla for the report
2016-10-19 09:57:55 +09:00
rhc54
2826da727a Merge pull request #2244 from rhc54/topic/pmixext
Create PMIx v3 external component
2016-10-18 16:46:18 -05:00
Ralph Castain
8113a8d1b0 Now that we are hiding symbols in the internal PMIx component, we cannot reuse that component for integration to the external PMIx master as the symbols don't match. So create a new "ext3x" component and copy the PMIx v3 integration over there.
Also, remove a couple of build-product files from the pmix3x component.
2016-10-18 13:15:32 -07:00
rhc54
1884aa68e5 Merge pull request #2240 from rhc54/topic/badapp
Properly report failure to launch when someone mis-types the name of the application
2016-10-18 13:08:06 -05:00
Ralph Castain
16540c7422 Properly report failure to launch when someone mis-types the name of the application
Fixes #2233
2016-10-18 10:09:30 -07:00
Ralph Castain
7be607582e ORTE applications need to commit any modex send's prior to calling fence 2016-10-18 09:22:56 -07:00
Ralph Castain
7910aa23eb Set lazy_wait_in_init "on" by default for test in master 2016-10-18 08:47:04 -07:00
rhc54
0e5d46ae7a Merge pull request #2237 from rhc54/topic/thread
Ensure the PMIx progress thread is stopped prior to tearing anything down.
2016-10-18 10:38:03 -05:00
Ralph Castain
50c9f3de55 Ensure the PMIx progress thread is stopped prior to tearing anything down. Thanks to Gilles for spotting this error! 2016-10-18 00:27:52 -07:00
rhc54
a659cb2fda Merge pull request #2229 from rhc54/topic/dvm
Pickup the npernode and npersocket options and include them in the job object
2016-10-17 15:27:21 -05:00
Ralph Castain
57114a09ae Pickup the npernode and npersocket options and include them in the job object 2016-10-17 12:26:21 -07:00
Gilles Gouaillardet
1e3191115b Merge pull request #2172 from ggouaillardet/topic/ialltoall_in_place
support MPI_IN_PLACE in MPI_Ialltoall*
2016-10-17 17:00:47 +09:00
Gilles Gouaillardet
bd1b6fe661 rml/oob: add a missing include file 2016-10-16 10:25:00 +09:00
Gilles Gouaillardet
c530b0a07c mpi/cxx: remove duplicate and now useless typedef 2016-10-15 14:30:00 +09:00
Ralph Castain
50bb0ded70 Update the PMIx nightly scripts to generalize locations 2016-10-14 08:40:05 -07:00
Joshua Ladd
64a15188bd Merge pull request #2199 from vspetrov/coll_hcoll_ialltoallv
coll/hcoll: ialltoallv interface
2016-10-14 07:59:23 -06:00
Gilles Gouaillardet
9389de4199 topo/treematch: fix displacements in mca_topo_treematch_dist_graph_create() 2016-10-14 17:16:49 +09:00
Gilles Gouaillardet
4e19cd51b1 hwloc/external: add a missing include file 2016-10-14 09:27:33 +09:00
rhc54
ef0610dd56 Merge pull request #2223 from rhc54/topic/pmixfix
Repair event notification support and resync to PMIx master
2016-10-13 19:26:44 -05:00
Ralph Castain
6f65d0a173 Repair event notification support. Cleanup the long-suffering "epoll: warning" coming out of libevent whenever a process abnormally terminated.
Add changes to test program

Sync to PMIx master
2016-10-13 16:27:39 -07:00
rhc54
952ea06f15 Merge pull request #2219 from ggouaillardet/topic/oob_vs_pmix_finalize
ess: tear down pmix (if any) before oob
2016-10-13 11:32:47 -05:00
Ralph Castain
6417f217e1 Turn PMIx dstore off by default as MTT was effectively broken 2016-10-13 08:14:51 -07:00
Joshua Ladd
b661307e6f Merge pull request #2218 from yosefe/topic/ucx-pml-spml-update
ucx: adapt pml_ucx and spml_ucx to new UCX APIs
2016-10-13 09:23:37 -04:00
Gilles Gouaillardet
958e29f929 osc/rdma: silence a warning
declare a local variable volatile and silence CID 1372692
2016-10-13 16:10:07 +09:00
Gilles Gouaillardet
451b9dc467 ess: tear down pmix (if any) before oob 2016-10-13 14:08:02 +09:00
Yossi Itigin
05ca466c6b ucx: adapt pml_ucx and spml_ucx to new UCX APIs
- pass field_mask to ucp_init().
- use non-blocking disconnect.
- recv() with pre-allocated request.
- call opal_progress() from iprobe() and improbe().
- use shift pattern in connect/disconnect.
2016-10-12 23:45:45 +03:00
Nathan Hjelm
1b01b6db4f Merge pull request #2213 from hjelmn/osc_rdma
osc/rdma: fix warnings
2016-10-12 13:13:27 -06:00
Nathan Hjelm
5c99d097ee Merge pull request #2214 from hjelmn/cxx_isolation
mpi/cxx: fix warnings
2016-10-12 13:04:30 -06:00
rhc54
4439d880fd Merge pull request #2211 from rhc54/topic/rml
Some compilers apparently complain about this, so modify the typedef …
2016-10-12 13:11:47 -05:00
Nathan Hjelm
ab0a005c95 mpi/cxx: fix warnings
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 11:30:36 -06:00
Nathan Hjelm
5b40fd267f Merge pull request #2204 from hjelmn/arm64
asm/arm64: ensure instruction ordering on timer
2016-10-12 11:22:28 -06:00
Nathan Hjelm
e8ef503bee osc/rdma: fix warnings
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 10:17:25 -06:00
Ralph Castain
fca1556787 Some compilers apparently complain about this, so modify the typedef statements 2016-10-12 08:44:03 -07:00
Nathan Hjelm
9a50ce6364 asm/arm64: ensure instruction ordering on timer
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 09:25:21 -06:00
Ralph Castain
b11c9574d4 Remove debug and update copyright 2016-10-11 23:28:16 -07:00
Ralph Castain
a2326e3ba0 Update the scaling test to properly use orterun for orte-dvm tests, and extend by adding params for async mpi init/finalize 2016-10-11 23:24:52 -07:00
Gilles Gouaillardet
974dd64293 ompi/errhandler: remove useless include file
and avoid #includ'ing stuff that breaks C++ compilers
2016-10-12 14:49:31 +09:00
Ralph Castain
8f05beb1ec Sync pmix/master@cb53105 2016-10-11 20:54:59 -07:00
rhc54
ad156e3e91 Merge pull request #2207 from rhc54/topic/pmixupdate
Update PMIx support to latest PMIx master
2016-10-11 18:57:11 -05:00
rhc54
ee9f33f08c Merge pull request #2146 from rhc54/topic/rml2
Bring the RML modifications across
2016-10-11 18:54:59 -05:00
Jeff Squyres
bcbf0bc4f9 usnic: s/OMPI/OPAL/
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-10-11 16:43:35 -07:00
Ralph Castain
a2919174d0 Bring the RML modifications across. This is the first step in a revamp of the ORTE messaging subsystem to support fabric-based communications during launch and wireup phases. When completed, the grpcomm and plm frameworks will each have their own "conduit" for communication - each conduit corresponds to a particular RML messaging transport. This can be the active OOB-based component, or a provider from within the RML/OFI component. Messages sent down the conduit will flow across the associated transport.
Multiple conduits can exist at the same time, and can even point to the same base transport. Each conduit can have its own characteristics (e.g., flow control) based on the info keys provided to the "open_conduit" call. For ease during the transition period, the "legacy" RML interfaces remain as wrappers over the new conduit-based APIs using a default conduit opened during orte_init - this default conduit is tied to the OOB framework so that current behaviors are preserved. Once the transition has been completed, a one-time cleanup will be done to update all RML calls to the new APIs and the "legacy" interfaces will be deleted.

While we are at it: Remove oob/usock component to eliminate the TMPDIR length problem - get all working, including oob_stress
2016-10-11 16:01:02 -07:00
Ralph Castain
6ce4b6d098 Eliminate -Wall from being hardcoded 2016-10-11 12:50:31 -07:00
Ralph Castain
1859b03416 Enable PMIx shared memory support by default 2016-10-11 12:18:01 -07:00
Ralph Castain
1d7d7c201b Update PMIx support to latest PMIx master 2016-10-11 10:17:23 -07:00
Nathan Hjelm
432d79046b Merge pull request #2197 from tkordenbrock/topic/master/osc-rdma.put.use.true_extent
osc-rdma: fix datatype lower bound errors in ompi_osc_rdma_master()
2016-10-11 10:42:02 -06:00
Ryan Grant
fd55204791 Merge pull request #2196 from tkordenbrock/topic/master/osc-portals4.put.use.true_extent
osc-portals4: fix datatype errors in put()
2016-10-10 08:57:12 -06:00