Gilles Gouaillardet
9389de4199
topo/treematch: fix displacements in mca_topo_treematch_dist_graph_create()
2016-10-14 17:16:49 +09:00
Gilles Gouaillardet
4e19cd51b1
hwloc/external: add a missing include file
2016-10-14 09:27:33 +09:00
rhc54
ef0610dd56
Merge pull request #2223 from rhc54/topic/pmixfix
...
Repair event notification support and resync to PMIx master
2016-10-13 19:26:44 -05:00
Ralph Castain
6f65d0a173
Repair event notification support. Cleanup the long-suffering "epoll: warning" coming out of libevent whenever a process abnormally terminated.
...
Add changes to test program
Sync to PMIx master
2016-10-13 16:27:39 -07:00
rhc54
952ea06f15
Merge pull request #2219 from ggouaillardet/topic/oob_vs_pmix_finalize
...
ess: tear down pmix (if any) before oob
2016-10-13 11:32:47 -05:00
Ralph Castain
6417f217e1
Turn PMIx dstore off by default as MTT was effectively broken
2016-10-13 08:14:51 -07:00
Joshua Ladd
b661307e6f
Merge pull request #2218 from yosefe/topic/ucx-pml-spml-update
...
ucx: adapt pml_ucx and spml_ucx to new UCX APIs
2016-10-13 09:23:37 -04:00
Gilles Gouaillardet
958e29f929
osc/rdma: silence a warning
...
declare a local variable volatile and silence CID 1372692
2016-10-13 16:10:07 +09:00
Gilles Gouaillardet
451b9dc467
ess: tear down pmix (if any) before oob
2016-10-13 14:08:02 +09:00
Yossi Itigin
05ca466c6b
ucx: adapt pml_ucx and spml_ucx to new UCX APIs
...
- pass field_mask to ucp_init().
- use non-blocking disconnect.
- recv() with pre-allocated request.
- call opal_progress() from iprobe() and improbe().
- use shift pattern in connect/disconnect.
2016-10-12 23:45:45 +03:00
Nathan Hjelm
1b01b6db4f
Merge pull request #2213 from hjelmn/osc_rdma
...
osc/rdma: fix warnings
2016-10-12 13:13:27 -06:00
Nathan Hjelm
5c99d097ee
Merge pull request #2214 from hjelmn/cxx_isolation
...
mpi/cxx: fix warnings
2016-10-12 13:04:30 -06:00
rhc54
4439d880fd
Merge pull request #2211 from rhc54/topic/rml
...
Some compilers apparently complain about this, so modify the typedef …
2016-10-12 13:11:47 -05:00
Nathan Hjelm
ab0a005c95
mpi/cxx: fix warnings
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 11:30:36 -06:00
Nathan Hjelm
5b40fd267f
Merge pull request #2204 from hjelmn/arm64
...
asm/arm64: ensure instruction ordering on timer
2016-10-12 11:22:28 -06:00
Nathan Hjelm
e8ef503bee
osc/rdma: fix warnings
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 10:17:25 -06:00
Ralph Castain
fca1556787
Some compilers apparently complain about this, so modify the typedef statements
2016-10-12 08:44:03 -07:00
Nathan Hjelm
9a50ce6364
asm/arm64: ensure instruction ordering on timer
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-10-12 09:25:21 -06:00
Ralph Castain
b11c9574d4
Remove debug and update copyright
2016-10-11 23:28:16 -07:00
Ralph Castain
a2326e3ba0
Update the scaling test to properly use orterun for orte-dvm tests, and extend by adding params for async mpi init/finalize
2016-10-11 23:24:52 -07:00
Gilles Gouaillardet
974dd64293
ompi/errhandler: remove useless include file
...
and avoid #includ'ing stuff that breaks C++ compilers
2016-10-12 14:49:31 +09:00
Ralph Castain
8f05beb1ec
Sync pmix/master@cb53105
2016-10-11 20:54:59 -07:00
rhc54
ad156e3e91
Merge pull request #2207 from rhc54/topic/pmixupdate
...
Update PMIx support to latest PMIx master
2016-10-11 18:57:11 -05:00
rhc54
ee9f33f08c
Merge pull request #2146 from rhc54/topic/rml2
...
Bring the RML modifications across
2016-10-11 18:54:59 -05:00
Jeff Squyres
bcbf0bc4f9
usnic: s/OMPI/OPAL/
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-10-11 16:43:35 -07:00
Ralph Castain
a2919174d0
Bring the RML modifications across. This is the first step in a revamp of the ORTE messaging subsystem to support fabric-based communications during launch and wireup phases. When completed, the grpcomm and plm frameworks will each have their own "conduit" for communication - each conduit corresponds to a particular RML messaging transport. This can be the active OOB-based component, or a provider from within the RML/OFI component. Messages sent down the conduit will flow across the associated transport.
...
Multiple conduits can exist at the same time, and can even point to the same base transport. Each conduit can have its own characteristics (e.g., flow control) based on the info keys provided to the "open_conduit" call. For ease during the transition period, the "legacy" RML interfaces remain as wrappers over the new conduit-based APIs using a default conduit opened during orte_init - this default conduit is tied to the OOB framework so that current behaviors are preserved. Once the transition has been completed, a one-time cleanup will be done to update all RML calls to the new APIs and the "legacy" interfaces will be deleted.
While we are at it: Remove oob/usock component to eliminate the TMPDIR length problem - get all working, including oob_stress
2016-10-11 16:01:02 -07:00
Ralph Castain
6ce4b6d098
Eliminate -Wall from being hardcoded
2016-10-11 12:50:31 -07:00
Ralph Castain
1859b03416
Enable PMIx shared memory support by default
2016-10-11 12:18:01 -07:00
Ralph Castain
1d7d7c201b
Update PMIx support to latest PMIx master
2016-10-11 10:17:23 -07:00
Nathan Hjelm
432d79046b
Merge pull request #2197 from tkordenbrock/topic/master/osc-rdma.put.use.true_extent
...
osc-rdma: fix datatype lower bound errors in ompi_osc_rdma_master()
2016-10-11 10:42:02 -06:00
Ryan Grant
fd55204791
Merge pull request #2196 from tkordenbrock/topic/master/osc-portals4.put.use.true_extent
...
osc-portals4: fix datatype errors in put()
2016-10-10 08:57:12 -06:00
Todd Kordenbrock
05f86b5df7
osc-rdma: fix datatype lower bound errors in ompi_osc_rdma_master()
...
Instead of ompi_datatype_get_extent(), use ompi_datatype_get_true_extent()
to get the local and remote lower bound. For derived types like
subarray, true_lb is the correct offset for RDMA operations.
2016-10-10 06:45:28 -05:00
Todd Kordenbrock
cc863ff9fb
osc-portals4: fix datatype errors in put()
...
Instead of ompi_datatype_get_extent(), use ompi_datatype_get_true_extent()
to get the origin and target lower bound. For derived types like
subarray, true_lb is the correct offset for RDMA operations. Also,
instead of the extent use the size of the datatype.
2016-10-10 06:45:14 -05:00
rhc54
b703f2e167
Merge pull request #2195 from rhc54/topic/notify
...
Implement the backend support for process-generated event notification
2016-10-08 13:55:19 -05:00
Ralph Castain
5b1484a836
Implement the backend support for process-generated event notification
2016-10-08 09:24:28 -07:00
Gilles Gouaillardet
315a622723
ompi: invokes opal_cleanup() if ompi_mpi_finalize() when possible
...
As long as it is illegal to call MPI_T_init_thread() after MPI_Finalize(),
be gentle and release as much memory as possible in MPI_Finalize().
opal_cleanup() will be invoked again by the OPAL destructor, but will
do nothing since classes was set to NULL
2016-10-08 16:58:20 +09:00
Gilles Gouaillardet
0d24fad307
opal: always run opal_class_finalize in the opal_cleanup destructor
...
if MPI_Init[_thread]/MPI_Finalize and MPI_T_init_thread/MPI_T_finalize
are balanced, opal_initialized is zero, and hence opal_cleanup destructor
never invokes opal_class_finalize.
if MPI_Init[_thread] nor MPI_T_init_thread have been called, classes is NULL,
so opal_class_finalize does nothing
2016-10-08 16:58:20 +09:00
Gilles Gouaillardet
b55dd2442a
libevent2022: rename _event_strlcpy
2016-10-08 16:58:20 +09:00
Gilles Gouaillardet
c92e9a5406
use the new OPAL_HASH_TABLE_FOREACH convenience macro
2016-10-08 16:58:20 +09:00
Gilles Gouaillardet
23a8f764bd
opal: add the OPAL_HASH_TABLE_FOREACH macro
...
this is a convenience macro similar to the OPAL_LIST_FOREACH macro,
that can be used to iterate on all the key/value pairs of an opal_hash_table_t
2016-10-08 16:58:20 +09:00
Gilles Gouaillardet
014f917462
opal: fix comment in OPAL_LIST_FOREACH macro. no code change.
2016-10-08 16:58:19 +09:00
Gilles Gouaillardet
13d49c135f
Merge pull request #2193 from ggouaillardet/topic/pmix_misc_plugs_and_fixes
...
pmix3x: plugs misc memory leaks and misc fixes
2016-10-08 16:57:06 +09:00
Gilles Gouaillardet
f1f1fb15eb
pmix3x: configury: output major, minor and release version after checking them
...
and hence fix the configure output
(back-ported from upstream commit pmix/master@7b7cdda2de )
2016-10-08 13:01:28 +09:00
Gilles Gouaillardet
f3af799608
pmix3x: misc fixes to get pmix build on Solaris
...
- replace MAXHOSTNAMELEN with hardcoded 1024.
unlike Linux, Solaris #define MAXHOSTNAMELEN in <netdb.h>,
so use a hard coded value to keep the test simpl
- stdout cannot be assigned on Solaris, so use freopen instead
(back-ported from upstream commit pmix/master@a63f6e53f4 )
2016-10-08 13:01:28 +09:00
Gilles Gouaillardet
5cbfddb8f1
pmix3x: fix misc memory leaks
...
(back-ported from upstream commit pmix/master@1eff526929 )
2016-10-08 13:01:28 +09:00
Gilles Gouaillardet
b4e4e4a5f1
pmix3x: enhance pmix_nspace_t destructor
...
PMIX_RELEASE all elements stored in the internal and modex hash tables
(back-ported from upstream commit pmix/master@b90674fc52 )
2016-10-08 13:01:27 +09:00
Gilles Gouaillardet
f1dc033767
pmix3x: add the PMIX_HASH_TABLE_FOREACH macro
...
this is a convenience macro similar to the PMIX_LIST_FOREACH macro,
that can be used to iterate on all the key/value pairs of a pmix_hash_table_t
(back-ported from upstream commit pmix/master@349971c68c )
2016-10-08 13:01:27 +09:00
rhc54
73298ad4e2
Merge pull request #2192 from rhc54/topic/showhelp
...
Send show_help out thru stderr
2016-10-07 22:43:44 -05:00
Ralph Castain
51b2bb1d41
Send show_help out thru stderr
2016-10-07 19:23:52 -07:00
Joshua Ladd
4f1b63d9a2
Merge pull request #2188 from jladd-mlnx/topic/oshmem-bump-to-v1.3
...
OSHMEM Specification version: Bump to v1.3.
2016-10-07 07:15:43 -04:00