Ralph Castain
2f966bf3bf
Cleanup external PMIx v3 component for copy/paste errors - component and module require unique names
2016-10-20 09:11:46 -07:00
Ralph Castain
8113a8d1b0
Now that we are hiding symbols in the internal PMIx component, we cannot reuse that component for integration to the external PMIx master as the symbols don't match. So create a new "ext3x" component and copy the PMIx v3 integration over there.
...
Also, remove a couple of build-product files from the pmix3x component.
2016-10-18 13:15:32 -07:00
Ralph Castain
50c9f3de55
Ensure the PMIx progress thread is stopped prior to tearing anything down. Thanks to Gilles for spotting this error!
2016-10-18 00:27:52 -07:00
Ralph Castain
6f65d0a173
Repair event notification support. Cleanup the long-suffering "epoll: warning" coming out of libevent whenever a process abnormally terminated.
...
Add changes to test program
Sync to PMIx master
2016-10-13 16:27:39 -07:00
Ralph Castain
6417f217e1
Turn PMIx dstore off by default as MTT was effectively broken
2016-10-13 08:14:51 -07:00
Ralph Castain
8f05beb1ec
Sync pmix/master@cb53105
2016-10-11 20:54:59 -07:00
Ralph Castain
6ce4b6d098
Eliminate -Wall from being hardcoded
2016-10-11 12:50:31 -07:00
Ralph Castain
1859b03416
Enable PMIx shared memory support by default
2016-10-11 12:18:01 -07:00
Ralph Castain
1d7d7c201b
Update PMIx support to latest PMIx master
2016-10-11 10:17:23 -07:00
Ralph Castain
5b1484a836
Implement the backend support for process-generated event notification
2016-10-08 09:24:28 -07:00
Gilles Gouaillardet
f1f1fb15eb
pmix3x: configury: output major, minor and release version after checking them
...
and hence fix the configure output
(back-ported from upstream commit pmix/master@7b7cdda2de )
2016-10-08 13:01:28 +09:00
Gilles Gouaillardet
f3af799608
pmix3x: misc fixes to get pmix build on Solaris
...
- replace MAXHOSTNAMELEN with hardcoded 1024.
unlike Linux, Solaris #define MAXHOSTNAMELEN in <netdb.h>,
so use a hard coded value to keep the test simpl
- stdout cannot be assigned on Solaris, so use freopen instead
(back-ported from upstream commit pmix/master@a63f6e53f4 )
2016-10-08 13:01:28 +09:00
Gilles Gouaillardet
5cbfddb8f1
pmix3x: fix misc memory leaks
...
(back-ported from upstream commit pmix/master@1eff526929 )
2016-10-08 13:01:28 +09:00
Gilles Gouaillardet
b4e4e4a5f1
pmix3x: enhance pmix_nspace_t destructor
...
PMIX_RELEASE all elements stored in the internal and modex hash tables
(back-ported from upstream commit pmix/master@b90674fc52 )
2016-10-08 13:01:27 +09:00
Gilles Gouaillardet
f1dc033767
pmix3x: add the PMIX_HASH_TABLE_FOREACH macro
...
this is a convenience macro similar to the PMIX_LIST_FOREACH macro,
that can be used to iterate on all the key/value pairs of a pmix_hash_table_t
(back-ported from upstream commit pmix/master@349971c68c )
2016-10-08 13:01:27 +09:00
Gilles Gouaillardet
7601e783cc
pmix3x: sec/munge: add a missing include file
...
(cherry picked from upstream pmix/master@f7cfb11f6b )
2016-10-03 16:09:10 +09:00
Ralph Castain
e773c17cf3
Put show_help thru the PMIx "log" API. This pushes the show_help output from apps into the pmix thread, thus avoiding conflicts in the RML thread, which should help with thread lock situations.
2016-10-02 16:02:23 -07:00
Gilles Gouaillardet
871ade9231
pmix/{cray,s1,s2}: make pmi_opcaddy_t class static
...
theses three pmix components use the same class name,
declare it as static so Open MPI can be built with --disable-dlopen
Thanks Limin Gu for the report
2016-09-28 09:18:36 +09:00
Gilles Gouaillardet
1fbc9a5431
pmix3x: dstore/pmix: flock portability
...
Using the fcntl-locking instead of the flock
(back-ported from upstream pmix/master@3030a0cca1 )
2016-09-27 13:21:03 +09:00
Gilles Gouaillardet
362a5886de
pmix3x: client: fix PMIx_Finalize() sequence
...
pmix_progress_thread_finalize() invokes libevent event_base_free,
so all libevent stuff cannot be used after.
Hence, pmix_client_globals.myserver must be PMIX_DESTRUCT'ed
before invoking pmix_progress_thread_finalize()
2016-09-24 00:01:23 +09:00
Gilles Gouaillardet
5479c6cca7
pmix3x: add missing #include
...
and get Open MPI build on OpenBSD 6.0
2016-09-23 11:23:18 +09:00
Gilles Gouaillardet
fbf03299c3
Merge pull request #2079 from ggouaillardet/topic/pmix_configury_dlopen
...
pmix3x: configury: correctly handle --disable-dlopen
2016-09-21 10:59:33 +09:00
Gilles Gouaillardet
6c1e25b76e
pmix/ext11: fix pmix1_value_unload() prototype and call
...
pmix1_value_unload() was added a "key" argument which is unused,
and pmix1_value_unload() was sometimes invoked with two arguments instead of three.
since the "key" argument is unused, simply remove it from the
subroutine prototype and calls.
2016-09-20 14:34:41 +09:00
Gilles Gouaillardet
041a431966
pmix3x: configury: correctly handle --disable-dlopen
...
the LT_* macros do overwrite the enable_dlopen variable,
so it must be tested and saved before invoking LT_INIT.
delay the invokation of the LT_* macros and use the
PMIX_ENABLE_DLOPEN_SUPPORT variable to figure out whether
--disable-dlopen was invoked
2016-09-15 13:26:20 +09:00
Artem Polyakov
dc0ab674de
Add PMIx key to provide RM with ability to indicate that it will cleanup
...
session directories provided at through OPAL_PMIX_TMPDIR,
OPAL_PMIX_NSDIR, OPAL_PMIX_PROCDIR
2016-09-05 07:48:44 +03:00
Ralph Castain
34f04a7924
Remove spurious Makefile.am line
2016-09-01 15:31:09 -07:00
Ralph Castain
0ea1cff733
Implement notification of completion on comm_spawn'd child jobs. Add a configure flag to enable PMIx 3's shared memory datastore, and set it disable by default so that comm_spawn functions again. Will reverse the default once that feature is fully functional
2016-09-01 13:10:10 -07:00
Ralph Castain
39992d1ad7
Silence trivial Coverity warnings
2016-08-31 09:42:33 -07:00
Ralph Castain
cfa784c9a6
Since we changed storage to pointers in pmix_value_t, we need to allocate space for those values when unpacking
2016-08-29 20:22:24 -07:00
rhc54
b90a64e734
Merge pull request #2022 from rhc54/topic/nnodes
...
Provide the number of nodes in the job
2016-08-26 18:15:24 -05:00
Ralph Castain
2f6e0fec90
Provide the number of nodes in the job
2016-08-26 14:50:41 -07:00
Jeff Squyres
e03a40a0e9
pmix3x: remove generated file
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-26 10:30:47 -07:00
Gilles Gouaillardet
e4bf915e75
pmix3x: remove auto-generated file
...
remove opal/mca/pmix/pmix3x/pmix/src/include/pmix_config.h.in
.gitignore is correct, so it seems this file was added before .gitignore was updated
2016-08-26 15:00:18 +09:00
Ralph Castain
af67f16422
Update configury to support multiple PMIx versions, rename pmix2x component to pmix3x for support of PMIx master
...
Update support for external v1.1.x and v2.x libraries. Minor corrections to the v3.x component
2016-08-25 18:19:05 -07:00
Gilles Gouaillardet
02847d9e7b
pmix2x: dstore: add missing <fcntl.h> include file in pmix_esh.c
...
(back-ported from upstream pmix/master@5c66ffe0f0 )
2016-08-24 11:18:46 +09:00
Gilles Gouaillardet
c11e8163f8
pmix2x: sec/native: fix the pmix_native module under solaris by using getpeerucred()
...
and fail with a user friendly message if no method is available:
"sec: native cannot validate_cred on this system"
(back-ported from upstream pmix/master@c474a1fc60 )
2016-08-24 11:18:40 +09:00
Gilles Gouaillardet
e91292aa41
pmix2x: configury: add missing check for <netdb.h> header file
...
(back-ported from upstream pmix/master@e54ce6d423 )
2016-08-24 11:18:32 +09:00
Ralph Castain
639dbdb7ea
For maintainability, fold the external PMIx 2.x integration into the internal PMIx 2.x library component. This ensures that we always stay in sync with the two as that is becoming a problem.
2016-08-22 13:28:55 -07:00
Ralph Castain
61ffba668b
Roll in the latest PMIx version - includes shared memory datastore and reduced memory footprint
2016-08-20 07:53:06 -07:00
Artem Polyakov
6ea8cccdab
Merge pull request #1969 from artpol84/pmix_jobid_fix
...
Pmix jobid fix
2016-08-18 17:24:58 +07:00
Ralph Castain
7da9793fef
Support the PMIX_TIMEOUT key at the PMIx server when timeout=0 - this indicates that the user doesn't want a lookup of any data from the host RM.
2016-08-17 16:26:58 -05:00
Gilles Gouaillardet
3126ff77e2
pmix2x: common syms: whitelist bison-generated common symbols
...
Bison generates some common symbols that we can't do anything about,
so whitelist them.
2016-08-16 11:29:06 +09:00
Artem Polyakov
c5a91c5c9d
opal/pmix: fix pmix jobid calculation if external PMIx server is used.
2016-08-15 21:13:51 +03:00
Ralph Castain
ecbedee8bb
Fix typo
2016-08-15 07:32:00 -07:00
Artem Polyakov
f3c816b52e
opal/pmix: fix indentation in some files.
2016-08-15 18:21:50 +07:00
Gilles Gouaillardet
483685eb6a
update .gitignore
...
remove autogenerated opal/mca/pmix/pmix2x/pmix/src/include/pmix_config.h.in
2016-08-15 17:00:20 +09:00
Ralph Castain
be8424b691
Provide backward compatible keys so that the non-PMIx components in the opal/pmix framework don't have to adjust as we continue to work on finalizing the PMIx reference scheme. Activate and utilize the new PMIx show_help capability to provide more meaningful error output when the server cannot start.
...
Add a contrib script to cleanup permissions incorrectly modified due to things like smb mounts
dd
2016-08-13 12:13:04 -07:00
rhc54
ddde154d28
Merge pull request #1962 from rhc54/topic/notify
...
Ensure we properly convert pmix status to ORTE state before activatin…
2016-08-13 06:59:50 -07:00
Ralph Castain
48d35a9627
Ensure we properly convert pmix status to ORTE state before activating an error state upon notification. Cleanup some conversion issues on notification info. Add a new orte_notify.c test program
2016-08-12 21:14:29 -07:00
Ralph Castain
4a4c9703a9
Setup the job list in the PMIx integration so that static ports can run
2016-08-12 13:27:10 -07:00