1
1

28043 Коммитов

Автор SHA1 Сообщение Дата
Aravind Gopalakrishnan
285fc42b4e Fix OFI MTL to recognize correct CQ empty scenario
Currently, the progress function is incorrectly interpreting any error
value other than a positive value or -FI_EAVAIL to mean CQ is empty.
CQ is empty only if fi_cq_read() call returned -EAGAIN error
code. Fix that here.

While at it, fix help text output for calls made to OFI API.

Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>
2017-10-30 12:13:44 -07:00
Thananon Patinyasakdikul
e3b267a8fe pml/ob1: match callback will now queue wrong sequence frag and return.
In multithreaded case, it is expensive to release the lock, call the slow match
and retake the lock again just to queue the frag. This patch will eliminate number of
lock taken by queueing the frag right away and return.

Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>
2017-10-30 12:01:50 -04:00
Gilles Gouaillardet
582aa6c0f0
Merge pull request #4422 from ggouaillardet/topic/configury_zlib_misc_fixes
configury: misc fixes in zlib detection
2017-10-30 17:10:31 +09:00
Gilles Gouaillardet
48a39e34f2 configury: misc fixes in zlib detection
- push extra local variables
 - remove unnecessary AC_MSG_RESULT

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-10-30 16:19:37 +09:00
Ralph Castain
56067f38e3
Merge pull request #4418 from rhc54/topic/pinterlib
Add another test program for cross-lib coordination, this one based on native PMIx commands
2017-10-29 16:41:40 -05:00
Ralph Castain
6be74bfa7e Add another test program for cross-lib coordination, this one based on native PMIx commands
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-29 11:33:25 -07:00
Ralph Castain
21474a6933
Merge pull request #4413 from rhc54/topic/event
Address external libevent once again
2017-10-29 13:16:26 -05:00
Ralph Castain
36d7e752b6 I think we have all concluded that there is no good answer to locating the external libevent library, so surrender to the situation and simply remove that requirement. Users wanting to utilize the embedded PMIx library can install it, but will have to use mpicc _and_ add an explicit -lpmix to their cmd line to compile their application.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-29 07:39:02 -07:00
Ralph Castain
ae4f310c0b
Merge pull request #4406 from rhc54/topic/update
Sync to PMIx master (now v3.0)
2017-10-27 12:22:44 -05:00
Ralph Castain
df48ddd2a1 Merge pull request #4323 from aravindksg/fix_help_text
Move help text output regarding PSM2_CUDA environment variable
2017-10-27 10:10:01 -05:00
Gilles Gouaillardet
5d208a177d Merge pull request #4407 from ggouaillardet/topic/configury_zlib
configury: fix handling of --with-zlib=DIR and --with-zlib-libdir=DIR…
2017-10-27 15:54:55 +09:00
Gilles Gouaillardet
5c61a4e3a5 configury: fix handling of external libevent library
Search external libevent library in both DIR/lib64 and DIR/lib
when --with-libevent=DIR is specified but --with-libevent-libdir=DIR is not

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-10-27 15:52:18 +09:00
Gilles Gouaillardet
5a816a98df configury: fix handling of --with-zlib=DIR and --with-zlib-libdir=DIR option
- if --with-zlib=DIR --with-zlib-libdir=LIBDIR are given, do not search
   libs in DIR/lib[64], and do not abort if libs are not there
 - if --with-zlib=DIR is given but not --with-zlib-libdir, then do append
   -LDIR/lib[64] to LDFLAGS

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-10-27 14:47:47 +09:00
Ralph Castain
ea3508b26b Sync to PMIx master (now v3.0)
Fix an apparent typo in external libevent configury
Require external libevent for install of separate libpmix

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-26 21:05:17 -07:00
Aravind Gopalakrishnan
bea4503f95 Move help text output regarding PSM2_CUDA envvar to component init phase
The messages should be printed only in the event of CUDA builds and in the
presence of supporting hardware and when PSM2 MTL has actually been selected
for use. To this end, move help text output to component init phase.

Also use opal_setenv/unsetenv() for safer setting, unsetting of the environment
variable and sanitize the help text message.

Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>
2017-10-26 16:01:01 -07:00
Thomas Naughton
86d282d6dd fix PML monitoring configury to compile DSOs
Signed-off-by: Thomas Naughton <naughtont@ornl.gov>
2017-10-26 15:53:11 -04:00
Ralph Castain
01ed7548c4 Update to PMIx v3.0a
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-25 12:25:27 -07:00
Ralph Castain
05f98d66a6 Merge pull request #4396 from rhc54/topic/pmixconfig
Alter the PMIx embedded configuration
2017-10-25 10:10:50 -05:00
Gilles Gouaillardet
c4650b5904 Merge pull request #4383 from ggouaillardet/topic/configury_ucx
configury: revamp ucx detection
2017-10-25 15:33:38 +09:00
Ralph Castain
8fbfe68754 Alter the PMIx embedded configuration so that we can build static with devel headers - if the builder requests that we install a separate libpmix, then don't prefix the PMIx variables.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-24 21:45:27 -07:00
Ralph Castain
cf3bc4f55b Merge pull request #4346 from matcabral/psm2_mtl_mq_thread_fix
MTL PSM2: add a thread lock while peeking and completing the psm2 requests.
2017-10-24 16:41:29 -05:00
Ralph Castain
14a0701949 Merge pull request #4391 from rhc54/topic/scale
Add timeout option to scaling script
2017-10-24 14:34:48 -05:00
Ralph Castain
e7c6718d29 Add timeout option to scaling script
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-24 12:33:22 -07:00
Ralph Castain
987aac1268 Merge pull request #4387 from rhc54/topic/dmodx
We should never block when requesting dmodex data from the PMIx server
2017-10-24 10:46:04 -05:00
Ralph Castain
292983261a We should never block when requesting dmodex data from the PMIx server as this will block it from being able to accept connections from local clients. Do not deregister standing dmodx requests when a fence completes unless we actually collected the data in the fence
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-24 07:51:10 -07:00
Ralph Castain
70c455938b Merge pull request #4382 from rhc54/topic/scaling
Update the scaling script to avoid use of "system" command
2017-10-23 22:43:25 -05:00
Ralph Castain
0353be9704 Update MPI init to properly skip barriers
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-23 19:28:34 -07:00
Gilles Gouaillardet
af03f55aa8 configury: revamp ucx detection
- when --with-ucx=DIR is not set, try the default path and fallback to /opt/ucx
 - when --with-ucx-libdir is not set, try lib64 and then lib directories
 - do not handle --with-ucx-libdir (this is a user mistake, no need to over-complicate our logic)

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-10-24 09:27:57 +09:00
Ralph Castain
3b71be4db4 Update the scaling script to avoid use of "system" command, thus ensuring that each command sees the same environment. Fix prun to pickup and propagate OMPI MCA params
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-23 16:27:41 -07:00
bosilca
ac348da13a Merge pull request #4374 from bosilca/topic/osx_syslog
Topic/osx syslog
2017-10-23 18:06:36 -04:00
Ralph Castain
0721d933fc Merge pull request #4376 from rhc54/topic/interlib
Update the interlib example to show an alternative method for model declaration
2017-10-23 14:53:13 -05:00
Ralph Castain
e33f319380 Update example to show tests of various APIs
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-23 12:02:54 -07:00
Ralph Castain
6ea3c8a0bd Update the interlib example to show an alternative method for model declaration. Add a missing range value to the OPAL layer. Make it easier to see OMPI model callbacks
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-23 11:27:42 -07:00
George Bosilca
8f32b345de
Address syslog issues on OSX 10.13 with gcc 7.x
gcc 7.[1,2] (at least) fails to correctly parse the OSX 10.13 sys/syslog.h
header. As a results we need to potect syslog support in OPAL, PMIX and
ORTE.

Signed-off-by: George Bosilca <bosilca@icl.utk.edu>
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-10-23 14:02:10 -04:00
Ralph Castain
79aef9369e Merge pull request #4371 from rhc54/topic/xvr
Updates to support cross-version operations with OMPI v2.x
2017-10-22 11:41:18 -05:00
Ralph Castain
a63904d47f Updates to support cross-version operations with OMPI v2.x
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-22 08:38:33 -07:00
Matias Cabral
b81bcd4b0d MTL PSM2: add a thread lock while peeking and completing the psm2
requests.
Reviewed-by: Gopalakrishnan, Aravind <aravind.gopalakrishnan@intel.com>
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2017-10-20 14:46:48 -07:00
Edgar Gabriel
defe73984a Merge pull request #4362 from edgargabriel/topic/fbtl-locking-support
Add file locking support in posix fbtl
2017-10-19 23:21:17 -05:00
Ralph Castain
f374ba161c Merge pull request #4366 from rhc54/topic/notify
Fix event registration so OpenMP/MPI coordination sides can both get notification of model declarations
2017-10-19 21:05:57 -05:00
Ralph Castain
f8ce31f13c Fix event registration so OpenMP/MPI coordination sides can both get notification of model declarations
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-19 18:06:38 -07:00
Ralph Castain
4eee95358b Merge pull request #4365 from rhc54/topic/routed
Ensure we update the routing plan so that tree spawn works!
2017-10-19 17:27:47 -05:00
Ralph Castain
75d411f3ea Ensure we update the routing plan so that tree spawn works!
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-19 14:02:06 -07:00
Edgar Gabriel
be0de21e6f fs/ufs and fbtl/posix: cleanup lock management
This commit looks large, but its really mostly a cleanup step.
1. introduce proper error handling for the return values of fcntl and the fbtl_posix_lock function
2. rename a parameter to more accurately reflect what it does
3. introduce an mca parameter in the fs/ufs component that allows to control
   what the level of locking the user would like to enforce
4. move the initialization of the fs_block_size parameter from fs/ufs into the
   common/ompio component. An fs component might be allowed to overwrite this
   value, but none of the actual fs components do that.

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-10-19 14:56:28 -05:00
Edgar Gabriel
e62f9d2e52 fs/ufs: ensure that the never-lock flag is set if not on NFS
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-10-19 13:32:40 -05:00
Edgar Gabriel
f66c55f77a fbtl/posix: fixes in the offset calculation and for aio operations
our own internal testsuite passes now correctly. More testing to follow.

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-10-19 13:32:39 -05:00
Edgar Gabriel
a3c638bc38 fbtl/posix: add support for file locking for the non-blocking operations
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-10-19 13:32:38 -05:00
Edgar Gabriel
415e76514d fbtl/posix: make the code compile
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-10-19 13:32:37 -05:00
Edgar Gabriel
f5e158c869 fbtl/posix: first cut in adding locking support
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-10-19 13:32:37 -05:00
Yossi Itigin
689f1be9b7 Merge pull request #4350 from alex-mikheev/topic/oshmem_spml_selection_fix
OSHMEM: add ucx to the list of default spmls
2017-10-19 17:59:52 +03:00
Gilles Gouaillardet
9771c575f5 Merge pull request #4352 from edgargabriel/pr/sem_close_fix
sharedfp/sm: close the named semaphore
2017-10-19 17:04:43 +09:00