1
1
Граф коммитов

29423 Коммитов

Автор SHA1 Сообщение Дата
Joseph Schuchart
91885f5876 Plug two memory leaks in rdma osc
Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>
2018-11-14 14:31:54 -05:00
matcabral
5f58453e63 MTL/OFI: Lower priority when all procs are local
So far Vader is faster than OFI MTL for doing shared memory.
Therefore, let it run by default when all procs are local.

Reviewed-by: Spruit, Neil R <neil.r.spruit@intel.com>
Reviewed-by: Gopalakrishnan, Aravind <aravind.gopalakrishnan@intel.com>
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2018-11-14 11:01:33 -08:00
Sergey Oblomov
e91f214982 OSC/UCX: added UCX version evaluation
- added UCX version evaluation to set OSC UCX priority

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-14 10:03:13 +02:00
KAWASHIMA Takahiro
4e6403ac0f configure: Remove $ac_cv_type_[TYPE] checks for C99 types
Now Open MPI requires a C99 compiler. Checking availability of
the following types is no more needed.

- `long long` (`signed` and `unsigned`)
- `long double`
- `float _Complex`
- `double _Complex`
- `long double _Complex`

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-14 09:57:10 +09:00
KAWASHIMA Takahiro
cacd6f389c datatype: Remove #if HAVE_[TYPE] for C99 types
Now Open MPI requires a C99 compiler. Checking availability of
the following types is no more needed.

- `long long` (`signed` and `unsigned`)
- `long double`
- `float _Complex`
- `double _Complex`
- `long double _Complex`

Furthermore, the `#if HAVE_[TYPE]` style checking is not correct.
Availability of C types is checked by `AC_CHECK_TYPES` in `configure.ac`.
`AC_CHECK_TYPES` defines macro `HAVE_[TYPE]` as `1` in `opal_config.h`
if the `[TYPE]` is available. But it does not define `HAVE_[TYPE]`
(instead of defining as `0`) if it is not available. So even if we
need `HAVE_[TYPE]` checking, it should be `#if defined(HAVE_[TYPE])`.

I didn't remove `AC_CHECK_TYPES` for these types in `configure.ac`
since someone may use `HAVE_[TYPE]` macros somewhere.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-14 09:32:52 +09:00
KAWASHIMA Takahiro
592e2cc0d9
Merge pull request #6072 from kawashima-fj/pr/timing
ess/pmi: Fix `--enable-timing` compilation error
2018-11-13 10:07:45 +09:00
Gilles Gouaillardet
fb6655a5ae
Merge pull request #6071 from ggouaillardet/topic/mpiext_cuda_fix
mpiext/cuda: fix mpiext_cuda_c.h install path
2018-11-12 14:11:11 -06:00
Sergey Oblomov
36934a8bb2 OSC: set UCX module used by default
- OSC/UCX module set priority to 200 to be used by default

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-12 15:08:22 +02:00
KAWASHIMA Takahiro
8e7d874e14 ess/pmi: Fix --enable-timing compilation error
This commit fixes an compilation error when configured
with `--enable-timing`.

Procedures in the function `orte_ess_base_app_setup`
in `orte/mca/ess/base/ess_base_std_app.c` are moved
to `orte/mca/ess/pmi/ess_pmi_module.c`
and `orte/mca/ess/singleton/ess_singleton_module.c`
in the recent commit 57f6b94fa5.

In `ess_pmi_module.c`, the first argument of the
`OPAL_TIMING_ENV_NEXT` macro should have been adapted
to the destination function but was not.

In `ess_singleton_module.c`, `OPAL_TIMING_ENV_INIT`
was not used in the destination function originally.
So `OPAL_TIMING_ENV_NEXT` cannot be used in the function.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-12 16:10:48 +09:00
Gilles Gouaillardet
b3ce25af95 mpiext/cuda: fix mpiext_cuda_c.h install path
This fixes a regression introduced in commit open-mpi/ompi@f8318f0a8f.

Fixes open-mpi/ompi#6069

Thanks Kawashima-san for the heads up !

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-12 00:58:19 -06:00
Thananon Patinyasakdikul
3dc1629771
Merge pull request #5241 from thananon/opal_progress
Add MCA param for multithread opal_progress().
2018-11-09 12:30:07 -05:00
Matias Cabral
30b6435897
Merge pull request #6015 from aravindksg/proc-threshold-fix
MTL/OFI: Check threshold number of peers allowed per rank
2018-11-08 15:47:45 -08:00
Jeff Squyres
2805b8a1d7
Merge pull request #6066 from jsquyres/pr/lsf-readme-update
README: Make LSF text more accurate
2018-11-08 18:31:47 -05:00
Jeff Squyres
25d15cd458
Merge pull request #6065 from jsquyres/pr/moar-updates-to-the-slots-show-help-message
orte-rmaps-base: slightly amend help message
2018-11-08 18:17:04 -05:00
Jeff Squyres
419852ab43 README: Make LSF text more accurate
Also remove a now-outdated LSF reference.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-08 17:38:26 -05:00
Jeff Squyres
e9bf318dcb orte-rmaps-base: slightly amend help message
Follow on to 430c659908: clarify the help message and fix one typo.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-08 14:21:47 -08:00
Jeff Squyres
168a56f65e
Merge pull request #6061 from jsquyres/pr/update-out-of-slots-show-help-message
orte-rmaps-base: update out-of-slots show_help message
2018-11-08 15:56:35 -05:00
Jeff Squyres
430c659908 orte-rmaps-base: update out-of-slots show_help message
Update the show_help message for when there are not enough slots to
run an application.

Also, remove a bunch of copies of this message in various show_help
text files that aren't used/referred to anywhere in the code.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-08 15:02:57 -05:00
Gilles Gouaillardet
efe72d3d92
Merge pull request #6055 from ggouaillardet/topic/c11_atomics
Misc C11 atomics related fixes
2018-11-08 09:11:42 +09:00
Jeff Squyres
2f479d11cc
Merge pull request #6054 from thananon/ofi_fix_warnings
btl/ofi: fixed compiler warning on OSX.
2018-11-07 10:07:53 -05:00
Jeff Squyres
7b82fb834c
Merge pull request #6051 from jsquyres/pr/README-openib-ob1-clarification
master: README: Clarify that only IB->openib is deprecated
2018-11-07 10:05:36 -05:00
Gilles Gouaillardet
72eb53e064 test: remove obsolete tests from orte/test/mpi
Those tests were likely built on a previous Open MPI version
and cannot even build.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-07 13:23:40 +09:00
Gilles Gouaillardet
03eec1ee95 configury: abort when configure'd with --enable-c11-atomics but C11 atomics are not supported
Refs. open-mpi/ompi#6053

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-07 13:23:36 +09:00
Gilles Gouaillardet
07970f192a atomic/c11: fix include header path
simply #include "opal_stdint.h" in order to work with --devel-headers

Refs open-mpi/ompi#6053

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-07 13:23:28 +09:00
Thananon Patinyasakdikul
d9bd54c628 btl/ofi: fixed compiler warning on OSX.
This commit closes #6049

Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>
2018-11-06 15:37:25 -05:00
Jeff Squyres
6cb4159826 README: Clarify that only IB->openib is deprecated
Per feedback from https://github.com/open-mpi/ompi/pull/6028, remove
"+ob1" from the sentence to emphasize that it's only IB usage through
openib that is deprecated/superceded (i.e., ob1 is definitely not
deprecated).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-06 10:04:15 -08:00
Jeff Squyres
aaa165f52c
Merge pull request #6028 from jsquyres/pr/v4.0.0-more-text-changes
README: More updates for v4.0.0
2018-11-06 12:59:08 -05:00
Nathan Hjelm
a968fae504
Merge pull request #6033 from hjelmn/6014_fix
opal/asm: work around possible gcc compiler bug
2018-11-06 10:23:49 -07:00
Gilles Gouaillardet
064a778115
Merge pull request #6034 from ggouaillardet/topic/mpiext_cuda
mpiext/cuda: do not include automatically generated file into dist ta…
2018-11-06 14:55:30 +09:00
Gilles Gouaillardet
f8318f0a8f mpiext/cuda: do not include automatically generated file into dist tarball
ompi/mpiext/cuda/c/mpiext_cuda_c.h is automatically generated from
ompi/mpiext/cuda/c/mpiext_cuda_c.h.in at configure time.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-06 13:57:31 +09:00
Nathan Hjelm
30119ee339 opal/asm: work around possible gcc compiler bug
It seems in some cases (gcc older than v6.0.0) the __atomic_thread_fence is a
no-op with __ATOMIC_ACQUIRE. This appears to be the case with X86_64 so go
ahead and use __ATOMIC_SEQ_CST for the x86_64 read memory barrier. This should
not cause any performance issues as it is equivalent to the memory barrier
in the hand-written atomics.

References #6014

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2018-11-05 11:45:42 -07:00
Geoff Paulsen
db60f699d5
Merge pull request #6031 from gpaulsen/topic/master/README_lsf
README: updating LSF version supported to 9.1.1 or later
2018-11-05 12:40:01 -06:00
Jeff Squyres
4ec8e6fe22 README: More updates for v4.0.0
Move the UCX and MXM text up to flow better with the rest of the
text+content.  Also emphasize that MXM is deprecated.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-05 10:27:54 -05:00
Geoffrey Paulsen
0100595898 README: updating LSF version supported to 9.1.1 or later
Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>
2018-11-05 02:28:42 -06:00
Jeff Squyres
e3eb01fd18
Merge pull request #6026 from jsquyres/pr/ompi-4.0.0-text-updates
v4.0.0 text updates
2018-11-03 17:23:02 -04:00
Jeff Squyres
e2ab41efac README: Add extensive information about deleted MPI-1 syms
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-03 16:25:29 -04:00
Jeff Squyres
78552e81c1 README: Update information about UCX
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-03 16:25:16 -04:00
Jeff Squyres
65eb118e08 MPI_Type_get_envelope: remove MPI-1 deleted names
Several names are now no longer returned by MPI_Type_get_envelope.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-03 16:20:45 -04:00
Howard Pritchard
6494626b6e
Merge pull request #6009 from hppritcha/topic/swat_issue5810
btl/openib: fix a problem with ib query
2018-11-02 10:39:29 -06:00
Geoff Paulsen
c79a4c55a2
Merge pull request #6013 from gpaulsen/task/restore_removed_mpi_items
MPI.h: restore some removed deprecated items
2018-11-01 17:38:04 -05:00
Aravind Gopalakrishnan
5cf43de445 MTL/OFI: Check threshold number of peers allowed per rank
When the provider does not support FI_REMOTE_CQ_DATA, the OFI tag does not have
sizeof(int) bits for the rank. Therefore, unexpected behavior will occur when
this limit is crossed.

Check the max allowed number of ranks during add_procs() and return if there is
danger of exceeding this threshold.

Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>
2018-11-01 14:03:00 -07:00
Geoffrey Paulsen
b03a39d359 mpi.h: restore some MPI-deprecated items to default builds
Commit 89da9651b inadvertantly #if'ed out both deprecated *and*
removed items from mpi.h.  The intent was only to #if out items that
have been *removed* from the MPI specification and leave all items
that are merely deprecated.

This commit also re-orders the deleted typedef+functions to be in the
same order as they are listed in MPI-3.1 chapter 17, just to make
verifying/checking the code easier.

Note that --enable-mpi1-compatibility can still be used to restore
prototypes for the items that have been removed from the MPI
specification (e.g., MPI_Address()).

Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-01 13:36:48 -07:00
Thananon Patinyasakdikul
4e23fedccb opal progress: Added MCA for multithreaded opal_progress.
This commit added MCA param `opal_max_thread_in_progress` to set the
number of threads allowed to do opal_progress concurrently. The default
value is 1.

Component with multithreaded design can benefit from this change to
parallelize their component progress function.

Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>
2018-11-01 13:34:20 -04:00
Aurelien Bouteiller
37954b5fda
Merge pull request #6010 from ICLDisco/export/orte_crashfini
Export/orte crashfini
2018-11-01 13:04:42 -04:00
Yossi Itigin
241b424bd3
Merge pull request #6000 from hoopoepg/topic/added-missing-amo-datatypes
OSHMEM/AMO: added missing C11 macro datatypes
2018-11-01 15:29:56 +02:00
Sergey Oblomov
6e78102089 OSHMEM/AMO: code beautify
- added <cr> to split API groups to simplify human processing

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-01 11:33:34 +02:00
Howard Pritchard
8126779a35 btl/openib: fix a problem with ib query
Under certain circumstances, ibv_exp_query_device was
returning an error due to uninitialized fields in the
extended attributes struct.

Fixes: #5810
Fixes: #5914

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2018-10-31 14:50:15 -06:00
Aurélien Bouteiller
43bd232fd0
Resolve a recursive destruct on the iof proct in finalize
Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu>
2018-10-31 16:38:42 -04:00
Aurelien Bouteiller
348bf8e13f
Prevent errmgr invokation from crashing in finalize
Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>
2018-10-31 16:28:04 -04:00
Matias Cabral
2da31706bf
Merge pull request #5970 from aravindksg/coll-tuned-fix
coll/tuned: Fix MPI_IN_PLACE processing in tuned algorithms
2018-10-31 11:20:07 -07:00