1
1
Граф коммитов

29491 Коммитов

Автор SHA1 Сообщение Дата
Yossi Itigin
e98ce2b36b
Merge pull request #6108 from yosefe/topic/pml-ucx-init-req_mpi_object
pml_ucx: initialize req_mpi_object.comm for error handler
2018-11-26 11:54:10 +02:00
KAWASHIMA Takahiro
829e4e78f7
Merge pull request #6111 from kawashima-fj/pr/update-pcoll-doc
README & man: Update pcollreq documentation
2018-11-26 18:26:58 +09:00
Sergey Oblomov
ff2fd0679e OSHMEM/COLL: optimization on zero-length ops
- removed barrier call on zero-length operations

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-26 10:41:33 +02:00
KAWASHIMA Takahiro
5f0fcf0f45 README & man: Update pcollreq documentation
The feature of persistent collectives is approved in the Sept. 2018
MPI Forum meeting and 2018 Draft Specification of the MPI standard is
published during SC18.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-26 17:27:43 +09:00
Gilles Gouaillardet
475d0355d7 configury: fix auto-generated mpi-[f08-]ext-module.F90
prefer #include vs include in order to correctly handle long Fortran lines.
We use the full path, and it can be very long, this is why
it cannot be passed to the Fortran compiler.

Thanks Igor Andriyash and Axel Huebl for reporting this issue.

Refs open-mpi/ompi#6106

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-26 10:25:08 +09:00
Yossi Itigin
f36eeef4c5 pml_ucx: initialize req_mpi_object.comm for error handler
without this fix, an error handler invoked on pml_ucx request would
segfault while trying to dereference requests[i]->req_mpi_object.comm

Signed-off-by: Yossi Itigin <yosefe@mellanox.com>
2018-11-25 19:37:54 +02:00
Sergey Oblomov
9de128afaf OSHMEM: added processing of zero-length collectives
- according spec 1.4, annex C shmem collectives should process
  calls where number of elements is zero independently from pointer
  value
- added zero-count processing - it just call barrier to
  sync ranks

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-23 14:34:46 +02:00
Yossi Itigin
f05ebe82d3
Merge pull request #6096 from hoopoepg/topic/amo-int32-64-datatypes
OSHMEM/AMO: added int/uint/32/64 atomics calls
2018-11-22 11:21:17 +02:00
Yossi Itigin
ed967d867b
Merge pull request #6073 from hoopoepg/topic/set-osc-ucx-level-200
OSC: set UCX module used by default
2018-11-22 10:53:37 +02:00
Sergey Oblomov
4c071da565 OSHMEM/AMO: added int/uint/32/64 atomics calls
- added int/uint/32/64 atomics calls
- added SHMEM_SYNC_SIZE macro

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-22 10:18:16 +02:00
Josh Hursey
3339e2aa33
Merge pull request #5986 from jjhursey/vpid-unpack
Add OPAL_VPID to unpacking
2018-11-21 11:42:02 -06:00
Jeff Squyres
748d8b6b4b
Merge pull request #6092 from jsquyres/pr/readme-updates
README updates
2018-11-20 13:25:19 -05:00
KAWASHIMA Takahiro
303d7842d9
Merge pull request #6074 from kawashima-fj/pr/remove-c99-type-check
Remove `#if HAVE_[TYPE]` for types available in C99
2018-11-20 11:42:13 +09:00
Jeff Squyres
ff44ca3fd2 README: Add a "quick start" section.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-17 03:28:39 -08:00
Jeff Squyres
078dc0a9bd README: Add text about network filesystem builds
Per https://github.com/open-mpi/ompi/issues/6058, add some text about
building on network filesystem, particularly with respect to
filesystem timestamps.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-17 03:27:47 -08:00
Aurelien Bouteiller
8086a34b34
Merge pull request #6085 from abouteiller/cleanup/nbc
Someone left a debug printf in NBC
2018-11-16 10:58:40 -06:00
Aurelien Bouteiller
20447be744
Someone left a debug printf in NBC
Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>
2018-11-16 10:37:04 -05:00
Aurelien Bouteiller
65660e5999
Manage errors in NBC collective ops
Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>

Correctly bubble up errors in NBC collective operations

Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>

The error field of requests needs to be rearmed at start, not at create

Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>
2018-11-15 16:43:56 -05:00
Joseph Schuchart
91885f5876 Plug two memory leaks in rdma osc
Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>
2018-11-14 14:31:54 -05:00
matcabral
5f58453e63 MTL/OFI: Lower priority when all procs are local
So far Vader is faster than OFI MTL for doing shared memory.
Therefore, let it run by default when all procs are local.

Reviewed-by: Spruit, Neil R <neil.r.spruit@intel.com>
Reviewed-by: Gopalakrishnan, Aravind <aravind.gopalakrishnan@intel.com>
Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>
2018-11-14 11:01:33 -08:00
Sergey Oblomov
e91f214982 OSC/UCX: added UCX version evaluation
- added UCX version evaluation to set OSC UCX priority

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-14 10:03:13 +02:00
KAWASHIMA Takahiro
4e6403ac0f configure: Remove $ac_cv_type_[TYPE] checks for C99 types
Now Open MPI requires a C99 compiler. Checking availability of
the following types is no more needed.

- `long long` (`signed` and `unsigned`)
- `long double`
- `float _Complex`
- `double _Complex`
- `long double _Complex`

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-14 09:57:10 +09:00
KAWASHIMA Takahiro
cacd6f389c datatype: Remove #if HAVE_[TYPE] for C99 types
Now Open MPI requires a C99 compiler. Checking availability of
the following types is no more needed.

- `long long` (`signed` and `unsigned`)
- `long double`
- `float _Complex`
- `double _Complex`
- `long double _Complex`

Furthermore, the `#if HAVE_[TYPE]` style checking is not correct.
Availability of C types is checked by `AC_CHECK_TYPES` in `configure.ac`.
`AC_CHECK_TYPES` defines macro `HAVE_[TYPE]` as `1` in `opal_config.h`
if the `[TYPE]` is available. But it does not define `HAVE_[TYPE]`
(instead of defining as `0`) if it is not available. So even if we
need `HAVE_[TYPE]` checking, it should be `#if defined(HAVE_[TYPE])`.

I didn't remove `AC_CHECK_TYPES` for these types in `configure.ac`
since someone may use `HAVE_[TYPE]` macros somewhere.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-14 09:32:52 +09:00
KAWASHIMA Takahiro
592e2cc0d9
Merge pull request #6072 from kawashima-fj/pr/timing
ess/pmi: Fix `--enable-timing` compilation error
2018-11-13 10:07:45 +09:00
Gilles Gouaillardet
fb6655a5ae
Merge pull request #6071 from ggouaillardet/topic/mpiext_cuda_fix
mpiext/cuda: fix mpiext_cuda_c.h install path
2018-11-12 14:11:11 -06:00
Sergey Oblomov
36934a8bb2 OSC: set UCX module used by default
- OSC/UCX module set priority to 200 to be used by default

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-11-12 15:08:22 +02:00
KAWASHIMA Takahiro
8e7d874e14 ess/pmi: Fix --enable-timing compilation error
This commit fixes an compilation error when configured
with `--enable-timing`.

Procedures in the function `orte_ess_base_app_setup`
in `orte/mca/ess/base/ess_base_std_app.c` are moved
to `orte/mca/ess/pmi/ess_pmi_module.c`
and `orte/mca/ess/singleton/ess_singleton_module.c`
in the recent commit 57f6b94fa5.

In `ess_pmi_module.c`, the first argument of the
`OPAL_TIMING_ENV_NEXT` macro should have been adapted
to the destination function but was not.

In `ess_singleton_module.c`, `OPAL_TIMING_ENV_INIT`
was not used in the destination function originally.
So `OPAL_TIMING_ENV_NEXT` cannot be used in the function.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-11-12 16:10:48 +09:00
Gilles Gouaillardet
b3ce25af95 mpiext/cuda: fix mpiext_cuda_c.h install path
This fixes a regression introduced in commit open-mpi/ompi@f8318f0a8f.

Fixes open-mpi/ompi#6069

Thanks Kawashima-san for the heads up !

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-12 00:58:19 -06:00
Thananon Patinyasakdikul
3dc1629771
Merge pull request #5241 from thananon/opal_progress
Add MCA param for multithread opal_progress().
2018-11-09 12:30:07 -05:00
Matias Cabral
30b6435897
Merge pull request #6015 from aravindksg/proc-threshold-fix
MTL/OFI: Check threshold number of peers allowed per rank
2018-11-08 15:47:45 -08:00
Jeff Squyres
2805b8a1d7
Merge pull request #6066 from jsquyres/pr/lsf-readme-update
README: Make LSF text more accurate
2018-11-08 18:31:47 -05:00
Jeff Squyres
25d15cd458
Merge pull request #6065 from jsquyres/pr/moar-updates-to-the-slots-show-help-message
orte-rmaps-base: slightly amend help message
2018-11-08 18:17:04 -05:00
Jeff Squyres
419852ab43 README: Make LSF text more accurate
Also remove a now-outdated LSF reference.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-08 17:38:26 -05:00
Jeff Squyres
e9bf318dcb orte-rmaps-base: slightly amend help message
Follow on to 430c659908: clarify the help message and fix one typo.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-08 14:21:47 -08:00
Jeff Squyres
168a56f65e
Merge pull request #6061 from jsquyres/pr/update-out-of-slots-show-help-message
orte-rmaps-base: update out-of-slots show_help message
2018-11-08 15:56:35 -05:00
Jeff Squyres
430c659908 orte-rmaps-base: update out-of-slots show_help message
Update the show_help message for when there are not enough slots to
run an application.

Also, remove a bunch of copies of this message in various show_help
text files that aren't used/referred to anywhere in the code.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-08 15:02:57 -05:00
Gilles Gouaillardet
efe72d3d92
Merge pull request #6055 from ggouaillardet/topic/c11_atomics
Misc C11 atomics related fixes
2018-11-08 09:11:42 +09:00
Jeff Squyres
2f479d11cc
Merge pull request #6054 from thananon/ofi_fix_warnings
btl/ofi: fixed compiler warning on OSX.
2018-11-07 10:07:53 -05:00
Jeff Squyres
7b82fb834c
Merge pull request #6051 from jsquyres/pr/README-openib-ob1-clarification
master: README: Clarify that only IB->openib is deprecated
2018-11-07 10:05:36 -05:00
Gilles Gouaillardet
72eb53e064 test: remove obsolete tests from orte/test/mpi
Those tests were likely built on a previous Open MPI version
and cannot even build.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-07 13:23:40 +09:00
Gilles Gouaillardet
03eec1ee95 configury: abort when configure'd with --enable-c11-atomics but C11 atomics are not supported
Refs. open-mpi/ompi#6053

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-07 13:23:36 +09:00
Gilles Gouaillardet
07970f192a atomic/c11: fix include header path
simply #include "opal_stdint.h" in order to work with --devel-headers

Refs open-mpi/ompi#6053

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-07 13:23:28 +09:00
Thananon Patinyasakdikul
d9bd54c628 btl/ofi: fixed compiler warning on OSX.
This commit closes #6049

Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>
2018-11-06 15:37:25 -05:00
Jeff Squyres
6cb4159826 README: Clarify that only IB->openib is deprecated
Per feedback from https://github.com/open-mpi/ompi/pull/6028, remove
"+ob1" from the sentence to emphasize that it's only IB usage through
openib that is deprecated/superceded (i.e., ob1 is definitely not
deprecated).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-11-06 10:04:15 -08:00
Jeff Squyres
aaa165f52c
Merge pull request #6028 from jsquyres/pr/v4.0.0-more-text-changes
README: More updates for v4.0.0
2018-11-06 12:59:08 -05:00
Nathan Hjelm
a968fae504
Merge pull request #6033 from hjelmn/6014_fix
opal/asm: work around possible gcc compiler bug
2018-11-06 10:23:49 -07:00
Gilles Gouaillardet
064a778115
Merge pull request #6034 from ggouaillardet/topic/mpiext_cuda
mpiext/cuda: do not include automatically generated file into dist ta…
2018-11-06 14:55:30 +09:00
Gilles Gouaillardet
f8318f0a8f mpiext/cuda: do not include automatically generated file into dist tarball
ompi/mpiext/cuda/c/mpiext_cuda_c.h is automatically generated from
ompi/mpiext/cuda/c/mpiext_cuda_c.h.in at configure time.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-11-06 13:57:31 +09:00
Nathan Hjelm
30119ee339 opal/asm: work around possible gcc compiler bug
It seems in some cases (gcc older than v6.0.0) the __atomic_thread_fence is a
no-op with __ATOMIC_ACQUIRE. This appears to be the case with X86_64 so go
ahead and use __ATOMIC_SEQ_CST for the x86_64 read memory barrier. This should
not cause any performance issues as it is equivalent to the memory barrier
in the hand-written atomics.

References #6014

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2018-11-05 11:45:42 -07:00
Geoff Paulsen
db60f699d5
Merge pull request #6031 from gpaulsen/topic/master/README_lsf
README: updating LSF version supported to 9.1.1 or later
2018-11-05 12:40:01 -06:00