1
1
Граф коммитов

29310 Коммитов

Автор SHA1 Сообщение Дата
Geoff Paulsen
752bbd195f
Merge pull request #6102 from hoopoepg/topic/set-osc-ucx-level-200-v4.0
OSC: set UCX module used by default - v4.0
2018-12-04 10:26:37 -06:00
Howard Pritchard
9fc4782b36
Merge pull request #6124 from hoopoepg/topic/oshmem-zero-len-coll-v4.0
OSHMEM: added processing of zero-length collectives - v4.0
2018-12-04 05:44:16 -07:00
Geoff Paulsen
03cf3e4400
Merge pull request #6112 from kawashima-fj/pr/v4.0.x/update-pcoll-doc
v4.0.x: README & man: Update pcollreq documentation
2018-11-30 13:58:34 -06:00
Geoff Paulsen
240582f600
Merge pull request #6121 from ggouaillardet/topic/v4.0.x/ompi_ext_fortran
v4.0.x: configury: fix auto-generated mpi-[f08-]ext-module.F90
2018-11-30 13:57:29 -06:00
Geoff Paulsen
bd2990f502
Merge pull request #6131 from devreal/rdma-plug-memleak-v4.0.x
v4.0.x: Plug two memory leaks in rdma osc
2018-11-30 13:54:51 -06:00
Geoff Paulsen
51d20915fd
Merge pull request #6139 from rhc54/cmr40/rmap
v4.0.x: Fix typo for rmaps_base_oversubscribe
2018-11-30 13:51:36 -06:00
Geoff Paulsen
937cf86077
Merge pull request #6135 from jsquyres/pr/v4.0.x/README-typo-fix
v4.0.x: README: Fix a typo
2018-11-29 14:05:08 -06:00
Ralph Castain
98c8492057 Fix typo for rmaps_base_oversubscribe
Causes the MCA param to be ignored, while the cmd line option still
works.

Thanks to @iassiour for the report!

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-11-29 07:40:51 -08:00
Joseph Schuchart
c5346751e6 Plug two memory leaks in rdma osc
Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>
(cherry picked from commit 91885f5876)
2018-11-29 10:19:26 -05:00
Jeff Squyres
e56c179d58 README: Fix a typo
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit e6241eaf6b)
2018-11-28 14:52:08 -08:00
Sergey Oblomov
6651672711 OSC/UCX: set max level value to 60
Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit 2d230b3aac)
2018-11-27 20:35:30 +02:00
Sergey Oblomov
5838760a3a OSHMEM/COLL/BCAST: removed unnecessary bcast call
- removed unnecessary bcast call on zero-length request

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit c93927e27a)
2018-11-27 14:26:56 +02:00
Sergey Oblomov
0a064d8c8d OSHMEM/COLL: optimization on zero-length ops
- removed barrier call on zero-length operations

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit ff2fd0679e)
2018-11-27 14:26:52 +02:00
Sergey Oblomov
dea9cf6b63 OSHMEM: added processing of zero-length collectives
- according spec 1.4, annex C shmem collectives should process
  calls where number of elements is zero independently from pointer
  value
- added zero-count processing - it just call barrier to
  sync ranks

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit 9de128afaf)
2018-11-27 14:26:44 +02:00
Gilles Gouaillardet
cb6841ac64 configury: fix auto-generated mpi-[f08-]ext-module.F90
prefer #include vs include in order to correctly handle long Fortran lines.
We use the full path, and it can be very long, this is why
it cannot be passed to the Fortran compiler.

Thanks Igor Andriyash and Axel Huebl for reporting this issue.

Refs open-mpi/ompi#6106

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
(cherry picked from commit 475d0355d7)
2018-11-27 09:31:56 +09:00
Howard Pritchard
7fc0841791
Merge pull request #6117 from yosefe/topic/pml-ucx-init-req_mpi_object-v4.0.x
pml_ucx: initialize req_mpi_object.comm for error handler
2018-11-26 13:30:53 -07:00
Howard Pritchard
176206fe8c
Merge pull request #6098 from jjhursey/enh/v4.0.x/vpid-unpack
Add OPAL_VPID to unpacking
2018-11-26 13:30:20 -07:00
Yossi Itigin
a112d10c93 pml_ucx: initialize req_mpi_object.comm for error handler
without this fix, an error handler invoked on pml_ucx request would
segfault while trying to dereference requests[i]->req_mpi_object.comm

(picked from master f36eeef)

Signed-off-by: Yossi Itigin <yosefe@mellanox.com>
2018-11-26 11:57:34 +02:00
KAWASHIMA Takahiro
6f68483fd5 README & man: Update pcollreq documentation
The feature of persistent collectives is approved in the Sept. 2018
MPI Forum meeting and 2018 Draft Specification of the MPI standard is
published during SC18.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
(cherry picked from commit 5f0fcf0f45)
2018-11-26 18:28:08 +09:00
Sergey Oblomov
63cbe36cab OSHMEM/AMO: added int/uint/32/64 atomics calls
- added int/uint/32/64 atomics calls
- added SHMEM_SYNC_SIZE macro

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit 4c071da565)
2018-11-26 09:29:55 +02:00
Sergey Oblomov
38a4953707 OSC/UCX: added UCX version evaluation
- added UCX version evaluation to set OSC UCX priority

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit e91f214982)
2018-11-22 11:31:53 +02:00
Sergey Oblomov
012e27af77 OSC: set UCX module used by default
- OSC/UCX module set priority to 200 to be used by default

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
(cherry picked from commit 36934a8bb2)
2018-11-22 10:59:43 +02:00
Joshua Hursey
e1f75d5ff1 Add OPAL_VPID to unpacking
* Needed to properly read PMIx job data like the following
   - `OPAL_PMIX_LOCALLDR`
   - `OPAL_PMIX_RANK`
   - `OPAL_PMIX_GLOBAL_RANK`
   - `OPAL_PMIX_APPLDR`
   - `OPAL_PMIX_APP_RANK`

Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
(cherry picked from commit a557c4130c)
2018-11-21 11:48:58 -06:00
Geoff Paulsen
206b574bb3
Merge pull request #6076 from hppritcha/topic/on_to_v4.0.1
roll to v4.0.1a1
2018-11-20 09:55:55 -06:00
Geoff Paulsen
6898de01a7
Merge pull request #6012 from hoopoepg/topic/added-missing-amo-datatypes-v4.0
OSHMEM/AMO: added missing C11 macro datatypes - v4.0
2018-11-19 14:20:15 -06:00
Howard Pritchard
8adaeb1536
Merge pull request #6007 from aravindksg/coll-tuned-fix-40x
coll/tuned: Fix MPI_IN_PLACE processing in tuned algorithms
2018-11-19 13:15:40 -07:00
Howard Pritchard
3369b0d10f
Merge pull request #6011 from hoopoepg/topic/fixed-oshmem-profile-build-v4.0
OSHMEM/PROFILE: fixed profile build - v4.0
2018-11-19 13:15:09 -07:00
Howard Pritchard
24dae8609e
Merge pull request #5926 from hjelmn/v4.0.x_need_to_unblock_sigchld_in_some_cases
v4.0.x: Ensure SIGCHLD is unblocked
2018-11-19 13:13:10 -07:00
Howard Pritchard
d6bab4e26d
Merge pull request #6064 from jsquyres/pr/v4.0.x/rmaps-help-message-update
v4.0.x: orte-rmaps-base: update out-of-slots show_help message
2018-11-19 13:12:41 -07:00
Howard Pritchard
ec79631ba2
Merge pull request #5936 from edgargabriel/pr/testmpio-v4.0.x
Pr/testmpio v4.0.x
2018-11-19 13:11:50 -07:00
Howard Pritchard
116a140be8 roll to v4.0.1a1
long live 4.0.1!

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2018-11-13 07:18:57 -07:00
Gilles Gouaillardet
9366c6eb2e mpiext/cuda: do not include automatically generated file into dist tarball
ompi/mpiext/cuda/c/mpiext_cuda_c.h is automatically generated from
ompi/mpiext/cuda/c/mpiext_cuda_c.h.in at configure time.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>

(cherry picked from commit open-mpi/ompi@f8318f0a8f)
(cherry picked from commit open-mpi/ompi@b3ce25af95)
2018-11-13 00:09:01 -06:00
Howard Pritchard
725f62554e
Merge pull request #6067 from jsquyres/pr/v4.0.x/fix-readme-lsf-references
v4.0.x: README: Make LSF text more accurate
2018-11-10 14:47:40 -07:00
Jeff Squyres
c6d8caf302 README: Make LSF text more accurate
Also remove a now-outdated LSF reference.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 419852ab43)
2018-11-08 18:32:34 -05:00
Jeff Squyres
8be14b9b07 orte-rmaps-base: slightly amend help message
Follow on to 430c659908: clarify the help message and fix one typo.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit e9bf318dcb)
2018-11-08 18:20:28 -05:00
Howard Pritchard
e47c3eaa8d
Merge pull request #6003 from yosefe/topic/scoll-basic-fix-pSync-v4.0.x
SCOLL/BASIC: Fix invalid pSync pointer passed to barrier func
2018-11-08 15:45:39 -07:00
Jeff Squyres
76d4c1843e orte-rmaps-base: update out-of-slots show_help message
Update the show_help message for when there are not enough slots to
run an application.

Also, remove a bunch of copies of this message in various show_help
text files that aren't used/referred to anywhere in the code.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 430c659908)
2018-11-08 16:03:28 -05:00
Howard Pritchard
9275b692b7
Merge pull request #6043 from hjelmn/v4.0.x_fix_this_damn_memory_barrier_bug_that_is_referenced_in_github_bug_6014_now_lets_get_this_release_out_the_door
v4.0.x: opal/asm: work around possible gcc compiler bug
2018-11-08 08:01:33 -07:00
Geoff Paulsen
d99518c0d4
Merge pull request #6046 from hjelmn/v4.0.x_fix_a_memory_barrier_bug_that_is_totally_related_to_6014_but_in_the_pmix_code
pmix3x: fix potential memory barrier bug with __atomic builtin atomics
2018-11-07 10:46:35 -06:00
Geoff Paulsen
f17dcd5961
Merge pull request #6027 from jsquyres/pr/v4.0.0-text-updates
v4.0.0: text updates
2018-11-07 10:45:34 -06:00
Jeff Squyres
9a7320fdab README: Clarify that only IB->openib is deprecated
Per feedback from https://github.com/open-mpi/ompi/pull/6028, remove
"+ob1" from the sentence to emphasize that it's only IB usage through
openib that is deprecated/superceded (i.e., ob1 is definitely not
deprecated).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 6cb4159826)
2018-11-06 10:07:32 -08:00
Jeff Squyres
7cb6cbc80f README: More updates for v4.0.0
Move the UCX and MXM text up to flow better with the rest of the
text+content.  Also emphasize that MXM is deprecated.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 4ec8e6fe22)
2018-11-06 10:07:15 -08:00
Jeff Squyres
740567ff92 README: Add extensive information about deleted MPI-1 syms
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit e2ab41efac)
2018-11-06 10:07:05 -08:00
Jeff Squyres
f149f64f7e README: Update information about UCX
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 78552e81c1)
2018-11-06 10:07:05 -08:00
Jeff Squyres
d0efdfd9c8 MPI_Type_get_envelope: remove MPI-1 deleted names
Several names are now no longer returned by MPI_Type_get_envelope.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 65eb118e08)
2018-11-06 10:07:05 -08:00
Nathan Hjelm
5efc76ef44 pmix3x: fix potential memory barrier bug with __atomic builtin atomics
See open-mpi/ompi#6014 for more information.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2018-11-06 10:37:14 -07:00
Nathan Hjelm
e57c3fb3c9 opal/asm: work around possible gcc compiler bug
It seems in some cases (gcc older than v6.0.0) the __atomic_thread_fence is a
no-op with __ATOMIC_ACQUIRE. This appears to be the case with X86_64 so go
ahead and use __ATOMIC_SEQ_CST for the x86_64 read memory barrier. This should
not cause any performance issues as it is equivalent to the memory barrier
in the hand-written atomics.

References #6014

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
(cherry picked from commit 30119ee339)
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2018-11-06 10:28:13 -07:00
Geoff Paulsen
4008c46e84
Merge pull request #6032 from gpaulsen/topic/v4.0.x/README_lsf
README: updating LSF version supported to 9.1.1 or later
2018-11-05 14:16:01 -06:00
Geoffrey Paulsen
f5dbecd5e7 README: updating LSF version supported to 9.1.1 or later
Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>
(cherry picked from commit 0100595898)
2018-11-05 02:33:42 -06:00
Geoff Paulsen
e720a9d31d
Merge pull request #6018 from gpaulsen/topic/v4.0.x/api_removal_for_v4.0.0
mpi.h: restore some MPI-deprecated items to default builds
2018-11-02 16:17:04 -05:00