1
1

29426 Коммитов

Автор SHA1 Сообщение Дата
Yossi Itigin
9a365555b0
Merge pull request #5879 from hoopoepg/topic/fixed-zero-size-window
OSC/UCX: fixed zero-size window processing
2018-10-10 16:28:55 +03:00
Yossi Itigin
40ac9e4771 pml_ucx: fix return code from mca_pml_ucx_init()
Signed-off-by: Yossi Itigin <yosefe@mellanox.com>
2018-10-10 14:41:05 +03:00
Yossi Itigin
dc6809495d osc_ucx: fix hang/timeout in component finalize
Add barrier to make sure all endpoints are destroyed before destroying
the worker.

Signed-off-by: Yossi Itigin <yosefe@mellanox.com>
2018-10-10 14:38:06 +03:00
Sergey Oblomov
ae6f81983f OSC/UCX: fixed zero-size window processing
- added processing of zero-size MPI window

Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>
2018-10-10 13:08:01 +03:00
Gilles Gouaillardet
98ad78e0bc
Merge pull request #5883 from ggouaillardet/topic/pmix_ext
pmix/ext: misc fixes
2018-10-10 14:26:38 +09:00
Gilles Gouaillardet
ca9009e3d0 pmix/ext3x: fix minor typos
refer PMIx 3x instead of previous 2x

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-10-10 13:18:36 +09:00
Gilles Gouaillardet
b3bd785ce0 pmix/ext3x: add missing header files into the dist tarball
Refs. open-mpi/ompi#5871

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-10-10 13:17:54 +09:00
Gilles Gouaillardet
67402f5f19 pmix/ext2x: add missing header files into the dist tarball
Refs. open-mpi/ompi#5871

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-10-10 13:17:42 +09:00
Nathan Hjelm
32682aa2c0
Merge pull request #5772 from mkurnosov/coll-ibcast-knomial
coll/libnbc: add knomial tree algorithm for MPI_Ibcast
2018-10-09 16:26:13 -06:00
Nathan Hjelm
121b4928c4
Merge pull request #5837 from hjelmn/uct_update
btl/uct: bug fixes and general improvements
2018-10-09 16:14:25 -06:00
Nathan Hjelm
39be6ec15c btl/uct: bug fixes and general improvements
This commit updates the uct btl to change the transports parameter
into a priority list. The dc_mlx5, rc_mlx5, and ud transports to the
priority list. This will give better out of the box performance for
multi-threaded codes beacuse the *_mlx5 transports can avoid the mlx5
lock inside libmlx5_rdmav2.

This commit also fixes a number of leaks and a possible deadlock when
using RDMA.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2018-10-09 15:15:45 -06:00
Jeff Squyres
bb13941b69
Merge pull request #5811 from ggouaillardet/topic/mpi_f08_c_types
fortran/use-mpi-f08: add MPI C types
2018-10-09 13:17:30 -04:00
Yossi Itigin
4763822a64 pml_ucx: add ompi datatype attribute to release ucp_datatype
Signed-off-by: Yossi Itigin <yosefe@mellanox.com>
2018-10-09 17:34:34 +03:00
Mikhail Kurnosov
b0429d25df coll/libnbc: add knomial tree algorithm for MPI_Ibcast
Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>
2018-10-09 20:43:04 +07:00
Mikhail Kurnosov
7bd63e79c8 coll/libnbc: add Rabenseifner's algorithm for MPI_Ireduce
An implementation of R. Rabenseifner's algorithm for MPI_Ireduce.
This algorithm is a combination of a reduce-scatter implemented with recursive vector halving
and recursive distance doubling, followed either by a gather.

Limitations:
-- count >= 2^{\floor{\log_2 p}}
-- commutative operations only
-- intra-communicators only

Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>
2018-10-09 20:27:09 +07:00
KAWASHIMA Takahiro
b0e6d1fefc
Merge pull request #5870 from kawashima-fj/pr/javadoc-tag
java: Fix javadoc build failure with OpenJDK 11
2018-10-09 19:11:48 +09:00
KAWASHIMA Takahiro
b491b454dc java: Fix javadoc build failure with OpenJDK 11
OpenJDK 11 changed the default javadoc output HTML version to HTML 5
from HTML 4.01. It causes an error on building Open MPI configured
with `--enable-mpi-java` (default: disable). This fix is compatible
with older OpenJDK.

I don't know whether this problem exists with other vender's JDKs.
But this fix should be compatible with other JDKs because the new
syntax is used in other places in the same file.

Thanks to Siegmar Gross for the bug report.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-10-09 17:49:30 +09:00
Gilles Gouaillardet
0bce306194
Merge pull request #5863 from ggouaillardet/topic/misc_fixes
fix double free & missing header
2018-10-09 13:02:33 +09:00
Gilles Gouaillardet
5803385d44 util/hostfile: fix a double free error
As reported at https://stackoverflow.com/questions/52707242/mpirun-segmentation-fault-whenever-i-use-a-hostfile
mpirun crashes when the hostfile contains a "user@host" line.
The root cause is username was not strdup'ed and free'd twice by opal_argv_free() and free()

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-10-09 11:09:17 +09:00
Gilles Gouaillardet
1ef45b7f1d plm/tm: add a missing include file
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2018-10-09 11:08:36 +09:00
Brian Barrett
e9e4d2a4bc Handle asprintf errors with opal_asprintf wrapper
The Open MPI code base assumed that asprintf always behaved like
the FreeBSD variant, where ptr is set to NULL on error.  However,
the C standard (and Linux) only guarantee that the return code will
be -1 on error and leave ptr undefined.  Rather than fix all the
usage in the code, we use opal_asprintf() wrapper instead, which
guarantees the BSD-like behavior of ptr always being set to NULL.
In addition to being correct, this will fix many, many warnings
in the Open MPI code base.

Signed-off-by: Brian Barrett <bbarrett@amazon.com>
2018-10-08 16:43:53 -07:00
Brian Barrett
c087365ead opal/util: Always support BSD behavior of asprintf
Open MPI's developers like to assume that asprintf() always sets
the ptr to NULL on error, but the standard (and Linux glibc) do
not guarantee this.  As a result, we're making opal_asprintf()
always available for developers, which will guarantee that
ptr is set to NULL on error.

Signed-off-by: Brian Barrett <bbarrett@amazon.com>
2018-10-08 16:43:53 -07:00
Nathan Hjelm
a7964bf1ad
Merge pull request #5852 from hjelmn/vader_fix_for_real_this_time
btl/vader: fix race condition in writing header
2018-10-08 08:44:10 -06:00
Yossi Itigin
6c9a95df3e
Merge pull request #5858 from amaslenn/mlnx-no-verbs
platform/mellanox: disable openib/verbs
2018-10-08 14:08:09 +03:00
Andrey Maslennikov
7180ab144a platform/mellanox: disable openib/verbs
Signed-off-by: Andrey Maslennikov <andreyma@mellanox.com>
2018-10-08 12:13:44 +03:00
Ralph Castain
784290944b
Merge pull request #5857 from rhc54/topic/plat
Add intel/bend platform file
2018-10-07 05:58:12 -07:00
Ralph Castain
952090854a Add intel/bend platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-10-06 20:26:26 -07:00
Ralph Castain
925edf3561
Merge pull request #5855 from rhc54/topic/orte
Replace asprintf with opal_asprintf
2018-10-06 13:34:02 -07:00
Ralph H Castain
fc81d0d519 Replace asprintf with opal_asprintf
Silence the flood of warnings from ORTE

Signed-off-by: Ralph H Castain <rhc@open-mpi.org>
2018-10-06 19:32:37 +00:00
Ralph Castain
80ee5c858d
Merge pull request #5854 from rhc54/topic/map
Fix map-by node for comm_spawn
2018-10-06 09:56:15 -07:00
Ralph H Castain
51acbf738e Fix map-by node for comm_spawn
Do not reorder the available host list as this causes the head node process assignment to differ from those computed on the other nodes

Signed-off-by: Ralph H Castain <rhc@open-mpi.org>
2018-10-06 15:58:45 +00:00
Ralph Castain
67b5057448
Merge pull request #5853 from rhc54/topic/subdir
Ignore --with-foo=external arguments in subdirs
2018-10-06 07:56:58 -07:00
Ralph Castain
08109acf8c Ignore --with-foo=external arguments in subdirs
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-10-06 06:48:13 -07:00
Nathan Hjelm
f11fea07e3
Merge pull request #5718 from mkurnosov/coll-iexscan-recursivedoubling
coll/libnbc: add recursive doubling algorithm for MPI_Iexscan
2018-10-05 22:23:30 -06:00
Nathan Hjelm
8291f6722d btl/vader: fix race condition in writing header
Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2018-10-05 16:30:06 -06:00
Ralph Castain
1a14785f3a
Merge pull request #5851 from rhc54/topic/plat
Update intel/bend platform files
2018-10-05 15:07:20 -07:00
Ralph Castain
1624f8090b Update intel/bend platform files
[skip ci]
bot:notest

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-10-05 15:06:20 -07:00
Ralph Castain
07e159e316
Merge pull request #5850 from rhc54/topic/fix
Remove build product - fix autogen.pl mode
2018-10-05 14:38:33 -07:00
Ralph Castain
76c11a1496 Remove build product - fix autogen.pl mode
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-10-05 13:38:39 -07:00
Ralph Castain
39a54377e9
Merge pull request #5847 from jsquyres/pr/fix-subdir-args-typo
opal_config_subdir_args.m4: fix typo
2018-10-05 08:55:19 -07:00
Ralph Castain
5499d89a02
Merge pull request #5846 from rhc54/topic/pmix
Fail configure if pmix won't build
2018-10-05 08:54:53 -07:00
Mikhail Kurnosov
9557fa087f Resolve merge conflicts
Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>
2018-10-05 21:40:27 +07:00
Jeff Squyres
7675956b8f opal_config_subdir_args.m4: fix typo
A typo inadvertantly crept in to e836dbd506.  Add the extra '-' to
make it correctly search for --with-*=internal.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2018-10-05 07:28:25 -07:00
Ralph Castain
c498a7e77a Protect PMIx from bad configure entry
Ignore with-hwloc=internal or external as those are meaningless to pmix
(will upstream)

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-10-05 07:07:05 -07:00
Ralph Castain
f379ba9c8e Fail configure if pmix won't build
If we are using the internal PMIx component and the embedded library fails to configure, then fail - don't silently fail to build and then fail in execution

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-10-05 06:29:36 -07:00
KAWASHIMA Takahiro
5f1c940c8b
Merge pull request #5840 from kawashima-fj/pr/pcollreq-f08-signatures
mpiext/pcollreq: Correct f08 routine signatures
2018-10-05 08:59:03 +09:00
Geoff Paulsen
56a55cdb4b
Merge pull request #5786 from jsquyres/pr/string-madness
Replace strcpy() and strncpy() with (new) opal_string_copy()
2018-10-04 16:12:46 -05:00
KAWASHIMA Takahiro
69f9da91fd
Merge pull request #5841 from kawashima-fj/pr/pcollreq-man
mpiext/pcollreq: Add Fortran bindings in man
2018-10-04 22:32:22 +09:00
KAWASHIMA Takahiro
43d85dbc81 mpiext/pcollreq: Add Fortran bindings in man
Fortran bindings were added to persistent collectives in 9e0115c980
but man was not updated.

Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-10-04 21:05:38 +09:00
KAWASHIMA Takahiro
994b345253 man: Correct markup of MPI_Neighbor_allgather
Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2018-10-04 21:02:35 +09:00