openmpi

Автор	SHA1	Сообщение	Дата
Mikhail Kurnosov	b0429d25df	coll/libnbc: add knomial tree algorithm for MPI_Ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-09 20:43:04 +07:00
Brian Barrett	e9e4d2a4bc	Handle asprintf errors with opal_asprintf wrapper The Open MPI code base assumed that asprintf always behaved like the FreeBSD variant, where ptr is set to NULL on error. However, the C standard (and Linux) only guarantee that the return code will be -1 on error and leave ptr undefined. Rather than fix all the usage in the code, we use opal_asprintf() wrapper instead, which guarantees the BSD-like behavior of ptr always being set to NULL. In addition to being correct, this will fix many, many warnings in the Open MPI code base. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2018-10-08 16:43:53 -07:00
Mikhail Kurnosov	9557fa087f	Resolve merge conflicts Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-05 21:40:27 +07:00
Nathan Hjelm	88a560fa3c	Merge pull request #5744 from mkurnosov/coll-iscan-recursivedoubling coll/libnbc: add recursive doubling algorithm for MPI_Iscan	2018-10-03 09:02:05 -06:00
Brian Barrett	2e24e6ec08	coll libnbc: Remove dead code Remove dead code that was causing warnings about unused static functions. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2018-10-02 13:35:15 -04:00
Jeff Squyres	6bb356ab87	Squash a bunch of harmless compiler warnings. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-09-26 12:15:21 -07:00
Mikhail Kurnosov	dfe203e167	coll/libnbc: add recursive doubling algorithm for MPI_Iexscan Implements recursive doubling algorithm for MPI_Iexscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter 'coll_libnbc_iexscan_algorithm' was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-09-23 19:54:27 +07:00
Mikhail Kurnosov	3d43ff0f32	coll/libnbc: add recursive doubling algorithm for MPI_Iscan Implements recursive doubling algorithm for MPI_Iscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter coll_libnbc_iscan_algorithm was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-09-22 21:09:12 +07:00
bosilca	3f598e9e83	Merge pull request #5450 from mkurnosov/coll-base-allgather-fix-in-place coll-base-allgather: fix MPI_IN_PLACE processing	2018-09-21 14:51:45 -04:00
bosilca	17f1684438	Merge pull request #5491 from mkurnosov/coll-base-allgatherv-fix-mpi-in-place coll/base/allgatherv: fix MPI_IN_PLACE processing	2018-09-21 14:48:16 -04:00
bosilca	441727fcb0	Merge pull request #5680 from ggouaillardet/topic/nbc_unpack coll/libnbc: fix NBC_Unpack()	2018-09-19 10:17:21 -04:00
bosilca	2ae3cfd9bc	Merge pull request #5699 from ICLDisco/export/coll_errors Error cases in base collectives	2018-09-19 09:47:24 -04:00
Gilles Gouaillardet	8b51862fb2	coll/libnbc: fix various error paths The parameter passed to NBC_Return_handle() was incorrectly casted and not dereferenced. Thanks Yossi for the bug report. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-09-18 13:17:43 +09:00
Jeff Squyres	06c1bf73da	libnbc: remove some stale/dead code Gcc 8 identified hb_tree_csearch() as an infinite recursion, and it turns out that we never call this function, anyway. So just remove it. Fixes #5670. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-09-15 06:04:13 -07:00
Aurelien Bouteiller	466217fadd	Always return a valid error code from collective operations Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2018-09-14 13:46:35 -04:00
Nathan Hjelm	000f9eed4d	opal: add types for atomic variables This commit updates the entire codebase to use specific opal types for all atomic variables. This is a change from the prior atomic support which required the use of the volatile keyword. This is the first step towards implementing support for C11 atomics as that interface requires the use of types declared with the _Atomic keyword. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-09-14 10:48:55 -06:00
Gilles Gouaillardet	ff48e92864	coll/libnbc: fix NBC_Unpack() always initialize 'size'. Only the a2a_sched_diss() alltoall algorithm is impacted, and this algo is currently unused, so there is no need to backport nor update the NEWS file for now. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-09-13 10:55:29 +09:00
Todd Kordenbrock	e9f378e851	Merge pull request #5500 from tkordenbrock/topic/master/fix.PtlMEUnlink.in.use coll-portals4: retry PtlMEUnlink() if PTL_IN_USE	2018-08-07 11:21:00 -05:00
George Bosilca	6d11a45f44	Remove few warnings identified by @rhc in #5514 . Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-08-03 16:21:06 -04:00
Todd Kordenbrock	f3f2a826b4	coll-portals4: retry PtlMEUnlink() if PTL_IN_USE In the cleanup phase, it is possible for PtlMEUnlink() to return PTL_IN_USE if the NIC is not done with the ME. This should not be considered an error. This commit adds a retry loop around PtlMEUnlink(). In some cases, the return value of PtlMEUnlink() and PtlCTFree() was not checked at all. Check them with the same retry loop as above. Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com>	2018-07-31 10:20:55 -05:00
Mikhail Kurnosov	b45e190e66	coll/base/allgatherv: fix MPI_IN_PLACE processing The call of MPI_Allgatherv with sendbuf and sendtype parameters equal to MPI_IN_PLACE and NULL correspondingly, produces the segmentation fault. The problem is that sendtype is used even when sendbuf value is MPI_IN_PLACE. But according to the standard, sendtype and sendcount parameters should be ignored in this case. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-07-27 09:34:17 +07:00
Mikhail Kurnosov	540c2d1617	coll-base-allgather: fix MPI_IN_PLACE processing The call of MPI_Allgather with sendbuf and sendtype parameters equal to MPI_IN_PLACE and NULL correspondingly, produces the segmentation fault. The problem is that sendtype is used even when sendbuf value is MPI_IN_PLACE. But according to the standard, sendtype and sendcount parameters should be ignored in this case. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-07-18 10:27:00 +07:00
Gilles Gouaillardet	1a41482720	coll/libnbc: do not recursively call opal_progress() instead of invoking ompi_request_test_all(), that will end up calling opal_progress() recursively, manually check the status of the requests. the same method is used in ompi_comm_request_progress() Refs open-mpi/ompi#3901 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-07-17 09:45:08 -06:00
Mikhail Kurnosov	ba83cc91eb	coll/base: add MPI_Bcast based on a binomial tree scatter followed by a ring allgather Implements MPI_Bcast using a binomial tree scatter followed by a ring allgather. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-07-16 08:56:09 -06:00
KAWASHIMA Takahiro	37a05e74aa	coll/libnbc: Suppress compiler warnings Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-07-12 14:42:39 +09:00
Howard Pritchard	34bc77747c	Merge pull request #5388 from mkurnosov/base-gather-bmtree-fix-mpi-in-place coll/base/gather_intra_binomial: fix MPI_IN_PLACE processing	2018-07-11 18:34:35 -05:00
Gilles Gouaillardet	76292951e5	coll/libnbc: fix integer overflow Use internal pack/unpack subroutines that operate on MPI_Aint instead of int and hence solve some integer overflows. Thanks Clyde Stanfield for reporting this issue. Refs open-mpi/ompi#5383 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-07-09 10:08:33 -06:00
Mikhail Kurnosov	22fa5a8a67	coll/base/scatter: replaces right skewed binomial tree (in order) with left skewed binomial tree Current implementation of `coll/base/MPI_Scatter` is based on in-order binomial tree. This tree is right skewed and it provides good performance for a MPI_Gather operation. But for a MPI_Scatter operation left skewed binomial tree is effective. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-07-09 10:04:41 -06:00
Mikhail Kurnosov	b9e14cd7d0	coll/base/gather_intra_binomial: fix MPI_IN_PLACE processing The call of MPI_Gather with sendbuf and sendtype parameters equal to MPI_IN_PLACE and NULL correspondingly, produces the segmentation fault in the root process. The problem is that sendtype is used even when sendbuf value is MPI_IN_PLACE. But according to the standard (page 150, line 37), sendtype and sendcount parameters should be ignored in this case. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-07-07 20:59:39 +07:00
KAWASHIMA Takahiro	a8da78eeaa	Merge pull request #4618 from ggouaillardet/topic/pcoll Add the persistent collectives feature	2018-06-26 12:36:34 +09:00
Joshua Ladd	98afc838aa	Merge pull request #5294 from yosefe/topic/coll-hcoll-progress-fn coll_hcoll: register progress callback directly without a proxy	2018-06-25 15:07:26 -04:00
Yossi Itigin	e3ee11608b	coll_hcoll: register progress callback directly without a proxy Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2018-06-24 18:06:07 +03:00
Mikhail Kurnosov	c500739293	coll/base: Add MPI_Bcast based on a scatter followed by an allgather Implements MPI_Bcast using a binomial tree scatter followed by an recursive doubling allgather. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-21 11:47:07 -06:00
Mikhail Kurnosov	66bc86a25b	Change the tree_next to a flexible array member Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-19 13:01:26 -06:00
Mikhail Kurnosov	6547b58316	coll/base: add knomial tree algorithm for MPI_Bcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-19 13:01:26 -06:00
KAWASHIMA Takahiro	a38e9e064f	coll: Update COLL module interface version to 2.3.0 Members for persistent operations are added to the module structure in a prior commit. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	e12a5056f1	coll/libnbc: Rename internal functions The `nbc_i` functions don't start communication, but create a request. `nbc__init` are appropriate names for them. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	5c21903477	coll/libnbc: Add assertion for `NBC_A2A_DISS` Persistent operation for `NBC_A2A_DISS` is not supported currently. Though the algorithm is not selected at all currently, I put an assertion not to select it by mistake. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	0b8b0f8393	coll/libnbc: Implement `MPI_STARTALL` Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	ed0144bad4	coll/libnbc: Adapt local copy for persistent request `NBC_Copy` shoud not be called in `MPI_*_INIT`. `NBC_Sched_copy` should be called instead. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	5c5de3a4fb	coll/libnbc: Fix handling of completed request Because a persistent reuqest does not free its `schedule` object when the communication completes, the `NBC_Progress` function cannot determine the completion using `schedule`. Without this change, a hang occurs when the `NBC_Progress` function is called recursively through the `NBC_Start_round` function. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	8e5690bf5c	coll/libnbc: Correct persistent request handling Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	e69e99575e	coll: Enable func check in `mca_coll_base_comm_select` Now libnbc COLL supports persistent collectives and all `*_init` functions of the COLL interface are available. So let's enable the check of availability of those functions on a communicator creation. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	a9609b6bf8	coll/libnbc: add persistent collectives implementation Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
KAWASHIMA Takahiro	a9fdea51aa	coll: Add persistent collective communication request feature Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	c753e9baff	coll/libnbc: code refactoring prepare the upcoming persistent collectives by pre-factoring some code Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> fixup 808c3c62cd9475edd91ecde9d2d53b12e28b2c04	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	fe0bb6c310	coll/libnbc: misc revamp - merge NBC_Init_handle() into NBC_Schedule_request() - set schedule in NBC_Schedule_request instead of NBC_Start() - update NBC_Start() prototype Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	360a76f440	coll/libnbc: revamp ibcast and use NBC_Schedule_request() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
Mikhail Kurnosov	3adf96fdb8	coll/base: add butterfly algorithm for MPI_Reduce_scatter Implements butterfly algorithm for MPI_Reduce_scatter. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-05 15:53:13 +07:00
Jeff Squyres	35438ae9b5	mpi/finalized: revamp INITIALIZED/FINALIZED Per MPI-3.1:8.7.1 p361:11-13, it's valid for MPI_FINALIZED to be invoked during an attribute destruction callback (e.g., during the destruction of keyvals on MPI_COMM_SELF during the very beginning of MPI_FINALIZE). In such cases, MPI_FINALIZED must return "false". Prior to this commit, we hung in FINALIZED if it were invoked during a COMM_SELF attribute destruction callback in FINALIZE. See https://github.com/open-mpi/ompi/issues/5084. This commit converts the MPI_INITIALIZED / MPI_FINALIZED infrastructure to use a single enum (ompi_mpi_state, set atomically) to represent the state of MPI: - not initialized - init started - init completed - finalize started - finalize past COMM_SELF destruction - finalize completed The "finalize past COMM_SELF destruction" state is what allows us to return "false" from MPI_FINALIZED before COMM_SELF has been fully destroyed / all attribute callbacks have been invoked. Since this state is checked at nearly every MPI API call (to see if we're outside of the INIT/FINALIZE epoch), care was taken to use atomics to set the ompi_mpi_state value in ompi_mpi_init() and ompi_mpi_finalize(), but performance-critical code paths can simply read the variable without needing to use a slow call to an opal_atomic_*() function. Thanks to @AndrewGaspar for reporting the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-06-01 13:36:29 -07:00

1 2 3 4 5 ...

1129 Коммитов