openmpi

Автор	SHA1	Сообщение	Дата
Alex Anenkov	2891a23329	coll/libnbc: add recursive doubling algorithm for MPI_Iallreduce Signed-off-by: Alex Anenkov <anenkov.ru@gmail.com> (cherry picked from commit 77d466edf369c9851476b7ec7392f3dfd4cdc0b1) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	ba11f31fc8	coll/libnbc: remove debug output 1. Remove debug output in iallgather (I have forgotten to remove it). 2. Remove an incorrect comment in description of ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 64abd0f405be91b927cd8f37d30cdf41aa6685c2) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	bf1c8bb394	coll/libnbc/ireduce: silence Coverity warning CID 1440360 Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 8b511c788965e6467b5fd834f1adbdcca5012f55) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	5ee1fb62b9	coll/libnbc: add Rabenseifner's algorithm for MPI_Iallreduce An implementation of R. Rabenseifner's algorithm for MPI_Iallreduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by an allgather. Limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators only Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 73e048b62a92325fc3fca80c2ade5f5e9bf3192a) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
George Bosilca	fd29cce114	Remove few warnings in libnbc identified by clang-1000.11.45.2 Signed-off-by: George Bosilca <bosilca@icl.utk.edu> (cherry picked from commit 66182a294d5e8cf03a00fba579b05f59e764133c) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	91a4b4c799	coll/libnbc: add recursive doubling algorithm for MPI_Iallgather Implements recursive doubling algorithm for MPI_Iallgather. The algorithm can be used only for power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit a7386c1e09fb274991ca5b50d9d418a0d6b77b6c) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	6971dab943	coll/libnbc: add knomial tree algorithm for MPI_Ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit b0429d25dfb1ed3f9adfa45169478cada0ba2675) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	a318f117f6	coll/libnbc: add Rabenseifner's algorithm for MPI_Ireduce An implementation of R. Rabenseifner's algorithm for MPI_Ireduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by a gather. Limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators only Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 7bd63e79c865080c801a45ed852602bdc4eb4d8f) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Brian Barrett	6f6d8180a3	coll libnbc: Remove dead code Remove dead code that was causing warnings about unused static functions. Signed-off-by: Brian Barrett <bbarrett@amazon.com> (cherry picked from commit 2e24e6ec082d29f76dcbc75c6f214d2d0d647701) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	de5e435dee	coll/libnbc: add recursive doubling algorithm for MPI_Iexscan Implements recursive doubling algorithm for MPI_Iexscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter 'coll_libnbc_iexscan_algorithm' was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit dfe203e167f5d8abc3b55226c6f17a468c9567dd) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	65990af3ad	coll/libnbc: add recursive doubling algorithm for MPI_Iscan Implements recursive doubling algorithm for MPI_Iscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter coll_libnbc_iscan_algorithm was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 3d43ff0f3209d5bf4713c6696acb0acb8f1756e4) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Jeff Squyres	547fb3d933	libnbc: remove some stale/dead code Gcc 8 identified hb_tree_csearch() as an infinite recursion, and it turns out that we never call this function, anyway. So just remove it. Fixes #5670. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit 06c1bf73da875f4a6449f38a993530d6fae7817d) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Gilles Gouaillardet	d9d84d5dd6	coll/libnbc: fix NBC_Unpack() always initialize 'size'. Only the a2a_sched_diss() alltoall algorithm is impacted, and this algo is currently unused, so there is no need to backport nor update the NEWS file for now. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit ff48e9286430b37aac3146efe2b355f255db94d5) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Joseph Schuchart	7b1beb0f6c	Harmonize return values of progress callbacks Signed-off-by: Joseph Schuchart <schuchart@hlrs.de> (cherry picked from commit 2c97187ee05e592346206a697ca3d9531d600fcc)	2020-03-30 18:58:57 +02:00
Maxwell Coil	84a67bd6cf	libnbc: fixed uninitialized variable Squash compiler warning. Signed-off-by: Maxwell Coil <mcoil@nd.edu> (cherry picked from commit 52241dbbcdcbf3605c8098d0cfbcf3c5a75a1c9c)	2019-12-14 12:25:18 -05:00
Gilles Gouaillardet	b37c85dcca	coll/libnbc: fixes ompi ompi_coll_libnbc_request_t parent base ompi_coll_libnbc_request_t on top of ompi_coll_base_nbc_request_t to correctly support the retention of datatypes/operators This fixes a regression introduced in open-mpi/ompi@0fe756d416 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@f8eef0fde9)	2019-08-13 00:13:40 +09:00
Gilles Gouaillardet	c9e4240e70	mpi: retain operation and datatype in non blocking collectives MPI standard states a user MPI_Op and/or user MPI_Datatype can be free'd after a call to a non blocking collective and before the non-blocking collective completes. Retain user (only) MPI_Op and MPI_Datatype when the non blocking call is invoked, and set a request callback so they are free'd when the MPI_Request completes. Thanks Thomas Ponweiser for reporting this Fixes open-mpi/ompi#2151 Fixes open-mpi/ompi#1304 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@0fe756d416)	2019-07-12 10:27:04 +09:00
Aurelien Bouteiller	9499dcfe41	Manage errors in NBC collective ops Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Correctly bubble up errors in NBC collective operations Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> The error field of requests needs to be rearmed at start, not at create Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> (cherry picked from commit open-mpi/ompi@65660e5999)	2019-07-12 10:26:08 +09:00
George Bosilca	4946570b24	Remove few warnings identified by @rhc in #5514 . Signed-off-by: George Bosilca <bosilca@icl.utk.edu> (cherry picked from commit open-mpi/ompi@6d11a45f44)	2019-05-11 16:38:31 +09:00
Gilles Gouaillardet	ece18aed45	coll/libnbc: fix various error paths The parameter passed to NBC_Return_handle() was incorrectly casted and not dereferenced. Thanks Yossi for the bug report. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@8b51862fb2)	2018-09-18 15:29:33 +09:00
Gilles Gouaillardet	1a41482720	coll/libnbc: do not recursively call opal_progress() instead of invoking ompi_request_test_all(), that will end up calling opal_progress() recursively, manually check the status of the requests. the same method is used in ompi_comm_request_progress() Refs open-mpi/ompi#3901 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-07-17 09:45:08 -06:00
KAWASHIMA Takahiro	37a05e74aa	coll/libnbc: Suppress compiler warnings Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-07-12 14:42:39 +09:00
Gilles Gouaillardet	76292951e5	coll/libnbc: fix integer overflow Use internal pack/unpack subroutines that operate on MPI_Aint instead of int and hence solve some integer overflows. Thanks Clyde Stanfield for reporting this issue. Refs open-mpi/ompi#5383 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-07-09 10:08:33 -06:00
KAWASHIMA Takahiro	a38e9e064f	coll: Update COLL module interface version to 2.3.0 Members for persistent operations are added to the module structure in a prior commit. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	e12a5056f1	coll/libnbc: Rename internal functions The `nbc_i` functions don't start communication, but create a request. `nbc__init` are appropriate names for them. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	5c21903477	coll/libnbc: Add assertion for `NBC_A2A_DISS` Persistent operation for `NBC_A2A_DISS` is not supported currently. Though the algorithm is not selected at all currently, I put an assertion not to select it by mistake. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	0b8b0f8393	coll/libnbc: Implement `MPI_STARTALL` Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	ed0144bad4	coll/libnbc: Adapt local copy for persistent request `NBC_Copy` shoud not be called in `MPI_*_INIT`. `NBC_Sched_copy` should be called instead. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	5c5de3a4fb	coll/libnbc: Fix handling of completed request Because a persistent reuqest does not free its `schedule` object when the communication completes, the `NBC_Progress` function cannot determine the completion using `schedule`. Without this change, a hang occurs when the `NBC_Progress` function is called recursively through the `NBC_Start_round` function. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	8e5690bf5c	coll/libnbc: Correct persistent request handling Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
Gilles Gouaillardet	a9609b6bf8	coll/libnbc: add persistent collectives implementation Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	c753e9baff	coll/libnbc: code refactoring prepare the upcoming persistent collectives by pre-factoring some code Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> fixup 808c3c62cd9475edd91ecde9d2d53b12e28b2c04	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	fe0bb6c310	coll/libnbc: misc revamp - merge NBC_Init_handle() into NBC_Schedule_request() - set schedule in NBC_Schedule_request instead of NBC_Start() - update NBC_Start() prototype Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
Gilles Gouaillardet	360a76f440	coll/libnbc: revamp ibcast and use NBC_Schedule_request() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
Nathan Hjelm	0e83568466	coll/libnbc: do not take lock in progress if there are no requests This commit fixes a flaw in the progress function for libnbc. The function was unconditionally taking a lock even if there are no requests to process. This lock was showing up in vtune traces of multi-threaded benchmarks. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-02-13 09:51:01 -07:00
Nathan Hjelm	1282e98a01	opal/asm: rename existing arithmetic atomic functions This commit renames the arithmetic atomic operations in opal to indicate that they return the new value not the old value. This naming differentiates these routines from new functions that return the old value. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-30 10:41:22 -07:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
Carlos Bederián	1767b218fb	coll/libnbc: demote progress_lock to regular flag Signed-off-by: Carlos Bederián <bc@famaf.unc.edu.ar>	2017-07-24 20:19:55 -03:00
Gilles Gouaillardet	9ba85b85e1	coll/libnbc: revisit NBC_Handle usage make NBC_Handle (almost) an internal structure created by NBC_Schedule_request() use a local variable instead of what was previously handle->tmpbuf Refs open-mpi/ompi#3487 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-06-20 17:24:16 +09:00
Gilles Gouaillardet	fa5cd0dbe5	use ptrdiff_t instead of OPAL_PTRDIFF_TYPE since Open MPI now requires a C99, and ptrdiff_t type is part of C99, there is no more need for the abstract OPAL_PTRDIFF_TYPE type. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-19 13:41:56 +09:00
Josh Hursey	0b273c2561	Merge pull request #2808 from jjhursey/fix/ibm/reduce-local-to-coll coll: Move reduce_local into the coll framework	2017-02-14 15:54:15 -06:00
Joshua Hursey	78006f93a4	coll: Move reduce_local into the coll framework * Since we are adding a new function to `mca_coll_base_module_2_1_0_t` we need to increase the version of the module structure to `2_2_0`. * Add a comment just above the PREDEFINED_COMMUNICATOR_PAD describing it's purpose and when it should change. To help future developers trying to answer the question noted in the comment. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-14 08:56:07 -06:00
Gilles Gouaillardet	e70a30cca4	coll/libnbc: optimize zero size ialltoall{v,w} with MPI_IN_PLACE and incidentally avoids malloc(0) Thanks Lisandro Dalcin for the report Fixes open-mpi/ompi#2945 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-13 15:21:28 +09:00
Gilles Gouaillardet	12949547f4	coll/libnbc: fix a2aw_sched_linear() with zero size datatype or zero count Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-13 15:21:28 +09:00
Howard Pritchard	acaecb2448	swat some compiler warnings Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-02-03 08:28:15 -07:00
Gilles Gouaillardet	9bcadbd51b	coll/libnbc: fix the red_schain algo of ireduce with MPI_IN_PLACE this fixes a regression introduced in open-mpi/ompi@045d0c5f4c Fixes open-mpi/ompi#2879 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-30 14:19:45 +09:00
Geoffrey Paulsen	d2527cff46	Fixing comment only in MPI_IN_PLACE case for ireduce in libnbc. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2017-01-26 10:58:51 -08:00
Geoffrey Paulsen	045d0c5f4c	Fix for Ireduce + MPI_IN_PLACE. Fixes a wrong answer from MPI_Ireduce when the red_sched_chain() path was taken (which only happens for np<=4 and mesgsize>=64k). The way libnbc treats MPI_IN_PLACE is to set sbuf == rbuf, and whether an algorithm will work cleanly or not after that depends on the details. In this case the last steps of the algorithm amounted to (right neighbor is sending us reduction results from ranks 1..n-1) recv into rbuf from right neighbor add the contribution from our sbuf into rbuf this would be fine in general, but if sbuf==rbuf, that recv overwrites the sbuf. I changed it to recv into a tmpbuf if MPI_IN_PLACE was used. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2017-01-25 18:08:08 -08:00
Gilles Gouaillardet	d0629f18c2	coll/libnbc: optimize size one communicators simply "return" with ompi_request_empty if the communicator size is 1 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-24 09:12:47 +09:00
Ralph Castain	dadc6fbaf6	Merge pull request #2448 from thananon/remove_request_lock Completely removed ompi_request_lock and ompi_request_cond	2017-01-03 19:31:46 -08:00

1 2 3

150 Коммитов