openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	edf03e52f3	Merge pull request #7944 from bosilca/4.1/adapt Import the ADAPT collective into the 4.1	2020-09-23 16:42:49 -04:00
Xi Luo	e65fa4ff5c	Bring ADAPT collective to 4.1 This is a meta commit, that encapsulate all the ADAPT commits in the master into a single PR for 4.1. The master commits included here are: fe73586, a4be3bb, d712645, c2970a3, e59bde9, ee592f3 and c98e387. Here is a detailed list of added capabilities: * coll/adapt: Fix naming conventions and C11 atomic use * coll/adapt: Remove unused component field in module * Consistent handling of zero counts in the MPI API. * Correctly handle non-blocking collectives tags * As it is possible to have multiple outstanding non-blocking collectives provided by different collective modules, we need a consistent mechanism to allow them to select unique tags for each instance of a collective. * Add support for fallback to previous coll module on non-commutative operations (#30) * Replace mutexes by atomic operations. * Use the correct nbc request type (for both ibcast and ireduce) * coll/base: document type casts in ompi_coll_base_retain_* * add module-wide topology cache * use standard instead of synchronous send and add mca parameter to control mode of initial send in ireduce/ibcast * reduce number of memory allocations * call the default request completion. * Remove the requests from the Fortran lookup conversion tables before completing and free it. * piggybacking Bull functionalities Signed-off-by: Xi Luo <xluo12@vols.utk.edu> Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Marc Sergent <marc.sergent@atos.net> Co-authored-by: Joseph Schuchart <schuchart@hlrs.de> Co-authored-by: Lemarinier, Pierre <pierre.lemarinier@atos.net> Co-authored-by: pierrele <31764860+pierrele@users.noreply.github.com>	2020-09-23 11:45:45 -04:00
William Zhang	7922430c66	coll/tuned: Fix dynamic message size for gather and scatter The gather and scatter operations did not use the correct message size (Only did datatype size * com size). This did not correctly reflect the total message size and prevents fine tuning within a com size. This patch multiplies the value by the number of elements sent. Signed-off-by: William Zhang <wilzhang@amazon.com> (cherry picked from commit 50823fe9a9ef4f93e55ee2087b311303d49f90a8)	2020-09-15 08:56:56 -07:00
William Zhang	dceea5ad87	coll/tuned: Revert RSB and RS default algorithms Reduce scatter block and reduce scatter algorithms were hitting correctness issues for non commutative strided tests. We will revert to the original default algorithms for those two collectives (basic linear and non overlapping respectively) in the non commutative op case. See #8010 Signed-off-by: William Zhang <wilzhang@amazon.com> (cherry picked from commit 57b95bcb45d5ce3ae1a1e00bd17ceeaa206526fe)	2020-08-25 15:48:45 -07:00
William Zhang	5a13c5352f	coll/tuned: Change the default collective algorithm selection The default algorithm selections were out of date and not performing well. After gathering data from OMPI developers, new default algorithm decisions were selected for: allgather allgatherv allreduce alltoall alltoallv barrier bcast gather reduce reduce_scatter_block reduce_scatter scatter These results were gathered using the ompi-collectives-tuning package and then averaged amongst the results gathered from multiple OMPI developers on their clusters. You can access the graphs and averaged data here: https://drive.google.com/drive/folders/1MV5E9gN-5tootoWoh62aoXmN0jiWiqh3 Signed-off-by: William Zhang <wilzhang@amazon.com> (cherry picked from commit ce40cfbaa53406be71319041e13e893b0def7ad9)	2020-07-28 15:48:20 -07:00
Jeff Squyres	36bcc48dc1	Merge pull request #7902 from vspetrov/v4.1.x_hcoll_reduce_scatter V4.1.x hcoll reduce scatter	2020-07-27 15:50:09 -04:00
Valentin Petrov	6f401186f7	coll/hcoll: compile warning fix Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2020-07-02 08:44:25 +03:00
Valentin Petrov	2441fb2baf	coll/hcoll: reduce_scatter(block) interface Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2020-07-02 08:44:22 +03:00
Todd Kordenbrock	20f9ed98f2	mtl-portals4: replace abort() with ompi_rte_abort() coll-portals4: replace abort() with ompi_rte_abort() Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com> (cherry picked from commit 04b94637dd2c5e05edf0917d02b9d1e48316d063)	2020-06-29 10:06:12 -05:00
William Zhang	db6ed187b2	coll/tuned: Add NULL check to prevent segfault Signed-off-by: William Zhang <wilzhang@amazon.com> cr https://code.amazon.com/reviews/CR-23837553 (cherry picked from commit 771f9c011d2a4daf78a4b26f88c971b3868fe132) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
William Zhang	03758b1ef7	coll/tuned: Fix typos Signed-off-by: William Zhang <wilzhang@amazon.com> (cherry picked from commit 50640402ab5765a0dfde71628adfbbaa686555bd) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Brinskii	7eb94164a0	COLL/TUNED: Add linear scatter using isend for mlnx platform Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com> (cherry picked from commit f2cbd4806e9a38b5e58c0fc69b41624af79fb99b) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Gilles Gouaillardet	221fad6862	coll/cuda: remove unnecessary references to ORTE Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit 531171ca50955f8b7762f932388f633852454e6f) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Tomislav Janjusic	f51bd8ca0c	Coll/hcoll: adding scatterv interface Signed-off-by: Valentin Petrov valentinp@mellanox.com (cherry picked from commit 6ea920e225c7ed905949c6afec554b6bf2705f94) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Alex Anenkov	2891a23329	coll/libnbc: add recursive doubling algorithm for MPI_Iallreduce Signed-off-by: Alex Anenkov <anenkov.ru@gmail.com> (cherry picked from commit 77d466edf369c9851476b7ec7392f3dfd4cdc0b1) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	ba11f31fc8	coll/libnbc: remove debug output 1. Remove debug output in iallgather (I have forgotten to remove it). 2. Remove an incorrect comment in description of ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 64abd0f405be91b927cd8f37d30cdf41aa6685c2) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	bf1c8bb394	coll/libnbc/ireduce: silence Coverity warning CID 1440360 Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 8b511c788965e6467b5fd834f1adbdcca5012f55) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	5ee1fb62b9	coll/libnbc: add Rabenseifner's algorithm for MPI_Iallreduce An implementation of R. Rabenseifner's algorithm for MPI_Iallreduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by an allgather. Limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators only Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 73e048b62a92325fc3fca80c2ade5f5e9bf3192a) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
George Bosilca	fd29cce114	Remove few warnings in libnbc identified by clang-1000.11.45.2 Signed-off-by: George Bosilca <bosilca@icl.utk.edu> (cherry picked from commit 66182a294d5e8cf03a00fba579b05f59e764133c) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	91a4b4c799	coll/libnbc: add recursive doubling algorithm for MPI_Iallgather Implements recursive doubling algorithm for MPI_Iallgather. The algorithm can be used only for power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit a7386c1e09fb274991ca5b50d9d418a0d6b77b6c) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	6971dab943	coll/libnbc: add knomial tree algorithm for MPI_Ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit b0429d25dfb1ed3f9adfa45169478cada0ba2675) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	a318f117f6	coll/libnbc: add Rabenseifner's algorithm for MPI_Ireduce An implementation of R. Rabenseifner's algorithm for MPI_Ireduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by a gather. Limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators only Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 7bd63e79c865080c801a45ed852602bdc4eb4d8f) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Brian Barrett	6f6d8180a3	coll libnbc: Remove dead code Remove dead code that was causing warnings about unused static functions. Signed-off-by: Brian Barrett <bbarrett@amazon.com> (cherry picked from commit 2e24e6ec082d29f76dcbc75c6f214d2d0d647701) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	de5e435dee	coll/libnbc: add recursive doubling algorithm for MPI_Iexscan Implements recursive doubling algorithm for MPI_Iexscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter 'coll_libnbc_iexscan_algorithm' was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit dfe203e167f5d8abc3b55226c6f17a468c9567dd) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	65990af3ad	coll/libnbc: add recursive doubling algorithm for MPI_Iscan Implements recursive doubling algorithm for MPI_Iscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter coll_libnbc_iscan_algorithm was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 3d43ff0f3209d5bf4713c6696acb0acb8f1756e4) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Jeff Squyres	547fb3d933	libnbc: remove some stale/dead code Gcc 8 identified hb_tree_csearch() as an infinite recursion, and it turns out that we never call this function, anyway. So just remove it. Fixes #5670. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit 06c1bf73da875f4a6449f38a993530d6fae7817d) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Aurelien Bouteiller	2692840d40	Always return a valid error code from collective operations Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> (cherry picked from commit 466217fadda0391698b383f2792de7bcbdff7e97) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Gilles Gouaillardet	d9d84d5dd6	coll/libnbc: fix NBC_Unpack() always initialize 'size'. Only the a2a_sched_diss() alltoall algorithm is impacted, and this algo is currently unused, so there is no need to backport nor update the NEWS file for now. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit ff48e9286430b37aac3146efe2b355f255db94d5) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Mikhail Kurnosov	ba221e1a08	coll/base/allgatherv: fix MPI_IN_PLACE processing The call of MPI_Allgatherv with sendbuf and sendtype parameters equal to MPI_IN_PLACE and NULL correspondingly, produces the segmentation fault. The problem is that sendtype is used even when sendbuf value is MPI_IN_PLACE. But according to the standard, sendtype and sendcount parameters should be ignored in this case. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit b45e190e6629b664872f7f872cfffd916180bb9a) Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2020-06-25 23:06:51 +00:00
Geoff Paulsen	b1eb3d7530	Merge pull request #7579 from devreal/progress-returns-v4.0.x Harmonize return values of progress callbacks (v4.0.x)	2020-04-03 13:37:56 -05:00
Joseph Schuchart	7b1beb0f6c	Harmonize return values of progress callbacks Signed-off-by: Joseph Schuchart <schuchart@hlrs.de> (cherry picked from commit 2c97187ee05e592346206a697ca3d9531d600fcc)	2020-03-30 18:58:57 +02:00
Mikhail Kurnosov	d7857d000a	Fix Bcast scatter_allgather (issue #7410 ) Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 66b6b8d34e9bb50d34145096a9a2b210290510ca)	2020-03-30 22:07:46 +07:00
George Bosilca	be58cf7982	Fix the communication ordering for all cartesian neighbor collectives. This work is rooted in the [MPI Forum issue 153](https://github.com/mpi-forum/mpi-issues/issues/153). Signed-off-by: George Bosilca <bosilca@icl.utk.edu> (cherry picked from commit 86acdee4606c1ac3b38070d1b7973a00a991f1d6)	2019-12-17 14:25:22 -08:00
Nathan Hjelm	21221eb70a	coll/basic: fix neighbor alltoall message ordering This commit updates the coll/basic component to correctly order sends and receives for cartesian communicators with cyclic boundaries. This addresses an issue identified by mpi-forum/mpi-issues#153. This issue occurs when the size in any dimension is 1. This gives the same neighbor in the positive and negative directions. The old code was sending and receiving in the same order so the -1 buffer contained the +1 result and vise-versa. The problem is addressed by using unique tags for each send. This should cover both the case where overtaking is allowed and is not allowed. The former case will be possible is a MPI_Cart_create_with_info() call is added to the standard. Signed-off-by: Nathan Hjelm <hjelmn@google.com> (cherry picked from commit 196a91e604885d7aae9ac9dfbd9b2e846b3015b7)	2019-12-17 14:25:22 -08:00
Maxwell Coil	84a67bd6cf	libnbc: fixed uninitialized variable Squash compiler warning. Signed-off-by: Maxwell Coil <mcoil@nd.edu> (cherry picked from commit 52241dbbcdcbf3605c8098d0cfbcf3c5a75a1c9c)	2019-12-14 12:25:18 -05:00
Valentin Petrov	83a2518994	Coll/hcoll: fixes hcoll non-blocking colls support open-mpi/ompi@0fe756d416 Introduced a bug in coll/hcoll component. The ompi_requests allocated by libhcoll would be treated as coll_base_nbc_request during ompi_coll_base_retain_<> call. Afterwards this would lead to a segv in the request cleanup. Fix: since libhcoll interface does not distinguish between the blocling/non-blocking requests use coll_base_nbc_request all the time and initialize it properly in coll/hcoll/get_coll_handle(). It is still within 2 cache lines. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2019-08-27 17:23:52 +03:00
Gilles Gouaillardet	39ec580b76	coll/base: only retain datatypes/op if the request has not yet completed a non blocking collective might return ompi_request_null, so we should not retain anything in that case. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@63d3ccde9d)	2019-08-13 00:13:40 +09:00
Gilles Gouaillardet	ae26957619	coll/base: cleanup ompi_coll_base_nbc_request_t elements Since ompi_coll_base_nbc_request_t is to be used in an opal_free_list_t, it must be returned into a "clean" state. So cleanup some data in the callback completion subroutines. This fixes a regression introduced in open-mpi/ompi@0fe756d416 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@0862c409f1)	2019-08-13 00:13:40 +09:00
Gilles Gouaillardet	b37c85dcca	coll/libnbc: fixes ompi ompi_coll_libnbc_request_t parent base ompi_coll_libnbc_request_t on top of ompi_coll_base_nbc_request_t to correctly support the retention of datatypes/operators This fixes a regression introduced in open-mpi/ompi@0fe756d416 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@f8eef0fde9)	2019-08-13 00:13:40 +09:00
Mikhail Brinskii	b9998a14dc	COLL/TUNED: Minor var names/comments fixes Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com> (cherry picked from commit 65618f8db848613c95cbe112033df94721d326a8)	2019-07-26 11:29:12 +03:00
Mikhail Brinskii	3d5b7b4a1b	COLL/TUNED: Update alltoall selection rule for mlx Use linear with sync alltoall algorithm for certain message/comm size ranges. Does not affect default fixed decision, unless HPCX (with its custom parameters) is used or corresponding mca is set. Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com> (cherry picked from commit 404c4800688548b021bda68bdf10792424e6b1c5)	2019-07-26 11:28:47 +03:00
Gilles Gouaillardet	c9e4240e70	mpi: retain operation and datatype in non blocking collectives MPI standard states a user MPI_Op and/or user MPI_Datatype can be free'd after a call to a non blocking collective and before the non-blocking collective completes. Retain user (only) MPI_Op and MPI_Datatype when the non blocking call is invoked, and set a request callback so they are free'd when the MPI_Request completes. Thanks Thomas Ponweiser for reporting this Fixes open-mpi/ompi#2151 Fixes open-mpi/ompi#1304 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@0fe756d416)	2019-07-12 10:27:04 +09:00
Aurelien Bouteiller	9499dcfe41	Manage errors in NBC collective ops Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Correctly bubble up errors in NBC collective operations Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> The error field of requests needs to be rearmed at start, not at create Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> (cherry picked from commit open-mpi/ompi@65660e5999)	2019-07-12 10:26:08 +09:00
Mikhail Brinskii	adba7f55f7	COLL/BASE: Fix linear sync all2all Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com> (cherry picked from commit 79006f4e5a578d32bfa08de7b98e747ae18706f6)	2019-06-09 21:31:19 +03:00
Valentin Petrov	8f82c899bc	Coll/hcoll: don't init opal memhooks unless explicitely requested by user If user sets HCOLL_EXTERNAL_UCM_EVENTS=1 then we try init opal memory framework and register a mem release cb. Otherwise, rely on ucx. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2019-05-20 14:00:50 +03:00
George Bosilca	4946570b24	Remove few warnings identified by @rhc in #5514 . Signed-off-by: George Bosilca <bosilca@icl.utk.edu> (cherry picked from commit open-mpi/ompi@6d11a45f44)	2019-05-11 16:38:31 +09:00
Aravind Gopalakrishnan	5a74ddb34d	coll/tuned: Fix MPI_IN_PLACE processing in tuned algorithms PR #5450 addresses MPI_IN_PLACE processing for basic collective algorithms. But in conjunction with that, we need to check for MPI_IN_PLACE in tuned paths as well before calling ompi_datatype_type_size() as otherwise we segfault. MPI spec also stipulates to ignore sendcount and sendtype for Alltoall and Allgatherv operations. So, extending the check to these algorithms as well. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com> (cherry picked from commit 88d781056f43934a93e16db556b340e72cdd3742)	2018-10-31 11:37:29 -07:00
Gilles Gouaillardet	ece18aed45	coll/libnbc: fix various error paths The parameter passed to NBC_Return_handle() was incorrectly casted and not dereferenced. Thanks Yossi for the bug report. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> (cherry picked from commit open-mpi/ompi@8b51862fb2)	2018-09-18 15:29:33 +09:00
Todd Kordenbrock	36369f9133	coll-portals4: retry PtlMEUnlink() if PTL_IN_USE In the cleanup phase, it is possible for PtlMEUnlink() to return PTL_IN_USE if the NIC is not done with the ME. This should not be considered an error. This commit adds a retry loop around PtlMEUnlink(). In some cases, the return value of PtlMEUnlink() and PtlCTFree() was not checked at all. Check them with the same retry loop as above. Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com> (cherry picked from commit f3f2a826b40cc0d4a45a63614835162ec6eef78e)	2018-08-07 11:23:51 -05:00
Mikhail Kurnosov	c540dfb18c	coll-base-allgather: fix MPI_IN_PLACE processing The call of MPI_Allgather with sendbuf and sendtype parameters equal to MPI_IN_PLACE and NULL correspondingly, produces the segmentation fault. The problem is that sendtype is used even when sendbuf value is MPI_IN_PLACE. But according to the standard, sendtype and sendcount parameters should be ignored in this case. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> (cherry picked from commit 540c2d1)	2018-07-25 08:11:28 +07:00

1 2 3 4 5 ...

1157 Коммитов