openmpi

Автор	SHA1	Сообщение	Дата
KAWASHIMA Takahiro	c4ca5e703d	coll: Update `ompi/mca/coll/base/coll_base_functions.h` - Support MPI-2.2 and MPI-3.0 COLL features. * `MPI_REDUCE_SCATTER_BLOCK` * neighborhood collective communication * nonblocking collective communication - Add `_BASE_ARGS` and `_BASE_ARG_NAMES` for convenience. - Use parameter names used in the MPI Standard. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-03-02 17:58:02 +09:00
Yossi Itigin	33471c44ee	pml_yalla/mtl_mxm/hcoll: open memory component to activate memory hooks. Memory hooks are now set-up on demand. pml/yalla, mtl/mxm and coll/hcoll need the memory hooks, so make sure those are installed. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2017-03-01 12:12:20 +02:00
George Bosilca	366d64b7e5	Move the collective structure outside the communicator. As we changed the ABI (forcing a major release), we can limit the size of the predefined communicators by moving the collective structure outside the communicator. This might have a minimal, but unnoticeable, impact on performance. This approach has been discussed during the January 2017 devel meeting. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-27 11:54:17 -06:00
Josh Hursey	0b273c2561	Merge pull request #2808 from jjhursey/fix/ibm/reduce-local-to-coll coll: Move reduce_local into the coll framework	2017-02-14 15:54:15 -06:00
Joshua Hursey	78006f93a4	coll: Move reduce_local into the coll framework * Since we are adding a new function to `mca_coll_base_module_2_1_0_t` we need to increase the version of the module structure to `2_2_0`. * Add a comment just above the PREDEFINED_COMMUNICATOR_PAD describing it's purpose and when it should change. To help future developers trying to answer the question noted in the comment. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-14 08:56:07 -06:00
Gilles Gouaillardet	e70a30cca4	coll/libnbc: optimize zero size ialltoall{v,w} with MPI_IN_PLACE and incidentally avoids malloc(0) Thanks Lisandro Dalcin for the report Fixes open-mpi/ompi#2945 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-13 15:21:28 +09:00
Gilles Gouaillardet	12949547f4	coll/libnbc: fix a2aw_sched_linear() with zero size datatype or zero count Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-13 15:21:28 +09:00
Joshua Hursey	383330a50d	coll/basic: Expand check for negative input values * Negative values are parameter errors for neighborhood collectives - Add checks to the mpi/c interface `MPI_PARAM_CHECK` * Fix a success check for neighbor_alltoallw with dist_graph Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-08 14:26:32 -06:00
Howard Pritchard	f4ad119693	Merge pull request #2914 from hppritcha/topic/nbc_compiler_warning swat some compiler warnings	2017-02-04 11:56:52 -05:00
Howard Pritchard	acaecb2448	swat some compiler warnings Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-02-03 08:28:15 -07:00
Gilles Gouaillardet	e879d2910a	coll/tuned: make coll_tuned_gather_algorithms MCA settable Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-02 11:00:38 +09:00
Gilles Gouaillardet	02558134ef	coll/base: remove unused local variable Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-01 11:54:17 +09:00
bosilca	c331e6794c	Allow all tuned MCA parameters to be modified programatically. (#2829 ) Fix a comment in the MCA header. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-01-31 21:47:36 -05:00
Gilles Gouaillardet	9bcadbd51b	coll/libnbc: fix the red_schain algo of ireduce with MPI_IN_PLACE this fixes a regression introduced in open-mpi/ompi@045d0c5f4c Fixes open-mpi/ompi#2879 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-30 14:19:45 +09:00
Josh Hursey	0408c116eb	Merge pull request #2805 from jjhursey/fix/ibm/base-allgatherv coll/base: Allgatherv MPI_IN_PLACE Bug	2017-01-26 14:21:57 -06:00
Geoffrey Paulsen	d2527cff46	Fixing comment only in MPI_IN_PLACE case for ireduce in libnbc. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2017-01-26 10:58:51 -08:00
Geoffrey Paulsen	045d0c5f4c	Fix for Ireduce + MPI_IN_PLACE. Fixes a wrong answer from MPI_Ireduce when the red_sched_chain() path was taken (which only happens for np<=4 and mesgsize>=64k). The way libnbc treats MPI_IN_PLACE is to set sbuf == rbuf, and whether an algorithm will work cleanly or not after that depends on the details. In this case the last steps of the algorithm amounted to (right neighbor is sending us reduction results from ranks 1..n-1) recv into rbuf from right neighbor add the contribution from our sbuf into rbuf this would be fine in general, but if sbuf==rbuf, that recv overwrites the sbuf. I changed it to recv into a tmpbuf if MPI_IN_PLACE was used. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2017-01-25 18:08:08 -08:00
Mark Allen	a3452adfa9	coll/base: Allgatherv MPI_IN_PLACE Bug MPI_Allgatherv with MPI_IN_PLACE reads data from wrong location. They were locating the MPI_IN_PLACE send buffer as ```c send_buf = (char)rbuf; for (i = 0; i < rank; ++i) { send_buf += ((ptrdiff_t)rcounts[i] extent); } ``` when it should be ```c send_buf = (char)rbuf; send_buf += ((ptrdiff_t)disps[rank] extent); ``` because disps[] specifies where things are in the v-style buffers. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-01-24 15:52:36 -06:00
Gilles Gouaillardet	d0629f18c2	coll/libnbc: optimize size one communicators simply "return" with ompi_request_empty if the communicator size is 1 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-24 09:12:47 +09:00
Ralph Castain	66131b4183	Remove the bcol, coll/ml, and sbgp code as stale and lacking a maintainer Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-01-03 19:32:48 -08:00
Ralph Castain	dadc6fbaf6	Merge pull request #2448 from thananon/remove_request_lock Completely removed ompi_request_lock and ompi_request_cond	2017-01-03 19:31:46 -08:00
Ralph Castain	585540bcee	Reduce the flood of warnings due to uninitialized variables, mismatched types, and unused things to a more bearable trickle Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-12-14 16:33:50 -08:00
Valentin Petrov	e13e264185	Detect hcoll_context_free at config Needed for better flexibility with versioning Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2016-12-02 22:09:20 +02:00
Gilles Gouaillardet	fe4c4e95eb	coll/libnbc: fix MPI_IN_PLACE handling in i{gather,scatter}[v] MPI_IN_PLACE is only relevant on the root task, so only test is there Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:25 +09:00
Gilles Gouaillardet	1a8a276914	coll/libnbc: use zero-size messages in ibarrier and silence a valgrind warning Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:25 +09:00
Gilles Gouaillardet	2eec6a08b5	coll/base: fix ompi_coll_base_reduce_scatter_intra_nonoverlapping() with MPI_IN_PLACE invoke underlying scatterv with MPI_IN_PLACE when appropriate Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:24 +09:00
Gilles Gouaillardet	8b7999469b	coll/base: fix MPI_IN_PLACE in ompi_coll_base_reduce_generic() avoid copying data to itself when MPI_IN_PLACE is used Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:24 +09:00
Gilles Gouaillardet	15098161a3	coll/libnbc: add some comments on how locks are used no code change Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-30 17:29:51 +09:00
Valentin Petrov	4cdb8ecaad	coll/hcoll: hcoll_context_free Adds the new API hcoll_conetxt_free that resolves the issues observed with the ctx cache and group_destroy_notify. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2016-11-29 07:33:05 +02:00
Ralph Castain	1e2019ce2a	Revert "Update to sync with OMPI master and cleanup to build" This reverts commit cb55c88a8b7817d5891ff06a447ea190b0e77479.	2016-11-22 15:03:20 -08:00
Thananon Patinyasakdikul	b25a8c3fa5	Completely removed ompi_request_lock and ompi_request_cond as we dont need them anymore. Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2016-11-22 17:58:31 -05:00
Ralph Castain	cb55c88a8b	Update to sync with OMPI master and cleanup to build Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-11-22 14:24:54 -08:00
Gilles Gouaillardet	2c94a3a6f3	coll/libnbc: fix race condition with multi threaded apps protect the mca_coll_libnbc_component.active_requests list with the new mca_coll_libnbc_component.lock mutex. Thanks Jie Hu for the report Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-21 10:21:47 +09:00
Gilles Gouaillardet	fc776e3fa5	coll: code cleanup - instead of coll_base_comm_get_reqs(2) for irecv/isend, use only one request allocated in the stack and do a irecv/send - instead of ompi_request_wait_all(2), simpy ompi_request_wait Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-13 22:35:33 -07:00
Gilles Gouaillardet	99d30353af	coll: Don't allocate space for zero requests Refs #2402 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-13 22:20:58 -07:00
George Bosilca	725277bc26	Don't allocate space for the requests if the underlying topology has no neighbors. This commit fixes issue #2402. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2016-11-12 18:01:09 -05:00
Joshua Hursey	350ef67fe0	coll/libnbc: Work around for non-uniform data types in ibcast * If (legal) non-uniform data type signatures are used in ibcast then the chosen algorithm may fail on the request, and worst case it could produce wrong answers. * Add an MCA parameter that, by default, protects the user from this scenario. If the user really wants to use it then they have to 'opt-in' by setting the following parameter to false: - `-mca coll_libnbc_ibcast_skip_dt_decision f` * Once the following Issues are resolved then this parameter can be removed. - https://github.com/open-mpi/ompi/issues/2256 - https://github.com/open-mpi/ompi/issues/1763 Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-11-01 13:33:23 -05:00
Valentin Petrov	2b7e362e56	coll/hcoll fortran pair types Adds mapping of the MPI Fortran pair types (2INTEGER, 2REAL, 2DBLPREC) to the corresponding hcoll dtypes. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2016-10-27 18:24:07 +03:00
George Bosilca	028e747470	Do not alter ompi_coll_tuned_use_dynamic_rules. This is set globally as an MCA parameter and should be never altered based on a single communicator setting.	2016-10-25 12:17:25 -04:00
George Bosilca	253eb80e26	Code cleaning of the tuned module.	2016-10-25 12:17:25 -04:00
Gilles Gouaillardet	6714f6aee7	coll/libnbc: fix MPI_Ialltoallv with MPI_IN_PLACE and without MPI param check	2016-10-24 09:29:06 +09:00
Josh Hursey	d1ecc83e14	Merge pull request #2245 from jjhursey/topic/libnbc-error-path coll/libnbc: Fix error path on internal error	2016-10-21 13:27:17 -05:00
Joshua Hursey	8748e54c11	coll/libnbc: Fix error path on internal error * If an error is detected internal to libnbc (e.g., PML truncation error) this patch makes sure that the request is completed and the `MPI_ERROR` field is set approprately. * Make an attempt to cleanup outstanding requests before returning. - This is a "best attempt" since not all PMLs support canceling requests.	2016-10-21 11:41:08 -04:00
Gilles Gouaillardet	45336d0bea	libnbc: fix iallgather[v] In order to optimize for MPI_IN_PLACE, data is sent from the receive buffer. consequently, it should be sent with the receive type and count. Thanks Josh Hursey for the report and test case Refs open-mpi/ompi#2256	2016-10-21 10:24:25 +09:00
Gilles Gouaillardet	e78fcc4db9	coll/base: fix ompi_coll_base_{gather,scatter}_intra_binomial receive type is only relevant for root with gather, send type is only relevant for root with scatter, so do not access these types on a non root task	2016-10-19 14:05:22 +09:00
Gilles Gouaillardet	1e3191115b	Merge pull request #2172 from ggouaillardet/topic/ialltoall_in_place support MPI_IN_PLACE in MPI_Ialltoall*	2016-10-17 17:00:47 +09:00
Valentin Petrov	9747a9ea9b	coll/hcoll: ialltoallv interface	2016-10-10 15:09:07 +03:00
Gilles Gouaillardet	1e0f591811	coll/libnbc: implement support for MPI_IN_PLACE in MPI_Ialltoall* Thanks Chris Ward for the report Many thanks to George for the guidance	2016-10-08 19:44:01 +09:00
Gilles Gouaillardet	80beb30c58	coll/sync: plug a memory leak	2016-09-27 16:29:57 +09:00
Ralph Castain	7f3fac48ab	Fix typo on the COLL_SYNC macro	2016-09-06 12:43:07 -07:00

1 2 3 4 5 ...

1021 Коммитов