openmpi

Автор	SHA1	Сообщение	Дата
William Zhang	ce40cfbaa5	coll/tuned: Change the default collective algorithm selection The default algorithm selections were out of date and not performing well. After gathering data from OMPI developers, new default algorithm decisions were selected for: allgather allgatherv allreduce alltoall alltoallv barrier bcast gather reduce reduce_scatter_block reduce_scatter scatter These results were gathered using the ompi-collectives-tuning package and then averaged amongst the results gathered from multiple OMPI developers on their clusters. You can access the graphs and averaged data here: https://drive.google.com/drive/folders/1MV5E9gN-5tootoWoh62aoXmN0jiWiqh3 Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-07-28 10:41:48 -07:00
William Zhang	50823fe9a9	coll/tuned: Fix dynamic message size for gather and scatter The gather and scatter operations did not use the correct message size (Only did datatype size * com size). This did not correctly reflect the total message size and prevents fine tuning within a com size. This patch multiplies the value by the number of elements sent. Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-05-14 12:17:52 -07:00
William Zhang	771f9c011d	coll/tuned: Add NULL check to prevent segfault Signed-off-by: William Zhang <wilzhang@amazon.com> cr https://code.amazon.com/reviews/CR-23837553	2020-04-21 17:53:46 +00:00
William Zhang	50640402ab	coll/tuned: Fix typos Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-04-21 17:39:37 +00:00
Austen Lauria	b65ec27307	Fix some compiler warnings. Silence unused variables, incompatible pointer types, un-initialized variables, and signed/unsigned comparisons. Signed-off-by: Austen Lauria <awlauria@us.ibm.com>	2020-01-10 13:10:53 -05:00
Mikhail Brinskii	f2cbd4806e	COLL/TUNED: Add linear scatter using isend for mlnx platform Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-11-07 11:04:39 +02:00
Mikhail Brinskii	65618f8db8	COLL/TUNED: Minor var names/comments fixes Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-07-24 10:23:38 +00:00
Mikhail Brinskii	404c480068	COLL/TUNED: Update alltoall selection rule for mlx Use linear with sync alltoall algorithm for certain message/comm size ranges. Does not affect default fixed decision, unless HPCX (with its custom parameters) is used or corresponding mca is set. Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-07-13 23:27:40 +03:00
Aravind Gopalakrishnan	88d781056f	coll/tuned: Fix MPI_IN_PLACE processing in tuned algorithms PR #5450 addresses MPI_IN_PLACE processing for basic collective algorithms. But in conjunction with that, we need to check for MPI_IN_PLACE in tuned paths as well before calling ompi_datatype_type_size() as otherwise we segfault. MPI spec also stipulates to ignore sendcount and sendtype for Alltoall and Allgatherv operations. So, extending the check to these algorithms as well. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2018-10-24 15:31:33 -07:00
Mikhail Kurnosov	ba83cc91eb	coll/base: add MPI_Bcast based on a binomial tree scatter followed by a ring allgather Implements MPI_Bcast using a binomial tree scatter followed by a ring allgather. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-07-16 08:56:09 -06:00
Mikhail Kurnosov	c500739293	coll/base: Add MPI_Bcast based on a scatter followed by an allgather Implements MPI_Bcast using a binomial tree scatter followed by an recursive doubling allgather. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-21 11:47:07 -06:00
Mikhail Kurnosov	6547b58316	coll/base: add knomial tree algorithm for MPI_Bcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-19 13:01:26 -06:00
Mikhail Kurnosov	3adf96fdb8	coll/base: add butterfly algorithm for MPI_Reduce_scatter Implements butterfly algorithm for MPI_Reduce_scatter. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-05 15:53:13 +07:00
Mikhail Kurnosov	28d5837dd9	coll: reduce_scatter_block: add butterfly algorithm Implements butterfly algorithm for MPI_Reduce_scatter_block. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-27 14:17:41 +07:00
bosilca	d13b9a2e25	Merge pull request #5156 from ggouaillardet/topic/reduce_scatter_block coll: reduce_scatter_block: rename and MCA parameter description fix	2018-05-15 11:13:26 -04:00
Mikhail Kurnosov	82299a9c04	coll: reduce_scatter_block: add recursive halving algorithm Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-15 08:20:32 +07:00
Gilles Gouaillardet	ce7b3113f6	coll: reduce_scatter_block: rename and MCA parameter description fix - rename ompi_coll_base_reduce_scatter_block_basic to more self descriptive ompi_coll_base_reduce_scatter_block_basic_linear - fix the description of the coll_tuned_reduce_scatter_block_algorithm MCA param this fixes and documents previous open-mpi/ompi@0e8b35b615 MPI_Reduce_scatter_block used to be implemented by the coll/basic module only. A new algo (recursive doubling) was recently introduced and can be used via the coll/tuned module, but we never intended to make it the default algo. In order to "restore" the previous default, the initial algo was moved from coll/basic to coll/base, and is now used by default by coll/tuned. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-09 08:54:48 +09:00
Gilles Gouaillardet	0e8b35b615	coll/tuned: use basic algo for reduce_scatter_block by default Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:44 +09:00
Mikhail Kurnosov	8cf8553abd	Resolve merge conflicts Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-03 07:28:32 +07:00
Mikhail Kurnosov	4cbcff7fcd	coll/base: add recursive doubling algorithm for MPI_Reduce_scatter_block Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-23 11:02:31 +07:00
Mikhail Kurnosov	82a3a5bdb5	Fix dynamic decision for Scan and bug in Allreduce Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-06 11:03:17 +07:00
Gilles Gouaillardet	e85fa469f3	coll/tuned: add recursive doubling algo for [ex]scan Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-04 14:56:23 +09:00
Gilles Gouaillardet	65fa0b59c3	coll/tuned: add Rabenseifner algo for [all]reduce Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-04 13:25:41 +09:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
Gilles Gouaillardet	5dfd4ab6ca	coll/tuned: remove set-but-not-used variables Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-04 13:18:11 +09:00
Gilles Gouaillardet	e879d2910a	coll/tuned: make coll_tuned_gather_algorithms MCA settable Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-02 11:00:38 +09:00
bosilca	c331e6794c	Allow all tuned MCA parameters to be modified programatically. (#2829 ) Fix a comment in the MCA header. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-01-31 21:47:36 -05:00
Ralph Castain	585540bcee	Reduce the flood of warnings due to uninitialized variables, mismatched types, and unused things to a more bearable trickle Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-12-14 16:33:50 -08:00
Ralph Castain	1e2019ce2a	Revert "Update to sync with OMPI master and cleanup to build" This reverts commit cb55c88a8b7817d5891ff06a447ea190b0e77479.	2016-11-22 15:03:20 -08:00
Ralph Castain	cb55c88a8b	Update to sync with OMPI master and cleanup to build Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-11-22 14:24:54 -08:00
George Bosilca	028e747470	Do not alter ompi_coll_tuned_use_dynamic_rules. This is set globally as an MCA parameter and should be never altered based on a single communicator setting.	2016-10-25 12:17:25 -04:00
George Bosilca	253eb80e26	Code cleaning of the tuned module.	2016-10-25 12:17:25 -04:00
George Bosilca	d577e12dd0	Fix comment.	2016-06-03 00:57:31 +09:00
Gilles Gouaillardet	cebde2a753	coll/tuned: add missing #include "opal/util/output.h" Thanks Marco Atzeri for contributing the original patch	2015-12-24 14:41:17 +09:00
George Bosilca	88492a1e12	Consistently use the request array for all modules (single array stored in the base). Correctly deal with persistent requests (they must be always freed when they are stored in the request array associated with the communicator). Always use MPI_STATUS_IGNORE for single request waiting functions.	2015-10-08 12:00:41 -04:00
Gilles Gouaillardet	de8de65b07	coll/tuned: remove unused prototypes from coll_tuned.h	2015-10-06 09:07:48 +09:00
Gilles Gouaillardet	e01bac962f	coll: do not cast way the const modifier when this is not necessary update the coll framework and mpi c bindings	2015-09-09 09:18:57 +09:00
Ralph Castain	869041f770	Purge whitespace from the repo	2015-06-23 20:59:57 -07:00
Gilles Gouaillardet	9d56b85b55	initialize common symbols from ompi	2015-05-08 10:11:58 +09:00
Nathan Hjelm	df75d0382f	ompi: use C99 subobject naming for component initialization This commit helps future-proof ompi components by initializing each component member by name. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-18 10:29:58 -06:00
Nathan Hjelm	b68d66bb9b	MCA: Add the project/project version to the MCA base component This commit adds support for project_framework_component_* parameter matching. This is the first step in allowing the same framework name in multiple projects. This change also bumps the MCA component version to 2.1.0. All master frameworks have been updated to use the new component versioning macro. An mca.h has been added to each project to add a project specific versioning macro of the form PROJECT_MCA_VERSION_2_1_0. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2015-03-27 10:59:04 -06:00
Gilles Gouaillardet	757b40e56a	coll/tuned: remove dead code as reported by Coverity with CID 1271638 that looks like a multiple paste error ...	2015-03-06 15:02:56 +09:00
Gilles Gouaillardet	71ac1331f1	coll/tuned: remove unused variables	2015-02-27 17:26:48 +09:00
George Bosilca	ced44e12da	Update copyright.	2015-02-26 15:54:58 -05:00
George Bosilca	d126c2e6f8	Fix few COVERITY reported issues.	2015-02-26 15:53:42 -05:00
George Bosilca	d6e69ecab3	Do not preallocate any requests. They are instead automatically preallocated on the first collective that needs them. Remove the ompi_coll_tuned_preallocate_memory_comm_size_limit MCA parameter.	2015-02-26 15:52:27 -05:00
George Bosilca	0445670bb9	Fix the automatic handling of communicator associated requests. If the array doesn't exist, or if it's size is not adequate then we reallocate it. Otherwise just keep using the same array of requests.	2015-02-26 15:52:18 -05:00
George Bosilca	67d01bd8cd	Redirect most of the basic module functions to base.	2015-02-26 15:52:00 -05:00
George Bosilca	211f05fb09	Complete the dismantle of the tuned module.	2015-02-26 15:50:55 -05:00
George Bosilca	8fbcdf685d	Split the tuned framework in two. Move all the functions down in the base, so that they can now be used by all modules. Keep the decision functions in tuned.	2015-02-26 15:46:13 -05:00

1 2 3 4 5 ...

277 Коммитов