openmpi

Автор	SHA1	Сообщение	Дата
Joseph Schuchart	2c97187ee0	Harmonize return values of progress callbacks Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-01-28 20:15:03 +01:00
Gilles Gouaillardet	f8eef0fde9	coll/libnbc: fixes ompi ompi_coll_libnbc_request_t parent base ompi_coll_libnbc_request_t on top of ompi_coll_base_nbc_request_t to correctly support the retention of datatypes/operators This fixes a regression introduced in open-mpi/ompi@0fe756d416 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-08-08 10:47:48 +09:00
Gilles Gouaillardet	0fe756d416	mpi: retain operation and datatype in non blocking collectives MPI standard states a user MPI_Op and/or user MPI_Datatype can be free'd after a call to a non blocking collective and before the non-blocking collective completes. Retain user (only) MPI_Op and MPI_Datatype when the non blocking call is invoked, and set a request callback so they are free'd when the MPI_Request completes. Thanks Thomas Ponweiser for reporting this Fixes open-mpi/ompi#2151 Fixes open-mpi/ompi#1304 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-12 09:15:45 +09:00
Alex Anenkov	77d466edf3	coll/libnbc: add recursive doubling algorithm for MPI_Iallreduce Signed-off-by: Alex Anenkov <anenkov.ru@gmail.com>	2019-05-19 18:39:11 +07:00
Mikhail Kurnosov	73e048b62a	coll/libnbc: add Rabenseifner's algorithm for MPI_Iallreduce An implementation of R. Rabenseifner's algorithm for MPI_Iallreduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by an allgather. Limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators only Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-18 08:50:16 +07:00
Nathan Hjelm	43547ade4c	Merge pull request #5663 from mkurnosov/coll-ireduce-rabenseifner coll/libnbc: add Rabenseifner's algorithm for MPI_Ireduce	2018-10-17 09:02:06 -06:00
Mikhail Kurnosov	a7386c1e09	coll/libnbc: add recursive doubling algorithm for MPI_Iallgather Implements recursive doubling algorithm for MPI_Iallgather. The algorithm can be used only for power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-11 21:43:13 +07:00
Mikhail Kurnosov	b0429d25df	coll/libnbc: add knomial tree algorithm for MPI_Ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-09 20:43:04 +07:00
Mikhail Kurnosov	7bd63e79c8	coll/libnbc: add Rabenseifner's algorithm for MPI_Ireduce An implementation of R. Rabenseifner's algorithm for MPI_Ireduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by a gather. Limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators only Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-09 20:27:09 +07:00
Mikhail Kurnosov	9557fa087f	Resolve merge conflicts Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-05 21:40:27 +07:00
Mikhail Kurnosov	dfe203e167	coll/libnbc: add recursive doubling algorithm for MPI_Iexscan Implements recursive doubling algorithm for MPI_Iexscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter 'coll_libnbc_iexscan_algorithm' was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-09-23 19:54:27 +07:00
Mikhail Kurnosov	3d43ff0f32	coll/libnbc: add recursive doubling algorithm for MPI_Iscan Implements recursive doubling algorithm for MPI_Iscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. The MCA parameter coll_libnbc_iscan_algorithm was added for dynamic algorithm selection. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-09-22 21:09:12 +07:00
KAWASHIMA Takahiro	37a05e74aa	coll/libnbc: Suppress compiler warnings Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-07-12 14:42:39 +09:00
KAWASHIMA Takahiro	0b8b0f8393	coll/libnbc: Implement `MPI_STARTALL` Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
KAWASHIMA Takahiro	8e5690bf5c	coll/libnbc: Correct persistent request handling Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-06-11 17:22:16 +09:00
Gilles Gouaillardet	a9609b6bf8	coll/libnbc: add persistent collectives implementation Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-11 09:53:37 +09:00
Nathan Hjelm	0e83568466	coll/libnbc: do not take lock in progress if there are no requests This commit fixes a flaw in the progress function for libnbc. The function was unconditionally taking a lock even if there are no requests to process. This lock was showing up in vtune traces of multi-threaded benchmarks. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-02-13 09:51:01 -07:00
Nathan Hjelm	1282e98a01	opal/asm: rename existing arithmetic atomic functions This commit renames the arithmetic atomic operations in opal to indicate that they return the new value not the old value. This naming differentiates these routines from new functions that return the old value. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-30 10:41:22 -07:00
Carlos Bederián	1767b218fb	coll/libnbc: demote progress_lock to regular flag Signed-off-by: Carlos Bederián <bc@famaf.unc.edu.ar>	2017-07-24 20:19:55 -03:00
Ralph Castain	dadc6fbaf6	Merge pull request #2448 from thananon/remove_request_lock Completely removed ompi_request_lock and ompi_request_cond	2017-01-03 19:31:46 -08:00
Gilles Gouaillardet	15098161a3	coll/libnbc: add some comments on how locks are used no code change Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-30 17:29:51 +09:00
Ralph Castain	1e2019ce2a	Revert "Update to sync with OMPI master and cleanup to build" This reverts commit cb55c88a8b7817d5891ff06a447ea190b0e77479.	2016-11-22 15:03:20 -08:00
Thananon Patinyasakdikul	b25a8c3fa5	Completely removed ompi_request_lock and ompi_request_cond as we dont need them anymore. Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2016-11-22 17:58:31 -05:00
Ralph Castain	cb55c88a8b	Update to sync with OMPI master and cleanup to build Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-11-22 14:24:54 -08:00
Gilles Gouaillardet	2c94a3a6f3	coll/libnbc: fix race condition with multi threaded apps protect the mca_coll_libnbc_component.active_requests list with the new mca_coll_libnbc_component.lock mutex. Thanks Jie Hu for the report Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-21 10:21:47 +09:00
Joshua Hursey	350ef67fe0	coll/libnbc: Work around for non-uniform data types in ibcast * If (legal) non-uniform data type signatures are used in ibcast then the chosen algorithm may fail on the request, and worst case it could produce wrong answers. * Add an MCA parameter that, by default, protects the user from this scenario. If the user really wants to use it then they have to 'opt-in' by setting the following parameter to false: - `-mca coll_libnbc_ibcast_skip_dt_decision f` * Once the following Issues are resolved then this parameter can be removed. - https://github.com/open-mpi/ompi/issues/2256 - https://github.com/open-mpi/ompi/issues/1763 Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-11-01 13:33:23 -05:00
Joshua Hursey	8748e54c11	coll/libnbc: Fix error path on internal error * If an error is detected internal to libnbc (e.g., PML truncation error) this patch makes sure that the request is completed and the `MPI_ERROR` field is set approprately. * Make an attempt to cleanup outstanding requests before returning. - This is a "best attempt" since not all PMLs support canceling requests.	2016-10-21 11:41:08 -04:00
bosilca	b90c83840f	Refactor the request completion (#1422 ) * Remodel the request. Added the wait sync primitive and integrate it into the PML and MTL infrastructure. The multi-threaded requests are now significantly less heavy and less noisy (only the threads associated with completed requests are signaled). * Fix the condition to release the request.	2016-05-24 18:20:51 -05:00
Nathan Hjelm	d42e0968b1	coll/libnbc: rewrite parts of libnbc This commit rewrites parts of libnbc to fix issues identified by coverity and myself. The changes are as follows: - libnbc function would return invalid error codes (internal to libnbc) to the mpi layer. These codes names are of the form NBC_. They do not match up with the error codes expected by the mpi layer. I purged the use of all these error codes with the exception of NBC_OK and NBC_CONTINUE in progress. These codes are used to identify when a request handle is complete. - Handles and schedules were leaked by all collective routines on error. A new routine was added to return a collective handle (NBC_Return_handle). - Temporary buffers containting in/out neighbors for neighborhood collectives were always leaked. - Neigborhood collectives contained code to handle MPI_IN_PLACE which is never a valid input for the send or receive buffer. Stipped this code out. - Files were inconsistently named. Most are nbc_isomething.c but one was named coll_libnbc_ireduce_scatter_block.c. - Made the NBC_Schedule "structure" and object so it can be retained/released. This may enable the use of schedule caching at a later time. More testing will be needed to ensure the caching code works. If it doesn't the code should be stripped out completely. - Added code to simply common case of scheduling send/recv + barrier. - Code cleanup for readability. The code now passes the clang static analyzer. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-08-10 11:53:25 -06:00
Ralph Castain	869041f770	Purge whitespace from the repo	2015-06-23 20:59:57 -07:00
Nathan Hjelm	df75d0382f	ompi: use C99 subobject naming for component initialization This commit helps future-proof ompi components by initializing each component member by name. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-18 10:29:58 -06:00
Nysal Jan K.A	ded408f485	Fix a crash while closing libnbc If the free list initialization fails in libnbc_open() mca_coll_libnbc_component.active_requests remain uninitialized, resulting in a crash while closing the component	2015-02-25 17:26:28 +05:30
Nathan Hjelm	5f1254d710	Update code base to use the new opal_free_list_t Use of the old ompi_free_list_t and ompi_free_list_item_t is deprecated. These classes will be removed in a future commit. This commit updates the entire code base to use opal_free_list_t and opal_free_list_item_t. Notes: OMPI_FREE_LIST__MT -> opal_free_list_ (uses opal_using_threads ()) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-02-24 10:05:45 -07:00
Nathan Hjelm	6232ef3bfb	At coll_select time we can not check whether the communicator has a virtual topology. Remove code checking for a virtual topology until this flag is set before coll_select. This commit was SVN r29344.	2013-10-03 03:37:46 +00:00
Nathan Hjelm	7bedf62dd8	Add basic algorithms for the remaining non-blocking collectives. The algorithms are intended for MPI-3.0 compliance and are not optimized. We should aim to add better algorithms in the future through cheetah. MPI_Iallreduce and MPI_Igatherv on intercommunicators are required for MPI_Comm_idup support. cmr=v1.7.4:reviewer=brbarret:ticket=trac:2715 This commit was SVN r29333. The following Trac tickets were found above: Ticket 2715 --> https://svn.open-mpi.org/trac/ompi/ticket/2715	2013-10-02 14:26:23 +00:00
Nathan Hjelm	4f12406436	Don't check for neighborhood collective routines on non-virtual topology communicators This commit was SVN r29319.	2013-10-01 19:59:18 +00:00
Nathan Hjelm	c5596548b2	MPI-3: Add support for neighborhood collectives Blocking versions are simple linear algorithms implemented in coll/basic. Non- blocking versions are from libnbc 1.1.1. All algorithms have been tested with simple test cases. cmr=v1.7.4:reviewer=jsquyres This commit was SVN r29265.	2013-09-26 21:55:08 +00:00
Nathan Hjelm	cf377db823	MCA/base: Add new MCA variable system Features: - Support for an override parameter file (openmpi-mca-param-override.conf). Variable values in this file can not be overridden by any file or environment value. - Support for boolean, unsigned, and unsigned long long variables. - Support for true/false values. - Support for enumerations on integer variables. - Support for MPIT scope, verbosity, and binding. - Support for command line source. - Support for setting variable source via the environment using OMPI_MCA_SOURCE_<var name>=source (either command or file:filename) - Cleaner API. - Support for variable groups (equivalent to MPIT categories). Notes: - Variables must be created with a backing store (char *, int , or bool *) that must live at least as long as the variable. - Creating a variable with the MCA_BASE_VAR_FLAG_SETTABLE enables the use of mca_base_var_set_value() to change the value. - String values are duplicated when the variable is registered. It is up to the caller to free the original value if necessary. The new value will be freed by the mca_base_var system and must not be freed by the user. - Variables with constant scope may not be settable. - Variable groups (and all associated variables) are deregistered when the component is closed or the component repository item is freed. This prevents a segmentation fault from accessing a variable after its component is unloaded. - After some discussion we decided we should remove the automatic registration of component priority variables. Few component actually made use of this feature. - The enumerator interface was updated to be general enough to handle future uses of the interface. - The code to generate ompi_info output has been moved into the MCA variable system. See mca_base_var_dump(). opal: update core and components to mca_base_var system orte: update core and components to mca_base_var system ompi: update core and components to mca_base_var system This commit also modifies the rmaps framework. The following variables were moved from ppr and lama: rmaps_base_pernode, rmaps_base_n_pernode, rmaps_base_n_persocket. Both lama and ppr create synonyms for these variables. This commit was SVN r28236.	2013-03-27 21:09:41 +00:00
Brian Barrett	c0f1775620	Fix warnings in nbc This commit was SVN r27514.	2012-10-29 19:52:43 +00:00
Brian Barrett	8b40c0de9b	* Lock around tag management, so that it's thread safe * Only register the progress function on first call to a non-blocking collective operation, to try to reduce overall performance impact * Fix tag management in roll-over case This commit was SVN r27498.	2012-10-26 15:36:09 +00:00
Brian Barrett	58413fa1e4	* properly setup communication infrastructure for libnbc. * Prevent infinite recursion in progress loop. Should fix improper barrier eugene was seeing. This commit was SVN r26758.	2012-07-06 13:59:03 +00:00
Brian Barrett	e0ceabd486	Need to set MPI_ERROR in the status before calling ompi_request_complete. This commit was SVN r26757.	2012-07-06 01:14:35 +00:00
Brian Barrett	27d45ad550	Implement reduce_scatter_block and ireduce_scatter_block, although possibly not nearly as optimal as they should be. This commit was SVN r26756.	2012-07-05 22:11:48 +00:00
Brian Barrett	32e70b691a	Re-enable non-blocking collectives in libnbc after finding issue with the definition of NBC_CACHE_SCHEDULE not being propogated to all uses. This commit was SVN r26686.	2012-06-27 22:08:19 +00:00
Brian Barrett	d85fdd2605	temporarily back out r26682 and r26683 until I can figure out why they cause crashes during shutdown This commit was SVN r26684. The following SVN revision numbers were found above: r26682 --> open-mpi/ompi@15a30af11f r26683 --> open-mpi/ompi@f6ea4b7234	2012-06-27 19:32:53 +00:00
Brian Barrett	15a30af11f	Turn on all the non-blocking collectives provided by libnbc... This commit was SVN r26682.	2012-06-27 18:32:57 +00:00
Brian Barrett	3933d0a8f0	Ibarrier works! :) This commit was SVN r26680.	2012-06-27 15:58:17 +00:00
Brian Barrett	7bdeafb772	Start bringing in libnbc. .ompi_ignored, as there's still a long way to go This commit was SVN r26658.	2012-06-25 22:38:06 +00:00

48 Коммитов