openmpi

Автор	SHA1	Сообщение	Дата
Sergey Oblomov	c142605566	SSHMEM/COLL: added sshmem/mpi implementation for shmem_collect call - added MPI based implementation of shmem_collect call Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com> (cherry picked from commit `7d8cb75b2e`)	2019-05-23 15:34:12 +03:00
Valentin Petrov	281f78c6e4	Fixes the O(N^2) loop in the mca_scoll_mpi_comm_query The new proc group is created from the "world_group" based on the ranks mapping which can be directly taken from proc_name->vpid. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2019-04-15 08:43:09 +03:00
Ben Menadue	001fa5b6ce	Add missing nlong_type parameter to call to original broadcast in scoll/fca broadcast. Signed-off-by: Ben Menadue <ben.menadue@nci.org.au>	2019-04-03 14:01:41 +11:00
Yossi Itigin	ad4b33336d	oshmem/scoll: fix shmem_collect32/64 for zero-size length Fixes scoll_basic failures with shmem_verifier, caused by recent changes in handling of zero-size collectives. - Check for zero-size length only for fixed size collect (shmem_fcollect), but not for variable-size collect (shmem_collect) - Add 'nlong_type' parameter to internal broadcast function, to indicate whether the 'nlong' parameter is valid on non-root PEs, since it's used by shmem_collect algorithm. Before this change, some components assumed it's true (scoll_mpi) while others assumed it's false (scoll_basic). - In scoll_basic, if nlong_type==false, do not exit if nlong==0, since this parameter may not be the same on all PEs. - In scoll_mpi, fallback to scoll_basic if nlong_type==false, since MPI requires the 'count' argument of MPI_Bcast to be valid on all ranks. (Picked from master `939162e`) Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2019-01-02 12:15:01 +02:00
Sergey Oblomov	5838760a3a	OSHMEM/COLL/BCAST: removed unnecessary bcast call - removed unnecessary bcast call on zero-length request Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com> (cherry picked from commit `c93927e27a`)	2018-11-27 14:26:56 +02:00
Sergey Oblomov	0a064d8c8d	OSHMEM/COLL: optimization on zero-length ops - removed barrier call on zero-length operations Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com> (cherry picked from commit `ff2fd0679e`)	2018-11-27 14:26:52 +02:00
Sergey Oblomov	dea9cf6b63	OSHMEM: added processing of zero-length collectives - according spec 1.4, annex C shmem collectives should process calls where number of elements is zero independently from pointer value - added zero-count processing - it just call barrier to sync ranks Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com> (cherry picked from commit `9de128afaf`)	2018-11-27 14:26:44 +02:00
Yossi Itigin	8a329a797c	SCOLL/BASIC: Fix invalid pSync pointer passed to barrier func mca_scoll_basic_alltoall() passed (pSync + 1) to barrier function, but the value of _SHMEM_ALLTOALL_SYNC_SIZE is 1, which made the barrier function use an invalid memory location. In particular, this location was not initialized to _SHMEM_SYNC_VALUE, which broke the barrier algorithm and it did not complete: One PE could read 0 from its peer and assume the peer already started the barrier, and then write 1 to the peer. Then, the peer entered the barrier and overwrote the 1 with 0, and then it waited forever to see '1' in its pSync. Found with shmem_verifier test suite. (picked from master `6754bf1`) Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2018-10-31 12:22:19 +02:00
Xin Zhao	c429900cd9	OMPI/OSHMEM: add new functionality of OpenSHMEM v1.4. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-07-16 12:55:25 -07:00
Mikhail Brinskii	8e9d401938	OSHMEM/SMPL/UCX: Add real fence support + Add quiet method to SPML, so it can have different implementation with fence. + Use ucp_worker_fence for spml_fence method of UCX SPML Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2018-05-25 22:43:06 +03:00
Xin Zhao	af32c305de	ompi/oshmem: fix bug in shmem_alltoall in mca/scoll/basic. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-03-29 14:54:36 -05:00
Boris Karasev	3796307a57	timings: added new timing points Signed-off-by: Boris Karasev <karasev.b@gmail.com>	2018-03-21 05:16:25 +02:00
Alex Mikheev	cca67a69ea	oshmem: scoll: fixes strided alltoall Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2018-02-19 09:41:21 +02:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
George Bosilca	366d64b7e5	Move the collective structure outside the communicator. As we changed the ABI (forcing a major release), we can limit the size of the predefined communicators by moving the collective structure outside the communicator. This might have a minimal, but unnoticeable, impact on performance. This approach has been discussed during the January 2017 devel meeting. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-27 11:54:17 -06:00
Alex Mikheev	0f83a1fd57	oshmem: scoll: fixes basic barrier broadcast and alltoall Add missing fence() call to alltoall and central counter broadcast. Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2016-11-24 16:56:55 +02:00
Gilles Gouaillardet	0a25420dac	oshmem: get rid of oshmem_proc_t and use ompi_proc_t instead store oshmem related per proc data in an oshmem_proc_data_t struct, that is stored in the padding section of an ompi_proc_t this data can be accessed via the OSHMEM_PROC_DATA(proc) macro Fixes open-mpi/ompi#2023	2016-09-01 14:20:14 +09:00
Jeff Squyres	2c5b39718d	oshmem: fix scoll_null_alltoall() prototype Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-03-26 03:50:57 -07:00
Mike Dubman	1d8fbfefb0	Merge pull request #1478 from igor-ivanov/pr/oshmem-v1.3-alltoall oshmem: Add alltoall	2016-03-22 07:51:36 +02:00
Igor Ivanov	1bed5d8aee	oshmem: Align OSHMEM API with spec v1.3 (update scoll/basic)	2016-03-21 11:46:01 +02:00
Igor Ivanov	50906b34b3	oshmem: Align OSHMEM API with spec v1.3 (Add scoll/alltoall interface)	2016-03-21 10:43:31 +02:00
Igor Ivanov	e690521cdd	oshmem/scoll: Fix bug in basic/barrier algorithm	2016-03-21 10:34:55 +02:00
Gilles Gouaillardet	99d046d060	scoll/fca: add missing #include <alloca.h>	2015-12-24 14:33:58 +09:00
Igor Ivanov	c0518c0417	oshmem: Enable force output for error messages This change fixes issue when oshmem related error messages are not visible for an user.	2015-11-11 13:26:10 +02:00
Igor Ivanov	7de0537a1d	oshmem: Add help message for fatal issues in scoll:mpi and scoll:fca Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-09-21 18:50:20 +03:00
Igor Ivanov	ec7cd13a81	oshmem: Fix compilation warnings	2015-09-21 18:50:20 +03:00
Igor Ivanov	ca8c3eebea	oshmem: Abort application in casesingle scoll:mpi is selected scoll:mpi does not have barrier and should be selected with any other scoll component. Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-09-21 10:42:54 +03:00
Igor Ivanov	f437f4012e	Revert "scoll/mpi: work around bug in oshmem/proc design" This workaround is needless after oshmem/proc refactoring This reverts commit `202c6a38e4`.	2015-09-17 19:01:24 +03:00
Nathan Hjelm	202c6a38e4	scoll/mpi: work around bug in oshmem/proc design Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-09-10 08:55:56 -06:00
Gilles Gouaillardet	1a238d3a4f	configury: fix fca detection * do not add -I/.../include/fca -I /.../include/fca_core to CPPFLAGS * allow configure --with-fca * search fca libs in both DIR/lib and DIR/lib64 * fix the description of the --with-fca option	2015-08-13 11:09:15 +09:00
Jeff Squyres	5065978a1e	oshmem: __FUNCTION__ -> __func__ fixes	2015-08-05 05:39:38 -07:00
Ralph Castain	869041f770	Purge whitespace from the repo	2015-06-23 20:59:57 -07:00
Nathan Hjelm	033894b493	Merge pull request #541 from hjelmn/c99_components C99 component initialization	2015-04-20 10:45:39 -06:00
Devendar Bureddy	3dbd95fa73	OSHMEM: enable mpi collective by default	2015-04-20 19:39:36 +03:00
Nathan Hjelm	c4a61969c0	oshmem: use C99 subobject naming for component initialization This commit helps future-proof oshmem components by initializing each component member by name. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-18 10:29:58 -06:00
Nathan Hjelm	b68d66bb9b	MCA: Add the project/project version to the MCA base component This commit adds support for project_framework_component_* parameter matching. This is the first step in allowing the same framework name in multiple projects. This change also bumps the MCA component version to 2.1.0. All master frameworks have been updated to use the new component versioning macro. An mca.h has been added to each project to add a project specific versioning macro of the form PROJECT_MCA_VERSION_2_1_0. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2015-03-27 10:59:04 -06:00
Jeff Squyres	8d04215741	coll: trivial spelling fix s/Algoritm/Algorithm/g	2015-02-27 18:20:17 -08:00
Igor Ivanov	3e2dd782ea	oshmem: Fix set of coverity issues Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-02-24 19:03:10 +02:00
Igor Ivanov	010dce307a	Fix set of coverity issues List of CIDs (scan.coverity.com): oshmem: 1269787, 1269907, 1270161, 1270162, 1270977, 1270978 ompi: 1270170, 1270172, 1270173 Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-02-20 17:45:46 +04:00
Igor Ivanov	426d1ce146	oshmem: Fix set of coverity issues List of CIDs (scan.coverity.com): 1269721, 1269725, 1269787, 1269907, 1269909, 1269910, 1269911, 1269912, 1269959, 1269960, 1269984, 1269985, 1270136, 1270157, 1269845, 1269875, 1269876, 1269877, 1269878, 1269884, 1269885, 1270161, 1270162, 1270175, 1269734, 1269739, 1269742, 1269743 Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-02-19 23:00:17 +04:00
Ralph Castain	780c93ee57	Per the PR and discussion on today's telecon, extend the process name definition as a two-field struct of uint32_t's down to the OPAL layer. This resolves issues created by prior commits that impacted both heterogeneous and SPARC support. This also simplifies the OMPI code base by removing the need for frequent memcpy's when transitioning between the OMPI/ORTE layers and OPAL. We recognize that this means other users of OPAL will need to "wrap" the opal_process_name_t if they desire to abstract it in some fashion. This is regrettable, and we are looking at possible alternatives that might mitigate that requirement. Meantime, however, we have to put the needs of the OMPI community first, and are taking this step to restore hetero and SPARC support.	2014-11-11 17:00:42 -08:00
Mike Dubman	b99fd08c3d	oshmem: scoll/fca - opal refactoring voices based on http://www.open-mpi.org/community/lists/devel/2014/08/15590.php This commit was SVN r32487.	2014-08-10 04:54:38 +00:00
Ralph Castain	552c9ca5a0	George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-) WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic. This commit was SVN r32317.	2014-07-26 00:47:28 +00:00
Gilles Gouaillardet	fae7adf8ee	Remove legacy FCA_IS_LOCAL_PROCESS macro and use OPAL_PROC_ON_LOCAL_NODE instead cmr=v1.8.2:reviewer=rhc This commit was SVN r32079.	2014-06-25 02:37:53 +00:00
Mike Dubman	6839f08159	OSHMEM: fix warning fixed by Igor, reviewed by MikeD cmr=v1.8.2:reviewer=ompi-rm1.8 This commit was SVN r31832.	2014-05-20 05:52:31 +00:00
Mike Dubman	d531a2ccad	OSHMEM: Fix deadlock for collect operation using various data sizes. Deadlock when using the shmem_collect32()/shmem_collect64() routines and any of the non-root PEs pass 0 as the number of elements. Algorithm in _algorithm_central_collector() does use 0 as a special value, and thus does not break out of the loop. fixed by IgorI, reviewed by MikeD cmr=v1.8.2:reviewer=ompi-rm1.8 This commit was SVN r31814.	2014-05-19 06:17:53 +00:00
Ralph Castain	11faab1091	The final step of the RFC: convert the <foo>libdir and friends to fit their respective code areas, and equate them all at the top. Note that we can't entirely separate things as the opal_install_dirs framework can't handle separated locations for the various trees. This commit was SVN r31679.	2014-05-08 02:01:35 +00:00
Jeff Squyres	16cab57ec5	Fix some set-but-not-used compiler warnings. cmr=v1.8:reviewer=miked This commit was SVN r31061.	2014-03-13 21:20:36 +00:00
Mike Dubman	c784aab7d8	OSHMEM: scoll fix corner cases - fix segv - proper enable/disable and prio handling fixed by Elena, reviewed by Igor/Mike cmr=v1.7.5:reviewer=ompi-rm1.7 This commit was SVN r30962.	2014-03-07 12:31:29 +00:00
Mike Dubman	2828afddce	OSHMEM: fix output, lower prio for scoll/mpi fixed by Roman/Elena, reviewed by Igor/Mike cmr=v1.7.5:revewer=ompi-rm1.7 This commit was SVN r30957.	2014-03-06 16:17:58 +00:00

1 2

75 Коммитов