openmpi

Автор	SHA1	Сообщение	Дата
Joseph Schuchart	52b52b8ebb	OSC RDMA: only touch pages before memory registration, don't fill them Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-09-29 07:45:07 +02:00
Joseph Schuchart	d11ccbada9	OSC RDMA: put memory of each process into separate pages Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-09-29 07:44:17 +02:00
Joseph Schuchart	1fdf05f634	pml/ob1: fix SPC potential over-counting when sending ack and requesting put mca_pml_ob1_recv_request_put_frag is used to request a put from the peer if get fails mca_pml_ob1_recv_request_ack_send_btl is used to send an acknowledgement, not data Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-09-28 15:38:47 +02:00
Joseph Schuchart	a9ed53aa66	pml/ob1: add SPC instrumentation of sent fin messages Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-09-28 15:19:29 +02:00
Joseph Schuchart	91a94201d2	PML UCX: add SPC instrumentation for message size sent/received Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-09-28 15:12:24 +02:00
George Bosilca	96fea22cdd	Don't allow some rank to don't count the collective if they have no data to exchange. This is the same logic as in `77eaa5c` applied to ialltoallw. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-09-24 13:29:01 -04:00
bosilca	21c9c666ab	Merge pull request #8039 from bosilca/fix/adapt Fix some corner cases with ADAPT	2020-09-18 17:18:41 -04:00
George Bosilca	77eaa5c8b8	Keep the non-blocking collective tags globally in sync. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-09-18 12:52:14 -04:00
George Bosilca	c98e387a53	Many fixes and improvements to ADAPT - Add support for fallback to previous coll module on non-commutative operations (#30) - Replace mutexes by atomic operations. - Use the correct nbc request type (for both ibcast and ireduce) * coll/base: document type casts in ompi_coll_base_retain_* - add module-wide topology cache - use standard instead of synchronous send and add mca parameter to control mode of initial send in ireduce/ibcast - reduce number of memory allocations - call the default request completion. - Remove the requests from the Fortran lookup conversion tables before completing and free it. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Joseph Schuchart <schuchart@hlrs.de> Co-authored-by: Joseph Schuchart <schuchart@hlrs.de>	2020-09-18 12:50:17 -04:00
Harumi Kuno	18baa5e291	use sync_send mask for ofi_create_recv_tag The upper 2 bits of an ompi tag encode the synchronize send and synchronize send ack. Because the mtl_ofi_create_recv_tag_CQD and mtl_ofi_create_recv_tag functions both use ompi_mtl_ofi.sync_proto_mask instead of ompi_mtl_ofi.sync_send when generating their "ignore" masks, they hide the ack bit, turning the tag into an "any tag receive" This is an issue because ssend is implemented by doing a send and receive internally. So if there happens to be an outstanding posted receive posted before the ssend, that receive will end up consuming the internal message intended for the ssend's internal receive. Updating mtl_ofi_create_recv_tag_CQD and mtl_ofi_create_recv_tag functions to use ompi_mtl_ofi.sync_send fixes this. Authored-by: John L. Byrne <john.l.byrne@hpe.com> Signed-off-by: Harumi Kuno <harumi.kuno@hpe.com>	2020-09-16 18:12:22 -06:00
Gilles Gouaillardet	fb8bfccb83	mpif-h: fix a typo in MPI_Status_f2f08() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2020-09-14 13:56:16 +09:00
Jeff Squyres	c04dc355de	mpi/man: convert MPI_Status conversion man pages to Markdown Convert the MPI_Status_f082f, MPI_Status_f082c, and MPI_Status_f2c man pages to Markdown. Fix some typos and improve the text a bit along the way. Left the raw NROFF redirect pages MPI_Status_f2f08, MPI_Status_c2f08, and MPI_Status_c2f files as they were -- they're 1-line redirects, and it seems simpler to leave those (vs. duplicating the Markdown). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-09-09 06:59:12 -07:00
Gilles Gouaillardet	e97d3ce645	Add missing MPI_Status conversion subroutines Only in C bindings: - MPI_Status_c2f08() - MPI_Status_f082c() In all bindings but mpif.h - MPI_Status_f082f() - MPI_Status_f2f08() and the PMPI_* related subroutines As initially inteded by the MPI forum, the Fortran to/from Fortran 2008 conversion subtoutines are not implemented in the mpif.h bindings. See the discussion at https://github.com/mpi-forum/mpi-issues/issues/298 Refs. open-mpi/ompi#1475 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2020-09-09 06:59:12 -07:00
Gilles Gouaillardet	7fce2f3057	update MPI_F08_status type Make the C MPI_F08_status type definition match the updated mpi_f08 type(MPI_Status) definition. This fix the inconsistency introduced in open-mpi/ompi@98bc7af7d4 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2020-09-09 06:59:12 -07:00
Jeff Squyres	560ebc5780	Merge pull request #7716 from bosilca/coll/adapt ADAPT: Event-driven collective implementation	2020-09-01 11:29:53 -04:00
Joseph Schuchart	4d420348f7	Fix MPI versions in MPI.3 manpage Thanks to Andy Riebs for reporting that on the Open MPI user mailing list (https://www.mail-archive.com/users@lists.open-mpi.org/msg34103.html) Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-08-31 09:21:26 +02:00
Aurelien Bouteiller	4df5fcf48c	errors_are_fatal_comm_handler takes a pointer to the error constant as input. Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-08-26 16:05:30 -04:00
Howard Pritchard	c1c71b22b9	Merge pull request #8002 from hppritcha/topic/ofi_gni_prov_patch_for_mtl OFI: patch OFI MTL for GNI provider	2020-08-26 12:30:50 -06:00
Howard Pritchard	d6ac41cbbd	OFI: patch OFI MTL for GNI provider Uncovered a problem using the GNI provider with the OFI MTL. See https://github.com/ofiwg/libfabric/issues/6194. Related to #8001 Signed-off-by: Howard Pritchard <hppritcha@gmail.com>	2020-08-26 08:25:53 -06:00
William Zhang	57b95bcb45	coll/tuned: Revert RSB and RS default algorithms Reduce scatter block and reduce scatter algorithms were hitting correctness issues for non commutative strided tests. We will revert to the original default algorithms for those two collectives (basic linear and non overlapping respectively) in the non commutative op case. See #8010 Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-08-25 08:44:24 -07:00
George Bosilca	ee592f3672	Address the comments on the PR. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-08-24 12:13:38 -07:00
Xi Luo	e59bde912e	Remove the code handling zero count cases in ADAPT. Set request in ibcast.c to empty when the count is 0. Signed-off-by: Xi Luo <xluo12@vols.utk.edu> Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-08-24 12:13:38 -07:00
George Bosilca	c2970a3695	Correctly handle non-blocking collectives tags As it is possible to have multiple outstanding non-blocking collectives provided by different collective modules, we need a consistent mechanism to allow them to select unique tags for each instance of a collective. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-08-24 12:13:38 -07:00
George Bosilca	8582e10d2b	Consistent handling of zero counts in the MPI API. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-08-24 12:13:38 -07:00
George Bosilca	d71264569e	Fix the atomic management of the bcast and reduce freelist API consistent with other collective modules Add comments Other minor cleanups. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-08-24 12:13:38 -07:00
bsergentm	a4be3bb93d	Coll/adapt Bull (#15 ) * piggybacking Bull functionalities * coll/adapt: Fix naming conventions and C11 atomic use This commit fixes some naming convention issues, such as function names which should follow the naming ompi_coll_adapt instead of mca_coll_adapt, reserved for component and module naming (cf. tuned collective component); It also fixes the use of _Atomic construct, which is only valid in C11. OPAL constructs have already been adapted to that use, so use opal_atomic_* types instead. * coll/adapt: Remove unused component field in module This commit removes an unneeded field referencing the component in the module of adapt, as it is already available through the mca_coll_adapt_component global variable. Signed-off-by: Marc Sergent <marc.sergent@atos.net> Co-authored-by: Lemarinier, Pierre <pierre.lemarinier@atos.net> Co-authored-by: pierrele <31764860+pierrele@users.noreply.github.com>	2020-08-24 12:13:38 -07:00
Xi Luo	fe73586808	Add ADAPT module Add comments in the ADAPT module Signed-off-by: Xi Luo <xluo12@vols.utk.edu> Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-08-24 12:13:38 -07:00
Howard Pritchard	eefaadf7f1	Merge pull request #8012 from hppritcha/topic/mprobe_with_ofi_fix ofi mtl: fix problem with mrecv	2020-08-18 17:21:37 -06:00
Howard Pritchard	e6f81ed6d6	ofi mtl: fix problem with mrecv the ofi mtl mrecv was not properly setting the message in/out arg to MPI_MRECV to MPI_MESSAGE_NULL. Signed-off-by: Howard Pritchard <hppritcha@gmail.com>	2020-08-18 15:39:19 -06:00
Jeff Squyres	20c772e733	Cleanup language about MPI exceptions --> errors MPI-4 is finally cleaning up its language: an MPI "exception" does not actually exist. The only thing that exists is an MPI "error" (and associated handlers). This commit replaces all relevant uses of the word "exception" with "error". Note that this is still applicable in versions of the MPI standard less than MPI-4.0 (indeed, nearly all the cases fixed in this commit are just changes to comments, anyway). One exception to this is the Java bindings, where there's an MPIException class. In hindsight, it probably should have been named MPIError, but changing it now would break anyone who is using the Java bindings. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-08-17 13:57:47 -04:00
Jeff Squyres	1e11933660	ompi: cleanup C++ MPI::ERRORS_THROW_EXCEPTIONS The C++ bindings were removed a while ago; MPI::ERRORS_THROW_EXCEPTIONS and MPI_ERRORS_THROW_EXCEPTIONS no longer exist. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-08-17 13:52:02 -04:00
Jeff Squyres	9a0f661a66	Merge pull request #7975 from wckzhang/btlcommonlist btl/ofi: Use common provider include/exclude list	2020-08-10 14:41:53 -04:00
Aurelien Bouteiller	e6c7731d9b	Missing function to populate java error handler abort Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-08-06 10:29:42 -04:00
Aurelien Bouteiller	efbc6ff6a5	Merge pull request #7798 from abouteiller/mpi-next/unbounderr-self MPI-4 error handling: 'unbound' errors to MPI_COMM_SELF	2020-08-03 15:59:14 -04:00
Aurelien Bouteiller	ee149fcfcb	MPI3 (unchanged in 4) says that errors after MPI_REQUEST_FREE are FATAL Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-31 17:49:38 -04:00
Aurelien Bouteiller	bec7dfc1b1	Errors in non-api calls remain fatal Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-31 17:49:35 -04:00
Aurelien Bouteiller	e0df0f4bd9	Make errors_mpi3 compat a global mpi-3 compatibility flag Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-31 17:48:47 -04:00
Aurelien Bouteiller	7dfe6c1adc	Thread-shift errors reported by PMIx to the main MPI progress engine Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> make things happen before the terminal call Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-31 17:48:44 -04:00
William Zhang	9b8f463a76	btl/ofi: Use common provider include/exclude list The btl/ofi does not currently utilize the common ofi include/exclude list. Added verification code similar to the mtl/ofi that will check if the info object is in the include or exclude list. If it isn't in the include list or is in the exclude list, validate_info will return OPAL_ERROR. The btl/ofi will no longer pass a provider name as a hint when calling getinfo, instead filtering the provider during validate_info. This patch also moves the is_in_list MTL function into common code and adds additional debugging output to the BTL to match the MTL standard. Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-07-31 12:13:00 -07:00
Tomislav Janjusic	cbfc9a3263	opal/mca/common/ucx: Use new TSD api Co-authored-by: Artem Y. Polyakov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-30 00:21:26 +03:00
Tomislav Janjusic	27ba4b612f	ompi/osc/ucx: Remove workerpool's global thread storage tables. Co-authored-by: Artem Y. Polyakov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-30 00:21:26 +03:00
Brian Barrett	41df122083	Merge pull request #7730 from wckzhang/newdefaults coll/tuned: Change the default collective algorithm selection	2020-07-28 15:27:46 -07:00
William Zhang	ce40cfbaa5	coll/tuned: Change the default collective algorithm selection The default algorithm selections were out of date and not performing well. After gathering data from OMPI developers, new default algorithm decisions were selected for: allgather allgatherv allreduce alltoall alltoallv barrier bcast gather reduce reduce_scatter_block reduce_scatter scatter These results were gathered using the ompi-collectives-tuning package and then averaged amongst the results gathered from multiple OMPI developers on their clusters. You can access the graphs and averaged data here: https://drive.google.com/drive/folders/1MV5E9gN-5tootoWoh62aoXmN0jiWiqh3 Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-07-28 10:41:48 -07:00
Jeff Squyres	c07d77fbf2	Merge pull request #7957 from bosilca/fix/avx_alignment Use the unaligned SSE memory access primitive.	2020-07-27 15:50:40 -04:00
Tomislav Janjusic	d809f6ba27	New TSD API interface fix for various components Co-authored by: Artem Polykaov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-24 18:29:40 +03:00
Tomislav Janjusic	cba5a0e117	Rename tsd interface function calls Co-authored by: Artem Polykaov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-24 18:29:07 +03:00
Aurélien Bouteiller	b37202c74e	Add compliance mode with MPI-4 routing of errors to MPI_COMM_SELF by default And other streamlining of aborting behavior. Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> Remove OMPI_COMM_ERRORS and use NOHANDLE macros instead. Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> route unbound errors to self error handler Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Do not raise the error handler from within components Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-23 05:09:29 -04:00
Joshua Ladd	366e92ce54	Merge pull request #7860 from vspetrov/hcoll_reduce_scatter Coll/Hcoll: reduce_scatter(block) interface	2020-07-22 09:45:34 -04:00
George Bosilca	b6d71aa893	Use the unaligned SSE memory access primitive. Alter the test to validate misaligned data. Fixes #7954. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-22 01:19:12 -04:00
Aurelien Bouteiller	816acbdfb1	Merge pull request #7840 from abouteiller/mpi-next/init-errh MPI-4: Initial error handler	2020-07-21 11:55:14 -04:00
Joseph Schuchart	60aa97b301	Merge pull request #7948 from devreal/osc-rdma-check-endpoints osc/rdma: fail query_btls if no endpoint for non-local peer is found	2020-07-20 15:14:25 +02:00
bosilca	1139d9ecae	Merge pull request #7931 from bosilca/fix/7928 Fix the BTL API conversion for the SMCUDA BTL	2020-07-18 17:35:39 -04:00
Joseph Schuchart	eebc451ec8	osc/rdma: fail query_btls if no endpoint for non-local peer is found Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-07-16 17:06:35 +02:00
Aurelien Bouteiller	5f1f7fe313	route errors to self/initial error handler depending upon the state of MPI initialization Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	bed909c3ba	Read the info key mpi_initial_errhandler from spawn/spawn_multiple Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> Use the same env to transmit the initial error handler to spawnees Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	83d0f92152	Set the initial error handler onto predefined communicators Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> update to the predefined initial error handler selection Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	3cd85a9ec5	Add the initial_errhandler info key to MPI_INFO_ENV and populate the value from prun populated paremeters Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> Allow errhandlers to invoke the initial error handler before MPI_INIT Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Indentation Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	703b8c356f	Make error_class and error_string callable before/after MPI_INIT/FINALIZE Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> make lazy initialization opal unlikely Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Ralph Castain	a574addce9	Ensure we init and protect values Scrub the entire ompi_rte.c file to initialize and protect values received from PMIx. Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-07-14 15:25:14 -07:00
George Bosilca	96e8cbe25f	First step on fixing the BTL API conversion for the SMCUDA BTL Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-13 14:46:10 -04:00
Joshua Ladd	aa8f7f4ede	Merge pull request #7893 from bureddy/cuda-ucx UCX: initialize cuda from ucx pml component	2020-07-13 14:18:48 -04:00
bosilca	1f237f5fc9	Merge pull request #7419 from bosilca/topic/avx512 Add support for AVX512/AVX2/SSE/MMX	2020-07-13 11:56:50 -04:00
Devendar Bureddy	2547e24c55	UCX: initialize cuda from ucx pml component Signed-off-by: Devendar Bureddy <devendar@mellanox.com>	2020-07-12 18:41:40 +03:00
Nathan Hjelm	d0c0cb7144	Merge pull request #7913 from hjelmn/btl_base_atomics_are_awesome btl: change argument type of BTL receive callbacks	2020-07-11 12:13:26 -06:00
dongzhong	14b3c70628	Add supports for MPI_OP using AVX512, AVX2 and MMX Add logic to handle different architectural capabilities Detect the compiler flags necessary to build specialized versions of the MPI_OP. Once the different flavors (AVX512, AVX2, AVX) are built, detect at runtime which is the best match with the current processor capabilities. Add validation checks for loadu 256 and 512 bits. Add validation tests for MPI_Op. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> Signed-off-by: dongzhong <zhongdong0321@hotmail.com> Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-10 21:25:35 -04:00
Jeff Squyres	98bc7af7d4	mpi_f08: fix Fortran-8-byte-INTEGER vs. C-4-byte-int issue It is important to have the mpi_f08 Type(MPI_Status) be the same length (in bytes) as the mpif.h status (which is an array of MPI_STATUS_SIZE INTEGERs). The reason is because MPI_Status_ctof() basically does the following: MPI_Fint f_status = ...; int s = (int*) &c_status; for i=0..sizeof(MPI_Status)/sizeof(int) f_status[i] = c_status[i]; Meaning: the Fortran status needs to be able to hold as many INTEGERs are there are C int's that can fit in sizeof(MPI_Status) bytes. This is because a Fortran INTEGER may be larger than a C int (e.g., Fortran 8 bytes vs. C 4 bytes). Hence, the assignment on the Fortran side will take sizeof(INTEGER) bytes for each sizeof(int) bytes in the C MPI_Status. This commit pads out the mpi_f08 Type(MPI_Status) with enough INTEGERs to make it the same size as an array of MPI_TYPE_SIZE INTEGERs. Hence, MPI_Status_ctof() will work properly, regardless of whether it is assinging to an mpi_f08 Type(MPI_Status) or an mpif.h array of MPI_STATUS_SIZE INTEGERs. Thanks to @ahaichen for reporting the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-07-09 15:12:43 -07:00
Nathan Hjelm	88f51fbb8e	btl: change argument type of BTL receive callbacks This commit updates the btl interface to change the parameters passed to receive callbacks. The interface used to pass the tag, a btl base descriptor, and the callback context. Most of the values in the btl base descriptor were unused and only helped simplify the callbacks from the self btl. All of the arguments have now been replaced with a single receive callback descriptor. This descriptor contains the incoming endpoint, data segment(s), tag, and callback context. All btls have been updated to use the new callback and the btl interface version has been bumped to v3.2.0. As part of this change the descriptor argument (and the segments contained within it) have been marked as const. The were treated as const before but this change could allow the compiler to make better optimization decisions and will enforce that the callback does not attempt to change the data in the descriptor. Signed-off-by: Nathan Hjelm <hjelmn@google.com>	2020-07-08 07:38:46 -07:00
Austen Lauria	dbc56758b6	Merge pull request #7802 from badgerious/mtl_ofi_cqread_break mtl/ofi: break from progress loop when events are read	2020-07-06 09:20:07 -04:00
Austen Lauria	9b86f1442a	Merge pull request #7823 from jsquyres/pr/put-osc-pt2pt-back Fix typos in OSC RDMA BTL allowlist	2020-06-30 10:55:16 -04:00
Todd Kordenbrock	4358e75a75	Merge pull request #7866 from tkordenbrock/topic/master/portals4.fix-inappropriate-use-of-abort portals4: fix inappropriate use of abort() in mtl-portals4 and coll-portals4 components	2020-06-30 08:46:03 -05:00
Valentin Petrov	d366812030	coll/hcoll: compile warning fix Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2020-06-30 09:44:46 +03:00
Valentin Petrov	1d54071fc1	coll/hcoll: reduce_scatter(block) interface Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2020-06-30 09:44:46 +03:00
Austen Lauria	3ed466e629	Merge pull request #7800 from abouteiller/mpi-next/errors_abort MPI4: Add ERRORS_ABORT infrastructure	2020-06-29 15:45:29 -04:00
Austen Lauria	a26e494953	Merge pull request #7882 from devreal/osc-rdma-noncontig-requests osc rdma: check for outstanding fragments before completing a request (II)	2020-06-29 09:51:47 -04:00
Joseph Schuchart	caed3b2eed	osc rdma: check for outstanding fragments before completing a request in ompi_osc_rdma_put_complete_flush as well Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-26 22:19:21 +02:00
Austen Lauria	5fa7ca7c15	Merge pull request #7858 from tkordenbrock/topic/master/portals4.call-pml-add_procs mtl-portals4: use the active PML to call add_procs()	2020-06-26 14:56:57 -04:00
Joseph Schuchart	2c36d37033	Merge pull request #7871 from devreal/osc-ucx-rget-rput-fetch-alignment OSC UCX: make sure no-op fetch in rget/rput is properly aligned	2020-06-26 15:58:51 +02:00
Joseph Schuchart	1314ef7668	OSC UCX: Remove stale free from merge conflict Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-25 19:01:53 +02:00
Joseph Schuchart	634f67b216	Merge pull request #7843 from devreal/clang-tidy-free Some fixups for issues detected by clang-tidy	2020-06-25 17:30:04 +02:00
Artem Polyakov	907f4e196a	Merge pull request #6980 from devreal/ucx-acc-singel-intrinsics UCX osc: add support for acc_single_intrinsic	2020-06-25 07:39:42 -07:00
Joseph Schuchart	c1f7776341	OSC UCX: make sure no-op fetch in rget/rput is properly aligned Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-25 16:16:58 +02:00
Austen Lauria	7814f4195c	Merge pull request #7845 from devreal/stack-fixes Fix unexpected optimizations detected by STACK	2020-06-25 08:15:09 -04:00
Todd Kordenbrock	04b94637dd	mtl-portals4: replace abort() with ompi_rte_abort() coll-portals4: replace abort() with ompi_rte_abort() Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com>	2020-06-24 11:31:26 -05:00
Joseph Schuchart	e3b417c776	Add missing copyright header Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	e215eff43d	UCX osc: atomic fetch-and-op only on 32 and 64bit values Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	434c9055ee	UCX osc: fall back to get-compare-put for unsupported datatypes Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	7d5a6e3e8b	UCX osc: safely load/store 64bit integer from variable size pointer Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	5f786bcce4	UCX osc: make MPI_Fetch_and_op non-blocking if possible Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	d8696aa8c4	UCX osc: centralize decision on whether to use AMOs Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	427d4bd226	UCX osc: do not acquire accumulate lock if exclusive lock was taken Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	471d76777a	UCX osc: fence active operations before releasing accumulate lock and free memory if required Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	4d7a3856fa	UCX osc: Use accumulate for operations/datatypes that are not covered by UCX Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	899f58cef5	UCX osc: simplify output address computation Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	d888b4fd76	UCX osc: correctly handle MPI_NO_OP Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	7cfc0e71da	UCX osc: allow to asynchronously compare-and-swap Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	557ae80858	UCX osc: allow for overlap with (some) request-based atomic operations Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	1a3c6bbf35	UCX osc: re-use value returned by cswap to save additional get Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	8606a02b87	UCX osc: fix macro parameter name usage in OMPI_OSC_UCX_REQUEST_RETURN Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	d448efd49c	UCX osc: properly clean up requests in case of errors Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00
Joseph Schuchart	73a183408f	UCX osc: add support for acc_single_intrinsic info key / mca param Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-06-23 12:41:52 +02:00

1 2 3 4 5 ...

10893 Коммитов