openmpi

Автор	SHA1	Сообщение	Дата
William Zhang	ce40cfbaa5	coll/tuned: Change the default collective algorithm selection The default algorithm selections were out of date and not performing well. After gathering data from OMPI developers, new default algorithm decisions were selected for: allgather allgatherv allreduce alltoall alltoallv barrier bcast gather reduce reduce_scatter_block reduce_scatter scatter These results were gathered using the ompi-collectives-tuning package and then averaged amongst the results gathered from multiple OMPI developers on their clusters. You can access the graphs and averaged data here: https://drive.google.com/drive/folders/1MV5E9gN-5tootoWoh62aoXmN0jiWiqh3 Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-07-28 10:41:48 -07:00
Austen Lauria	d0152eb51e	Merge pull request #7940 from awlauria/revert_libevent_commit Revert "Address a race condition in libevent select."	2020-07-28 11:34:59 -04:00
Jeff Squyres	c07d77fbf2	Merge pull request #7957 from bosilca/fix/avx_alignment Use the unaligned SSE memory access primitive.	2020-07-27 15:50:40 -04:00
Artem Polyakov	e5ef80fe8c	Merge pull request #7936 from janjust/master-new-tsd-thread-api Master: new thread-specific-data (tsd) api	2020-07-24 14:58:03 -07:00
Ralph Castain	863a058f8d	Merge pull request #7964 from rhc54/topic/sync Sync to PRRTE master	2020-07-24 14:57:32 -07:00
Ralph Castain	8c0269cd4f	Sync to PRRTE master Pickup the FT and libev cleanups Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-07-24 14:11:34 -07:00
Tomislav Janjusic	d809f6ba27	New TSD API interface fix for various components Co-authored by: Artem Polykaov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-24 18:29:40 +03:00
Tomislav Janjusic	cba5a0e117	Rename tsd interface function calls Co-authored by: Artem Polykaov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-24 18:29:07 +03:00
Tomislav Janjusic	cb1955bb53	Fix renamed interface functions for argo, q, and pthreads Co-authored by: Artem Polykaov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-24 18:29:07 +03:00
Tomislav Janjusic	07dc86eb3a	opal/thread: New TSD API Co-authored-by: Artem Polyakov <artemp@mellanox.com> Signed-off-by: Tomislav Janjusic <tomislavj@mellanox.com>	2020-07-24 18:29:07 +03:00
Ralph Castain	06c585c316	Merge pull request #7962 from rhc54/topic/sync Sync to PMIx and PRRTE master	2020-07-23 16:22:32 -07:00
Ralph Castain	c0bc89dc50	Sync to PMIx and PRRTE master Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-07-23 12:35:17 -07:00
Aurelien Bouteiller	06c563625a	Add a test for mpi_errors_mpi3 behavior and non-catastrophic errors Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-23 05:09:29 -04:00
Aurélien Bouteiller	b37202c74e	Add compliance mode with MPI-4 routing of errors to MPI_COMM_SELF by default And other streamlining of aborting behavior. Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> Remove OMPI_COMM_ERRORS and use NOHANDLE macros instead. Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> route unbound errors to self error handler Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Do not raise the error handler from within components Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-23 05:09:29 -04:00
George Bosilca	c4e88a43a3	Check unaligned ops for correctness. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-22 11:26:07 -04:00
Joshua Ladd	366e92ce54	Merge pull request #7860 from vspetrov/hcoll_reduce_scatter Coll/Hcoll: reduce_scatter(block) interface	2020-07-22 09:45:34 -04:00
George Bosilca	b6d71aa893	Use the unaligned SSE memory access primitive. Alter the test to validate misaligned data. Fixes #7954. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-22 01:19:12 -04:00
Jeff Squyres	30ba603c2c	Merge pull request #7953 from cniethammer/configure-leak-fix Fix memory leak in configure, which prevents leak sanitizer usage	2020-07-21 16:34:27 -04:00
Christoph Niethammer	6564c1b942	Fix memory leak in configure, which prevents leak sanitizer usage If building Open MPI with sanitizers, e.g $ configure CC=clang CFLAGS=-fsanitize=address .... configure test programs are also build with the sanitizers and will report errors resulting in configure to fail. Signed-off-by: Christoph Niethammer <niethammer@hlrs.de>	2020-07-21 21:28:29 +02:00
Aurelien Bouteiller	816acbdfb1	Merge pull request #7840 from abouteiller/mpi-next/init-errh MPI-4: Initial error handler	2020-07-21 11:55:14 -04:00
Joseph Schuchart	60aa97b301	Merge pull request #7948 from devreal/osc-rdma-check-endpoints osc/rdma: fail query_btls if no endpoint for non-local peer is found	2020-07-20 15:14:25 +02:00
bosilca	1139d9ecae	Merge pull request #7931 from bosilca/fix/7928 Fix the BTL API conversion for the SMCUDA BTL	2020-07-18 17:35:39 -04:00
Joseph Schuchart	eebc451ec8	osc/rdma: fail query_btls if no endpoint for non-local peer is found Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-07-16 17:06:35 +02:00
Aurelien Bouteiller	7118755ae8	Add a tester for the initial error handler Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurelien Bouteiller	5f1f7fe313	route errors to self/initial error handler depending upon the state of MPI initialization Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	bed909c3ba	Read the info key mpi_initial_errhandler from spawn/spawn_multiple Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> Use the same env to transmit the initial error handler to spawnees Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	83d0f92152	Set the initial error handler onto predefined communicators Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> update to the predefined initial error handler selection Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	3cd85a9ec5	Add the initial_errhandler info key to MPI_INFO_ENV and populate the value from prun populated paremeters Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> Allow errhandlers to invoke the initial error handler before MPI_INIT Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Indentation Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Aurélien Bouteiller	703b8c356f	Make error_class and error_string callable before/after MPI_INIT/FINALIZE Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu> make lazy initialization opal unlikely Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-07-16 03:10:32 -04:00
Ralph Castain	7702dfcdd2	Merge pull request #7942 from rhc54/topic/init Ensure we init and protect values	2020-07-15 07:59:01 -07:00
George Bosilca	8bc1f3d8fb	Don't allow any asynchronous CUDA operations. There are 2 reasons for this: - pending CUDA events are not progressed by this BTL, so anything that becomes asychronous will never be completed. - we use the packed data on the shared memory backing file, and this will be returned to the peer process upon return (thus if we copy asynchronously we might not copy the right data). Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-15 01:37:09 -04:00
George Bosilca	0e32b0acef	Avoid a lock if no CUDA IPC operations are pending. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-15 01:35:34 -04:00
Ralph Castain	a574addce9	Ensure we init and protect values Scrub the entire ompi_rte.c file to initialize and protect values received from PMIx. Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-07-14 15:25:14 -07:00
Austen Lauria	67d90166cf	Revert "Address a race condition in libevent select." We do not want to be patching upstream components anymore. The proper method is to get this merged upstream, then pull it in the next upstream release. This reverts commit c39fb5758a772c062e20db9b42f2b06805884802. Signed-off-by: Austen Lauria <awlauria@us.ibm.com>	2020-07-14 16:23:21 -04:00
George Bosilca	fd4ca394e2	Make the smcuda BTL great again. It has been broken for months because of the lack of initialization of the HWLOC library. The smcuda process creating the backing file (local rank 0) uses opal_cache_line_size to align the objects in the backing file, and the opal_cache_line_size is initialized by default to 128. Later on, when the rest of the processes attach the same backing file, HWLOC has been called and the cache size has now been updated to the correct value. If this value is different than the default one (and they are as most cache sizes are 64 bytes right now) the objects in the backing file will be misaligned. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-14 01:48:08 -04:00
George Bosilca	96e8cbe25f	First step on fixing the BTL API conversion for the SMCUDA BTL Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-13 14:46:10 -04:00
Joshua Ladd	aa8f7f4ede	Merge pull request #7893 from bureddy/cuda-ucx UCX: initialize cuda from ucx pml component	2020-07-13 14:18:48 -04:00
bosilca	1f237f5fc9	Merge pull request #7419 from bosilca/topic/avx512 Add support for AVX512/AVX2/SSE/MMX	2020-07-13 11:56:50 -04:00
Devendar Bureddy	2547e24c55	UCX: initialize cuda from ucx pml component Signed-off-by: Devendar Bureddy <devendar@mellanox.com>	2020-07-12 18:41:40 +03:00
Nathan Hjelm	d0c0cb7144	Merge pull request #7913 from hjelmn/btl_base_atomics_are_awesome btl: change argument type of BTL receive callbacks	2020-07-11 12:13:26 -06:00
dongzhong	14b3c70628	Add supports for MPI_OP using AVX512, AVX2 and MMX Add logic to handle different architectural capabilities Detect the compiler flags necessary to build specialized versions of the MPI_OP. Once the different flavors (AVX512, AVX2, AVX) are built, detect at runtime which is the best match with the current processor capabilities. Add validation checks for loadu 256 and 512 bits. Add validation tests for MPI_Op. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> Signed-off-by: dongzhong <zhongdong0321@hotmail.com> Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-10 21:25:35 -04:00
Howard Pritchard	677b662295	Merge pull request #7912 from tomhers/fix_opal_ofi_compile_bug BTL/OFI: Fix missing include file.	2020-07-10 14:43:38 -06:00
Jeff Squyres	b62545ef4c	Merge pull request #7921 from jsquyres/pr/fix-fortran8integer-vs-c4int-issue mpi_f08: fix Fortran-8-byte-INTEGER vs. C-4-byte-int issue	2020-07-10 09:15:55 -04:00
Jeff Squyres	62d1f89622	Merge pull request #7917 from jmciver/fix/rpmbuild-rpm-build-parameter Fix buildrpm.sh "-r" option used for RPM options specification	2020-07-09 20:58:07 -04:00
Brian Barrett	4a6e1629e8	Merge pull request #7906 from dancejic/multi common/ofi: added address format check to fix provider selection	2020-07-09 16:59:02 -07:00
Jeff Squyres	98bc7af7d4	mpi_f08: fix Fortran-8-byte-INTEGER vs. C-4-byte-int issue It is important to have the mpi_f08 Type(MPI_Status) be the same length (in bytes) as the mpif.h status (which is an array of MPI_STATUS_SIZE INTEGERs). The reason is because MPI_Status_ctof() basically does the following: MPI_Fint f_status = ...; int s = (int*) &c_status; for i=0..sizeof(MPI_Status)/sizeof(int) f_status[i] = c_status[i]; Meaning: the Fortran status needs to be able to hold as many INTEGERs are there are C int's that can fit in sizeof(MPI_Status) bytes. This is because a Fortran INTEGER may be larger than a C int (e.g., Fortran 8 bytes vs. C 4 bytes). Hence, the assignment on the Fortran side will take sizeof(INTEGER) bytes for each sizeof(int) bytes in the C MPI_Status. This commit pads out the mpi_f08 Type(MPI_Status) with enough INTEGERs to make it the same size as an array of MPI_TYPE_SIZE INTEGERs. Hence, MPI_Status_ctof() will work properly, regardless of whether it is assinging to an mpi_f08 Type(MPI_Status) or an mpif.h array of MPI_STATUS_SIZE INTEGERs. Thanks to @ahaichen for reporting the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-07-09 15:12:43 -07:00
Nathan Hjelm	ad81839dd5	Merge pull request #7916 from hjelmn/allow_splitting_ompi_c_compiler_and_end_user_c_compiler_so_eventually_we_can_require_a_sane_standards_compliance_compiler_for_building_open_mpi config: add support for setting the wrapper C compiler	2020-07-09 14:15:46 -06:00
Jeff Squyres	27d30f3d17	Merge pull request #7910 from jsquyres/pr/mr-clean-for-submodules Trivial helper script to git clean submodules	2020-07-09 09:58:53 -04:00
Nathan Hjelm	88f51fbb8e	btl: change argument type of BTL receive callbacks This commit updates the btl interface to change the parameters passed to receive callbacks. The interface used to pass the tag, a btl base descriptor, and the callback context. Most of the values in the btl base descriptor were unused and only helped simplify the callbacks from the self btl. All of the arguments have now been replaced with a single receive callback descriptor. This descriptor contains the incoming endpoint, data segment(s), tag, and callback context. All btls have been updated to use the new callback and the btl interface version has been bumped to v3.2.0. As part of this change the descriptor argument (and the segments contained within it) have been marked as const. The were treated as const before but this change could allow the compiler to make better optimization decisions and will enforce that the callback does not attempt to change the data in the descriptor. Signed-off-by: Nathan Hjelm <hjelmn@google.com>	2020-07-08 07:38:46 -07:00
George Bosilca	ddfb4def2d	Second take on fixing the Inel _Atomic atomic operation warning. We completely disable C11 atomic op support for _Atomic for all Intel compiler prior to 20200310 (which is currently the latest released), by switching to our pre-C11 atomic operations. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-07-08 09:58:43 -04:00

1 2 3 4 5 ...

30961 Коммитов