openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	fcf2766a03	AVX code generation improvements 1. Allow fallback to a lesser AVX support during make Due to the fact that some distro restrict the compiule architecture during make (while not setting any restrictions during configure) we need to detect the target architecture also during make in order to restrict the code we generate. 2. Add comments and better protect the arch specific code. Identify all the vectorial functions used and clasify them according to the neccesary hardware capabilities. Use these requirements to protect the code for load and stores (the rest of the code being automatically generated it is more difficult to protect). 3. Correctly check for AVX* support. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2021-01-04 13:43:18 -05:00
George Bosilca	31068e063b	Major update to the AVX* detection and support 1. Consistent march flag order between configure and make. 2. op/avx: give the option to skip some tests it is possible to skip some intrinsic tests by setting some environment variables to "no" before invoking configure: - ompi_cv_op_avx_check_avx512 - ompi_cv_op_avx_check_avx2 - ompi_cv_op_avx_check_avx - ompi_cv_op_avx_check_sse41 - ompi_cv_op_avx_check_sse3 3. op/avx: update AVX512 flags try -mavx512f -mavx512bw -mavx512vl -mavx512dq instead of -march=skylake-avx512 since the former is less likely to conflict with user provided CFLAGS (e.g. -march=...) Thanks Bart Oldeman for pointing this. 4. op/avx: have the op/avx library depend on libmpi.so Refs. open-mpi/ompi#8323 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2021-01-04 13:41:39 -05:00
Ralph Castain	cd49049ab9	Merge pull request #8320 from rhc54/topic/warn Silence warnings	2020-12-27 17:10:37 -08:00
Ralph Castain	607052252f	Update PMIx and PRRTE pointers Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-12-27 14:47:59 -08:00
Ralph Castain	194e66b3e2	Silence warnings Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-12-27 12:18:40 -08:00
Raghu Raja	c8e1847372	Merge pull request #8244 from dancejic/oversub_message common/ofi: fixing error message to be a debug output	2020-12-23 11:18:22 -08:00
Raghu Raja	c752534e43	Merge pull request #8316 from wzamazon/master_fix_btl_ofi_leak btl/ofi: fix memory leaks in error handling path	2020-12-23 11:17:57 -08:00
Wei Zhang	01f5d6896d	btl/ofi: fix memory leaks in error handling path Currently, mca_btl_ofi_put (get, aop, afop, acswp) will allocate a mca_btl_ofi_rdma_completion_t object and use it as the context for fi_write/fi_read/fi_atomic/fi_fetch_atomic/fi_compare_atomic. In normal code path, this completion object when processing completion entry. However, when error happened when calling fi_write/fi_read/fi_atomic/fi_fetch_atomic/fi_compare_atomic, there will be no completion entry from libfabric, in this case the completion object's memory is leaked. This patch address the issue by calling opal_free_list_return() in the error handling code path. Signed-off-by: Wei Zhang <wzam@amazon.com>	2020-12-22 21:42:24 +00:00
Ralph Castain	8ff2277b7e	Merge pull request #8298 from rhc54/topic/prte Update PRRTE pointer to include ULFM fixes	2020-12-17 18:05:28 -08:00
Ralph Castain	745f4930fc	Update PRRTE pointer to include ULFM fixes Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-12-17 16:44:28 -08:00
Edgar Gabriel	38e3936721	Merge pull request #8293 from edgargabriel/topic/lustre-perf-fix Topic/lustre perf fix	2020-12-17 14:22:45 -06:00
Edgar Gabriel	56dbd096d3	io/ompio: remove the special handling of Lustre in the selection logic ompio is now the default on Lustre as well Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-17 13:26:51 -06:00
Edgar Gabriel	aa2d21ee50	lustre_file_open: avoid explicit locking on lustre file systems Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-17 12:22:54 -06:00
Edgar Gabriel	2c61074739	dynamic_gen2: code cleanup remove now unused mca parameter, get rid of an unnecesary if-else part, and move setting the flag outside of the while loop. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-17 11:43:23 -06:00
Jeff Squyres	d36977c925	Merge pull request #8296 from rhc54/topic/pmixv4 Update PMIx/PRRTE pointers	2020-12-17 11:59:17 -05:00
Ralph Castain	76ac72fd24	Update PMIx/PRRTE pointers Switch PMIx to v4.0 branch Update PRRTE to current master Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-12-17 07:53:43 -08:00
Edgar Gabriel	d65480df35	fbtl_posix_pwritev: add datasieving support for write its however restricted to collective I/O operations, at this point only from vulcan and dynamic_gen2. required some more infrastructure to be added to recognize individual I/O and multi-threaded environments. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-16 15:34:45 -06:00
Edgar Gabriel	90d8c8c39c	fbtl_posix_preadv: limit the size of the temporary buffer when using data sieving. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-16 11:17:15 -06:00
Edgar Gabriel	dbf0d6e5a3	fbtl_posix: add control logic for data sieving only implemented for read at the moment, but the parameters for write are also in place. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-15 18:12:07 -06:00
Ralph Castain	3f6877b510	Merge pull request #8291 from rhc54/topic/up2 Update PMIx/PRRTE pointers again	2020-12-15 12:29:35 -08:00
Ralph Castain	35bcda788a	Update PMIx/PRRTE pointers again Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-12-15 09:12:58 -08:00
Jeff Squyres	8d814cf511	Merge pull request #8286 from jsquyres/pr/disable-libevent-checks libevent: disable checks	2020-12-15 11:47:32 -05:00
Ralph Castain	545ebe2c27	Merge pull request #8287 from rhc54/topic/up Advance PMIx and PRRTE pointers	2020-12-15 06:22:23 -08:00
Ralph Castain	b27158a479	Advance PMIx and PRRTE pointers Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-12-14 18:38:27 -08:00
Jeff Squyres	59d98373b7	libevent: disable checks Disable libevent regression checks so that we do not need to require Python for Open MPI builds. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-12-14 14:04:49 -08:00
Gilles Gouaillardet	26aeb59f6d	Merge pull request #8280 from ggouaillardet/topic/dist_cleanup do not include automatically generated file into the dist tarball	2020-12-11 23:08:09 +09:00
Gilles Gouaillardet	6f282bd385	do not include automatically generated file into the dist tarball - ompi/mpi/fortran/use-mpi-f08/mod/mpi-f08-interfaces.h - ompi/mpiext/shortfloat/c/mpiext_shortfloat_c.h - ompi/mpiext/shortfloat/mpif-h/mpiext_shortfloat_mpifh.h - ompi/mpiext/shortfloat/use-mpi-f08/mpiext_shortfloat_usempif08.h Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2020-12-11 16:01:33 +09:00
Edgar Gabriel	5385e5f85f	fbtl/posix/preadv.c: first cut on adding data sieving the lack of performing data sieving has been identified as a main reason for the poor performance in some instances on the Lustre file system. This commit introduces the fundamental ability to perform data sieving for read operations (which should not be controversial). The code itself is correct, what is still lacking is a) the logic when and how to activate data sieving and b) the logic to limit the size of the temporary buffer when doing data sieving. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-10 14:16:07 -06:00
Edgar Gabriel	f70bb4774a	dynamic_gen2/file_write_all: fix chunk assignment per stride the dynamic_gen_file_write_all component distinguishes between the amount of data communicated to aggregators, and the amount of data written in a cycle by the aggregator (in contrary e.g. to the vulcan component). There was a bug in calculating which chunks have to be written in a cycle by an aggregator: we added as many elements into the io_array until we filled one stripe. Unfortuantely, the metric used was the amount of data instead of ensuring that all offsets fall within a single stripe. This commit fixes this issue. Note, the bug did not create a correctness problem, just a performance problem in case there were gaps in the file view. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2020-12-10 08:53:11 -06:00
Jeff Squyres	2d5e666808	Merge pull request #8267 from hpcraink/man-pages Fix man-page for MPI_File_set_view	2020-12-07 12:16:58 -05:00
Rainer Keller	3202a16e7d	Fix man-page for MPI_File_set_view The datarep may not be NULL, as check was added in ba955883329. Signed-off-by: Rainer Keller <rainer.keller@hs-esslingen.de>	2020-12-04 22:05:53 +01:00
Yossi Itigin	47fb05f82a	Merge pull request #8184 from gleon99/master Replace usage of the deprecated NB API of UCX with NBX	2020-11-25 13:02:48 +02:00
Leonid Genkin	7f9a305a64	Replace usage of the deprecated NB API of UCX with NBX Signed-off-by: Leonid Genkin <lgenkin@nvidia.com>	2020-11-25 12:21:06 +02:00
Jeff Squyres	a8f883a73a	Merge pull request #8250 from devreal/fix-han-commselect-new coll/han: fix coll preference selection in mca_coll_han_comm_create_new	2020-11-24 09:21:57 -05:00
Joseph Schuchart	33105b031b	coll/han: fix coll preference selection in mca_coll_han_comm_create_new Exclude HAN, don't include it. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-11-24 10:09:14 +01:00
Ralph Castain	7dc9815340	Merge pull request #8245 from rhc54/topic/up Update PMIx/PRRTE pointers	2020-11-23 12:07:08 -08:00
Ralph Castain	3c2dbac64f	Update PMIx/PRRTE pointers Resolves the PRRTE launch scale limitation Signed-off-by: Ralph Castain <rhc@pmix.org>	2020-11-23 11:13:03 -08:00
Nikola Dancejic	5af8497a05	common/ofi: fixing error message to be a debug output A path that was being used in oversubscribed cases caused a help message to output for each process. This replaces the help message with a debug output to prevent excessive output unless the user enables debug output. Signed-off-by: Nikola Dancejic <dancejic@amazon.com>	2020-11-23 08:35:33 -08:00
Jeff Squyres	61dc29f0f6	Merge pull request #8239 from ggouaillardet/topic/OPAL_GET_VERSION configury: fix OPAL_GET_VERSION	2020-11-23 10:56:05 -05:00
Jeff Squyres	3edd62e568	Merge pull request #8203 from jsquyres/pr/fix-warnings Fix many compiler warnings	2020-11-23 10:15:15 -05:00
Gilles Gouaillardet	c134664c02	Merge pull request #8222 from ggouaillardet/topic/libtool_bigsur autogen.pl: patch libtool.m4 for OSX Big Sur	2020-11-23 08:10:50 +09:00
Gilles Gouaillardet	3f45ceda1b	autogen.pl: patch libtool.m4 for OSX Big Sur Thanks FX Coudert for reporting this issue and pointing to a solution. Refs. open-mpi/ompi#8218 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-11-22 10:21:46 -08:00
Gilles Gouaillardet	930d3c4695	configury: fix OPAL_GET_VERSION - fix path to getdate.sh - do not prepend "date" to the revision - support git worktree Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2020-11-22 21:05:32 +09:00
Artem Polyakov	159856f9aa	Merge pull request #8219 from hoopoepg/topic/fixed-coverity-warning-wpool PML/UCX/WPOOL: fixed coverity issue	2020-11-20 17:31:04 -08:00
Raghu Raja	38d2f12112	Merge pull request #8220 from devreal/fix-coll-base-preference Fix preference treatment in coll/base	2020-11-20 08:14:37 -08:00
Jeff Squyres	28779321a7	Merge pull request #8233 from Webcretaire/typo Typo in ob1 comments, and uniform conditions	2020-11-19 17:07:49 -05:00
Julien EMMANUEL	7d493c6bcd	Uniform conditions in ob1 recv In ob1 we have four similar conditions but they are not written in a uniform way Signed-off-by: Julien EMMANUEL <julien.emmanuel@inria.fr>	2020-11-19 22:27:49 +01:00
Joseph Schuchart	1cdc85564e	coll/han: reduce default segment size for reduce/allreduce to 64k This has shown to be more effective in achieving overlap of inter- and intra-node communication and reduces the inital delay before hitting the network. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-11-19 19:23:32 +01:00
Joseph Schuchart	971d58c524	coll/han: remove references to experimental solo and shared collective components Also make coll/tuned the default for shared memory communication as coll/sm has shown performance issues that need investigation. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-11-19 19:22:06 +01:00
Joseph Schuchart	09c2f4af94	coll/[sm\|han\|adapt]: don't disqualify on priority 0 Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>	2020-11-19 19:10:09 +01:00

1 2 3 4 5 ...

31189 Коммитов