openmpi

Автор	SHA1	Сообщение	Дата
Nysal Jan K.A	3529d44702	osc/ucx: Fix data corruption with non-contiguous accumulates Signed-off-by: Nysal Jan K.A <jnysal@in.ibm.com>	2019-07-24 13:07:59 +05:30
bosilca	94f26f5a51	Merge pull request #6695 from bosilca/fix/vector_stride_0 A big refresh of the datatype engine	2019-07-23 15:20:14 -04:00
Ralph Castain	8f32a59304	Merge pull request #6830 from rhc54/topic/dpm Provide locality for all procs on node	2019-07-23 08:10:57 -07:00
Nysal Jan K A	20dd06c151	Merge pull request #6826 from nysal/ucx_nolocks_infokey osc/ucx: Add support for the no_locks info key	2019-07-23 15:33:39 +05:30
Gilles Gouaillardet	102a46e28a	Merge pull request #6812 from ggouaillardet/topic/mpifh_c_ierr fortran/mpif-h: fix C to Fortran error code conversion	2019-07-23 17:07:26 +09:00
KAWASHIMA Takahiro	facf8c5e98	pcollreq/mpif-h: fix MPIX_Alltoallw_init() binding These issues were introduced in the recent commit `b71af0eca0`. This commit fixes Coverity CID 1451661 and 1451660. Though `c_info` part was an actual bug, the `c_sendtypes` part was not. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-07-23 08:45:17 +09:00
Ralph Castain	d202e10c14	Provide locality for all procs on node Update PMIx to latest master to get supporting updates. For connect/accept (part of comm_spawn as well), lookup locality for all participating procs on the node and compute the relative locality so it can be used for MPI operations. Signed-off-by: Ralph Castain <rhc@pmix.org>	2019-07-22 09:23:38 -07:00
Nysal Jan K.A	14808922cf	osc/ucx: Add support for the no_locks info key Signed-off-by: Nysal Jan K.A <jnysal@in.ibm.com>	2019-07-18 17:29:01 +05:30
Gilles Gouaillardet	b71af0eca0	pcollreq/mpif-h: fix MPIX_Alltoallw_init() binding Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-17 11:58:18 +09:00
Gilles Gouaillardet	ed703bec1b	fortran/mpif-h: fix [i]alltoallw bindings Fix a regression introduced in open-mpi/ompi@cdaed89d04 Fixes CID 1451610, 1451611 and 1451612 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-17 11:14:35 +09:00
Mikhail Brinskii	404c480068	COLL/TUNED: Update alltoall selection rule for mlx Use linear with sync alltoall algorithm for certain message/comm size ranges. Does not affect default fixed decision, unless HPCX (with its custom parameters) is used or corresponding mca is set. Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-07-13 23:27:40 +03:00
Gilles Gouaillardet	cdaed89d04	fortran/mpif-h: fix MPI_[I]Alltoallw() binding - ignore sendcounts, sendispls and sendtypes arguments when MPI_IN_PLACE is used - use the right size when an inter-communicator is used. Thanks Markus Geimer for reporting this. Refs. open-mpi/ompi#5459 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-13 22:34:30 +09:00
Gilles Gouaillardet	223e6cc537	fortran/mpif-h: fix C to Fortran error code conversion - remove incorrect use of OMPI_INT_2_FINT() - use homogenous syntax (e.g. c_ierr = PMPI_...()) Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-13 18:36:12 +09:00
Gilles Gouaillardet	5de5e751ed	fortran/use-mpi-f08: do not slurp the sentinel module files A sentinel is only an internal Fortran module and hence should not be slurped into libmpi_usempif08.so Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-13 16:50:55 +09:00
Gilles Gouaillardet	020a5918af	Merge pull request #2154 from ggouaillardet/topic/retain_op_and_datatypes non-blocking collectives: retain MPI_op and MPI_Datatype(s)	2019-07-13 10:20:36 +09:00
Jeff Squyres	a985a0d7d1	Merge pull request #6809 from wkliao/man_vector man page of MPI_Type_vector	2019-07-12 21:04:04 -04:00
Geoff Paulsen	4b696dca5b	Merge pull request #6660 from gpaulsen/task/master/revert-mpi1-removal-commits Add --enable-mpi1-compatibility configure option back	2019-07-12 14:42:28 -05:00
Wei-keng Liao	56f45b2aeb	stride size should be 4 x 16, as extent of oldtype is 16 bytes Signed-off-by: wkliao	2019-07-12 13:55:22 -05:00
Gilles Gouaillardet	0fe756d416	mpi: retain operation and datatype in non blocking collectives MPI standard states a user MPI_Op and/or user MPI_Datatype can be free'd after a call to a non blocking collective and before the non-blocking collective completes. Retain user (only) MPI_Op and MPI_Datatype when the non blocking call is invoked, and set a request callback so they are free'd when the MPI_Request completes. Thanks Thomas Ponweiser for reporting this Fixes open-mpi/ompi#2151 Fixes open-mpi/ompi#1304 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-12 09:15:45 +09:00
guserav	3c9f4e6823	Fix osc sm posts when only 32 bit atomics support Signed-off-by: guserav <erik.zeiske@hpe.com>	2019-07-09 15:13:25 -07:00
George Bosilca	f25674291b	Optimized datatype description. Move toward a base type of vector (count, type, blocklen, extent, disp) with disp and extent applying toward the count repertition and blocklen being a contiguous memory of type type. Implement 2 optimizations on this description used during type_commit: - collapse: successive similar datatype descriptions are collapsed together with an increased count. - fusion: fuse successive datatype descriptions in order to minimize the number of resulting memcpy during pack/unpack. Fixes at the OMPI datatype level including: - Fix the create_hindexed and vector creation. - Fix the handling of [get\|set]_elements and _count. - Correctly compute the dispacement for block indexed types. - Support the MPI_LB and MPI_UB deprecation, aka. OMPI_ENABLE_MPI1_COMPAT. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-07-09 14:50:08 -04:00
Jeff Squyres	506d0b104d	Merge pull request #6793 from jmbr/patch-1 Add missing argument name.	2019-07-09 11:00:09 -04:00
Juan M. Bello-Rivas	24c018fa22	Add missing argument name. Signed-off-by: Juan M. Bello-Rivas <jbellorivas@rigetti.com>	2019-07-08 17:00:09 -07:00
Gilles Gouaillardet	db760c508d	man: fix MPI_Allgather[v] man pages - remove incorrect reference to MPI_ROOT - fix MPI_IN_PLACE description no code change [skip ci] Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-08 13:45:35 +09:00
Gilles Gouaillardet	c2d35aaadc	Merge pull request #6790 from ggouaillardet/topic/ompi_comm_spawn_f fortran/mpif-h: correctly handle array_of_errcodes in ompi_comm_spawn…	2019-07-08 09:06:21 +09:00
Nysal Jan K.A	fe4ef147f8	pml/ucx: Fix the max tag and context id values Signed-off-by: Nysal Jan K.A <jnysal@in.ibm.com>	2019-07-03 14:33:01 +05:30
Gilles Gouaillardet	07830d05a7	fortran/mpif-h: correctly handle array_of_errcodes in ompi_comm_spawn[_multiple]_f Since array_of_errcodes is only allocated when MPI_ERRCODES_IGNORE is not used, it should not be cleaned when MPI_ERRCODES_IGNORE is used. Correctly allocate array_of_errcodes with the right size (e.g. maxprocs). Thanks Gyevi-Nagy Laszlo for reporting this issue. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-07-03 09:53:46 +09:00
Geoff Paulsen	f1b2a09675	Merge pull request #6649 from devreal/rdma-fetchop-local OSC rdma: make sure accumulating in shared memory is safe	2019-06-28 14:46:21 -05:00
Gilles Gouaillardet	5655d64bd3	mpi/c: fix param checks in [I]Neighbor_alltoall{v,w} do not check some input parameters when an {in,out}degree is zero Thanks Junchao Zhang for analyzing and reporting this issue. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-06-28 13:39:28 +09:00
Artem Polyakov	6678ac0f55	osc/ucx: Fix possible win creation/destruction race condition To avoid fully initializing the osc/ucx component for MPI application that are not using One-Sided functionality, the initialization happens at the first MPI window creation. This commit ensures atomicity of global state modifications. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-06-20 09:05:03 -07:00
Artem Polyakov	0857742624	osc/ucx: Fix worker pool finalization Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-06-20 09:05:03 -07:00
Nathan Hjelm	560886f095	Merge pull request #6746 from devreal/osc_winalloc_err OSC rdma win allocate: propagate errors to avoid deadlocks	2019-06-18 17:57:53 -07:00
Geoffrey Paulsen	54a286ee9d	Revert "ompi_info: report MPI1 compat is disabled" This reverts commit `61ccc65302`. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-06-14 13:22:21 -05:00
Geoffrey Paulsen	ca4b70913e	Revert "man: remove man pages of removed MPI1 subroutines" This reverts commit `26c1b833c7`. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-06-14 13:22:21 -05:00
Geoffrey Paulsen	ed9a670074	Revert "mpi.h.in: delete removed MPI1 functions/datatypes (API change!)" This reverts commit `a6d6be2853`. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-06-14 13:22:21 -05:00
Geoffrey Paulsen	5cc0141675	Revert "MPI_Type_get_envelope: remove MPI-1 deleted names" This reverts commit `65eb118e08`. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-06-14 13:22:21 -05:00
Geoffrey Paulsen	e036941ab5	Revert "mpi.h: remove MPI_UB/MPI_LB when not enabling MPI-1 compat" This reverts commit `7223334d4d`. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-06-14 13:22:21 -05:00
Geoffrey Paulsen	6de263fc29	Revert "mpi: make C++ bindings compile when MPI-1 compat is disabled" This reverts commit `b323655809`. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-06-14 13:22:21 -05:00
Harald Klimach	e222a04ae5	Suggestion to fix division by zero in file view. In common_ompi_aggregators calc_cost routine: do not cast the real division to an int intermediately. This patch removes the obsolete int variable c and assigns the result of the P_a/P_x division directly to n_as. With the intermediate int c variable, n_as gets 0 if P_a < P_x, resulting in a division by 0 when computing n_s. Signed-off-by: Harald Klimach <harald.klimach@uni-siegen.de>	2019-06-13 18:47:32 +02:00
Jeff Squyres	7c3aeb3061	Merge pull request #6686 from alex-anenkov/coll-iallreduce-recursivedoubling coll/libnbc: add recursive doubling algorithm for MPI_Iallreduce	2019-06-10 10:09:51 -04:00
Yossi Itigin	a46e5da3ca	Merge pull request #6744 from brminich/topic/all2all_linear_sync_fix COLL/BASE: Fix linear sync all2all	2019-06-09 21:23:38 +03:00
Joseph Schuchart	8f27cc26d9	OSC rdma win allocate: synchronize error codes across shared memory group Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2019-06-07 11:03:21 +02:00
KAWASHIMA Takahiro	85c3311b7d	Merge pull request #6726 from yanagibashi/pr/add-f08-procedure-names mpiext/pcollreq: Add `_f08` to procedure names	2019-06-07 09:10:58 +09:00
Mikhail Brinskii	79006f4e5a	COLL/BASE: Fix linear sync all2all Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-06-06 19:22:42 +03:00
Yossi Itigin	8535dd570b	Merge pull request #6732 from dmitrygladkov/topic/pml/ucx_init PML/UCX: Don't destroy UCP worker if it wasn't created	2019-06-06 10:41:33 +03:00
KAWASHIMA Takahiro	2b856573b2	Merge pull request #6699 from t-kurita/pr/java-alltoallw-arrays java: Fix compilation error in allToAllw using Java arrays	2019-06-04 11:33:17 +09:00
Dmitry Gladkov	c864ca51d2	PML/UCX: Don't destroy UCP worker if it wasn't created Signed-off-by: Dmitry Gladkov <dmitrygla@mellanox.com>	2019-06-03 10:49:36 +03:00
Tsubasa Yanagibashi	3148b0cfaa	mpiext/pcollreq: Add `_f08` to procedure names The procedure names don't contain "_f08" of Fortran 2008 bindings of Persistent Collective Operations(mpiext/pcollreq/use-mpi-f08). This fix adds "_f08" to the procedure names of pcollreq/use-mpi-f08, same as other Fortran 2008 routines in `ompi/mpi/fortran/use-mpi-f08/mod`. Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>	2019-05-31 15:22:42 +09:00
George Bosilca	a0fce4eac2	Fix the man pages for some of the MPI_T_* functions. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-05-29 00:23:35 -04:00
George Bosilca	eed770ce5c	Fix the SPC initialization. Use the PVAR ctx to save the SPC index, so that no lookup nor restriction on the SPC vars position is imposed. Make sure the PVAR are always registered. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-05-29 00:23:18 -04:00
George Bosilca	7dab8c002b	Fixed SPC/MPI_T initialization error. Signed-off-by: Yong Qin <yongq@mellanox.com>	2019-05-28 15:10:32 -04:00
Tomislav Janjusic	6ea920e225	Coll/hcoll: adding scatterv interface Signed-off-by: Valentin Petrov valentinp@mellanox.com	2019-05-27 12:27:43 +03:00
Edgar Gabriel	8eda9f2ecd	common/ompio: fix coverty warnings this commmit fixes coverty warnings CID 1445198 and CID 1445197 For a reason that is a bit unclear to me, coverty only complained about the read files, but the write operations had the same issue, so I fixed that within the same commit as well. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-23 13:40:39 -05:00
Kurita, Takehiro	7ece564978	java: Fix compilation error in allToAllw using Java arrays Java bindings in Open MPI support Java arrays and direct buffers as buffers. All non-blocking methods must use direct buffers and only blocking methods can choose between Java arrays and direct buffers. Though Comm.allToAllw() is a blocking method, Java applications using Java arrays as buffers get compilation errors. This fix enables using Java arrays in Comm.allToAllw(). Signed-off-by: Kurita, Takehiro <fj6370fp@aa.jp.fujitsu.com>	2019-05-22 10:00:16 +09:00
Edgar Gabriel	27b2ec71a7	common/ompio: add support for read operations and collective I/O external32 data representation is now support by ompio for everything but non-blocking collective I/O operations. The support can further be improved in a second step to limit the temporary buffer size (at least for blocking operations), but it does work now for many scenarios. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-20 17:56:16 -05:00
Edgar Gabriel	ab56e6f0db	common/ompio: make individual read operations work. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-20 17:22:33 -05:00
Edgar Gabriel	f6b3a0af52	common/ompio: individual write of external32 works both blocking and non-blocking. collective write and read operations not yet. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-20 16:26:14 -05:00
Edgar Gabriel	d955753cb8	common/ompio: abstraction for different convertor types introduce separate convertors for memory vs. file representation. Adjust the interfaces for decode_datatype to provide the convertor to be used for that. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-20 13:35:38 -05:00
Edgar Gabriel	35be18b266	common/ompio: rename ompio_cuda* to ompio_buffer* the infrastructure put in place to manage cuda buffers is actually a lot more generic than just for cuda buffers. Specifically, we ca reuse much of the code to implement the external32 data representation. This commit converts the code from common_ompio_cuda* to common_ompio_buffer*. There are just very few places where we actually need to keep the OPAL_CUDA_SUPPORT ifdef in place. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-20 12:50:04 -05:00
Edgar Gabriel	a96efb7620	common/ompio: add comm_ompio_read_all/write_all functions in preparation for adding support for the external32 data representation. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-05-20 12:49:36 -05:00
Valentin Petrov	f19f6f432a	Coll/hcoll: don't init opal memhooks unless explicitely requested by user If user sets HCOLL_EXTERNAL_UCM_EVENTS=1 then we try init opal memory framework and register a mem release cb. Otherwise, rely on ucx. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2019-05-20 11:17:44 +03:00
Yossi Itigin	9d1994b906	OSC/UCX: Fix deadlock with atomic lock Atomic lock must progress local worker while obtaining the remote lock, otherwise an active message which actually releases the lock might not be processed while polling on local memory location. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2019-05-19 20:10:09 +03:00
Alex Anenkov	77d466edf3	coll/libnbc: add recursive doubling algorithm for MPI_Iallreduce Signed-off-by: Alex Anenkov <anenkov.ru@gmail.com>	2019-05-19 18:39:11 +07:00
Sergey Oblomov	a3578d9ece	PML/UCX: disable PML UCX if MT is requested but not supported - in case if multithreading requested but not supported disable PML UCX Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2019-05-17 11:25:23 +03:00
bosilca	6089608858	Merge pull request #6647 from bosilca/fix/length_0 Fix/length 0	2019-05-14 17:59:15 -04:00
Jeff Squyres	9442989e2c	Merge pull request #6382 from jsquyres/pr/ofi-mtl-gitignore mtl/ofi: add a .gitignore	2019-05-13 12:00:41 -04:00
George Bosilca	42119254c7	Fix incorrect behavior with length == 0 Fixes #6575. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-05-10 19:53:34 -04:00
George Bosilca	d141bf7912	Update the datatype dump to match the actual types. Update the comments to better reflect what is going on. Minor indentations. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-05-10 18:03:57 -04:00
Joseph Schuchart	c67e229193	OSC rdma: make sure accumulating in shared memory is safe Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2019-05-10 17:27:10 +02:00
Nathan Hjelm	4345308dfd	osc/rdma: fix CAS 32-bit network atomic compatibility check When checking for btl compatibility with 32-bit CAS osc/rdma was checking the incorrect flag field. Signed-off-by: Nathan Hjelm <hjelmn@cs.unm.edu>	2019-05-10 07:27:53 -06:00
KAWASHIMA Takahiro	dabad084b5	Merge pull request #6621 from bosilca/topic/persistent_req_leak Fix the leak of fragments for persistent sends (issue #6565)	2019-05-03 15:21:42 +09:00
George Bosilca	a16cf0e4dd	Fix the leak of fragments for persistent sends. The rdma_frag attached to the send request was not correctly released upon request completion, leaking until MPI_Finalize. A quick solution would have been to add RDMA_FRAG_RETURN at different locations on the send request completion, but it would have unnecessarily made the sendreq completion path more complex. Instead, I added the length to the RDMA fragment so that it can be completed during the remote ack. Be more explicit on the comment. The rdma_frag can only be freed once when the peer forced a protocol change (from RDMA GET to send/recv). Otherwise the fragment will be returned once all data pertaining to it has been trasnferred. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-05-02 09:40:11 -04:00
Jeff Squyres	ac54d771ec	mtl/ofi: add a .gitignore Ignore generated files. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-05-01 14:00:00 -07:00
Yossi Itigin	5d2200a7d6	Merge pull request #6605 from brminich/topic/shmem_all2all_put SPML/UCX: Add shmemx_alltoall_global_nb routine to shmemx.h	2019-05-01 12:00:21 +03:00
bosilca	399b7133ab	Merge pull request #6556 from EmmanuelBRELLE/PR_fix_local_handle_in_PUT_message pml/ob1: fixed local handle sent during PUT control message	2019-04-27 13:51:22 -04:00
Mikhail Brinskii	2ef5bd8b36	SPML/UCX: Add shmemx_alltoall_global_nb routine to shmemx.h The new routine transfers the data asynchronously from the source PE to all PEs in the OpenSHMEM job. The routine returns immediately. The source and target buffers are reusable only after the completion of the routine. After the data is transferred to the target buffers, the counter object is updated atomically. The counter object can be read either using atomic operations such as shmem_atomic_fetch or can use point-to-point synchronization routines such as shmem_wait_until and shmem_test. Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-04-26 14:47:58 +03:00
Mark Allen	d85cac8f1a	fixing an unsafe usage of integer disps[] (romio321 gpfs) There are a couple MPI_Alltoallv calls in ad_gpfs_aggrs.c where the send/recv data comes from places like req[r].lens, and the send buffer and send displacements for example were being calculated as sbuf = pick one of the reqs: req[bottom].lens sdisps[r] = req[r].lens - req[bottom].lens which might be okay if the .lens was data inside of req[] so they'd all be close to each other. But each .lens field is just a pointer that's malloced, so those addresses can be all over the place, so the integer-sized sdisps[] isn't safe. I changed it to have a new extra array sbuf and rbuf for those two Alltoallv calls, and copied the data into the sbuf from the same locations it used to be setting up the sdisps[] at, and after the Alltoallv I copy the data out of the new rbuf into the same locations it used to be setting up the rdisps[] at. For what it's worth I was able to get this to fail -np 2 on a GPFS filesystem with hints romio_cb_write enable. I didn't whittle the test down to something small, but it was failing in an MPI_File_write_all call. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2019-04-23 16:01:55 -04:00
Jeff Squyres	9a9d106296	Merge pull request #6555 from EmmanuelBRELLE/PR-pmlob1_fix_rc_for_putfrag_when_get_failed pml/ob1: fixed exit from get_frag_fail when falling back on btl_put	2019-04-22 17:19:12 -04:00
Gilles Gouaillardet	251477c518	Merge pull request #6431 from ggouaillardet/topic/mpiext_nolib mpiext/shortfloat: do not create empty libraries	2019-04-22 11:23:19 +09:00
Edgar Gabriel	c80a842036	Merge pull request #6602 from edgargabriel/topic/io_array_refactor common/ompio: refactor the build_io_array function	2019-04-18 13:44:48 -05:00
Gilles Gouaillardet	e1098dae4b	mpiext/shortfloat: do not build an empty library the shortfloat extension is only made of header files, and hence do not require a library to be built. Refs. open-mpi/ompi#6205 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-04-18 13:42:18 -04:00
Gilles Gouaillardet	e70780b762	configury: allow mpi extensions with no libraries Do not require an archive when the OMPI_MPIEXT_<ext>_HAVE_OBJECT macro is defined to 0. See `ompi/mpiext/example/configure.m4`. Allow some extensions to be built on OS X since the creation of archives with no files is not permitted. Refs. open-mpi/ompi#6205 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp> Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com> Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-04-18 13:42:01 -04:00
Gilles Gouaillardet	232055fc7a	fortran/use-mpi-f08: fix intent of the internal ompi_*_f bindings Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-04-18 13:29:19 +09:00
Edgar Gabriel	d43427fc76	common/ompio: refactor the build_io_array function abstract out the io_array structure to be used in common_ompio_build_io_array function. This is preparation for a future component that would like to use the same function, but not modify the io_array stored on the file handle itself. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-04-17 14:42:33 -05:00
Valentin Petrov	30970bdfdf	OSC/UCX: correctly handle NULL origin addr and MPI_NO_OP Addtional bugfix: origin_addr -> result_addr for no_op, replace_op and sum_op fetch destination. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2019-04-17 10:30:21 +03:00
bosilca	8cf7a7e87d	Merge pull request #6538 from bosilca/topic/issue6522 Prevent a segfault when accessing a rank outside a communicator.	2019-04-09 18:08:49 -04:00
David Eberius	461d8bc77b	Fixed a potential name collision. Signed-off-by: David Eberius <deberius@vols.utk.edu>	2019-04-03 16:43:48 -04:00
markalle	98fdeeeb41	Merge pull request #6448 from markalle/macro_writing_input_arg in-place conversion macro writes into INPUT argument	2019-04-02 11:33:18 -05:00
Brelle Emmanuel	e630046a4b	pml/ob1: fixed local handle sent during PUT control message In case of using a btl_put in ob1, the handle of the locally registered memory is sent with a PUT control message. In the current master code the sent handle is necessary the handle in the frag but if the handle has been successfully registered in the request, the frag structure does not have any valid handle and all fragments use the request one. I suggest to check if the handle in the fragment is valid and if not to send the handle from the request. Signed-off-by: Brelle Emmanuel <emmanuel.brelle@atos.net>	2019-04-01 18:45:05 +02:00
Brelle Emmanuel	9c689f2225	pml/ob1: fixed exit from get_frag_fail when falling back on btl_put In the case the btl_get fails Ob1 tries to fallback on btl_put first but the return code was ignored. So the code fell back on both btl_put and btl_send. Signed-off-by: Brelle Emmanuel <emmanuel.brelle@atos.net>	2019-04-01 18:17:10 +02:00
Mark Allen	0a7f1e3cc5	in-place conversion macro writes into INPUT argument In fint_2_int.h there are some conversion macros for logicals. It has one path for OMPI_SIZEOF_FORTRAN_LOGICAL != SIZEOF_INT where a new array would be allocated and the conversions then might expand to c_array[i] = (array[i] == 0 ? 0 : 1) and another path for OMPI_SIZEOF_FORTRAN_LOGICAL == SIZEOF_INT where it does things "in place", so the same conversion there would just be array[i] = (array[i] == 0 ? 0 : 1) The problem is some of the logical arrays being converted are INPUT arguments. And it's possible for some compilers to even put the argument in read-only memory so the above "in place" conversion SEGV's. A testcase I have used call MPI_CART_SUB(oldcomm, (/.true.,.false./), newcomm, ierr) and gfortran put the second arg in read-only mem. In cart_sub_f.c you can trace the ompi_fortran_logical_t *remain_dims arg. remain_dims[] is for input only, but the file uses OMPI_LOGICAL_ARRAY_NAME_DECL(remain_dims); OMPI_ARRAY_LOGICAL_2_INT(remain_dims, ndims); PMPI_Cart_sub(..., OMPI_LOGICAL_ARRAY_NAME_CONVERT(remain_dims), ...); OMPI_ARRAY_INT_2_LOGICAL(remain_dims, ndims); to convert it to c-ints make a C call then restore it to Fortran logicals before returning. It's not always wrong to convert purely in-place, eg cart_get_f.c has a periods[] that's exclusively for OUTPUT and it would be fine with the macros as they were. But I still say the macros are invalid because they don't distinguish whether they're being used on INPUT or OUTPUT args and thus they can't be used in a way that's legal for both cases. It might be possible to fix the macros by adding more of them so that cart_create_f.c and cart_get_f.c would use different macros that give more context. But my fix here is just to turn off the first block and make all paths run as if OMPI_SIZEOF_FORTRAN_LOGICAL != SIZEOF_INT. The main macros that get enlarged by this change are define OMPI_ARRAY_LOGICAL_2_INT_ALLOC : mallocs now define OMPI_ARRAY_LOGICAL_2_INT : also mallocs now But these are only used in 4 places, three of which are the purpose of this checkin, to avoid the former in-place expansion of an INPUT arg: cart_create_f.c cart_map_f.c cart_sub_f.c and one of which is an OUPUT arg that was fine and that gets unnecessarily expanded into a separate array by this checkin. cart_get_f.c So I think an unnecessary malloc in cart_get_f.c is the only downside to this change, where the logicals array argument could have been used and converted in place. Signed-off-by: Mark Allen <markalle@us.ibm.com> Update provided by Gilles Gouaillardet to keep the in-place option if OMPI_FORTRAN_VALUE_TRUE == 1 where no conversion is needed. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-04-01 10:38:05 -04:00
KAWASHIMA Takahiro	63a1968459	man: Fix typo of MPI_TYPE_GET_NAME Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-03-29 13:01:52 +09:00
Jeff Squyres	05c5e2034b	Merge pull request #6527 from James-A-Clark/master Add compilation flag to allow unwinding through files that are present in the stack when attaching with MPIR	2019-03-28 18:16:02 -04:00
George Bosilca	6ea0c4eab9	Prevent a segfault when accessing a rank outside a communicator. This is not fixing any issue, it is simply preventing a sefault if the communicator creation has not happened as expected. Thus, this code path should never really be hit in a correct MPI application with a valid communicator creation support. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-03-28 12:03:29 -04:00
Jeff Squyres	3c1b33c93a	Merge pull request #6140 from bertwesarg/fix-cpp-condition Fix use of bitwise operation in CPP condition	2019-03-28 10:06:20 -04:00
James Clark	20f5840cbb	Add a compilation flag that adds unwind info to all files that are present in the stack starting from MPI_Init. This is so when a debugger attaches using MPIR, it can step out of this stack back into main. This cannot be done with certain aggressive optimisations and missing debug information. Signed-off-by: James Clark <james.clark@arm.com> Signed-off-by: Jeff Squyres <jsquyres@cisco.com> Co-authored-by: Jeff Squyres <jsquyres@cisco.com>	2019-03-27 14:32:15 +00:00
Ralph Castain	dfbc14430d	Merge pull request #6440 from ggouaillardet/topic/yield_when_idle schizo/ompi: correctly handle the yield_when_idle option	2019-03-25 12:17:34 -07:00
Artem Polyakov	bfff5783f9	Merge pull request #6371 from artpol84/osc/select_dbg osc/base: Add debug output stating a selected component	2019-03-22 22:24:04 -07:00
Yossi Itigin	9b91cf09cc	Merge pull request #6481 from hoopoepg/topic/check-ucx-params PML/SPML/UCX: added evaluation of mmap events	2019-03-14 11:53:42 +02:00
Austen Lauria	b61e6242d3	Fix integer overflows with indexed datatype creation. The types of count, disp, and extent passed into ompi_datatype_add() should be size_t, ptrdiff_t and ptrdiff_t, respectively. This prevents integer overflows and errors in computing the size of large indexed datatypes. Signed-off-by: Austen Lauria <awlauria@us.ibm.com>	2019-03-13 09:39:57 -04:00
Sergey Oblomov	d8e3562bae	PML/SPML/UCX: added evaluation of mmap events - there was a set of UCX related issues reported which caused by mmap API hooks conflicts. We added diagnostic of such problems to simplify bug-resolving pipeline Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2019-03-12 21:14:27 +02:00
Geoff Paulsen	a14bb4bc89	Merge pull request #6471 from hppritcha/topic/issue_6470 ompi_info: report whether MPI1 compat is enabled	2019-03-11 21:11:55 -05:00
Howard Pritchard	61ccc65302	ompi_info: report MPI1 compat is disabled MPI1 compat disabled beyond v4.0.x Related to #6470 Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2019-03-11 13:50:29 -06:00
Gilles Gouaillardet	26c1b833c7	man: remove man pages of removed MPI1 subroutines Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-03-05 15:01:07 +09:00
Gilles Gouaillardet	cc97c0f611	schizo/ompi: correctly handle the yield_when_idle option in schizo/ompi, sets the new OMPI_MCA_mpi_oversubscribe environment variable according to the node oversubscription state. This MCA parameter is used to set the default value of the mpi_yield_when_idle parameter. This two steps tango is needed so the mpi_yield_when_idle setting is always honored when set in a config file. Refs. open-mpi/ompi#6433 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-02-28 09:53:29 +09:00
Nathan Hjelm	73085e9ce3	Merge pull request #6413 from nuriallv/issue_osc_rdma osc/rdma: fix when determining the node with the rank_array info for a peer	2019-02-27 16:30:06 -07:00
Geoffrey Paulsen	a6d6be2853	mpi.h.in: delete removed MPI1 functions/datatypes (API change!) This commit DELETES the removed MPI1 functions and datatypes from both the mpi.h header and from the library (they were deleted from the MPI standard in MPI-3.0). WARNING: This changes the MPI API in a non-backwards compatible way. This also removes the configure option that was added in Open MPI v4.0.x, requiring users to change their apps if they are using any of these almost 20 year old APIs. This commit removes the following MPI1 removed functions and datatypes: MPI_Address MPI_Errhandler_create MPI_Errhandler_get MPI_Errhandler_set MPI_Type_extent MPI_Type_hindexed MPI_Type_hvector MPI_Type_struct MPI_Type_UB MPI_Type_LB Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-02-27 08:24:11 -08:00
Geoffrey Paulsen	3136a1706c	mpi.h.in: Revamp MPI-1 removed function warnings Refs https://github.com/open-mpi/ompi/issues/6278. This commit is intended to be cherry-picked to v4.0.x and the following commit will ammend to this functionality for master's removal. Changes the prototypes for MPI removed functions in the following ways: There are 4 cases: 1) User wants MPI-1 compatibility (--enable-mpi1-compatibility) MPI_Address (and friends) are declared in mpi.h with deprecation notice 2) User does not want MPI-1 compatibility, and has a C11-capable compiler Declare an MPI_Address (etc.) macro in mpi.h, which will cause a compile-time error using _Static_assert C11 feature 3) User does not want MPI-1 compatibility, and does not have a C11-capable compiler, but the compiler supports error function attributes. Declare an MPI_Address (etc.) macro in mpi.h, which will cause a compile-time error using error function attribute. 4) User does not want MPI-1 compatibility, and does not have a C11-capable compiler, or a compiler that supports error function attributes. Do not declare MPI_Address (etc.) in mpi.h at all. Unless the user is compiling with something like -Werror, this will allow the user's code to compile. We are choosing this because it seems like a losing battle to make some kind of compile time error that is friendly to the user (and doesn't make it look like mpi.h itself is broken). On v4.0.x, this will allow the user code to both compile (albeit with a warning) and link (because the MPI_Address will be in the MPI library because we are preserving ABI back to 3.0.x). On master/v5.0.x, this will allow the user code to compile, but it will fail to link (because the MPI_Address symbol will not be in the MPI library). Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2019-02-27 08:24:11 -08:00
bosilca	8400502d8a	Merge pull request #6353 from bosilca/topic/fix_monitoring_pvar Fix the PVAR allocation usage.	2019-02-25 16:03:56 -05:00
Howard Pritchard	9b3a9c2579	Merge pull request #6417 from abouteiller/bugfix/cart_create_cid Cart/Graph create would not run the next_cid algorithm	2019-02-22 13:05:59 -07:00
Howard Pritchard	d6cdbdfd39	Merge pull request #6412 from hppritcha/topic/fix_pgi_usempif08 fortran:fix for PGI linking	2019-02-21 20:31:14 -07:00
Aurelien Bouteiller	fb17115ba9	Cart/Graph create would not run the next_cid algorithm and create disjoint communicator with inconsistent cid. Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2019-02-21 11:40:22 -05:00
Howard Pritchard	266bc3aced	fortran:use mpif08 fix for PGI linking commit `c6070fd2e` broke building fortran bindings with PGI compilers. Turns out PGI compilers need to link in the *.o from a module file whether or not there are module subroutines defined or not in the module file. Related to #6411 Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2019-02-20 12:33:25 -07:00
Nuria Losada	3cae149262	osc/rdma: fix when determining the node with the rank_array info for a peer Signed-off-by: Nuria Losada <nlosada@icl.utk.edu>	2019-02-20 13:12:00 -05:00
Artem Polyakov	13a8e42108	Merge pull request #6163 from artpol84/osc/mt_submission Refactoring of osc/ucx component for MT	2019-02-20 09:41:27 -08:00
Gilles Gouaillardet	ad114be28c	configury: automatically select rte/pmix runtime if ORTE project is not built Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-02-20 13:55:55 +09:00
Gilles Gouaillardet	69d136ae5e	ompi/pmix: fix misc OPAL function calls Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-02-20 13:55:55 +09:00
Gilles Gouaillardet	18f679efac	Merge pull request #6401 from ggouaillardet/topic/osc_rdma_self osc/rdma: correctly handle communications to self	2019-02-20 11:43:22 +09:00
KAWASHIMA Takahiro	7095ad10a5	man: fix more typos in MPI_Win_attach man page Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-20 11:22:38 +09:00
Gilles Gouaillardet	7c0596819b	man: fix typos in MPI_Win_{attach,detach} man pages no code change [skip ci] bot:notest Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-02-20 11:09:45 +09:00
Gilles Gouaillardet	fe05fcc11a	osc/rdma: correctly handle communications to self mark the "self" peer OMPI_OSC_RDMA_PEER_LOCAL_BASE when the window is dynamically created and use_cpu_atomics is set in order to correctly handle communications to self. Thanks Bart Janssens for reporting this issue. Refs. open-mpi/ompi#6394 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-02-20 09:52:17 +09:00
Artem Polyakov	19e2ae2efb	opal/common/ucx: Switch to opal/tsd Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-02-19 14:22:07 -08:00
Artem Polyakov	7984d7d997	opal/common/ucx: Remove unused debugging macro Will be reintroduced later if needed and after adaptation to the OMPI infrastructure. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-02-19 14:22:07 -08:00
Artem Polyakov	43f16d8796	opal/common/ucx: Remove common_ucx_int.h Place the content of common_ucx_int.h back to the common_ucx.h and include common_ucx_wpool.h explicitly. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-02-19 14:22:07 -08:00
Xin Zhao	c6de09940f	ompi/osc/ucx: Switch osc/ucx code to use Worker Pool. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-02-19 14:22:07 -08:00
Yossi Itigin	91d05f91e2	Merge pull request #6384 from brminich/topic/ucx_worker_net_address PML/UCX: Use net worker address for remote peers	2019-02-17 12:21:00 +02:00
Matias A Cabral	25bdd118ac	MTL_OFI: Changed Recv cancel to be non-blocking Updated the OFI MTL's Recv cancel to be a non-blocking call to match the MPI spec. Given fi_cancel succeeded, then it is expected that the user will wait on the request to read the result of if the cancel has completed. Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com	2019-02-14 17:07:20 -05:00
Mikhail Brinskii	751d88192d	PML/UCX: Use net worker address for remote peers For remote node peers pack smaller worker address, which contains network device addresses only. This would reduce amount of OOB traffic during startup. Signed-off-by: Mikhail Brinskii <mikhailb@mellanox.com>	2019-02-14 18:06:36 +02:00
Brian Barrett	7a593cea4a	Merge pull request #6361 from aravindksg/fix_tg_segfault mtl/ofi: Fix segfault when not using Thread-Grouping feature	2019-02-12 12:04:26 -08:00
Ralph Castain	125d236173	Move from the use of regex to compression We've been fighting the battle of trying to create a regex generator and parser that can handle arbitrary hostname schemes - without long-term success. The worst of it is that there is no way of checking to see if the computed regex is correct short of parsing it and doing a character-by-character comparison with the original string. Ugh...there has to be a better solution. One option is to investigate using 3rd-party regex libraries as those are coming from communities whose sole focus is resolving that problem. However, someone would need to spend the time to investigate it, and we'd have to find a license-friendly implementation. Another option is to quit beating our heads against the wall and just compress the information. It won't be as much of a reduction, but we also won't keep hitting scenarios where things break. In this case, it seems that "perfection" is definitely the enemy of "good enough". This PR implements the compression option while retaining the possibility of people adding regex-generating components. The compression code used in ORTE is consolidated into the opal/compress framework. That framework currently held bzip and gzip components for use in compressing checkpoint files - since we no longer support C/R, I have .opal_ignore'd those components. However, I have left the original framework APIs alone in case someone ever decides to redo C/R. The APIs of interest here are added to the framework - specifically, the "compress_block" and "decompress_block" functions. I then moved the ORTE zlib compression code into a new component in this framework. Unfortunately, the framework currently is a single-select one - i.e., only one active component at a time. Since I .opal_ignore'd the other two and made the priority of zlib high, this isn't a problem. However, if someone wants to re-enable bzip/gzip or add another component, they might need to transition opal/compress to a multi-select framework. Included changes: * Consolidate the compression code into the opal/compress framework * Move the ORTE zlib compression code into a new opal/compress/zlib component * Ignore the bzip and gzip components in opal/compress framework * Add a "compress_base_limit" MCA param to set the threshold above which we compress data - defaults to 4096 bytes * Delete stale brucks and rcd components from orte/grpcomm framework * Delete the orte/regx framework * Update the launch system to use opal/compress instead of string regex * Provide a default module if no zlib is available * Fix some misc multi-node issues * Properly generate the nidmap in response to a "connection warmup" message so the remote daemon knows the children it needs to launch. * Remove stale references to orte_node_regex * opal_byte_object_t's are not OPAL objects - properly release allocated memory. * Set the topology * Currently only handling homogeneous case * Update the compress framework files to conform * Consolidate open/close into one "frame" file. Ensure we open/close the framework Signed-off-by: Ralph Castain <rhc@pmix.org>	2019-02-08 11:11:14 -08:00
KAWASHIMA Takahiro	8bbd201029	Merge pull request #6205 from kawashima-fj/pr/fp16 Add FP16 datatypes	2019-02-08 14:52:13 +09:00
Artem Polyakov	35090b69f1	osc/base: Add debug output stating a selected component Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2019-02-07 15:54:20 -08:00
Aravind Gopalakrishnan	6edcc479c4	mtl/ofi: Fix segfault when not using Thread-Grouping feature For the non thread-grouping paths, only the first (0th) OFI context should be used for communication. Otherwise this would access a non existant array item and cause segfault. While at it, clarifiy some content regarding SEPs in README (Credit to Matias Cabral for README edits). Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2019-02-07 11:52:53 -08:00
Jeff Squyres	f5e1a672cc	ofi: revamp OPAL_CHECK_OFI configury Update the OPAL_CHECK_OFI configury macro: - Make it safe to call the macro multiple times: - The checks only execute the first time it is invoked - Subsequent invocations, it just emits a friendly "checking..." message so that configure output is sensible/logical - With the goal of ultimately removing opal/mca/common/ofi, rename the output variables from OPAL_CHECK_OFI to be opal_ofi_{happy\|CPPFLAGS\|LDFLAGS\|LIBS}. - Update btl/ofi, btl/usnic, and mtl/ofi for these new conventions. - Also, don't use AC_REQUIRE to invoke OPAL_CHECK_OFI because that causes the macro to be invoked at a fairly random time, which makes configure stdout confusing / hard to grok. - Remove a little left-over kruft in OPAL_CHECK_OFI, too (which resulted in an indenting change, making the change to opal_check_ofi.m4 look larger than it really is). Thanks Alastair McKinstry for the report and initial fix. Thanks Rashika Kheria for the reminder. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-02-07 06:29:58 -08:00
Jeff Squyres	aba2571881	mtl/ofi/Makefile.am: down with tabs! Replace all tabs with spaces. No code or logic changes. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-02-07 06:29:58 -08:00
Gilles Gouaillardet	945f830f7a	mtl/ofi: fix configury when VPATH is used Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-02-07 06:29:58 -08:00
Matias Cabral	0601b3e982	Merge pull request #6325 from aravindksg/fix_help_reference mtl/ofi: Fix reference to help text object	2019-02-05 07:22:51 -08:00
George Bosilca	e42b573cd3	Fix the PVAR allocation usage. According to the MPI standard the obj_handle is a pointer to an MPI object, and therefore cannot be MPI_COMM_WORLD. The MPI standard example 14.6 highlight this usage. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-02-02 19:03:43 -05:00
KAWASHIMA Takahiro	f8a441957a	mpiext/shortfloat: Add `MPIX_C_FLOAT16` datatype `MPIX_C_FLOAT16` is defined as a synonym for `MPIX_SHORT_FLOAT` if the C compiler supports `_Float16`, which is defined in ISO/IEC JTC 1/SC 22/WG 14 N1945 (ISO/IEC TS 18661-3:2015). This name and meaning are same as that of MPICH. This may be a transitional datatype until the MPI Forum decides a proper name for the type. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-01 14:55:52 +09:00
KAWASHIMA Takahiro	c44599ec13	mpiext/shortfloat: Add `shortfloat` MPI extension This extension provides additional MPI datatypes `MPIX_SHORT_FLOAT`, `MPIX_C_SHORT_FLOAT_COMPLEX`, and `MPIX_CXX_SHORT_FLOAT_COMPLEX` for `short float` (C/C++), `short float _Complex` (C), and `std::complex<short float>` (C++), respectively, or their alternate types like `_Float16`. See `ompi/mpiext/shortfloat/README.txt` for details. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-01 13:01:14 +09:00
KAWASHIMA Takahiro	4d7bde27fb	ompi/datatype: Use `short float` for `MPI_REAL2` ... and add `MPI_COMPLEX4`. This commit changes values of existing `OMPI_DATATYPE_MPI_*` macros. This change does not affect ABI compatibility of `libmpi.so` and the like because these values are only used in OMPI internal code. On the other hand, `ompi_datatype_t::id` values of existing datatypes are not changed and 73 is newly assigned to for `MPI_COMPLEX4` to retain ABI compatibility. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-01 13:01:10 +09:00
KAWASHIMA Takahiro	4375c11a58	ompi/datatype: Add `ompi_mpi_short_float` ... and `ompi_mpi_c_short_float_complex` and `ompi_mpi_cxx_sfltcplex`. These are Open MPI internal variables intended to be defined as `MPI_SHORT_FLOAT`, `MPI_C_SHORT_FLOAT_COMPLEX`, and `MPI_CXX_SHORT_FLOAT_COMPLEX` in the future. `OMPI_DATATYPE_MPI_C_SHORT_FLOAT_COMPLEX` is also required to support `MPI_COMPLEX4` in the next commit. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-01 12:43:13 +09:00
Sergey Lebedev	829846dbcc	fp16 hcoll bindings Signed-off-by: Sergey Lebedev <sergeyle@mellanox.com>	2019-02-01 12:40:14 +09:00
KAWASHIMA Takahiro	2ad1c09848	opal/datatype: Add `opal_short_float_t` The type `short float`, which is proposed in ISO/IEC JTC 1/SC 22 WG 14 (C WG), is not supported by most compilers yet. But some compilers (including gcc 7 for AArch64 and clang 6) support `_Float16`, which is defined in ISO/IEC TS 18661-3:2015 (ISO/IEC JTC 1/SC 22/WG 14 N1945) as an extensions for C. If it is detected in `configure`, it is used as an alternate type of `short float` in Open MPI internal code. This commit adds a `configure` option `--enable-alt-short-float=TYPE`. It can be used to specify a type other than `short float` and `_Float16` as the alternate type. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-01 12:40:14 +09:00
KAWASHIMA Takahiro	f6b39452f6	opal/datatype: Support `short float` The type `short float` is proposed for the C language in ISO/IEC JTC 1/SC 22 WG 14 (C WG) for mainly IEEE 754-2008 binary16, a.k.a. half-precision floating point or FP16. By this commit, `short float` and `short float _Complex` are detected in `configure` and used in Open MPI internal code. `MPI_SHORT_FLOAT` and its complex number version are not added yet. This commit changes values of existing `OPAL_DATATYPE_*` macros. This change does not affect ABI compatibility of `libmpi.so` and the like because these values are only used in OPAL and OMPI internal code. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-02-01 12:40:14 +09:00
Jeff Squyres	4c64322db4	Merge pull request #6334 from jsquyres/pr/make-mpi-h-a-little-more-c++-friendly mpi.h.in: use C++ static_cast<> where appropriate	2019-01-31 07:14:34 -05:00
Jeff Squyres	30afdcead9	mpi.h.in: use C++ static_cast<> where appropriate When compiling mpi.h with a modern C++ compiler and a high degree of pickyness (e.g., -Wold-style-cast), casting using (void) in the OMPI_PREDEFINED_GLOBAL and MPI_STATUS_IGNORE macros will emit warnings. So if we're compiling with a C++ compiler, use C++'s static_cast<> instead of (void*). Thanks to @shadow-fax for identifying the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-01-31 03:22:26 -08:00
Thananon Patinyasakdikul	782ec851ea	Merge pull request #6319 from thananon/pr/allow_overtake pml/ob1: fix deadlock with communicator flag ALLOW_OVERTAKE.	2019-01-30 15:32:04 -05:00
Jeff Squyres	2203f8d900	Merge pull request #6185 from ggouaillardet/topic/hwloc_macros hwloc: remove public hwloc macros from opal_config.h	2019-01-30 07:32:22 -05:00
Gilles Gouaillardet	0aeb27f776	topo/treematch: silence a hwloc related warning treematch/km_partitioning.c #include "config.h", but there is no such file when the embedded treematch is used. In order to prevent the embedded treematch from incorrectly using the config.h from the embedded hwloc, generate a dummy config.h. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-01-30 14:51:38 +09:00
Aravind Gopalakrishnan	9cabcfdbba	mtl/ofi: Fix reference to help text object When we exceed the threshold number of contexts created, print appropriate help text Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2019-01-29 15:10:06 -08:00
Thananon Patinyasakdikul	0263456cf4	pml/ob1: fix deadlock with communicator flag ALLOW_OVERTAKE. We missed an assert to check if ALLOW_OVERTAKE is set or not before validating the sequence number and this will cause deadlock. Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2019-01-29 14:55:06 -05:00
Nathan Hjelm	f9338dac93	Merge pull request #6312 from ggouaillardet/topic/op ompi/op: fix support of non predefined datatypes with predefined oper…	2019-01-29 10:55:00 -07:00
Brian Barrett	23da9fac23	Merge pull request #6294 from bwbarrett/mtl-ofi-no-device-warning mtl/ofi: Print descriptive error message on modex failure	2019-01-29 08:32:49 -08:00
Brian Barrett	1bb7a73a9c	Merge pull request #6302 from bwbarrett/feature/ofi-av-count mtl/ofi: Provide av count hint during initialization	2019-01-29 08:32:24 -08:00
Edgar Gabriel	7023357843	Merge pull request #6286 from edgargabriel/pr/floating-point-division-problem common/ompio: fix a floating point division problem	2019-01-29 10:07:09 -06:00
Gilles Gouaillardet	bc1cab5498	ompi/op: fix support of non predefined datatypes with predefined operators ACCUMULATE, unlike REDUCE, can use with derived datatypes with predefinied operations, with some restrictions outlined in MPI-3:11.3.4. The derived datatype must be composed entierly from one predefined datatype (so you can do all the construction you want, but at the bottom, you can only use one datatype, say, MPI_INT). Refs. open-mpi/ompi#6275 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-01-29 09:33:39 +09:00
Gilles Gouaillardet	45fb69b2b9	ompi/datatype: fix how we compute the space needed for the args Refs. open-mpi/ompi#6275 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2019-01-28 15:26:11 +09:00
Brian Barrett	44be7f139a	mtl/ofi: Provide av count hint during initialization Provide the av_attr.count hint (number of addresses that will be inserted into the address vector through the life of the process) at initialization of the address vector. It's ok to be a bit wrong, but some endpoints (RxR) can benefit by not going through the slow growth realloc churn. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2019-01-24 15:47:24 -08:00
Edgar Gabriel	c0f8ce0fff	common/ompio: fix a floating point division problem This commit fixes a problem reported on the mailing list with individual writes larger than 512 MB. The culprit is a floating point division of two large, close values. Changing the datatypes from float to double (which is what is being used in the fcoll components) fixes the problem. See issue #6285 and https://forum.hdfgroup.org/t/cannot-write-more-than-512-mb-in-1d/5118 Thanks for Axel Huebl and René Widera for reporting the issue. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2019-01-21 17:59:12 -06:00
Brian Barrett	fe25097194	mtl/ofi: Print descriptive error message on modex failure With MTLs, there's no "other transport" when the remote side does not have an active NIC, so we should print a useful error message when the modex failed (indicating lack of a NIC on the remote side). Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2019-01-21 23:50:31 +00:00
KAWASHIMA Takahiro	352b667323	Merge pull request #6210 from kawashima-fj/pr/mpiext-use-mod Use mpi_f08 module in mpi_f08_ext module	2019-01-21 11:56:41 +09:00
René Widera	a91fab80a1	common/ompio: possible rounding issue Similar to #6286 rounding number of bytes into a single precision floating point value to round up the result of a division is a potential risk due to rounding errors. - remove floating point operations for `round up` - removes floating point conversion for round down (native behavior of integer division) Signed-off-by: René Widera <r.widera@hzdr.de>	2019-01-18 14:05:23 +01:00
Yossi Itigin	387b2ff56f	Merge pull request #6260 from hoopoepg/topic/removed-fca COLL: removed FCA component	2019-01-17 00:05:07 +08:00
KAWASHIMA Takahiro	b380dd58b5	config/ompi_ext: use mpi module in mpi_ext module If MPI extensions are enabled, all `ompi/mpiext/pcollreq/use-mpi/mpiext__usempi.h` are included in `ompi/mpi/fortran/mpiext-use-mpi/mpi-ext-module.F90` and all `ompi/mpiext/pcollreq/use-mpi/mpiext__usempif08.h` are included in `ompi/mpi/fortran/mpiext-use-mpi-f08/mpi-f08-ext-module.F90` using `#include` directives. In `mpiext__usempi.h` and `mpiext__usempif08.h`, some MPI extension may want to use constants or handles defined in the `mpi` module and the `mpi_f08` module. For example, if you want to define a new datatype in `mpi_f08_ext`, you'll need the definition of `type(mpi_datatype)`. However, putting `use mpi_f08` line in thier `mpiext_*_usempif08.h` may cause a compilation error if more than one MPI extensions are enabled because the `use` statement must be put prior to any variable declarations. To resolve this problem, this commit puts `use mpi` and `use mpi_f08` as first lines of `mpi-ext-module.F90` and `mpi-f08-ext-module.F90` respectively. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-01-16 11:55:55 +09:00
KAWASHIMA Takahiro	2220623f34	config/ompi_ext: Don't include mpiext__mpifh.h in mpi_f08_ext Including `mpiext__mpifh.h` in the source file of the `mpi_f08_ext` module is not always appropriate. For example, if you want to define a new datatype in an MPI extension, the `include 'mpif-ext.h'` binding defines the datatype as `integer` but the `use mpi_f08_ext` binding defines it as `type(mpi_datatype)`. They conflict. This commit allows each MPI extension to declare whether it wants to include its `mpiext_*_mpifh.h` in `mpi_f08` and `mpi_f08_ext` respectively. The default (no declaration) is 'want'. See `ompi/mpiext/example/configure.m4` for an example. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2019-01-16 11:55:55 +09:00
Aravind Gopalakrishnan	37f9aff2a0	mtl/ofi: Add MCA variables to enable SEP and to request number of OFI contexts Moving to a model where we have users actively _enable_ SEP feature for use rather than opening SEP by default if provider supports it. This allows us to not regress (either functionally or for performance reasons) any apps that were working correctly on regular endpoints. Also, providing MCA to specify number of OFI contexts to create and default this value to 1 (Given btl/ofi also creates one by default, this reduces the incidence of a scenario where we allocate all available contexts by default and if btl/ofi asks for one more, then provider breaks as it doesn't support it). While at it, spruce up README on SEP content. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2019-01-14 09:58:36 -08:00
Ralph Castain	d1fd1f4cce	Merge pull request #6151 from nrspruit/ns_ompi_mtl_ofi_specializations MTL_OFI: Generation of specialized functions at build time	2019-01-14 09:31:54 -08:00
Sergey Oblomov	0759bb8561	COLL: removed FCA component - removed FCA collectives from coll/scoll Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2019-01-09 16:51:40 +02:00
Risto Toijala	f14a0f4fc9	mpi/fortran: Fix valgrind warnings for type create Valgrind warns that newtype is uninitialized when calling from Fortran as e.g. use mpi integer :: t, err call MPI_Type_create_f90_integer(5, t, err) Since newtype is intent(out), this should not happen. There is no reason to convert the type using PMPI_Type_f2c, only to over- write it immediately afterwards. The other type_create_ functions did not convert newtype. The valgrind warnings: ==28441== Conditional jump or move depends on uninitialised value(s) ==28441== at 0x581B555: PMPI_Type_f2c (in [...]/lib/libmpi.so.0.0.0) ==28441== by 0x4E87AB7: MPI_TYPE_CREATE_F90_INTEGER (in [...]/lib/libmpi_mpifh.so.0.0.0) ==28441== by 0x400BA1: MAIN__ (in [...]) ==28441== by 0x400C46: main (in [...]) ==28441== ==28441== Conditional jump or move depends on uninitialised value(s) ==28441== at 0x581B563: PMPI_Type_f2c (in [...]/lib/libmpi.so.0.0.0) ==28441== by 0x4E87AB7: MPI_TYPE_CREATE_F90_INTEGER (in [...]/lib/libmpi_mpifh.so.0.0.0) ==28441== by 0x400BA1: MAIN__ (in [..]) ==28441== by 0x400C46: main (in [...]) ==28441== ==28441== Use of uninitialised value of size 8 ==28441== at 0x581B577: PMPI_Type_f2c (in [...]/lib/libmpi.so.0.0.0) ==28441== by 0x4E87AB7: MPI_TYPE_CREATE_F90_INTEGER (in [...]/lib/libmpi_mpifh.so.0.0.0) ==28441== by 0x400BA1: MAIN__ (in [...]) ==28441== by 0x400C46: main (in [...]) ==28441== Signed-off-by: Risto Toijala <risto.toijala@gmail.com>	2019-01-08 22:00:00 +02:00
Aurelien Bouteiller	e54496bf2a	Merge pull request #6087 from ICLDisco/export/errors_cid Manage errors in communicator creations (cid)	2018-12-31 15:01:55 -05:00
Jeff Squyres	17be4c6d1f	Merge pull request #6229 from jsquyres/pr/fix-enable-grequest-extension-in-a-tarball romio321: ensure to distribute ompi_grequestx.h	2018-12-28 16:15:23 -05:00
Jeff Squyres	62321be186	romio321: ensure to distribute ompi_grequestx.h Refs https://github.com/open-mpi/ompi/issues/6227. Thanks to @georgemarselis for reporting. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-12-27 15:39:47 -08:00
bosilca	96f88052e9	Merge pull request #5948 from mkurnosov/coll-ireduce-silence-coverity coll/libnbc/ireduce: silence Coverity warning CID 1440360	2018-12-24 12:59:16 -05:00
bosilca	593db292da	Merge pull request #5644 from mkurnosov/coll-iallreduce-rabenseifner coll/libnbc: add Rabenseifner's algorithm for MPI_Iallreduce	2018-12-24 12:58:21 -05:00
Jeff Squyres	efcaef74d8	MPI_Type_set_name: fix string length at target opal_string_copy() takes care of all the string computations. Specifically: when we converted to opal_string_copy(), we accidentally left the source length as the argument, not the target length, which resulted in one less character being copied than intended (as was showing up in MTT C++ testing results). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-12-23 13:00:01 -08:00
Aurelien Bouteiller	bd0d2b832e	Merge pull request #6086 from ICLDisco/export/errors_nbc Manage errors in NBC collective ops	2018-12-21 02:34:00 -05:00
Jeff Squyres	1be5358834	Merge pull request #6212 from jsquyres/pr/fix-treematch-common-symbol treematch: fix global common symbol	2018-12-20 15:20:41 -05:00
Jeff Squyres	e9a6246b90	treematch: fix global common symbol Despite its name, this symbol doesn't need to be global. So just make it static. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-12-20 11:06:14 -08:00
Jeff Squyres	81bfb5f5e5	Remove some IMPI attributes that were never implemented. This is a holdover from LAM/MPI that was never implemented here in Open MPI (and never will be). Might as well remove this dead code. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-12-20 10:12:32 -08:00
Nathan Hjelm	4944508603	Merge pull request #6136 from hjelmn/opal_cleanup opal: clean up init/finalize	2018-12-18 15:23:32 -07:00
Nathan Hjelm	a39cb747dd	ompi/datatype: don't call opal_datatype_finalize directly Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-12-18 14:37:04 -07:00
Nathan Hjelm	06baa518f7	rte/pmix: fill in opal_process_info when using prrte/pmix This commit fixes a bug when launching with prun where the process info structures used by the btls are not populated. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-12-13 16:04:31 -07:00
bosilca	804a517929	Merge pull request #6146 from bosilca/topic/treematch_update Update to the latest TreeMatch (v1.3).	2018-12-13 13:26:40 -05:00
Spruit, Neil R	bef5f50a42	MTL_OFI: Generation of specialized functions at build time -> Added new targets in Makefile.am to call a new build script generate-opt-funcs.pl to generate specialized functions for each .pm file. -> Added new perl module .pm files for send,isend,irecv,iprobe,improbe which are loaded by generate-opt-funcs.pl to create new source files that correspond to the name of the .pm file to be used as part of MTL OFI. -> Added mtl_ofi_opt.pm.template and updated README with details on the specialization features and how to add additional specialization support. -> Added new opt_common/mtl_ofi_opt_common.pm containing common functions for generating the specialized functions used by all other *.pm modules. -> Added new mtl_ofi.h which includes the definitions for the function symbol table for storing the specialized functions along with the definitions for the initialization functions for the corresponding function pointers. -> Based off the OFI provider capabilities the specialized function pointers are assigned at mtl_ofi_component_init to the corresponding MTL OFI function. -> mca_mtl_ofi_module_t has been updated with the symbol table struct which is assigned at component init. Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>	2018-12-13 00:35:19 -08:00
Aravind Gopalakrishnan	e5e19dfcf7	Fix for SEP when num local procs is greater than available contexts For cases when the number of local processes is greater than the number of available contexts, the SEP initialization phase would calculate the number of contexts to provision for each rank to be 0 and would eventually crash. Fix the issue here by using regular endpoints in the event the number of local processes is more than available contexts. This fixes issue #6182. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2018-12-12 16:49:04 -08:00
KAWASHIMA Takahiro	adc05f705e	Merge pull request #6174 from kawashima-fj/pr/f08-missing-handles fortran/use-mpi-f08: Add C++ datatypes and MPI_NO_OP	2018-12-12 14:13:36 +09:00
Brian Barrett	6e15128d96	mtl/ofi: Fix crash if no providers found Commit `109d0569ff` introduced a crash when an error occurred before ofi_ctxt was allocated, including when no providers passed the selection logic. Properly check that the pointer is not NULL in the error cleanup code before dereferencing the pointer. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2018-12-11 15:46:18 -08:00
Jeff Squyres	6f7fbd1676	Merge pull request #6158 from ggouaillardet/topic/mpiext-path-updates mpiext: updates for header file locations	2018-12-11 13:01:46 -05:00
KAWASHIMA Takahiro	63ecf01610	fortran/use-mpi-f08: Add C++ datatypes and MPI_NO_OP Though the MPI standard does not have `MPI_CXX_COMPLEX`, `mpi.h`, `mpif.h`, and `mpi.mod` have it. So I added it for consistency. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-12-11 13:08:29 +09:00
KAWASHIMA Takahiro	e0c5bad195	fortran/use-mpi-f08: Remove unnecessary `;` Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-12-11 09:06:21 +09:00
Matias Cabral	cdb952f66d	Merge pull request #6170 from matcabral/remove_psm2_lower_p MTL/PSM2: add missing default priority	2018-12-07 16:11:45 -08:00
Matias A Cabral	c76c6d8b28	MTL/PSM2: add missing default priority Missing default priority after PR #6153 Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2018-12-07 14:46:34 -08:00
Matias Cabral	0b821f2184	Merge pull request #6153 from matcabral/remove_psm2_lower_p MTL/PSM2: Do not lower the priority when all processes are local.	2018-12-07 10:19:53 -08:00
KAWASHIMA Takahiro	4be5a6cdc8	Merge pull request #6159 from kawashima-fj/pr/fix-type-create-f90 mpi/c: Fix MPI_TYPE_CREATE_F90_{REAL,COMPLEX}	2018-12-08 01:41:20 +09:00
KAWASHIMA Takahiro	6fb01f64fe	mpi/c: Fix MPI_TYPE_CREATE_F90_{REAL,COMPLEX} This commit fixes edge cases of `r = 38` and `r = 308`. As defined in the MPI standard, `TYPE_CREATE_F90_REAL` and `TYPE_CREATE_F90_COMPLEX` must be consistent with the Fortran `SELECTED_REAL_KIND` function. The `SELECTED_REAL_KIND` function is defined based on the `RANGE` function. The `RANGE` function returns `INT(MIN(LOG10(HUGE(X)), -LOG10(TINY(X))))` for a real value `X`. The old code considers only `INT(LOG10(HUGE(X)))` using `_MAX_10_EXP`. This commit adds `INT(-LOG10(TINY(X)))` part using `_MIN_10_EXP`. This bug affected the following `p`-`r` combinations. \| p \| r \| expected \| returned \| expected \| returned \| \| :------------ \| --: \| :-------- \| :-------- \| :------- \| :-------- \| \| MPI_UNDEFINED \| 38 \| REAL8 \| REAL4 \| COMPLEX16 \| COMPLEX8 \| \| 0 <= p <= 6 \| 38 \| REAL8 \| REAL4 \| COMPLEX16 \| COMPLEX8 \| \| MPI_UNDEFINED \| 308 \| REAL16 \| REAL8 \| COMPLEX32 \| COMPLEX16 \| \| 0 <= p <= 15 \| 308 \| REAL16 \| REAL8 \| COMPLEX32 \| COMPLEX16 \| MPICH returns the same result as Open MPI with this fix. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-12-06 16:48:23 +09:00
Gilles Gouaillardet	975e3cd0c9	mpiext: updates for header file locations Per discussion on https://github.com/open-mpi/ompi/pull/6030 and https://github.com/open-mpi/ompi/pull/6145, move around where MPI extension header files are installed (specifically: the installation tree path does not need to match the source tree path). For reference, header files were installed like this : - <prefix>/include/openmpi/ompi/mpiext/pcollreq/mpif-h/mpiext_pcollreq_mpifh.h - <prefix>/include/openmpi/ompi/mpiext/pcollreq/c/mpiext_pcollreq_c.h and they are now installed like this : - <prefix>/include/openmpi/mpiext/mpiext_pcollreq_mpifh.h - <prefix>/include/openmpi/mpiext/mpiext_pcollreq_c.h Signed-off-by: Jeff Squyres <jsquyres@cisco.com> Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-12-06 15:40:02 +09:00
Gilles Gouaillardet	4918fc4455	Revert "fortran/mpif-h: keep include path for extension short" This reverts commit open-mpi/ompi@848a868f7b. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-12-06 15:39:59 +09:00
Gilles Gouaillardet	ccbdc8fd58	Revert "c: keep include path for extension short" This reverts commit open-mpi/ompi@27c25fa721. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-12-06 15:39:54 +09:00
Gilles Gouaillardet	a152aa215e	cleanup: remove the unused (and unexpanded) {ORTE,OMPI}_WANT_REPO_REV macro Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-12-06 13:13:13 +09:00
George Bosilca	74f2365d6e	Remove most (all) warnings from the new TreeMatch. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-12-05 15:38:39 -05:00
Guillaume Mercier	27aa34e53f	New version based on TM 1.3 Optimize_topology is commented for now until bug resolved in TM Signed-off-by: Guillaume Mercier <guillaume.mercier@bordeaux-inp.fr>	2018-12-05 15:38:39 -05:00
Matias A Cabral	fc8582c560	MTL/PSM2: Do not lower the priority when all processes are local. The intention of lowering the priority when all processes are local was to favor Vader BTL. However, in builds including the OFI MTL it gets selected instead. Reviewed-by: Spruit, Neil R <neil.r.spruit@intel.com> Reviewed-by: Gopalakrishnan, Aravind <aravind.gopalakrishnan@intel.com> Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2018-12-04 15:31:09 -08:00
Matias Cabral	abd34620f4	Merge pull request #5972 from aravindksg/ofi_sep_master MTL/OFI: Add OFI Scalable Endpoint support	2018-12-04 13:07:44 -08:00
Aravind Gopalakrishnan	109d0569ff	MTL/OFI: Add OFI Scalable Endpoint support OFI MTL supports OFI Scalable Endpoints feature as means to improve multi-threaded application throughput and message rate. Currently the feature is designed to utilize multiple TX/RX contexts exposed by the OFI provider in conjunction with a multi-communicator MPI application model. For more information, refer to README under mtl/ofi. Reviewed-by: Matias Cabral <matias.a.cabral@intel.com> Reviewed-by: Neil Spruit <neil.r.spruit@intel.com> Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2018-12-03 09:56:52 -08:00
George Bosilca	c6f73e8883	First step of the integration with the new TreeMatch. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-12-02 20:05:03 -05:00
Yossi Itigin	83cca9d52a	ucx: add owner.txt for components Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2018-12-01 17:14:03 +02:00
matcabral	6a15712df5	MTL/OFI: revert PR 6082 Revert to avoid issues with dynamic processes. Signed-off-by: matcabral <matias.a.cabral@intel.com>	2018-11-30 13:44:39 -08:00
Matias Cabral	ef5db1b752	Merge pull request #6082 from matcabral/lower_mtl_ofi_p MTL/OFI: Lower priority when all procs are local	2018-11-30 12:05:40 -08:00
Bert Wesarg	18525ce39b	Fix use of bitwise operation in CPP condition Signed-off-by: Bert Wesarg <bert.wesarg@tu-dresden.de>	2018-11-30 12:44:42 +01:00
Nathan Hjelm	5ebcbe444e	Merge pull request #6083 from devreal/rdma-plug-memleak Plug two memory leaks in rdma osc	2018-11-27 09:56:19 -07:00
Nathan Hjelm	27084c60c9	Merge pull request #6123 from hoopoepg/topic/osc-ucx-max-level-60 OSC/UCX: set max level value to 60	2018-11-27 09:51:09 -07:00
Sergey Oblomov	2d230b3aac	OSC/UCX: set max level value to 60 Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-11-27 14:20:28 +02:00
KAWASHIMA Takahiro	291d7654c5	Merge pull request #6030 from ggouaillardet/topic/mpiext_short_path mpiext: keep include path for extension short	2018-11-27 20:59:01 +09:00
Gilles Gouaillardet	5a968306d6	mpi/c: add back (some more) deprecated subroutines - MPI_NULL_DELETE_FN - MPI_NULL_COPY_FN - MPI_DUP_FN Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-11-27 13:56:03 +09:00
Gilles Gouaillardet	27c25fa721	c: keep include path for extension short move openmpi/ompi/mpiext/FOO/c/mpiext_FOO_c.h to openmpi/ompi/mpiext/FOO_c.h in order to use consistent paths with mpif.h extensions Refs. open-mpi/ompi#6019 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-11-27 11:21:05 +09:00
Gilles Gouaillardet	848a868f7b	fortran/mpif-h: keep include path for extension short in order to cope with the 72 characters per line limit, move openmpi/ompi/mpiext/FOO/mpif-h/mpiext_FOO_mpifh.h to openmpi/ompi/mpiext/FOO_mpifh.h Refs. open-mpi/ompi#6019 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-11-27 09:39:09 +09:00
Jeff Squyres	8459d29738	Merge pull request #5979 from mkurnosov/coll-libnbc-cleanup coll/libnbc: remove debug output	2018-11-26 18:10:10 -05:00
Jeff Squyres	dbe064af97	Merge pull request #5653 from bmwiedemann/userhost Allow to override build user and host	2018-11-26 17:48:37 -05:00
Bert Wesarg	b3f3281290	Re-add removed deprecate-only MPI-2.0 symbols See #6114 Signed-off-by: Bert Wesarg <bert.wesarg@tu-dresden.de>	2018-11-26 14:00:05 +01:00
Yossi Itigin	e98ce2b36b	Merge pull request #6108 from yosefe/topic/pml-ucx-init-req_mpi_object pml_ucx: initialize req_mpi_object.comm for error handler	2018-11-26 11:54:10 +02:00
KAWASHIMA Takahiro	5f0fcf0f45	README & man: Update pcollreq documentation The feature of persistent collectives is approved in the Sept. 2018 MPI Forum meeting and 2018 Draft Specification of the MPI standard is published during SC18. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-11-26 17:27:43 +09:00
Yossi Itigin	f36eeef4c5	pml_ucx: initialize req_mpi_object.comm for error handler without this fix, an error handler invoked on pml_ucx request would segfault while trying to dereference requests[i]->req_mpi_object.comm Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2018-11-25 19:37:54 +02:00
Yossi Itigin	ed967d867b	Merge pull request #6073 from hoopoepg/topic/set-osc-ucx-level-200 OSC: set UCX module used by default	2018-11-22 10:53:37 +02:00
KAWASHIMA Takahiro	303d7842d9	Merge pull request #6074 from kawashima-fj/pr/remove-c99-type-check Remove `#if HAVE_[TYPE]` for types available in C99	2018-11-20 11:42:13 +09:00
Aurelien Bouteiller	20447be744	Someone left a debug printf in NBC Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2018-11-16 10:37:04 -05:00
Aurelien Bouteiller	65660e5999	Manage errors in NBC collective ops Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> Correctly bubble up errors in NBC collective operations Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> The error field of requests needs to be rearmed at start, not at create Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2018-11-15 16:43:56 -05:00
Joseph Schuchart	91885f5876	Plug two memory leaks in rdma osc Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2018-11-14 14:31:54 -05:00
matcabral	5f58453e63	MTL/OFI: Lower priority when all procs are local So far Vader is faster than OFI MTL for doing shared memory. Therefore, let it run by default when all procs are local. Reviewed-by: Spruit, Neil R <neil.r.spruit@intel.com> Reviewed-by: Gopalakrishnan, Aravind <aravind.gopalakrishnan@intel.com> Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2018-11-14 11:01:33 -08:00
Sergey Oblomov	e91f214982	OSC/UCX: added UCX version evaluation - added UCX version evaluation to set OSC UCX priority Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-11-14 10:03:13 +02:00
KAWASHIMA Takahiro	cacd6f389c	datatype: Remove `#if HAVE_[TYPE]` for C99 types Now Open MPI requires a C99 compiler. Checking availability of the following types is no more needed. - `long long` (`signed` and `unsigned`) - `long double` - `float _Complex` - `double _Complex` - `long double _Complex` Furthermore, the `#if HAVE_[TYPE]` style checking is not correct. Availability of C types is checked by `AC_CHECK_TYPES` in `configure.ac`. `AC_CHECK_TYPES` defines macro `HAVE_[TYPE]` as `1` in `opal_config.h` if the `[TYPE]` is available. But it does not define `HAVE_[TYPE]` (instead of defining as `0`) if it is not available. So even if we need `HAVE_[TYPE]` checking, it should be `#if defined(HAVE_[TYPE])`. I didn't remove `AC_CHECK_TYPES` for these types in `configure.ac` since someone may use `HAVE_[TYPE]` macros somewhere. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2018-11-14 09:32:52 +09:00
Sergey Oblomov	36934a8bb2	OSC: set UCX module used by default - OSC/UCX module set priority to 200 to be used by default Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-11-12 15:08:22 +02:00
Gilles Gouaillardet	b3ce25af95	mpiext/cuda: fix mpiext_cuda_c.h install path This fixes a regression introduced in commit open-mpi/ompi@f8318f0a8f. Fixes open-mpi/ompi#6069 Thanks Kawashima-san for the heads up ! Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-11-12 00:58:19 -06:00
Matias Cabral	30b6435897	Merge pull request #6015 from aravindksg/proc-threshold-fix MTL/OFI: Check threshold number of peers allowed per rank	2018-11-08 15:47:45 -08:00
Gilles Gouaillardet	f8318f0a8f	mpiext/cuda: do not include automatically generated file into dist tarball ompi/mpiext/cuda/c/mpiext_cuda_c.h is automatically generated from ompi/mpiext/cuda/c/mpiext_cuda_c.h.in at configure time. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-11-06 13:57:31 +09:00
Jeff Squyres	65eb118e08	MPI_Type_get_envelope: remove MPI-1 deleted names Several names are now no longer returned by MPI_Type_get_envelope. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-11-03 16:20:45 -04:00
Aravind Gopalakrishnan	5cf43de445	MTL/OFI: Check threshold number of peers allowed per rank When the provider does not support FI_REMOTE_CQ_DATA, the OFI tag does not have sizeof(int) bits for the rank. Therefore, unexpected behavior will occur when this limit is crossed. Check the max allowed number of ranks during add_procs() and return if there is danger of exceeding this threshold. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2018-11-01 14:03:00 -07:00
Geoffrey Paulsen	b03a39d359	mpi.h: restore some MPI-deprecated items to default builds Commit `89da9651b` inadvertantly #if'ed out both deprecated and removed items from mpi.h. The intent was only to #if out items that have been removed from the MPI specification and leave all items that are merely deprecated. This commit also re-orders the deleted typedef+functions to be in the same order as they are listed in MPI-3.1 chapter 17, just to make verifying/checking the code easier. Note that --enable-mpi1-compatibility can still be used to restore prototypes for the items that have been removed from the MPI specification (e.g., MPI_Address()). Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com> Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-11-01 13:36:48 -07:00
Matias Cabral	2da31706bf	Merge pull request #5970 from aravindksg/coll-tuned-fix coll/tuned: Fix MPI_IN_PLACE processing in tuned algorithms	2018-10-31 11:20:07 -07:00
Ralph Castain	05ac8fa71c	Remove stale defunct tools Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-10-30 08:48:16 -07:00
Mikhail Kurnosov	64abd0f405	coll/libnbc: remove debug output 1. Remove debug output in iallgather (I have forgotten to remove it). 2. Remove an incorrect comment in description of ibcast Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-26 15:52:02 +07:00
Aravind Gopalakrishnan	88d781056f	coll/tuned: Fix MPI_IN_PLACE processing in tuned algorithms PR #5450 addresses MPI_IN_PLACE processing for basic collective algorithms. But in conjunction with that, we need to check for MPI_IN_PLACE in tuned paths as well before calling ompi_datatype_type_size() as otherwise we segfault. MPI spec also stipulates to ignore sendcount and sendtype for Alltoall and Allgatherv operations. So, extending the check to these algorithms as well. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2018-10-24 15:31:33 -07:00
Joseph Schuchart	a193ae26bf	Fix regression introduced earlier by re-adding a barrier after shared memory has been registered Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2018-10-24 15:54:19 -04:00
Aurelien Bouteiller	96c91e94eb	Manage errors in communicator creations (cid) In order for this to work, error management needs to also be added to NBC, from separate PR Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu> The error field of requests needs to be rearmed at start, not at create Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2018-10-23 23:43:33 -04:00
Yossi Itigin	4c442f2601	Merge pull request #5934 from hoopoepg/topic/suppressed-cov-warn-added-log-msg COMMON/UCX: suppressed coverity warnings	2018-10-22 11:00:47 +03:00
Mikhail Kurnosov	8b511c7889	coll/libnbc/ireduce: silence Coverity warning CID 1440360 Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-10-22 11:20:28 +07:00
Sergey Oblomov	1099d5f023	COMMON/UCX: added error code to log output Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-10-21 11:37:25 +03:00
Nathan Hjelm	a66373454e	Merge pull request #5943 from bosilca/fix/libnbc_warnings Remove few warnings in libnbc identified by clang-1000.11.45.2	2018-10-20 21:24:30 -06:00
Bernhard M. Wiedemann	bc23993dea	Allow to override build user and host using the standard $USER and $HOSTNAME environment variables to make reproducible builds possible. See https://reproducible-builds.org/ for why this is good. This helps improve issue #3759 Signed-off-by: Bernhard M. Wiedemann <bwiedemann@suse.de>	2018-10-20 09:27:00 -04:00
bosilca	c3abedbd2c	Merge pull request #5759 from bosilca/fix/monitoring Fix/monitoring	2018-10-19 07:18:41 -07:00

... 3 4 5 6 7 ...

10695 Коммитов