openmpi

Автор	SHA1	Сообщение	Дата
Nathan Hjelm	63ded4d083	Merge pull request #5224 from benmenadue/master io/romio314: Replace deprecated MPI-1 functions	2018-06-06 15:41:53 -06:00
Ralph Castain	86d699d42e	Correct typo in name comparison flags Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-06 12:18:52 -07:00
Ralph Castain	840fb42f93	PMIx rte component does support dynamics Minor cleanups Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-05 21:55:19 -07:00
Nathan Hjelm	64a5baaa28	Merge pull request #5193 from hjelmn/osc_sm_location Use /dev/shm for shared memory files in osc components	2018-06-05 09:42:14 -06:00
Sergey Oblomov	0a8261f3b0	PML/UCX: fixed hand on MPI_Finalize fixes issue https://github.com/openucx/ucx/issues/2656 added flush for worker object to complete all pending operations Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-06-05 17:22:03 +03:00
Mikhail Kurnosov	3adf96fdb8	coll/base: add butterfly algorithm for MPI_Reduce_scatter Implements butterfly algorithm for MPI_Reduce_scatter. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-05 15:53:13 +07:00
Ben Menadue	34ec0bd8ab	Replace MPI_Type_extent with MPI_Type_get_extent in ROMIO. Signed-off-by: Ben Menadue <ben.menadue@nci.org.au>	2018-06-05 15:27:58 +10:00
Ben Menadue	756cc67221	Replace MPI_Address with MPI_Get_address in ROMIO. Signed-off-by: Ben Menadue <ben.menadue@nci.org.au>	2018-06-05 15:27:25 +10:00
Ralph Castain	3020b699f3	Merge pull request #5213 from rhc54/topic/rte Enable the PMIx ompi/rte component	2018-06-03 10:23:40 -07:00
Ralph Castain	55ac526a67	Enable the PMIx ompi/rte component Get the OMPI rte/pmix component working. This was tested using PRRTE as the RM, configuring OMPI using: * autogen --no-orte * with external libevent, external hwloc, and external PMIx master * configuring PMIx master with the same libevent and hwloc * execute the application using PRRTE's "prun" launcher, which has the same cmd line as ORTE's mpirun Note that PMIx master appears to have a bug in the event notification system that caches job termination events. Thus, the first execution runs fine, but subsequent executions cause an "abort" when the OMPI default error handler is invoked upon notification of the prior job's termination. Will work that separately. Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 134cca9ac0de092d767999357573a31703f72292)	2018-06-03 07:25:12 -07:00
Jeff Squyres	35438ae9b5	mpi/finalized: revamp INITIALIZED/FINALIZED Per MPI-3.1:8.7.1 p361:11-13, it's valid for MPI_FINALIZED to be invoked during an attribute destruction callback (e.g., during the destruction of keyvals on MPI_COMM_SELF during the very beginning of MPI_FINALIZE). In such cases, MPI_FINALIZED must return "false". Prior to this commit, we hung in FINALIZED if it were invoked during a COMM_SELF attribute destruction callback in FINALIZE. See https://github.com/open-mpi/ompi/issues/5084. This commit converts the MPI_INITIALIZED / MPI_FINALIZED infrastructure to use a single enum (ompi_mpi_state, set atomically) to represent the state of MPI: - not initialized - init started - init completed - finalize started - finalize past COMM_SELF destruction - finalize completed The "finalize past COMM_SELF destruction" state is what allows us to return "false" from MPI_FINALIZED before COMM_SELF has been fully destroyed / all attribute callbacks have been invoked. Since this state is checked at nearly every MPI API call (to see if we're outside of the INIT/FINALIZE epoch), care was taken to use atomics to set the ompi_mpi_state value in ompi_mpi_init() and ompi_mpi_finalize(), but performance-critical code paths can simply read the variable without needing to use a slow call to an opal_atomic_*() function. Thanks to @AndrewGaspar for reporting the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-06-01 13:36:29 -07:00
Edgar Gabriel	52bd606294	fcoll/dynamic_gen2: make sure that intermediate variables can hold the offset for very large offsets, ome ariables used in the fcoll/dynamic_gen2 code base were under certain circumstances not large enough to hold intermediate values. This issue was more detected in the vulcan component but could happen in the dynamic_gen2 component as well. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-01 06:53:38 -05:00
Jeff Squyres	25f2d02c61	fcoll/dynamic_gen2: minor compiler warning stomp Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-30 10:08:19 -07:00
Nathan Hjelm	e9de42544e	osc/sm: add support for controlling location of backing store This commit adds a new MCA variable to set the location of the backing store: osc_sm_backing_directory. The default on Linux has been changed to use /dev/shm to improve performance in cases where /tmp is not a tmpfs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-29 21:44:01 -06:00
Nathan Hjelm	d0d59b1d7d	osc/rdma: add support for controlling location of backing store This commit adds a new MCA variable to set the location of the backing store: osc_rdma_backing_directory. The default on Linux has been changed to use /dev/shm to improve performance in cases where /tmp is not a tmpfs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-29 21:43:33 -06:00
Howard Pritchard	5b7c866f59	osc/pt2pt: disable when THREAD_MULTIPLE. Per discussion at https://github.com/open-mpi/ompi/issues/2614#issuecomment-392815654, do not allow for selection of the OSC PT2PT when creating an MPI RMA window when THREAD_MULTIPLE is active. Print a helpful message and return a not-supported error. Signed-off-by: Howard Pritchard <howardp@lanl.gov> Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit d0ffd660841623c02d1dfa3151e7f7afd3327698) Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-29 08:59:53 -07:00
Mikhail Kurnosov	28d5837dd9	coll: reduce_scatter_block: add butterfly algorithm Implements butterfly algorithm for MPI_Reduce_scatter_block. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-27 14:17:41 +07:00
Edgar Gabriel	6b03cee7f1	io/ompio: erroneous condition in selecting aggregator selection logic fix the logic in the decision which aggregator selection algorithm to use. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-05-24 15:52:19 -05:00
John L. Jolly	36b9e15fb7	- Build warning: stringop-overflow in get_dynamic_win_info() at osc_ucx_comm.c In file included from /usr/include/string.h:494:0, from ../../../../ompi/info/info.h:29, from ../../../../ompi/mca/osc/base/base.h:24, from osc_ucx_comm.c:13: In function 'memcpy', inlined from 'get_dynamic_win_info' at osc_ucx_comm.c:359:5, inlined from 'ompi_osc_ucx_put' at osc_ucx_comm.c:401:18: /usr/include/bits/string_fortified.h:34:10: warning: '__builtin___memcpy_chk' writing 8 bytes into a region of size 4 overflows the destination [-Wstringop-overflow=] return __builtin___memcpy_chk (__dest, __src, __len, __bos0 (__dest)); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This is caused by a type size mismatch in a call to memcpy This fix corrects the type definition of the win_count variable. Signed-off-by: John Jolly <jjolly@suse.com>	2018-05-22 10:11:57 -06:00
Brian Barrett	09e4c40ce9	mtl: remove MXM MTL Remove the MXM MTL, which has been deprecated in preference for the Yalla PML. This was discussed at the last developers meeting and somehow I ended up with the action item to do the removal. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2018-05-21 14:18:30 -07:00
Sergey Oblomov	5ec26914a6	PML/UCX: do not set offset on ordered data recv Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-21 19:40:07 +03:00
Sergey Oblomov	19607daa32	PML/UCX: create convertor clone instead of stack reset Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-17 16:39:13 +03:00
Sergey Oblomov	7c5de01c57	PML/UCX: reset converter stack on unordered messages Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-17 13:11:02 +03:00
George Bosilca	7191ea120c	Fix merge conflict related to function renaming. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-05-15 11:34:20 -04:00
bosilca	2ab628b92e	Merge pull request #5074 from bosilca/topic/remove_warnings Remove warnings identified by clang.	2018-05-15 11:15:23 -04:00
bosilca	d13b9a2e25	Merge pull request #5156 from ggouaillardet/topic/reduce_scatter_block coll: reduce_scatter_block: rename and MCA parameter description fix	2018-05-15 11:13:26 -04:00
Mikhail Kurnosov	82299a9c04	coll: reduce_scatter_block: add recursive halving algorithm Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-15 08:20:32 +07:00
Gilles Gouaillardet	ce7b3113f6	coll: reduce_scatter_block: rename and MCA parameter description fix - rename ompi_coll_base_reduce_scatter_block_basic to more self descriptive ompi_coll_base_reduce_scatter_block_basic_linear - fix the description of the coll_tuned_reduce_scatter_block_algorithm MCA param this fixes and documents previous open-mpi/ompi@0e8b35b615 MPI_Reduce_scatter_block used to be implemented by the coll/basic module only. A new algo (recursive doubling) was recently introduced and can be used via the coll/tuned module, but we never intended to make it the default algo. In order to "restore" the previous default, the initial algo was moved from coll/basic to coll/base, and is now used by default by coll/tuned. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-09 08:54:48 +09:00
Nathan Hjelm	cf585d725c	osc/rdma: fix SEGV will null origin in FOP in debug build Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-08 14:10:20 -06:00
Jeff Squyres	b39bbfb3c0	Merge pull request #5142 from mkurnosov/base-reduce-remove-warnings coll/base/reduce: remove warning identified by Coverity Scan	2018-05-07 15:49:56 -04:00
Gilles Gouaillardet	0e8b35b615	coll/tuned: use basic algo for reduce_scatter_block by default Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:44 +09:00
Gilles Gouaillardet	32095be0d6	coll/{base,basic}: move reduce_scatter_block from basic to base Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:38 +09:00
Mikhail Kurnosov	ba968e4490	coll/base/reduce: Remove warning identified by Coverity Scan Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-04 20:48:37 +07:00
Mikhail Kurnosov	8cf8553abd	Resolve merge conflicts Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-03 07:28:32 +07:00
Nathan Hjelm	c22c485837	Merge pull request #5136 from hjelmn/mtl_fix mtl: reset ompi_mtl_base_selected_component on framework close	2018-05-02 15:48:54 -06:00
Joshua Ladd	32ddc6af7e	Merge pull request #5094 from xinzhao3/topic/osc-win-fix-master OMPI/OSC/UCX: fix issue in impl of MPI_Win_create_dynamic/MPI_Win_attach/MPI_Win_detach	2018-05-02 17:42:34 -04:00
Nathan Hjelm	f432d07844	mtl: reset ompi_mtl_base_selected_component on framework close Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-02 14:53:34 -06:00
Xin Zhao	3f5ac97649	OMPI/OSC/UCX: set priority to 0. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-05-02 21:40:06 +03:00
Yossi Itigin	66d931b7c4	Merge pull request #5116 from yosefe/topic/ucx-connect-errs ucx: improve error messages during connection establishment	2018-05-02 14:04:24 +03:00
Nathan Hjelm	ae17908f35	io/romio314: fix two more MPI-3 compliance issues Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-01 15:18:18 -06:00
Nathan Hjelm	e9ef7aa256	Merge pull request #4985 from mkurnosov/spacc-scan-exscan coll/spacc: Add recursive doubling algorithm for Scan and Exscan	2018-05-01 09:21:23 -06:00
Yossi Itigin	385f38ab4e	ucx: improve error messages during connection establishment Also, unite common code calling ucp_ep_create() Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2018-04-30 15:45:05 +03:00
Ninad Prabhukhanolkar	1518d7e003	Updated aggregate_profile.pl The files array was also storing $phase.prof. This was leading to $phase.prof's output getting dumped into itself again and again. Updated code to initialise files array with files other than $phase.prof. Signed-off-by: Ninad Prabhukhanolkar <ninadchess96@gmail.com>	2018-04-26 20:34:24 +05:30
Edgar Gabriel	19b71e4eb6	ompio/fs: add summary of supported file systems Add the list of supported file systems to the summary output add the end of configure Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-04-25 14:39:18 -05:00
Xin Zhao	53bdfd1dcb	OMPI/OSC/UCX: fix issue in impl of MPI_Win_create_dynamic/MPI_Win_attach/MPI_Win_detach Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-04-24 23:09:52 +03:00
Mikhail Kurnosov	787ec8929b	Rename function `rounddown` into `ompi_rounddown` FreeBSD 11 `/sys/param.h` has declaration of `rounddown` Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-24 07:59:49 +07:00
Mikhail Kurnosov	4cbcff7fcd	coll/base: add recursive doubling algorithm for MPI_Reduce_scatter_block Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-23 11:02:31 +07:00
raafatfeki	91e028f7fd	fcoll/dynamic_gen2: Reduce number of realloc calls keep track of the sizeof the blocklen_per_process and displs_per_process on the aggregator datastructure to minimze the number of realloc function calls required in the shuffle_init operation. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-04-20 10:13:57 -05:00
Nathan Hjelm	84765001aa	io/romio: do not use removed functions This commit attempts to update the romio io component to not use functions removed in MPI-3.0 (2012). This is a first cut and will probably need to be reviewed for correctness. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-04-16 12:06:52 -06:00
Nathan Hjelm	4d876ec6fe	io/romio314: fix minmax datatypes romio assumes that all predefined datatypes are contiguous. Because of the (terribly named) composed datatypes MPI_SHORT_INT, MPI_DOUBLE_INT, MPI_LONG_INT, etc this is an incorrect assumption. The simplest way to fix this is to override the MPI_Type_get_envelope and MPI_Type_get_contents calls with calls that will work on these datatypes. Note that not all calls to these MPI functions are replaced, only the ones used when flattening a non-contiguous datatype. References #5009 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-04-16 10:46:38 -06:00

... 3 4 5 6 7 ...

6782 Коммитов