openmpi

Автор	SHA1	Сообщение	Дата
Thananon Patinyasakdikul	390d72addd	Merge pull request #4885 from davideberius/spc_pr Initial Software-based Performance Counters PR	2018-06-12 14:04:49 -07:00
David Eberius	d377a6b6f4	Added Software-based Performance Counters driver code along with several counters. This code is the implementation of Software-base Performance Counters as described in the paper 'Using Software-Base Performance Counters to Expose Low-Level Open MPI Performance Information' in EuroMPI/USA '17 (http://icl.cs.utk.edu/news_pub/submissions/software-performance-counters.pdf). More practical usage information can be found here: https://github.com/davideberius/ompi/wiki/How-to-Use-Software-Based-Performance-Counters-(SPCs)-in-Open-MPI. All software events functions are put in macros that become no-ops when SOFTWARE_EVENTS_ENABLE is not defined. The internal timer units have been changed to cycles to avoid division operations which was a large source of overhead as discussed in the paper. Added a --with-spc configure option to enable SPCs in the Open MPI build. This defines SOFTWARE_EVENTS_ENABLE. Added an MCA parameter, mpi_spc_enable, for turning on specific counters. Added an MCA parameter, mpi_spc_dump_enabled, for turning on and off dumping SPC counters in MPI_Finalize. Added an SPC test and example. Signed-off-by: David Eberius <deberius@vols.utk.edu>	2018-06-11 22:48:16 -04:00
Edgar Gabriel	2d8a769bfd	fcoll/static: remove component now that we have a shiny new fcoll component, no need to keep the static component around. No use for it anymore. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-08 07:39:46 -05:00
Edgar Gabriel	b27a40cdf9	Merge pull request #5246 from edgargabriel/topic/ibm-testsuite-fixes Topic/ibm testsuite fixes	2018-06-08 06:06:49 -05:00
Yossi Itigin	fd12540751	Merge pull request #5227 from hoopoepg/topic/pml-ucx-hang-on-finalize PML/UCX: fixed hand on MPI_Finalize	2018-06-08 13:19:49 +03:00
Edgar Gabriel	a1484ec69a	io/ompio: check error conditions before executing file_sync check for pending I/O operations and invalid modes and return proper error codes before executing MPI_File_sync makes the e_sync_1 test from the ibm testsuite pass. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-07 19:30:27 -05:00
Edgar Gabriel	14bd114973	common/ompio: return error code from file_delete operation in file_close in case the user opened a file using the DELETE_ON_CLOSE flag, return the error code generated in the delete operation. Note, that this is however just a partial fix to the e_close_1 test from the ibm testsuite, since the object destructor that triggers the file_close function does not have a mechanism right now to recognize and return an error code. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2018-06-07 19:30:14 -05:00
Edgar Gabriel	f7cae7731c	io/ompio: return error code for invalid offset in file_get_byte_offset, return an error code if the offset leads to an invalid position in file. Makes the e_get_byte_offset_1 test from the ibm testsuite pass. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2018-06-07 18:46:17 -05:00
Edgar Gabriel	deaeaa60de	fcoll/vulcan: minor bugfix when creating the groups_per_proc arrays Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-07 17:52:32 -05:00
Edgar Gabriel	8feb497dbe	io/ompio: cleanup the aggregator selection logic and some internal structure elements/components. Along the way, add support for the cb_nodes Info object. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-07 16:47:10 -05:00
Edgar Gabriel	529d882ff0	io/ompio and common/ompio: relocate ompio_request code to common since the request code is now being accessed also from the vulcan fcoll component, the request code was relocated into the common/ompio directory to avoid ld load problems. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-07 16:13:12 -05:00
raafatfeki	5ecb4a56e3	fcoll/vulcan: Support of asynchronous write in collective writeAll We introduced a new mca_vulcan parameter that specify the I/O synchronization type (Async/sync I/O) applied within the collective write operation. The user can explicitly choose to use async or sync write operation or make the choice automatically made. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-06-07 16:13:12 -05:00
raafatfeki	4f7172ddf6	fcoll/vulcan: Support of larger offsets For very large offsets, the data chunk size to be written by each aggregator exceeds the capacity of an integer variable. Besides, some variables were not large enough to hold intermediate values. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-06-07 16:13:12 -05:00
raafatfeki	4670fe50d7	fcoll/vulcan: Remove unnecessary calls to write Identify the index of each aggregator process in order to restrict the call to write_init function by the specific aggregator. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-06-07 16:13:12 -05:00
raafatfeki	bc6431bee9	fcoll/vulcan: use hindexed constructor on the sender side Instead of using a temporary buffer and copy data into the temp buffer before sending, use a derived datatype to describe the data that needs to be sent during a cycle in the collective I/O operation. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-06-07 16:13:12 -05:00
Edgar Gabriel	1c2c110824	fcoll/vulcan: add new fcoll component import of the new vulcan component. It is an enhanced version of the two_phase component, which uses however the ompio internal codes/loops to assemble the data arrays. It is therefore more inline with the dynamic and dynamic_gen2 component, and will be easier to maintain. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-07 16:13:12 -05:00
Nathan Hjelm	63ded4d083	Merge pull request #5224 from benmenadue/master io/romio314: Replace deprecated MPI-1 functions	2018-06-06 15:41:53 -06:00
Ralph Castain	86d699d42e	Correct typo in name comparison flags Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-06 12:18:52 -07:00
Ralph Castain	840fb42f93	PMIx rte component does support dynamics Minor cleanups Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-05 21:55:19 -07:00
Nathan Hjelm	64a5baaa28	Merge pull request #5193 from hjelmn/osc_sm_location Use /dev/shm for shared memory files in osc components	2018-06-05 09:42:14 -06:00
Sergey Oblomov	0a8261f3b0	PML/UCX: fixed hand on MPI_Finalize fixes issue https://github.com/openucx/ucx/issues/2656 added flush for worker object to complete all pending operations Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-06-05 17:22:03 +03:00
Mikhail Kurnosov	3adf96fdb8	coll/base: add butterfly algorithm for MPI_Reduce_scatter Implements butterfly algorithm for MPI_Reduce_scatter. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-06-05 15:53:13 +07:00
Ben Menadue	34ec0bd8ab	Replace MPI_Type_extent with MPI_Type_get_extent in ROMIO. Signed-off-by: Ben Menadue <ben.menadue@nci.org.au>	2018-06-05 15:27:58 +10:00
Ben Menadue	756cc67221	Replace MPI_Address with MPI_Get_address in ROMIO. Signed-off-by: Ben Menadue <ben.menadue@nci.org.au>	2018-06-05 15:27:25 +10:00
Ralph Castain	3020b699f3	Merge pull request #5213 from rhc54/topic/rte Enable the PMIx ompi/rte component	2018-06-03 10:23:40 -07:00
Ralph Castain	55ac526a67	Enable the PMIx ompi/rte component Get the OMPI rte/pmix component working. This was tested using PRRTE as the RM, configuring OMPI using: * autogen --no-orte * with external libevent, external hwloc, and external PMIx master * configuring PMIx master with the same libevent and hwloc * execute the application using PRRTE's "prun" launcher, which has the same cmd line as ORTE's mpirun Note that PMIx master appears to have a bug in the event notification system that caches job termination events. Thus, the first execution runs fine, but subsequent executions cause an "abort" when the OMPI default error handler is invoked upon notification of the prior job's termination. Will work that separately. Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 134cca9ac0de092d767999357573a31703f72292)	2018-06-03 07:25:12 -07:00
Jeff Squyres	35438ae9b5	mpi/finalized: revamp INITIALIZED/FINALIZED Per MPI-3.1:8.7.1 p361:11-13, it's valid for MPI_FINALIZED to be invoked during an attribute destruction callback (e.g., during the destruction of keyvals on MPI_COMM_SELF during the very beginning of MPI_FINALIZE). In such cases, MPI_FINALIZED must return "false". Prior to this commit, we hung in FINALIZED if it were invoked during a COMM_SELF attribute destruction callback in FINALIZE. See https://github.com/open-mpi/ompi/issues/5084. This commit converts the MPI_INITIALIZED / MPI_FINALIZED infrastructure to use a single enum (ompi_mpi_state, set atomically) to represent the state of MPI: - not initialized - init started - init completed - finalize started - finalize past COMM_SELF destruction - finalize completed The "finalize past COMM_SELF destruction" state is what allows us to return "false" from MPI_FINALIZED before COMM_SELF has been fully destroyed / all attribute callbacks have been invoked. Since this state is checked at nearly every MPI API call (to see if we're outside of the INIT/FINALIZE epoch), care was taken to use atomics to set the ompi_mpi_state value in ompi_mpi_init() and ompi_mpi_finalize(), but performance-critical code paths can simply read the variable without needing to use a slow call to an opal_atomic_*() function. Thanks to @AndrewGaspar for reporting the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-06-01 13:36:29 -07:00
Edgar Gabriel	52bd606294	fcoll/dynamic_gen2: make sure that intermediate variables can hold the offset for very large offsets, ome ariables used in the fcoll/dynamic_gen2 code base were under certain circumstances not large enough to hold intermediate values. This issue was more detected in the vulcan component but could happen in the dynamic_gen2 component as well. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-01 06:53:38 -05:00
Jeff Squyres	25f2d02c61	fcoll/dynamic_gen2: minor compiler warning stomp Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-30 10:08:19 -07:00
Nathan Hjelm	e9de42544e	osc/sm: add support for controlling location of backing store This commit adds a new MCA variable to set the location of the backing store: osc_sm_backing_directory. The default on Linux has been changed to use /dev/shm to improve performance in cases where /tmp is not a tmpfs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-29 21:44:01 -06:00
Nathan Hjelm	d0d59b1d7d	osc/rdma: add support for controlling location of backing store This commit adds a new MCA variable to set the location of the backing store: osc_rdma_backing_directory. The default on Linux has been changed to use /dev/shm to improve performance in cases where /tmp is not a tmpfs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-29 21:43:33 -06:00
Howard Pritchard	5b7c866f59	osc/pt2pt: disable when THREAD_MULTIPLE. Per discussion at https://github.com/open-mpi/ompi/issues/2614#issuecomment-392815654, do not allow for selection of the OSC PT2PT when creating an MPI RMA window when THREAD_MULTIPLE is active. Print a helpful message and return a not-supported error. Signed-off-by: Howard Pritchard <howardp@lanl.gov> Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit d0ffd660841623c02d1dfa3151e7f7afd3327698) Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-29 08:59:53 -07:00
Mikhail Kurnosov	28d5837dd9	coll: reduce_scatter_block: add butterfly algorithm Implements butterfly algorithm for MPI_Reduce_scatter_block. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-27 14:17:41 +07:00
Edgar Gabriel	6b03cee7f1	io/ompio: erroneous condition in selecting aggregator selection logic fix the logic in the decision which aggregator selection algorithm to use. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-05-24 15:52:19 -05:00
Brian Barrett	09e4c40ce9	mtl: remove MXM MTL Remove the MXM MTL, which has been deprecated in preference for the Yalla PML. This was discussed at the last developers meeting and somehow I ended up with the action item to do the removal. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2018-05-21 14:18:30 -07:00
Sergey Oblomov	5ec26914a6	PML/UCX: do not set offset on ordered data recv Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-21 19:40:07 +03:00
Sergey Oblomov	19607daa32	PML/UCX: create convertor clone instead of stack reset Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-17 16:39:13 +03:00
Sergey Oblomov	7c5de01c57	PML/UCX: reset converter stack on unordered messages Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-17 13:11:02 +03:00
George Bosilca	7191ea120c	Fix merge conflict related to function renaming. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-05-15 11:34:20 -04:00
bosilca	2ab628b92e	Merge pull request #5074 from bosilca/topic/remove_warnings Remove warnings identified by clang.	2018-05-15 11:15:23 -04:00
bosilca	d13b9a2e25	Merge pull request #5156 from ggouaillardet/topic/reduce_scatter_block coll: reduce_scatter_block: rename and MCA parameter description fix	2018-05-15 11:13:26 -04:00
Mikhail Kurnosov	82299a9c04	coll: reduce_scatter_block: add recursive halving algorithm Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-15 08:20:32 +07:00
Gilles Gouaillardet	ce7b3113f6	coll: reduce_scatter_block: rename and MCA parameter description fix - rename ompi_coll_base_reduce_scatter_block_basic to more self descriptive ompi_coll_base_reduce_scatter_block_basic_linear - fix the description of the coll_tuned_reduce_scatter_block_algorithm MCA param this fixes and documents previous open-mpi/ompi@0e8b35b615 MPI_Reduce_scatter_block used to be implemented by the coll/basic module only. A new algo (recursive doubling) was recently introduced and can be used via the coll/tuned module, but we never intended to make it the default algo. In order to "restore" the previous default, the initial algo was moved from coll/basic to coll/base, and is now used by default by coll/tuned. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-09 08:54:48 +09:00
Nathan Hjelm	cf585d725c	osc/rdma: fix SEGV will null origin in FOP in debug build Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-08 14:10:20 -06:00
Jeff Squyres	b39bbfb3c0	Merge pull request #5142 from mkurnosov/base-reduce-remove-warnings coll/base/reduce: remove warning identified by Coverity Scan	2018-05-07 15:49:56 -04:00
Gilles Gouaillardet	0e8b35b615	coll/tuned: use basic algo for reduce_scatter_block by default Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:44 +09:00
Gilles Gouaillardet	32095be0d6	coll/{base,basic}: move reduce_scatter_block from basic to base Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:38 +09:00
Mikhail Kurnosov	ba968e4490	coll/base/reduce: Remove warning identified by Coverity Scan Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-04 20:48:37 +07:00
Mikhail Kurnosov	8cf8553abd	Resolve merge conflicts Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-03 07:28:32 +07:00
Nathan Hjelm	c22c485837	Merge pull request #5136 from hjelmn/mtl_fix mtl: reset ompi_mtl_base_selected_component on framework close	2018-05-02 15:48:54 -06:00

1 2 3 4 5 ...

6597 Коммитов