openmpi

Автор	SHA1	Сообщение	Дата
Gilles Gouaillardet	7cae36f5ab	ompi: accept MPI_IN_PLACE in MPI_Ialltoall*	2016-10-08 19:47:25 +09:00
Edgar Gabriel	bc042259bc	make initialization of the io framework thread safe. Also, remove the lock/unlock in the file_open ompi-interface routines of romio314. The global lock in the romio component does probably not work, it is easy to construct a testcase where two threads perform collective I/O operations on different file handles. With a global lock it is easy to deadlock. THe lock has to be at least on the file handle basis. move the mutex to file/file.c to avoid duplicate symbol problem in file_open.c pfile_open.c	2016-08-21 16:09:00 -05:00
Nathan Hjelm	7589a25377	osc/pt2pt: do not repost receive from request callback This commit fixes an issue that can occur if a target gets overwhelmed with requests. This can cause osc/pt2pt to go into deep recursion with a stack like req_complete_cb -> ompi_osc_pt2pt_callback -> start -> req_complete_cb -> ... . At small scale this is fine as the recursion depth stays small but at larger scale we can quickly exhaust the stack processing frag requests. To fix the issue the request callback now simply puts the request on a list and returns. The osc/pt2pt progress function then handles the processing and reposting of the request. As part of this change osc/pt2pt can now post multiple fragment receive requests per window. This should help prevent a target from being overwhelmed. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-08-11 15:33:07 -06:00
Nathan Hjelm	aac611237b	opal/thread: clean up and add additional OPAL_THREAD macros This commit expands the OPAL_THREAD macros to include 32- and 64-bit atomic swap. Additionally, macro declararations have been updated to include both OPAL_THREAD_* and OPAL_ATOMIC_*. Before this commit the former was used with add and the later with cmpset. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-07-28 09:23:14 -06:00
Nathan Hjelm	035c2e2e2a	ompi/comm: refactor communicator cid code This commit simplifies the communicator context ID generation by removing the blocking code. The high level calls: ompi_comm_nextcid and ompi_comm_activate remain but now call the non-blocking variants and wait on the resulting request. This was done to remove the parallel paths for context ID generation in preperation for further improvements of the CID generation code. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-07-18 12:47:05 -06:00
Joshua Hursey	96779f68e8	mpi/c: Add each check for count==0 in nonblocking reduce interface * Matches the blocking versions of these interfaces - `iallreduce.c` to match `allreduce.c` - `ireduce.c` to match `reduce.c` - `ireduce_scatter.c` to match `reduce_scatter.c` * Workaround for IMB-NBC benchmark, similar to the workaround in place for the IMB-MPI1 benchmark for the blocking collectives.	2016-07-01 13:45:30 -05:00
Ralph Castain	5d330d5220	Enable the PMIx event notification capability and use that for all error notifications, including debugger release. This capability requires use of PMIx 2.0 or above as the features are not available with earlier PMIx releases. When OMPI master is built against an earlier external version, it will fallback to the prior behavior - i.e., debugger will be released via RML and all notifications will go strictly to the default error handler. Add PMIx 2.0 Remove PMIx 1.1.4 Cleanup copying of component Add missing file Touchup a typo in the Makefile.am Update the pmix ext114 component Minor cleanups and resync to master Update to latest PMIx 2.x Update to the PMIx event notification branch latest changes	2016-06-14 13:08:41 -07:00
Nathan Hjelm	e968ddfe64	start bug fixes (#1729 ) * mpi/start: fix bugs in cm and ob1 start functions There were several problems with the implementation of start in Open MPI: - There are no checks whatsoever on the state of the request(s) provided to MPI_Start/MPI_Start_all. It is erroneous to provide an active request to either of these calls. Since we are already looping over the provided requests there is little overhead in verifying that the request can be started. - Both ob1 and cm were always throwing away the request on the initial call to start and start_all with a particular request. Subsequent calls would see that the request was pml_complete and reuse it. This introduced a leak as the initial request was never freed. Since the only pml request that can be mpi complete but not pml complete is a buffered send the code to reallocate the request has been moved. To detect that a request is indeed mpi complete but not pml complete isend_init in both cm and ob1 now marks the new request as pml complete. - If a new request was needed the callbacks on the original request were not copied over to the new request. This can cause osc/pt2pt to hang as the incoming message callback is never called. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov> * osc/pt2pt: add request for gc after starting a new request Starting a new receive may cause a recursive call into the pt2pt frag receive function. If this happens and the prior request is on the garbage collection list it could cause problems. This commit moves the gc insert until after the new request has been posted. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-06-02 20:22:40 -04:00
bosilca	b90c83840f	Refactor the request completion (#1422 ) * Remodel the request. Added the wait sync primitive and integrate it into the PML and MTL infrastructure. The multi-threaded requests are now significantly less heavy and less noisy (only the threads associated with completed requests are signaled). * Fix the condition to release the request.	2016-05-24 18:20:51 -05:00
George Bosilca	0641005dab	Only check the parameters on valid dimensions.	2016-05-21 15:54:04 -04:00
Jeff Squyres	265e5b9795	Merge pull request #1552 from kmroz/wip-hostname-len-cleanup-1 ompi/opal/orte/oshmem/test: max hostname length cleanup	2016-05-02 09:44:18 -04:00
George Bosilca	6e6ed62a3c	Allow NULL arrays for emoty datatypes. When building an empty datatype (aka. size = 0) because the count of included datatypes is 0, be less strict on what the arguments are (allow NULL pointers).	2016-05-01 12:37:02 -04:00
Karol Mroz	3322347da9	ompi: fixup hostname max length usage Signed-off-by: Karol Mroz <mroz.karol@gmail.com>	2016-04-25 07:08:23 +02:00
Nathan Hjelm	ae0ffbb67f	Merge pull request #1397 from hjelmn/enable_thread_multiple ompi: always enable MPI_THREAD_MULTIPLE support	2016-04-23 08:40:22 -06:00
KAWASHIMA Takahiro	eb5c31521b	mpi/c: Fix `MPI_IALLTOALLW` memchecker	2016-04-11 18:47:30 +09:00
KAWASHIMA Takahiro	1ced7f213c	mpi/c: Fix `IALLTOALL{V\|W}` + `MPI_IN_PLACE` param check `sendcounts`, `sdispls`, and `sendtype(s)` must be ignored if `MPI_IN_PLACE` is specified for `sendbuf`. This commit makes the param check code same as the blocking `ALLTOALL{V\|W}` function.	2016-04-11 18:34:11 +09:00
Gilles Gouaillardet	7b803ac557	MPI_Unpack: fix error code when insize <= 0 this fixes a regression from open-mpi/ompi@f2e33c725f	2016-04-06 09:47:21 +09:00
Gilles Gouaillardet	f2e33c725f	MPI_Unpack: fix return status this regression was previously introduced in open-mpi/ompi@221e6e2eab	2016-03-31 09:56:54 +09:00
Gilles Gouaillardet	5932287cef	datatype/[un]pack_external[_size]: move subroutines down to ompi/datatype so it can be directly used by test/datatype/external32	2016-03-30 13:01:33 +09:00
Gilles Gouaillardet	221e6e2eab	Add the datatype checks to the pack/unpack functions. The datatype must satisfy the same constraints as for the corresponding communication function (send for pack and recv for unpack).	2016-03-30 11:40:08 +09:00
Gilles Gouaillardet	a89f113507	mpi/c: add missing OPAL_CR_EXIT_LIBRARY() in [un]pack[_external]	2016-03-30 11:25:21 +09:00
Nathan Hjelm	d4afb16f5a	opal: rework mpool and rcache frameworks This commit rewrites both the mpool and rcache frameworks. Summary of changes: - Before this change a significant portion of the rcache functionality lived in mpool components. This meant that it was impossible to add a new memory pool to use with rdma networks (ugni, openib, etc) without duplicating the functionality of an existing mpool component. All the registration functionality has been removed from the mpool and placed in the rcache framework. - All registration cache mpools components (udreg, grdma, gpusm, rgpusm) have been changed to rcache components. rcaches are allocated and released in the same way mpool components were. - It is now valid to pass NULL as the resources argument when creating an rcache. At this time the gpusm and rgpusm components support this. All other rcache components require non-NULL resources. - A new mpool component has been added: hugepage. This component supports huge page allocations on linux. - Memory pools are now allocated using "hints". Each mpool component is queried with the hints and returns a priority. The current hints supported are NULL (uses posix_memalign/malloc), page_size=x (huge page mpool), and mpool=x. - The sm mpool has been moved to common/sm. This reflects that the sm mpool is specialized and not meant for any general allocations. This mpool may be moved back into the mpool framework if there is any objection. - The opal_free_list_init arguments have been updated. The unused0 argument is not used to pass in the registration cache module. The mpool registration flags are now rcache registration flags. - All components have been updated to make use of the new framework interfaces. As this commit makes significant changes to both the mpool and rcache frameworks both versions have been bumped to 3.0.0. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-03-14 10:50:41 -06:00
Nathan Hjelm	230d04327e	ompi: always enable MPI_THREAD_MULTIPLE support This commit removes the --with-mpi-thread-multiple option and forces MPI_THREAD_MULTIPLE support. This cleans up an abstration violation in opal where OMPI_ENABLE_THREAD_MULTIPLE determines whether the opal_using_threads is meaningful. To reduce the performance hit on MPI_THREAD_SINGLE programs an OPAL_UNLIKELY is used for the check on opal_using_threads in OPAL_THREAD_* macros. This commit does not clean up the arguments to the various functions that take whether muti-threading support is enabled. That should be done at a later time. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-23 10:02:14 -07:00
George Bosilca	56425a5d48	Fix issue identified by Lisandro Dalcin regarding the lack of support for NULL value in MPI_Type_set_attr. Provides a fix for issue #1359.	2016-02-14 00:07:08 -05:00
Nathan Hjelm	064a67f5b9	Fix MPI_Get_address (MPI_BOTTOM, ...) Nowhere in the standard does it say that it is invalid to pass MPI_BOTTOM to MPI_Get_address yet we were returning an error. This commit removes the error check on NULL == location. Fixes open-mpi/ompi#1355. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-12 16:34:21 -07:00
Gilles Gouaillardet	2adbe273d6	mpi: have MPI_Wtick() return the period (and not the frequency) if OPAL_TIMER_CYCLE_NATIVE	2016-01-20 14:14:47 +09:00
Artem Polyakov	2abb2972ac	Fix Mellanox copyrights with respect to the following PRs: * https://github.com/open-mpi/ompi/pull/1184 * https://github.com/open-mpi/ompi/pull/1188 * https://github.com/open-mpi/ompi/pull/1197 * https://github.com/open-mpi/ompi/pull/1202 * https://github.com/open-mpi/ompi/pull/1210 * https://github.com/open-mpi/ompi/pull/1216 * https://github.com/open-mpi/ompi/pull/1236 * https://github.com/open-mpi/ompi/pull/1237 * https://github.com/open-mpi/ompi/pull/1248 * https://github.com/open-mpi/ompi/pull/1260 * https://github.com/open-mpi/ompi/pull/1264	2015-12-30 00:12:19 +06:00
George Bosilca	6e6fd14a19	Fix indentation.	2015-12-20 03:15:19 -05:00
Gilles Gouaillardet	f0df2a7b2b	ompi: silence CID 1343322	2015-12-15 13:33:43 +09:00
Nathan Hjelm	139799f3c4	Merge pull request #1202 from artpol84/alltoall_fix Fix MPI_Alltoall to support inter-communicators.	2015-12-14 14:33:23 -08:00
Nathan Hjelm	b7ba301310	Merge pull request #1165 from hjelmn/add_procs_group ompi/group: release ompi_proc_t's at group destruction	2015-12-14 13:53:42 -08:00
Ralph Castain	5e5adebf8e	Port the changes from #782 to the master. Not everything applies here as the code in the 1.10 series is a little different. In addition, we asked for a few changes (e.g., using MPI_ERR_ARG instead of "13") that are incorporated here. Thanks to @jsharpe for the PR	2015-12-12 12:40:34 -08:00
Artem Polyakov	25077fc5d9	Fix MPI_Alltoall to support inter-communicators. Remove excessive parameter check to avoid premature exit from the collective. MPI standard says: The type signature associated with sendcount, sendtype, at a process must be equal to the type signature associated with recvcount, recvtype at any other process. This implies that the amount of data sent must be equal to the amount of data received, pairwise between every pair of processes. In case of inter-communicator we have 2 group of processes and "left" group may call MPI_Alltoall(NULL, 0, MPI_INT, buf, 10, MPI_INT, comm, ...); and the right one: MPI_Alltoall(buf,10,MPI_INT, NULL, 0, MPI_INT, comm, ...); And it would be legal though one of the group will receive 0 bytes from others. This was triggered by MPICH/coll test called icalltoall.	2015-12-11 08:50:34 +06:00
Gilles Gouaillardet	ef03bc726c	ompi: fix comment in ompi/mpi/c/Makefile.am Thanks Jeff for the review	2015-12-07 11:34:01 +09:00
Nathan Hjelm	5334d22a37	ompi/group: release ompi_proc_t's at group destruction This commit changes the way ompi_proc_t's are retained/released by ompi_group_t's. Before this change ompi_proc_t's were retained once for the group and then once for each retain of a group. This method adds unnecessary overhead (need to traverse the group list each time the group is retained) and causes problems when using an async add_procs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-11-30 23:03:47 -07:00
George Bosilca	4ac247b1da	Minor updated on the validity checks for the alltoall collectives.	2015-10-24 15:25:28 -04:00
Nathan Hjelm	6ae57647ab	win: fix erroneous argument check When using dynamic memory windows the displacement becomes a pointer. Since the high bit may be set on valid pointers on some platforms the check for disp > 0 is invalid. This commit adds the window flavor to ompi_win_t and disables the displacement check when operating on dynamic memory windows. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-10-22 09:33:26 -06:00
Gilles Gouaillardet	a0782e1c7e	mpi: MPI_Neighbor_all* and MPI_Ineighbor_all* do not work with inter communicators (fail with MPI_ERR_COMM) or non process topologies communicators (fail with MPI_ERR_TOPOLOGY)	2015-10-21 16:21:19 +09:00
Gilles Gouaillardet	3b0b929883	ompi: MPI_IN_PLACE is not a valid argument of MPI_Neighbor_all* and MPI_Ineighbor_all*	2015-10-21 14:46:35 +09:00
Gilles Gouaillardet	256976a108	mpi: MPI_IN_PLACE is not a valid argument of MPI_All* and MPI_Iall* with an inter communicator	2015-10-21 14:46:28 +09:00
Gilles Gouaillardet	2bd77ed4f9	mpi: fail with MPI_ERR_INTERN if MPI_IN_PLACE is used with MPI_Ialltoall currently, MPI fails with MPI_ERR_ARG. This is counter intuitive since MPI_IN_PLACE is a legitimate parameter. MPI_IN_PLACE might not be correctly implemented by all the non blocking modules (libnbc, ...) so fail with MPI_ERR_INTERN for the time being.	2015-10-20 14:12:33 +09:00
Jeff Squyres	f5ad90c920	init/finalize: extensions Proposed extensions for Open MPI: - If MPI_INITLIZED is invoked and MPI is only partially initialized, wait until MPI is fully initialized before returning. - If MPI_FINALIZED is invoked and MPI is only partially finalized, wait until MPI is fully finalized before returning. - If the ompi_mpix_allow_multi_init MCA param is true, allow MPI_INIT and MPI_INIT_THREAD to be invoked multiple times without error (MPI will be safely initialized only the first time it is invoked).	2015-10-15 12:39:15 -04:00
Jeff Squyres	ac25505e03	mpi: infrastructure to gracefully disable MPI dyn procs Add ompi_mpi_dynamics_disable() function to disable MPI dynamic process functionality (i.e., such that if MPI_COMM_SPAWN/etc. are invoked, you'll get a show_help error explaining that MPI dynamic process functionality is disabled in this environment -- instead of a potentially-cryptic network or hardware error). Fixes #984	2015-10-14 13:42:56 -07:00
Jeff Squyres	a4adee5329	dynamics: fix OPAL_CR_EXIT_LIBRARY() Noticed that these were wrong will working on a different pull request. Submit these fixes indepdent of other changes, just to keep things separated.	2015-10-13 10:57:33 -07:00
Gilles Gouaillardet	291a464efb	configury: remove the --enable-mpi-profiling option and directly call the PMPI_* symbols from C and Fortran bindings	2015-10-13 08:52:35 +09:00
Gilles Gouaillardet	53b952dc2b	oshmem: invoke the C PMPI_* subroutines instead of the MPI_* ones when profiling is built. This prevents oshmem subroutines from being wrapped twice by third party tools (e.g. once in oshmem and once in MPI) see discussion starting at http://www.open-mpi.org/community/lists/devel/2015/08/17842.php Thanks to Bert Wesarg for bringing this to our attention	2015-10-13 08:52:03 +09:00
Gilles Gouaillardet	16d65a2762	fortran/mpif-h: invoke the C PMPI_* subroutines instead of the MPI_* ones when profiling is built. This prevents Fortran subroutines from being wrapped twice by third party tools (e.g. once in Fortran and once in C) see discussion starting at http://www.open-mpi.org/community/lists/devel/2015/08/17842.php	2015-10-13 08:52:02 +09:00
Nathan Hjelm	6751409c32	ompi/win: save value of accumulate_ops info key on window Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-09-21 16:37:29 -06:00
Gilles Gouaillardet	fe351f6801	io: do not cast way the const modifier when this is not necessary update the io framework and mpi c bindings	2015-09-09 09:18:58 +09:00
Gilles Gouaillardet	e01bac962f	coll: do not cast way the const modifier when this is not necessary update the coll framework and mpi c bindings	2015-09-09 09:18:57 +09:00

1 2 3 4 5 ...

500 Коммитов