openmpi

Автор	SHA1	Сообщение	Дата
Pascal Deveze	025201b459	osc/portals4: set the initial value of req_status.MPI_ERROR to MPI_SUCCESS	2016-07-18 09:52:56 +02:00
Pascal Deveze	aa0d687a0a	osc/portals4: Display an ouput message if ompi_osc_portals4_get_dt() or ompi_osc_portals4_get_op() returns an error	2016-07-18 09:52:56 +02:00
Pascal Deveze	c4181909a4	osc/portals4: Be sure that the ME are operationnal (wait for the PTL_EVENT_LINK)	2016-07-18 09:52:56 +02:00
Pascal Deveze	e99e7d08ed	osc/portals4: For the ME, use the uid from PtlGetUid instead of PTL_UID_ANY	2016-07-18 09:52:56 +02:00
Pascal Deveze	56b36eeb7e	osc/portals4: Format of "target_disp" is OPAL_PTRDIFF_TYPE and %lu is the appropriate format to display it.	2016-07-18 09:52:55 +02:00
Pascal Deveze	a76566c754	osc/portals4: To allocate a PT, use REQ_OSC_TABLE_ID and test that the right ID is allocated	2016-07-18 09:52:55 +02:00
Edgar Gabriel	195ec89732	fcoll/base: mv coll_array functionis to fcoll base the coll_array functions are truly only used by the fcoll modules, so move them to fcoll/base. There is currently one exception to that rule (number of aggreagtors logic), but that function will be moved in a long term also to fcoll/base.	2016-07-14 08:41:14 -05:00
Edgar Gabriel	1f1504ebbb	remove some unused code	2016-07-14 08:41:14 -05:00
Joshua Ladd	06930a0423	Merge pull request #1840 from artpol84/yalla_perf_fix pml/yalla: fix yalla performance regression	2016-07-14 10:55:30 +03:00
Gilles Gouaillardet	c3c262b3a8	ompi/group: get rid of malloc(0) in ompi_group_intersection(...) Thanks Lisandro Dalcin for the report Fixes open-mpi/ompi#1866	2016-07-14 11:19:46 +09:00
Jeff Squyres	1bea2b2575	mpi.h: fix types of MPI_UNWEIGHTED and MPI_WEIGHTS_EMPTY Thanks to Lisandro Dalcin for reporting. Fixes open-mpi/ompi#1865. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-07-13 09:36:24 -04:00
Pascal Deveze	b87ed1ad4a	mtl/portals4: Display actual limits given by the portals4 PtlNIInit function	2016-07-12 15:07:31 +02:00
Pascal Deveze	f666b0d9aa	mtl/portals4: Allocate a PT with the PTL_PT_FLOWCTRL flag only if OMPI_MTL_PORTALS4_FLOW_CONTROL is set	2016-07-12 15:07:31 +02:00
Pascal Deveze	bed572cd6c	mtl/portals4: Unlink the ME first, then free the CT and at the end free the PT	2016-07-12 15:07:30 +02:00
Ralph Castain	0e433eaa78	Silence warning	2016-07-11 19:43:02 -07:00
Nathan Hjelm	b47208e909	osc/rdma: fix bug in CAS This commit fixes a bug in the RDMA compare-and-swap implementation that caused the origin value to always be written even if the compare should have failed. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-07-11 09:54:23 -06:00
Edgar Gabriel	c8b1c6cae1	Merge pull request #1856 from edgargabriel/pr/zero-size-iread-iwrite io/ompio: fix the request in case of a zero size write/read operation	2016-07-11 08:19:02 -05:00
Gilles Gouaillardet	14624506df	coll/libnbc: do not exchange data between roots in ompi_coll_libnbc_ireduce_scatter_inter() this is now useless since the scatter is done via the local communicator	2016-07-11 17:18:30 +09:00
Edgar Gabriel	3dd81e9e09	io/ompio: fix the request in case of a zero size write/read operation	2016-07-08 14:11:22 -05:00
Gilles Gouaillardet	a55d57406b	coll/base: fix non zero lower bound datatype handling in mca_coll_base_alltoallv_intra_basic_inplace()	2016-07-08 16:55:26 +09:00
Gilles Gouaillardet	7b8094aac1	coll/base: silence misc warning as reported by Coverity with CIDs 1363349-1363362 Offset temporary buffer when a non zero lower bound datatype is used. Thanks Hristo Iliev for the report (cherry picked from commit 0e393195d9f2373ffa9d59a240092f643117cd39)	2016-07-08 13:06:26 +09:00
Gilles Gouaillardet	678d08647b	coll/libnbc: various fixes - correctly handle non commutative operators - correctly handle non zero lower bound ddt - correctly handle ddt with size > extent - revamp NBC_Sched_op so it takes two buffers and matches ompi_op_reduce semantic - various fix for inter communicators Thanks Yuki Matsumoto for the report	2016-07-07 15:55:49 +09:00
Gilles Gouaillardet	3e559a14a9	coll/inter: fix non standard ddt handling - correctly handle non zero lower bound ddt - correctly handle ddt with size > extent Thanks Yuki Matsumoto for the report	2016-07-07 15:49:59 +09:00
Gilles Gouaillardet	488d037d51	coll/basic: fix non standard ddt handling - correctly handle non zero lower bound ddt - correctly handle ddt with size > extent Thanks Yuki Matsumoto for the report	2016-07-07 15:49:53 +09:00
Gilles Gouaillardet	c06fb04a9a	coll/base: fix non zero lower bound ddt handling in ompi_coll_base_reduce_intra_basic_linear() Thanks Yuki Matsumoto for the report	2016-07-07 15:49:48 +09:00
Ralph Castain	ee56d9dc1a	Shorten the session directory name as some OS's are now providing unusually long temp directory names, causing us to overflow the sockaddr field	2016-07-05 14:59:50 -07:00
George Bosilca	eac5b3c668	Various cleanups in the monitoring PML.	2016-07-05 18:31:25 +02:00
George Bosilca	73972768f8	Remove an apparently useless function.	2016-07-05 18:30:11 +02:00
Artem Polyakov	a4ff9bef6d	fix #2	2016-07-05 14:38:35 +03:00
Artem Polyakov	bc973cad30	fix	2016-07-05 14:33:31 +03:00
Artem Polyakov	7d96f12fec	pml/yalla: fix yalla performance regression It was introduced in PR https://github.com/open-mpi/ompi/pull/1228 in particular in commit 041a6a9f53033a12d1cbf5c1af36cb16c7cdcc36. Original solution was using "flexible array member" called "mxm_base" to "fall-through" to the "mxm" send/recv member that located in the outer structure. After changing number of elements in "mxm_base" from 0 to 1 we actually allocating 2 mxm_req_base_t elements which leads to increased overal size and harms cache performance. It also brakes "mca_pml_yalla_check_request_state" function.	2016-07-05 10:52:48 +03:00
Josh Hursey	59bf1f0c41	Merge pull request #1836 from jjhursey/topic/coll-nbc-0-count-ireduce mpi/c: Add each check for count==0 in nonblocking reduce interface	2016-07-01 15:22:37 -05:00
Josh Hursey	9b4ed968a4	Merge pull request #1833 from jjhursey/topic/op-init-fix op: Add a default value for MPI_OP o_name	2016-07-01 15:22:15 -05:00
Joshua Hursey	0671e45de0	op: Add a default value for MPI_OP o_name	2016-07-01 13:46:01 -05:00
Joshua Hursey	96779f68e8	mpi/c: Add each check for count==0 in nonblocking reduce interface * Matches the blocking versions of these interfaces - `iallreduce.c` to match `allreduce.c` - `ireduce.c` to match `reduce.c` - `ireduce_scatter.c` to match `reduce_scatter.c` * Workaround for IMB-NBC benchmark, similar to the workaround in place for the IMB-MPI1 benchmark for the blocking collectives.	2016-07-01 13:45:30 -05:00
Joshua Hursey	0a09f8bc51	coll/hcoll: Protect module destruct when not fully initialized * If hcoll is given a negative priority, but not enabled=0 then the module is constructed, but then destructed before calling it's query(). So the previous pointers are not initialized. If we try to OBJ_RELEASE them in a debug build an assert will fire. This commit adds some protection against that and initializes the _module pointers to NULL.	2016-07-01 13:41:27 -05:00
Joshua Hursey	59f304b9e9	coll/base: neg. priority cleanup, verbose output improvements * Print a verbose message if the component was disqualified because of a negative priority. * If a disqualified component provided a module, release it. * Display list of selected components in priority order - During the process of volunteering collective functions for a communicator, print the component name and priority. This will cause the verbose messages to be displayed in reverse priority order (lowest priority first, up to highest). This is helpful when determining which collective components are active in which order for a given communicator. To see the messages you need the following MCA parameter set to 9 or higher: `-mca coll_base_verbose 9` * Adjust verbose for commonly needed verbose output from 10 to 9 to make it easier to access this information.	2016-07-01 13:41:27 -05:00
Nathan Hjelm	f38cc00df9	Merge pull request #1835 from hjelmn/thread_fix ompi/request: fix hang in ompi_request_wait_completion	2016-06-30 18:50:32 -06:00
Nathan Hjelm	445b79bba8	ompi/request: fix hang in ompi_request_wait_completion This commit fixes a hang reported by @nysal which happens when a request is completed after a sync object is created but before the sync object can be assigned to the request. In this case we need to set the sync signaling field to false to ensure WAIT_SYNC_RELEASE does not hang. Fixes open-mpi/ompi#1828 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-30 09:54:00 -06:00
Nathan Hjelm	5f390b5f5a	bml/r2: be more restrictive on rdma endpoints This commit makes bml/r2 more restrictive on which endpoints end up in the rdma endpoint list. Before this commit an endpoint was added if it supported either put or get. This was done to ensure that endpoints are available for RMA. Thought it is possible to support put or get endpoints we only currently support endpoints that have put, get, and amos. bml/r2 now reflects this support. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-06-29 18:54:58 -06:00
Artem Polyakov	732d89095b	MPI_Waitsome performance improvement by avoiding extra atomic exchanges. Use indices array to mark already completed connections in the pre-wait loop to avoid extra atomic exchanges in the after-wait loop.	2016-06-29 20:40:41 +06:00
Artem Polyakov	541715572f	Fix MPI_Waitany and MPI_Waitsome (request handling related)	2016-06-28 16:40:00 +03:00
Nathaniel Graham	bb9485bcd9	Fix Java Coverity issue Fixing a possible error that Coverity pointed out in ompi_java_exceptionCheck. Signed-off-by: Nathaniel Graham <nrgraham23@gmail.com>	2016-06-23 15:09:07 +02:00
Nathan Hjelm	143a93f379	opal/sync: remove usage of OPAL_ENABLE_MULTI_THREADS The OPAL_ENABLE_MULTI_THREADS macro is always defined as 1. This was causing us to always use the multi-thread path for synchronization objects. The code has been updated to use the opal_using_threads() function. When MPI_THREAD_MULTIPLE support is disabled at build time (2.x only) this function is a macro evaluating to false so the compiler will optimize out the MT-path in this case. The OPAL_ATOMIC_ADD_32 macro has been removed and replaced by the existing OPAL_THREAD_ADD32 macro. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-22 09:52:37 -06:00
Nathaniel Graham	679a66ccc8	Merge pull request #1803 from nrgraham23/jni_error_handling Java bindings exception handling fix	2016-06-22 04:14:00 -07:00
Nathan Hjelm	7bd7c0578b	Merge pull request #1807 from hjelmn/request_perfm_regression ompi/request: fix performance regression	2016-06-21 14:47:56 -06:00
Nathan Hjelm	544adb9aed	ompi/request: fix performance regression This commit fixes a performance regression introduced by the request rework. We were always using the multi-thread path because OPAL_ENABLE_MULTI_THREADS is either not defined or always defined to 1 depending on the Open MPI version. To fix this I removed the conditional and added a conditional on opal_using_threads(). This path will be optimized out in 2.0.0 in a non-thread-multiple build as opal_using_threads is #defined to false in that case. Fixes open-mpi/ompi#1806 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-21 11:45:32 -06:00
Nathan Hjelm	2409024c17	osc/rdma: fix typo Need to increment the total size after checking the local offset not before. This typo causes large allocations with MPI_Win_allocate() to fail. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-21 09:50:29 -06:00
Nathaniel Graham	88dea4e4de	Java bindings exception handling fix Fixed an error where if there were no MPI exceptions, a JNI error could still exist and not get handled. Signed-off-by: Nathaniel Graham <nrgraham23@gmail.com>	2016-06-21 12:40:31 +02:00
George Bosilca	9c4f56be4b	Fix the coll_base_sendrecv function.	2016-06-18 18:23:51 +02:00

... 2 3 4 5 6 ...

9218 Коммитов