openmpi

Автор	SHA1	Сообщение	Дата
Edgar Gabriel	aa7e852e44	common/ompio: files are only compiled in case MPI I/O is requested fixes: open-mpi/ompi#1932	2016-08-02 15:01:38 -05:00
George Bosilca	087761c2dc	Fix a warning and other small cleanups.	2016-08-02 17:33:53 +02:00
Edgar Gabriel	19fe5cac50	io/ompio: next step in code-reorganization - move the sort_iovec operations to fcoll/base - move set_view_internal to common/ompio - move set_file_default to common/ompio - remove io_ompio_sort, not used anymore.	2016-08-02 09:18:29 -05:00
KAWASHIMA Takahiro	722e898e8e	fortran: Correct the name of `MPI_INTEGER16`. The name of `MPI_INTEGER16` obtained using `MPI_TYPE_GET_NAME` from Fortran program was incorrect (`MPI_INTEGER8` was obtained) when `INTEGER*16` is not supported by a compiler. This bug affects only the Fortran binding because `MPI_INTEGER16` is not defined in `mpi.h` if a compiler does not support it.	2016-08-02 22:37:58 +09:00
KAWASHIMA Takahiro	5383003eab	fortran: Add missing predefined datatype named constants. This commit add the following Fortran named constants which are defined in the MPI standard but are missing in Open MPI. - `MPI_LONG_LONG` (defined as a synonym of `MPI_LONG_LONG_INT`) - `MPI_CXX_FLOAT_COMPLEX` - `MPI_C_BOOL` And this commit also changes the value of the following Fortran named constant for consistency. - `MPI_C_COMPLEX` `(MPI_C_FLOAT_COMPLEX` is defined as a synonym of this) Each needs a different solution described below. For `MPI_LONG_LONG`: The value of `MPI_LONG_LONG` is defined to have a same value as `MPI_LONG_LONG_INT` because of the following reasons. 1. It is defined as a synonym of `MPI_LONG_LONG_INT` in the MPI standard. 2. `MPI_LONG_LONG_INT` and `MPI_LONG_LONG` has a same value for C in `mpi.h`. 3. `ompi_mpi_long_long` is not defined in `ompi/datatype/ompi_datatype_module.c`. For `MPI_CXX_FLOAT_COMPLEX`: Existing `MPI_CXX_COMPLEX` is replaced with `MPI_CXX_FLOAT_COMPLEX` bacause `MPI_CXX_FLOAT_COMPLEX` is the right name defined in MPI-3.1 and `MPI_CXX_COMPLEX` is not defined in MPI-3.1 (nor older). But for compatibility, `MPI_CXX_COMPLEX` is treated as a synonym of `MPI_CXX_FLOAT_COMPLEX` on Open MPI. For `MPI_C_BOOL`: `MPI_C_BOOL` is newly added. The value which `MPI_C_COMPLEX` had used (68) is assinged for it because the value becomes no longer in use (described later) and it is a suited position as a datatype added on MPI-2.2. For `MPI_C_COMPLEX`: Existing `MPI_C_FLOAT_COMPLEX` is replaced with `MPI_C_COMPLEX` and `MPI_C_FLOAT_COMPLEX` is changed to have the same value. In other words, make `MPI_C_COMPLEX` the canonical name and make `MPI_C_FLOAT_COMPLEX` an alias of it. This is bacause the relation of these datatypes is same as the relation of `MPI_LONG_LONG_INT` and `MPI_LONG_LONG`, and `MPI_LONG_LONG_INT` and `MPI_LONG_LONG` are implemented like that. But in the datatype engine, we use `ompi_mpi_c_float_complex` instead of `ompi_mpi_c_complex` as a variable name to keep the consistency with the other similar types such as `ompi_mpi_c_double_complex` (see George's comment in open-mpi/ompi#1927). We don't delete `ompi_mpi_c_complex` now because it is used in some other places in Open MPI code. It may be cleand up in the future. In addition, `MPI_CXX_COMPLEX`, which was defined only in the Open MPI Fortran binding, is added to `mpi.h` for the C binding. This commit breaks binary compatibility of Fortran `MPI_C_COMPLEX`. When this commit is merged into v2.x branch, the change of `MPI_C_COMPLEX` should be excluded.	2016-08-02 22:36:41 +09:00
Gilles Gouaillardet	917d96ba50	coll/libnbc: cleanup handling of the second temporary buffer in ireduce	2016-08-02 16:32:15 +09:00
Gilles Gouaillardet	ed9139ca13	coll/libnbc: correctly handle datatype alignment when allocating two buffers at once	2016-08-02 15:44:12 +09:00
KAWASHIMA Takahiro	0cb5dfe18d	fortran: Correct predefined datatype named constants.	2016-08-02 13:12:22 +09:00
KAWASHIMA Takahiro	ad3b590172	fortran: Add missing `MPI_NO_OP` and `MPI_WIN_*` named constants.	2016-08-02 13:10:44 +09:00
Edgar Gabriel	c0bd8728fd	io/ompio: move aggregator selection code to a separate file - move all functions related to aggregator selection to a single file - perform code cleanup fixing many Coverty complains along the way.	2016-08-01 14:04:27 -05:00
Jeff Squyres	50952c3a31	Merge pull request #1912 from rivis/pr/mpisync mpisync: Fix a compilation error.	2016-07-29 14:53:44 -04:00
Edgar Gabriel	160d9a78c1	Merge pull request #1886 from edgargabriel/pr/ompio-reorg io/ompio: move io/ompio functionality to common/ompio	2016-07-29 12:24:21 -05:00
Joshua Ladd	4a03a657c6	Merge pull request #1913 from vspetrov/hcoll_derived_datatypes coll/hcoll mpi datatypes support	2016-07-29 10:08:23 -04:00
Nathan Hjelm	1da558407c	Merge pull request #1911 from hjelmn/threads opal/thread: clean up and add additional OPAL_THREAD macros	2016-07-29 06:44:11 -06:00
Valentin Petrov	3582bba6b7	coll/hcoll mpi datatypes support	2016-07-29 10:06:39 +03:00
Gilles Gouaillardet	9f3e1a0620	Merge pull request #1898 from ggouaillardet/topic/poc_configury_cli configury: capture configury command line	2016-07-29 11:55:21 +09:00
Howard Pritchard	5ff6b81eee	Merge pull request #1871 from hppritcha/topic/ofi_mtl_params mtl/ofi: add some more mca parameters	2016-07-28 18:21:23 -06:00
Gilles Gouaillardet	273e56096b	configury: capture configury command line configury command line is quoted and made available via the OPAL_CONFIGURE_CLI macro. it can be retrieved via {orte-info,ompi_info,oshmem_info} -c, or {orte-info,ompi_info,oshmem_info} --all --parseable \| grep ^config:cli:	2016-07-29 09:14:09 +09:00
Ralph Castain	cacb582ecd	Support timeout values when performing connect/accept operations. Bump default timeout to 10 minutes so folks have time to start the partnering application	2016-07-28 14:09:06 -07:00
KAWASHIMA Takahiro	2a932f48ad	mpisync: Fix a compilation error. This commit fixes the undefined `OPAL_MAXHOSTNAMELEN` error which arises only when `--enable-timing` is specified for `configure`. This bug exists only in master branch because the commit `3322347` is not merged into other branches.	2016-07-29 02:38:25 +09:00
Nathan Hjelm	aac611237b	opal/thread: clean up and add additional OPAL_THREAD macros This commit expands the OPAL_THREAD macros to include 32- and 64-bit atomic swap. Additionally, macro declararations have been updated to include both OPAL_THREAD_* and OPAL_ATOMIC_*. Before this commit the former was used with add and the later with cmpset. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-07-28 09:23:14 -06:00
Howard Pritchard	22c8743557	mtl/ofi: add some more mca parameters allow for toggling of both control/data progress models. allow for using FI_AV_TABLE or FI_AV_MAP for av type. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2016-07-28 02:35:09 -06:00
Gilles Gouaillardet	a0a999e63d	coll/base: fix ompi_coll_base_allgatherv_intra_basic_default() with MPI_IN_PLACE	2016-07-28 13:57:18 +09:00
Gilles Gouaillardet	b8a1ffb87e	coll/base: fix ompi_coll_base_allgatherv_intra_basic_default() Fixes open-mpi/ompi#1907	2016-07-28 13:50:04 +09:00
Jeff Squyres	2e0c3c7d77	libompitrace: explicitly set the .so version Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-07-27 07:05:58 -04:00
Pascal Deveze	10763f5abc	mtl/portals4: Take into account the limitation of portals4 (max_msg_size) and split messages if necessary	2016-07-26 08:44:07 +02:00
Pascal Deveze	724801b018	mtl-portals4: Introduce a "short_limit" for the short message size. "eager_limit" will only be used for the limit of the eager part of the messages sent with the rndv protocol	2016-07-26 08:43:24 +02:00
Pascal Deveze	9e58b4842f	mtl-portals4: Correct how the request_status._ucount is set	2016-07-26 08:42:48 +02:00
Pascal Deveze	3ca194f10a	mtl-portals4: Store ptl_process_id (from PtlGetPhysId) and display it.	2016-07-26 08:42:08 +02:00
Pascal Deveze	bd3b1cf7be	mtl-portals4: Control that flowctl_idx is egal to REQ_FLOWCTL_TABLE_ID and use OPAL_ATOMIC_CMPSET_32 to test and set flowctl_active flag to true	2016-07-26 08:41:31 +02:00
Ralph Castain	9ab20cafe3	Pass the nodeid for each proc in the job. Fix a mistaken error output message	2016-07-25 15:41:15 -07:00
Gilles Gouaillardet	bbc6d4b3d4	ompi/communicator: remove an other debug print statement in ompi_comm_allreduce_intra_pmix_nb()	2016-07-22 15:42:56 +09:00
Edgar Gabriel	b0fa1fd2a1	move the internal file_open/close functions to common/ompio	2016-07-21 13:08:32 -05:00
Edgar Gabriel	ccf76b7791	moving the internal read/write functions to common/ompio and update all fs/fcoll/sharedfp components to use these functions.	2016-07-21 13:08:32 -05:00
Edgar Gabriel	688710d408	make common/ompio compile	2016-07-21 13:08:32 -05:00
Edgar Gabriel	39ae93b87b	modify the fcoll components to use the common/ompio print queues	2016-07-21 13:08:32 -05:00
Edgar Gabriel	fe17410943	next step in making the print_queue functionality move to common/ompio	2016-07-21 13:08:32 -05:00
Edgar Gabriel	af67c8f239	first cut on moving some ompio functionality to common/ompio	2016-07-21 13:08:32 -05:00
Edgar Gabriel	a899c0fb38	fcoll/static: fix coverty warnings fix coverty warnings CID 72144, CID 710677, CID 1364164	2016-07-21 13:08:15 -05:00
Pascal Deveze	a7e3de6c4f	coll-portals4: No more messages passed to Portals4 bigger than the limit given by PtlNIInit	2016-07-21 15:58:20 +02:00
Pascal Deveze	175e6aa385	coll-portals4: Before calling PtlCTWait, call PtlTriggeredInc twice so be sure all pending PtlTriggredPut are triggered	2016-07-21 15:58:20 +02:00
Pascal Deveze	df59d6cdd4	coll-portals4: Correct and simplify how the data are cut in segment_nb segments (bcast)	2016-07-21 15:58:09 +02:00
Pascal Deveze	274f8d608c	coll-portals4: Change output format and change variable names (minor changes).	2016-07-21 11:06:45 +02:00
Todd Kordenbrock	37ad6aa711	Merge pull request #1853 from PDeveze/Patchs-on-osc-portals4 Patchs on osc portals4	2016-07-20 09:22:19 -05:00
Todd Kordenbrock	210534adb3	Merge pull request #1850 from PDeveze/Patchs-on-mtl-portals4 Patchs on mtl portals4	2016-07-20 08:21:03 -05:00
rhc54	4bc5048608	Merge pull request #1888 from rhc54/topic/pmixup Update pmix2 component	2016-07-20 06:14:05 -07:00
Ralph Castain	01a653d50a	Remove a debug print in comm_cid.c. Update PMIx2 to include the revised PMIx_Get logic for higher performance by reducing the number of hash table lookups. Fix a bug where requests for data from a proc in another nspace could hang, or result in "not found". Remove stale file reference Restore autogen pass thru pmix Remove generated file	2016-07-20 00:58:19 -07:00
Gilles Gouaillardet	252fadf099	ompi: fix #if vs #ifdef HAVE___MALLOC_INITIALIZE_HOOK usage	2016-07-20 13:18:11 +09:00
Ralph Castain	36a9063466	Silence warnings	2016-07-19 17:36:13 -07:00
Nathan Hjelm	40f71f2d7a	Merge pull request #1873 from hjelmn/comm_split_update Improve MPI_Comm_split_type scalability	2016-07-19 14:36:44 -06:00
Nathan Hjelm	5edab9cb22	Merge pull request #1855 from hjelmn/comm_rework ompi/comm: refactor communicator cid code	2016-07-19 10:04:17 -06:00
Pascal Deveze	9cac32ba6a	mtl/portals4: Modifications concerning the short message management	2016-07-19 11:21:50 +02:00
Pascal Deveze	49e9936914	mtl/portals4: Some little patches	2016-07-19 11:18:55 +02:00
Nathan Hjelm	ced853476f	Merge pull request #1878 from hjelmn/f_rops ompi/fortran: fix typos in request RMA bindings	2016-07-18 13:48:41 -06:00
Nathan Hjelm	8bdcb40dc4	ompi/fortran: fix typos in request RMA bindings This commit fixes typos on the C side of the request-based RMA binding. We were not returning the request on success but on failure. Thanks to @alazzaro for reporting and @ggouaillardet, and @vondele for tracking this down. Fixes part of open-mpi/ompi#1869 Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-07-18 13:46:28 -06:00
Nathan Hjelm	4c49c42dd0	ompi/comm: improve comm_split_type scalability This commit introduces a new algorithm for MPI_Comm_split_type. The old algorithm performed an allgather on the communicator to decide which processes were part of the new communicators. This does not scale well in either time or memory. The new algorithm performs a couple of all reductions to determine the global parameters of the MPI_Comm_split_type call. If any rank gives an inconsistent split_type (as defined by the standard) an error is returned without proceeding further. The algorithm then creates a communicator with all the ranks that match the split_type (no communication required) in the same order as the original communicator. It then does an allgather on the new communicator (which should be much smaller) to determine 1) if the new communicator is in the correct order, and 2) if any ranks in the new communicator supplied MPI_UNDEFINED as the split_type. If either of these conditions are detected the new communicator is split using ompi_comm_split and the intermediate communicator is freed. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-07-18 12:47:05 -06:00
Nathan Hjelm	035c2e2e2a	ompi/comm: refactor communicator cid code This commit simplifies the communicator context ID generation by removing the blocking code. The high level calls: ompi_comm_nextcid and ompi_comm_activate remain but now call the non-blocking variants and wait on the resulting request. This was done to remove the parallel paths for context ID generation in preperation for further improvements of the CID generation code. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-07-18 12:47:05 -06:00
Pascal Deveze	f19a2b961c	osc/portals4: Correct an error in an if statement	2016-07-18 13:16:12 +02:00
Pascal Deveze	81823d7a63	osc/portals4: Store the no_locks parameter in osc_portals4_component.no_locks	2016-07-18 11:51:52 +02:00
Pascal Deveze	76b38651da	osc/portals4: For the contiguous datatype, take into account the lower bound before calling portals4	2016-07-18 11:20:50 +02:00
Pascal Deveze	7aaf16e7fe	osc/portals4: Put/Get splitting because Portals4 may restrict sizes	2016-07-18 10:49:28 +02:00
Pascal Deveze	025201b459	osc/portals4: set the initial value of req_status.MPI_ERROR to MPI_SUCCESS	2016-07-18 09:52:56 +02:00
Pascal Deveze	aa0d687a0a	osc/portals4: Display an ouput message if ompi_osc_portals4_get_dt() or ompi_osc_portals4_get_op() returns an error	2016-07-18 09:52:56 +02:00
Pascal Deveze	c4181909a4	osc/portals4: Be sure that the ME are operationnal (wait for the PTL_EVENT_LINK)	2016-07-18 09:52:56 +02:00
Pascal Deveze	e99e7d08ed	osc/portals4: For the ME, use the uid from PtlGetUid instead of PTL_UID_ANY	2016-07-18 09:52:56 +02:00
Pascal Deveze	56b36eeb7e	osc/portals4: Format of "target_disp" is OPAL_PTRDIFF_TYPE and %lu is the appropriate format to display it.	2016-07-18 09:52:55 +02:00
Pascal Deveze	a76566c754	osc/portals4: To allocate a PT, use REQ_OSC_TABLE_ID and test that the right ID is allocated	2016-07-18 09:52:55 +02:00
Edgar Gabriel	195ec89732	fcoll/base: mv coll_array functionis to fcoll base the coll_array functions are truly only used by the fcoll modules, so move them to fcoll/base. There is currently one exception to that rule (number of aggreagtors logic), but that function will be moved in a long term also to fcoll/base.	2016-07-14 08:41:14 -05:00
Edgar Gabriel	1f1504ebbb	remove some unused code	2016-07-14 08:41:14 -05:00
Joshua Ladd	06930a0423	Merge pull request #1840 from artpol84/yalla_perf_fix pml/yalla: fix yalla performance regression	2016-07-14 10:55:30 +03:00
Gilles Gouaillardet	c3c262b3a8	ompi/group: get rid of malloc(0) in ompi_group_intersection(...) Thanks Lisandro Dalcin for the report Fixes open-mpi/ompi#1866	2016-07-14 11:19:46 +09:00
Jeff Squyres	1bea2b2575	mpi.h: fix types of MPI_UNWEIGHTED and MPI_WEIGHTS_EMPTY Thanks to Lisandro Dalcin for reporting. Fixes open-mpi/ompi#1865. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-07-13 09:36:24 -04:00
Pascal Deveze	b87ed1ad4a	mtl/portals4: Display actual limits given by the portals4 PtlNIInit function	2016-07-12 15:07:31 +02:00
Pascal Deveze	f666b0d9aa	mtl/portals4: Allocate a PT with the PTL_PT_FLOWCTRL flag only if OMPI_MTL_PORTALS4_FLOW_CONTROL is set	2016-07-12 15:07:31 +02:00
Pascal Deveze	bed572cd6c	mtl/portals4: Unlink the ME first, then free the CT and at the end free the PT	2016-07-12 15:07:30 +02:00
Ralph Castain	0e433eaa78	Silence warning	2016-07-11 19:43:02 -07:00
Nathan Hjelm	b47208e909	osc/rdma: fix bug in CAS This commit fixes a bug in the RDMA compare-and-swap implementation that caused the origin value to always be written even if the compare should have failed. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-07-11 09:54:23 -06:00
Edgar Gabriel	c8b1c6cae1	Merge pull request #1856 from edgargabriel/pr/zero-size-iread-iwrite io/ompio: fix the request in case of a zero size write/read operation	2016-07-11 08:19:02 -05:00
Gilles Gouaillardet	14624506df	coll/libnbc: do not exchange data between roots in ompi_coll_libnbc_ireduce_scatter_inter() this is now useless since the scatter is done via the local communicator	2016-07-11 17:18:30 +09:00
Edgar Gabriel	3dd81e9e09	io/ompio: fix the request in case of a zero size write/read operation	2016-07-08 14:11:22 -05:00
Gilles Gouaillardet	a55d57406b	coll/base: fix non zero lower bound datatype handling in mca_coll_base_alltoallv_intra_basic_inplace()	2016-07-08 16:55:26 +09:00
Gilles Gouaillardet	7b8094aac1	coll/base: silence misc warning as reported by Coverity with CIDs 1363349-1363362 Offset temporary buffer when a non zero lower bound datatype is used. Thanks Hristo Iliev for the report (cherry picked from commit `0e393195d9`)	2016-07-08 13:06:26 +09:00
Gilles Gouaillardet	678d08647b	coll/libnbc: various fixes - correctly handle non commutative operators - correctly handle non zero lower bound ddt - correctly handle ddt with size > extent - revamp NBC_Sched_op so it takes two buffers and matches ompi_op_reduce semantic - various fix for inter communicators Thanks Yuki Matsumoto for the report	2016-07-07 15:55:49 +09:00
Gilles Gouaillardet	3e559a14a9	coll/inter: fix non standard ddt handling - correctly handle non zero lower bound ddt - correctly handle ddt with size > extent Thanks Yuki Matsumoto for the report	2016-07-07 15:49:59 +09:00
Gilles Gouaillardet	488d037d51	coll/basic: fix non standard ddt handling - correctly handle non zero lower bound ddt - correctly handle ddt with size > extent Thanks Yuki Matsumoto for the report	2016-07-07 15:49:53 +09:00
Gilles Gouaillardet	c06fb04a9a	coll/base: fix non zero lower bound ddt handling in ompi_coll_base_reduce_intra_basic_linear() Thanks Yuki Matsumoto for the report	2016-07-07 15:49:48 +09:00
Ralph Castain	ee56d9dc1a	Shorten the session directory name as some OS's are now providing unusually long temp directory names, causing us to overflow the sockaddr field	2016-07-05 14:59:50 -07:00
George Bosilca	eac5b3c668	Various cleanups in the monitoring PML.	2016-07-05 18:31:25 +02:00
George Bosilca	73972768f8	Remove an apparently useless function.	2016-07-05 18:30:11 +02:00
Artem Polyakov	a4ff9bef6d	fix #2	2016-07-05 14:38:35 +03:00
Artem Polyakov	bc973cad30	fix	2016-07-05 14:33:31 +03:00
Artem Polyakov	7d96f12fec	pml/yalla: fix yalla performance regression It was introduced in PR https://github.com/open-mpi/ompi/pull/1228 in particular in commit `041a6a9f53`. Original solution was using "flexible array member" called "mxm_base" to "fall-through" to the "mxm" send/recv member that located in the outer structure. After changing number of elements in "mxm_base" from 0 to 1 we actually allocating 2 mxm_req_base_t elements which leads to increased overal size and harms cache performance. It also brakes "mca_pml_yalla_check_request_state" function.	2016-07-05 10:52:48 +03:00
Josh Hursey	59bf1f0c41	Merge pull request #1836 from jjhursey/topic/coll-nbc-0-count-ireduce mpi/c: Add each check for count==0 in nonblocking reduce interface	2016-07-01 15:22:37 -05:00
Josh Hursey	9b4ed968a4	Merge pull request #1833 from jjhursey/topic/op-init-fix op: Add a default value for MPI_OP o_name	2016-07-01 15:22:15 -05:00
Joshua Hursey	0671e45de0	op: Add a default value for MPI_OP o_name	2016-07-01 13:46:01 -05:00
Joshua Hursey	96779f68e8	mpi/c: Add each check for count==0 in nonblocking reduce interface * Matches the blocking versions of these interfaces - `iallreduce.c` to match `allreduce.c` - `ireduce.c` to match `reduce.c` - `ireduce_scatter.c` to match `reduce_scatter.c` * Workaround for IMB-NBC benchmark, similar to the workaround in place for the IMB-MPI1 benchmark for the blocking collectives.	2016-07-01 13:45:30 -05:00
Joshua Hursey	0a09f8bc51	coll/hcoll: Protect module destruct when not fully initialized * If hcoll is given a negative priority, but not enabled=0 then the module is constructed, but then destructed before calling it's query(). So the previous pointers are not initialized. If we try to OBJ_RELEASE them in a debug build an assert will fire. This commit adds some protection against that and initializes the _module pointers to NULL.	2016-07-01 13:41:27 -05:00
Joshua Hursey	59f304b9e9	coll/base: neg. priority cleanup, verbose output improvements * Print a verbose message if the component was disqualified because of a negative priority. * If a disqualified component provided a module, release it. * Display list of selected components in priority order - During the process of volunteering collective functions for a communicator, print the component name and priority. This will cause the verbose messages to be displayed in reverse priority order (lowest priority first, up to highest). This is helpful when determining which collective components are active in which order for a given communicator. To see the messages you need the following MCA parameter set to 9 or higher: `-mca coll_base_verbose 9` * Adjust verbose for commonly needed verbose output from 10 to 9 to make it easier to access this information.	2016-07-01 13:41:27 -05:00
Nathan Hjelm	f38cc00df9	Merge pull request #1835 from hjelmn/thread_fix ompi/request: fix hang in ompi_request_wait_completion	2016-06-30 18:50:32 -06:00
Nathan Hjelm	445b79bba8	ompi/request: fix hang in ompi_request_wait_completion This commit fixes a hang reported by @nysal which happens when a request is completed after a sync object is created but before the sync object can be assigned to the request. In this case we need to set the sync signaling field to false to ensure WAIT_SYNC_RELEASE does not hang. Fixes open-mpi/ompi#1828 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-30 09:54:00 -06:00
Nathan Hjelm	5f390b5f5a	bml/r2: be more restrictive on rdma endpoints This commit makes bml/r2 more restrictive on which endpoints end up in the rdma endpoint list. Before this commit an endpoint was added if it supported either put or get. This was done to ensure that endpoints are available for RMA. Thought it is possible to support put or get endpoints we only currently support endpoints that have put, get, and amos. bml/r2 now reflects this support. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-06-29 18:54:58 -06:00
Artem Polyakov	732d89095b	MPI_Waitsome performance improvement by avoiding extra atomic exchanges. Use indices array to mark already completed connections in the pre-wait loop to avoid extra atomic exchanges in the after-wait loop.	2016-06-29 20:40:41 +06:00
Artem Polyakov	541715572f	Fix MPI_Waitany and MPI_Waitsome (request handling related)	2016-06-28 16:40:00 +03:00
Nathaniel Graham	bb9485bcd9	Fix Java Coverity issue Fixing a possible error that Coverity pointed out in ompi_java_exceptionCheck. Signed-off-by: Nathaniel Graham <nrgraham23@gmail.com>	2016-06-23 15:09:07 +02:00
Nathan Hjelm	143a93f379	opal/sync: remove usage of OPAL_ENABLE_MULTI_THREADS The OPAL_ENABLE_MULTI_THREADS macro is always defined as 1. This was causing us to always use the multi-thread path for synchronization objects. The code has been updated to use the opal_using_threads() function. When MPI_THREAD_MULTIPLE support is disabled at build time (2.x only) this function is a macro evaluating to false so the compiler will optimize out the MT-path in this case. The OPAL_ATOMIC_ADD_32 macro has been removed and replaced by the existing OPAL_THREAD_ADD32 macro. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-22 09:52:37 -06:00
Nathaniel Graham	679a66ccc8	Merge pull request #1803 from nrgraham23/jni_error_handling Java bindings exception handling fix	2016-06-22 04:14:00 -07:00
Nathan Hjelm	7bd7c0578b	Merge pull request #1807 from hjelmn/request_perfm_regression ompi/request: fix performance regression	2016-06-21 14:47:56 -06:00
Nathan Hjelm	544adb9aed	ompi/request: fix performance regression This commit fixes a performance regression introduced by the request rework. We were always using the multi-thread path because OPAL_ENABLE_MULTI_THREADS is either not defined or always defined to 1 depending on the Open MPI version. To fix this I removed the conditional and added a conditional on opal_using_threads(). This path will be optimized out in 2.0.0 in a non-thread-multiple build as opal_using_threads is #defined to false in that case. Fixes open-mpi/ompi#1806 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-21 11:45:32 -06:00
Nathan Hjelm	2409024c17	osc/rdma: fix typo Need to increment the total size after checking the local offset not before. This typo causes large allocations with MPI_Win_allocate() to fail. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-21 09:50:29 -06:00
Nathaniel Graham	88dea4e4de	Java bindings exception handling fix Fixed an error where if there were no MPI exceptions, a JNI error could still exist and not get handled. Signed-off-by: Nathaniel Graham <nrgraham23@gmail.com>	2016-06-21 12:40:31 +02:00
George Bosilca	9c4f56be4b	Fix the coll_base_sendrecv function.	2016-06-18 18:23:51 +02:00
Nathan Hjelm	3a69b727a6	Merge pull request #1788 from hjelmn/split_type comm/split_type: allow MPI_UNDEFINED for split_type	2016-06-16 21:12:25 -06:00
Nathan Hjelm	65be935676	comm/split_type: allow MPI_UNDEFINED for split_type It is valid for any rank to deviate on the split_type argument if they specify MPI_UNDEFINED. The code was incorrectly not allowing this condition. Changed the split_type uniformity check and allow local_size to be 0 if the local split_type is MPI_UNDEFINED. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-16 17:42:28 -06:00
rhc54	702a982271	Merge pull request #1767 from rhc54/topic/pmix2 Enable the PMIx event notification capability	2016-06-16 15:27:43 -07:00
Nathan Hjelm	e135543cb0	Merge pull request #1785 from hjelmn/malloc_hook_fix opal/memory: disable __malloc_initialize_hook if poisoned	2016-06-15 14:55:44 -06:00
Nathan Hjelm	7018aeda2b	opal/memory: disable __malloc_initialize_hook if poisoned Newer versions of gcc have "poisoned" the __malloc_initialize_hook name and it can no longer be used. Added a configure check and protection around its usage. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-15 12:00:49 -06:00
KAWASHIMA Takahiro	dff6accec6	ompi/datatype: Fix args of DARRAY According to MPI-3.1 P.122, `ni` for `MPI_COMBINER_DARRAY` should be `4ndims+4`, not `4size+4`. This bug may cause SEGV if `size` is smaller than `ndims` when the darray is used for one-sided communication (pt2pt OSC). This bug was introduced in open-mpi/ompi@79b13f36 (when darray became a first class citizen and the `a_i` index of darray was shifted by 2). The corresponding `MPI_Type_create_darray()` function sets a right value so we don't need to update the function.	2016-06-15 11:24:22 +09:00
Ralph Castain	5d330d5220	Enable the PMIx event notification capability and use that for all error notifications, including debugger release. This capability requires use of PMIx 2.0 or above as the features are not available with earlier PMIx releases. When OMPI master is built against an earlier external version, it will fallback to the prior behavior - i.e., debugger will be released via RML and all notifications will go strictly to the default error handler. Add PMIx 2.0 Remove PMIx 1.1.4 Cleanup copying of component Add missing file Touchup a typo in the Makefile.am Update the pmix ext114 component Minor cleanups and resync to master Update to latest PMIx 2.x Update to the PMIx event notification branch latest changes	2016-06-14 13:08:41 -07:00
Jeff Squyres	c2185bb4b8	Merge pull request #1781 from jsquyres/pr/disable-psm-psm2-signal-hijacking PSM/PSM2: Disable signal handler hijacking by default	2016-06-14 15:33:24 -04:00
Jeff Squyres	5071602c59	PSM/PSM2: Disable signal handler hijacking by default Per discussion on https://github.com/open-mpi/ompi/pull/1767 (and some subsequent phone calls and off-issue email discussions), the PSM library is hijacking signal handlers by default. Specifically: unless the environment variables `IPATH_NO_BACKTRACE=1` (for PSM / Intel TrueScale) is set, the library constructor for this library will hijack various signal handlers for the purpose of invoking its own error reporting mechanisms. This may be a bit surprising, but is not a problem, per se. The real problem is that older versions of at least the PSM library do not unregister these signal handlers upon being unloaded from memory. Hence, a segv can actually result in a double segv (i.e., the original segv and then another segv when the now-non-existent signal handler is invoked). This PSM signal hijacking subverts Open MPI's own signal reporting mechanism, which may be a bit surprising for some users (particularly those who do not have Intel TrueScale). As such, we disable it by default so that Open MPI's own error-reporting mechanisms are used. Additionally, there is a typo in the library destructor for the PSM2 library that may cause problems in the unloading of its signal handlers. This problem can be avoided by setting `HFI_NO_BACKTRACE=1` (for PSM2 / Intel OmniPath). This is further compounded by the fact that the PSM / PSM2 libraries can be loaded by the OFI MTL and the usNIC BTL (because they are loaded by libfabric), even when there is no Intel networking hardware present. Having the PSM/PSM2 libraries behave this way when no Intel hardware is present is clearly undesirable (and is likely to be fixed in future releases of the PSM/PSM2 libraries). This commit sets the following two environment variables to disable this behavior from the PSM/PSM2 libraries (if they are not already set): * IPATH_NO_BACKTRACE=1 * HFI_NO_BACKTRACE=1 If the user has set these variables before invoking Open MPI, we will not override their values (i.e., their preferences will be honored). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-06-14 11:45:23 -07:00
Edgar Gabriel	1ddfd6cdca	io/ompio: fix the preallocate function handle preallocating sizes less than the current file size correctly.	2016-06-14 10:50:32 -05:00
KAWASHIMA Takahiro	84b110a1f2	ompi/datatype: Fix args of HINDEXED_BLOCK According to MPI-3.1 P.121, `ni` for `MPI_COMBINER_HINDEXED_BLOCK` should be `2`, not `2 + count`. This bug was introduced in `113b45b4` (when `MPI_Type_create_hindexed_block` support is added in Open MPI) and fixed partially in `7f5314ee` and `8de93982`. This commit fixes the remaining part. Probably this bug has no user impact. It only consumes a bit more memory.	2016-06-10 17:32:33 +09:00
Gilles Gouaillardet	80e362de52	coll/base: fix memory free in ompi_coll_base_allreduce_intra_recursivedoubling err handler Fix CID 1362630 Fixes open-mpi/ompi@0e393195d9	2016-06-09 13:12:25 +09:00
Gilles Gouaillardet	ead7efef3f	coll/basic: silence CID 1362614 in mca_coll_basic_allreduce_inter()	2016-06-09 09:40:19 +09:00
Gilles Gouaillardet	ad2e1a5ae9	coll/base: silence CID 1362613 in ompi_coll_base_alltoall_intra_basic_linear()	2016-06-09 09:40:05 +09:00
Gilles Gouaillardet	80b267af1c	coll/base: silence CID 1362601 in ompi_coll_base_sendrecv_zero()	2016-06-09 09:37:31 +09:00
Gilles Gouaillardet	0e393195d9	coll/base: fix [all]reduce with non zero lower bound datatypes Offset temporary buffer when a non zero lower bound datatype is used. Thanks Hristo Iliev for the report	2016-06-08 16:48:00 +09:00
Nathan Hjelm	97c1643216	Merge pull request #1766 from hjelmn/req_fix ompi/request: fix loop conditional	2016-06-07 12:11:56 -06:00
Nathan Hjelm	3ddf3ccbf3	Merge pull request #1758 from hjelmn/ob1_fixes pml/ob1: bug fixes	2016-06-07 11:18:55 -06:00
Nathan Hjelm	5a4adb866d	ompi/request: fix loop conditional This commit fixes a bug in waitany that causes the code to go past the beginning of the request array. The loop conditional i >= 0 is invalid since i is unsigned. Changed to loop to check (i+1) > 0. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-07 10:28:46 -06:00
Todd Kordenbrock	9671d6af47	Merge pull request #1689 from francois-wellenreiter/remove_trig_rdv_portals4 MTL portals4 : remove the triggered rendez-vous protocol	2016-06-06 21:55:01 -05:00
Nathan Hjelm	5d0b4679ea	pml/ob1: bug fixes This commit fixes two bugs in pml/ob1: - Do not called MCA_PML_OB1_PROGRESS_PENDING from mca_pml_ob1_send_request_start_copy as this may lead to a recursive call to mca_pml_ob1_send_request_process_pending. - In mca_pml_ob1_send_request_start_rdma return the rdma frag object if a btl fragment can not be allocated. This fixes a leak identified by @abouteiller and @bosilca. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-06 17:54:55 -06:00
Gilles Gouaillardet	544a2f1631	configury: fix mpifort and oshmemfort wrapper data NAG compiler use gcc (and not ld) as a linker, so in order to pass an option to the linker, the flag is -Wl,-Wl,,<option> and not -Wl,<option> Thanks Paul Hargrove for the report	2016-06-06 11:54:12 +09:00
Gilles Gouaillardet	c976559877	coll/basic: fix log basic bcast The log basic bcast was completely broken. The rank 0 gets the hibit set to -1, so it always returned an error.	2016-06-06 11:01:51 +09:00
Gilles Gouaillardet	99fedcb7a3	fs/base: silence a memory leak in mca_fs_base_get_fstype() Fixes CID 1351211	2016-06-06 09:20:14 +09:00
George Bosilca	9376b0340b	Fix the basic barrier. The log basic barrier was completely broken. The rank 0 gets the hibit set to 0, so it always returned an error.	2016-06-03 23:46:25 -04:00
Edgar Gabriel	d6af5444a6	fix the get_byte_offset code	2016-06-03 11:36:53 -05:00
Josh Hursey	9f9f70ee50	Merge pull request #1746 from jjhursey/topic/op-init ompi/op: Provide a default value for type/flags	2016-06-03 07:56:29 -05:00
Nathan Hjelm	e968ddfe64	start bug fixes (#1729 ) * mpi/start: fix bugs in cm and ob1 start functions There were several problems with the implementation of start in Open MPI: - There are no checks whatsoever on the state of the request(s) provided to MPI_Start/MPI_Start_all. It is erroneous to provide an active request to either of these calls. Since we are already looping over the provided requests there is little overhead in verifying that the request can be started. - Both ob1 and cm were always throwing away the request on the initial call to start and start_all with a particular request. Subsequent calls would see that the request was pml_complete and reuse it. This introduced a leak as the initial request was never freed. Since the only pml request that can be mpi complete but not pml complete is a buffered send the code to reallocate the request has been moved. To detect that a request is indeed mpi complete but not pml complete isend_init in both cm and ob1 now marks the new request as pml complete. - If a new request was needed the callbacks on the original request were not copied over to the new request. This can cause osc/pt2pt to hang as the incoming message callback is never called. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov> * osc/pt2pt: add request for gc after starting a new request Starting a new receive may cause a recursive call into the pt2pt frag receive function. If this happens and the prior request is on the garbage collection list it could cause problems. This commit moves the gc insert until after the new request has been posted. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-06-02 20:22:40 -04:00
Matias A Cabral	29ab28f4f6	Adding owner.txt file for PSM2 MTL.	2016-06-02 16:26:16 -07:00
Joshua Hursey	a776d78f2d	ompi/op: Provide a default value for type/flags * User defined ops leave the op_type unset which can confuse logic in a collective component that is trying to convert the op to the approprate local function.	2016-06-02 13:59:04 -05:00
George Bosilca	d577e12dd0	Fix comment.	2016-06-03 00:57:31 +09:00
George Bosilca	fc5d458249	Consistency in handling OPAL_ENABLE_FT_CR. I am not sure if we should continue to maintain the request support for FT_CR, but I tried here to simplify the code while maintaining the same meaning.	2016-06-03 00:54:24 +09:00
Nathan Hjelm	b001184e63	request: fix warnings (#1742 ) Fix warnings introduced by request rework. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-06-02 04:53:16 -04:00
George Bosilca	bfcf145613	Refactor the request test and wait functions.	2016-06-02 11:58:25 +09:00
George Bosilca	2e1b1d34c6	Safety first !	2016-06-02 11:52:43 +09:00
George Bosilca	50cec456fb	ompi_request_complete with signal Rewrite the ompi_request_complete function to take in account the with_signal argument. Change the comment to explain the expected behavior. Alter all the ompi_request_complete uses to make sure the status of the request is set before calling ompi_request_complete. bot🏷️enhancement	2016-06-02 11:49:12 +09:00
George Bosilca	223d75595d	Give a boost to MPI_Barrier. Based on current implementation it is faster to use a blocking send than the non-blocking version. Switch the exchange function used in the barrier to use the blocking version combined with the non-blocking version of the receive.	2016-06-02 11:45:25 +09:00
Ralph Castain	2c086e56be	Add an experimental ability to skip the RTE barriers at the end of MPI_Init and the beginning of MPI_Finalize	2016-06-01 17:01:15 -07:00
Nathan Hjelm	086ffc1838	pml/ob1: fix race on pml completion of send requests The request code was setting the request as pml_complete before calling MCA_PML_OB1_SEND_REQUEST_MPI_COMPLETE. This was causing MCA_PML_OB1_SEND_REQUEST_RETURN to be called twice in some cases. The code now mirrors the recvreq code and only sets the request as pml complete if the request has not already been freed. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-06-01 13:36:06 -06:00
Gilles Gouaillardet	5f565dfec3	configury: clean the flex generated .c files	2016-06-01 11:13:31 +09:00
Gilles Gouaillardet	1bbc5fadee	ompi/win: silence an other warning	2016-05-31 13:18:39 +09:00
Gilles Gouaillardet	c41321b9e5	ompi/win: silence warning	2016-05-31 13:03:20 +09:00
Jeff Squyres	59f4a765b3	Merge pull request #1656 from hpcraink/pr/make_manpage In case, we do not build Fortran, Fortran 2008 or CXX, the regexp in …	2016-05-28 11:02:12 -04:00
Nathan Hjelm	d8fd3a411a	Merge pull request #1725 from hjelmn/request_fixes ompi/request: fix bugs in MPI_Wait_some and MPI_Wait_any	2016-05-27 13:47:49 -06:00
Nathan Hjelm	0591139f49	ompi/request: fix bugs in MPI_Wait_some and MPI_Wait_any This commit fixes two bugs in MPI_Wait_any: - If all requests are inactive then the sync wait would hang forever because no requests are attached to the sync. - The request pointer was pointing to the request before the completed request which caused the wrong request to be freed or marked inactive. MPI_Wait_some had a similar issue if all the requests were pending. These issues were identified by MTT. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-05-27 12:36:10 -06:00
Nathan Hjelm	0adfb328e1	win: fix warnings Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-05-27 10:14:02 -06:00
Thananon Patinyasakdikul	60d0fbf683	Removal of ompi_request_lock from pml/ucx.	2016-05-26 12:36:58 -04:00
George Bosilca	90f294096e	Remove more references to the request mutex. Regarding BFO it should be mentionned that this component is currently unmaintained, and that despite my efforts I could not make it compile (it would not compile before this patch either).	2016-05-25 23:27:06 -04:00
Nathan Hjelm	9d439664f0	pml/yalla: update for request changes This commit brings the pml/yalla component up to date with the request rework changes. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-05-25 15:42:53 -06:00
Nathan Hjelm	8445c885ce	pml/cm: update for request changes This fixes a hang caused by the request refactor work. The cm pml was not updated and was hanging is most cases. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-05-25 15:35:32 -06:00
Nathan Hjelm	ef11ba9394	request: fix compilation error The request.h header is unfortunately included files in the C++ bindings. C++ does not allow assigning from void * to another pointer without a cast. This commit adds the cast. We can clean this up when the C++ bindings are deleted. Fixes open-mpi/ompi#1707 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-05-25 09:52:23 -06:00
Valentin Petrov	5ff6372886	coll/hcoll: bugfix: initialize req_type field If left uninitialized then segfault is possible in MPI_Waitall in the case the field by chance equals OMPI_REQUEST_GEN.	2016-05-25 15:38:01 +03:00
George Bosilca	2b868c4952	Fix MPI datatype args. Compensate for the datatype ID that we add to the array.	2016-05-24 23:36:54 -04:00
bosilca	b90c83840f	Refactor the request completion (#1422 ) * Remodel the request. Added the wait sync primitive and integrate it into the PML and MTL infrastructure. The multi-threaded requests are now significantly less heavy and less noisy (only the threads associated with completed requests are signaled). * Fix the condition to release the request.	2016-05-24 18:20:51 -05:00
Nathan Hjelm	5126da5377	win: add support for accumulate_ordering info key This commit adds support for the MPI-3.1 accumulate_ordering info key. The default value is rar,war,raw,waw and is supported using an MCA variable flag enumerator. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-05-24 11:13:30 -06:00
Jeff Squyres	e7d46b96a3	Merge pull request #1680 from yburette/topic/fix_provider_selection mtl/ofi: Change default provider selection behavior.	2016-05-23 15:06:02 -04:00
Francois WELLENREITER	b2b0fc63e2	MTL portals4 : remove the triggered rendez-vous protocol	2016-05-23 15:50:00 +02:00
Gilles Gouaillardet	bca44592af	Merge pull request #1643 from ggouaillardet/topic/romio_openbsd57 io/romio: fix filesystem type check on OpenBSD	2016-05-23 16:33:56 +09:00
George Bosilca	16d9f71d01	Correctly compute the space needed for the args. Add checks to bail out if our precomputed value is less than needed (we are already at fault). bot:milestone:v1.10.3 bot:milestone:v2.0 bot🏷️bug bot:assign: @ggouaillardet	2016-05-21 16:01:16 -04:00
George Bosilca	0641005dab	Only check the parameters on valid dimensions.	2016-05-21 15:54:04 -04:00
George Bosilca	6aac0d9c22	Remove useless output stream.	2016-05-21 15:54:04 -04:00
Nathan Hjelm	31bfeede82	bml/r2: always add btl progress function This commit changes the behavior of bml/r2 from conditionally registering btl progress functions to always registering progress functions. Any progress function beloning to a btl that is not yet in use is registered as low-priority. As soon as a proc is added that will make use of the btl is is re-registered normally. This works around an issue with some btls. In order to progress a first message from an unknown peer both ugni and openib need to have their progress functions called. If either btl is not in use after the first call to add_procs the callback was never happening. This commit ensures the btl progress function is called at some point but the number of progress callbacks is reduced from normal to ensure lower overhead when a btl is not used. The current ratio is 1 low priority progress callback for every 8 calls to opal_progress(). Fixes open-mpi/ompi#1676 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-05-21 15:54:04 -04:00
yohann	2f0cde791a	mtl/ofi: Change default provider selection behavior. As more providers get added to libfabric, the default exclude list would need to be updated. Instead, we choose to include only the providers known to work by default. New default: - include: psm,psm2,gni - exclude: none	2016-05-19 10:59:25 -07:00
Ralph Castain	a35bb8453a	Unlock the mutex prior to destructing it. Thanks to Nicolas Joly for the report	2016-05-19 10:36:58 -07:00
Rainer Keller	0fb0913cd4	In case, we do not build Fortran, Fortran 2008 or CXX, the regexp in make_manpage.pl will delete all lines up to the next ".fi" -- which for functions that do not implement the corresponding interface as code will have all eliminated. Change to delete the man page's content up to the next section header ".SH" Also in case of make V=1, we'd like to see the command line, too. Amend OMPI_Affinity_str according to the other man-pages definitions.	2016-05-17 14:21:35 +02:00
rhc54	8b534e9897	Merge pull request #1668 from rhc54/topic/slurm When direct launching applications, we must allow the MPI layer to pr…	2016-05-16 12:23:19 -07:00
Jeff Squyres	5275e5e2a1	bml_r2: use __func__ to identify function names There were some old/stale function names in some debugging/verbose opal_output calls. Use __func__ instead, so that they won't become stale in the future. Thanks to Durga Choudhury for pointing out the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-05-16 11:06:47 -04:00
Ralph Castain	01ba861f2a	When direct launching applications, we must allow the MPI layer to progress during RTE-level barriers. Neither SLURM nor Cray provide non-blocking fence functions, so push those calls into a separate event thread (use the OPAL async thread for this purpose so we don't create another one) and let the MPI thread sping in wait_for_completion. This also restores the "lazy" completion during MPI_Finalize to minimize cpu utilization. Update external as well Revise the change: we still need the MPI_Barrier in MPI_Finalize when we use a blocking fence, but do use the "lazy" wait for completion. Replace the direct logic in MPI_Init with a cleaner macro	2016-05-14 16:37:00 -07:00
Aurélien Bouteiller	7f65c2b18e	forgot to update copyright in commits `627a89b` `4899c89`	2016-05-13 11:34:59 -04:00
George Bosilca	37e03e3e5b	Don't update req_bytes_received if no bytes were received.	2016-05-12 23:39:32 -04:00
rhc54	4d026e223c	Merge pull request #1661 from matcabral/master PSM and PSM2 MTLs to detect drivers and link	2016-05-11 17:43:17 -07:00
George Bosilca	f8facb177d	atomically update the refcount on the datatype args.	2016-05-11 12:40:18 -04:00
Matias A Cabral	528abff6ae	Merge remote-tracking branch 'upstream/master'	2016-05-10 15:42:08 -07:00
Matias A Cabral	d28ee62a96	Update in PSM and PSM2 MTLs to detect entries created by drivers for Intel TrueScale and Intel OmniPath, and detect a link in ACTIVE state. This fix addresses the scenario reported in the below OMPI users email, including formerly named Qlogic IB, now Intel True scale. Given the nature of the PSM/PSM2 mtls this fix applies to OmniPath: https://www.open-mpi.org/community/lists/users/2016/04/29018.php	2016-05-09 12:08:44 -07:00
Gilles Gouaillardet	0a19337371	coll/base: return MPI_ERR_UNSUPPORTED_OPERATION when coll_base_*_two_procs algo is used on a communicator that has no two tasks Thanks Dave Love for the report	2016-05-09 14:18:40 +09:00
Gilles Gouaillardet	b159587325	io/romio: fix filesystem type check on OpenBSD 5.7 check the existence of the f_type field in struct statfs Thanks Paul Hargrove for the report	2016-05-09 13:54:46 +09:00
Ralph Castain	6b24e2779b	Remove stale component - I'm not going to get to it	2016-05-07 04:13:34 -07:00
Edgar Gabriel	def1b95fd7	Merge pull request #1646 from edgargabriel/getview-preallocate-fixes io/ompio: file_getview and file_preallocate fixes	2016-05-06 11:46:00 -05:00
Edgar Gabriel	e65e189671	io/ompio: fix file size after file_preallocate Thanks for @dalcini for reporting Fixes open-mpi/ompi#1633	2016-05-06 08:20:59 -05:00
Edgar Gabriel	d358965134	io/ompio: fix envelope of datatype returned by getview Thanks for @dalcini for reporting Fixes open-mpi/ompi#1632	2016-05-06 08:19:48 -05:00
Edgar Gabriel	7c92acaa78	Merge pull request #1637 from edgargabriel/pr/netbsd-compilation-problems fs/lustre and fs/pvfs2: fix netbsd compilation problems	2016-05-06 08:05:36 -05:00
Jeff Squyres	810db734c4	Merge pull request #1640 from jsquyres/pr/mpir-cleanup debuggers: remove some useless code	2016-05-05 21:23:30 -04:00
Gilles Gouaillardet	6c9d65c0ca	coll/libnbc: fix MPI_Ireduce_scatter_block for one task communicator Thanks Lisandro Dalcin for the report Fixes open-mpi/ompi#248	2016-05-06 09:43:29 +09:00
Ralph Castain	08022d7af1	Some minor cleanups of warnings from gcc 6.0.0. Update s1/s2 pmix to get max_procs as required.	2016-05-05 15:28:13 -07:00
Jeff Squyres	83c2d04aa3	debuggers: remove some useless code MPIR-1.0 specifies that the following symbols are only relevant in the starter process: - MPIR_Breakpoint - MPIR_being_debugged - MPIR_debug_state - MPIR_debug_abort_string I.e., the code filling in values in these various symbols was useless / never used. MPIR-1.1 will define that MPIR_being_debugged is relevant in MPI processes. That symbol is currently defined in libopen-rte (which is currently causing a duplicate symbol error for static builds -- this commit fixes that error), and is therefore still available for MPI processes. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-05-05 14:22:55 -07:00
Jeff Squyres	f167be1c91	ompio: always return valid info from FILE_GET_INFO MPI-3.1 says that even if no info keys are set on the file, we need to return a new, empty info. Thanks to Lisandro Dalcin for identifying the issue. Fixes open-mpi/ompi#1630 Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-05-05 12:03:29 -07:00
Aurélien Bouteiller	4899c89731	Fix a race condition when multiple threads try to create a bml endpoint simultaneously.	2016-05-05 10:49:30 -04:00
Aurélien Bouteiller	627a89bf71	Fix a race condition when multiple threads do the "first send" to an endpoint simultaneously.	2016-05-05 09:04:10 -04:00
Joshua Ladd	4771c9ece6	Merge pull request #1617 from jladd-mlnx/topic/disable-hcoll-barrier-in-finalize-ompi-trunk HCOLL: fix hang in hcoll barrier called from finalize for MXM/yalla	2016-05-04 10:12:34 -04:00

... 2 3 4 5 6 ...

9279 Коммитов