openmpi

Автор	SHA1	Сообщение	Дата
Edgar Gabriel	1cee83cc1b	use the common/ interfaces in file_preallocate instead of the io_ompio_ interfaces. Necessar for avoiding potential deadlock situations in multi-threaded scenarios.	2016-08-25 08:55:12 -05:00
Edgar Gabriel	41ed4a28d2	add the protective lock around read and write operations in ompio	2016-08-23 11:07:58 -05:00
Howard Pritchard	696121cc4a	Merge pull request #1988 from hppritcha/topic/another_ofi_fix mtl/ofi: fix a botched assignment of av_type	2016-08-22 17:59:59 -06:00
Ralph Castain	6549c878a9	Silence the warnings	2016-08-22 15:35:27 -07:00
Ralph Castain	871bedb103	Add missing "const" qualifiers	2016-08-22 12:54:24 -07:00
Edgar Gabriel	a76f4d7c69	Merge pull request #1990 from edgargabriel/topic/mt-io steps towards making file I/O operations thread safe	2016-08-22 08:19:33 -05:00
Joshua Ladd	deae1ab375	Merge pull request #1985 from vspetrov/master coll/hcoll: Fixes predifined types mapping	2016-08-22 09:18:59 -04:00
Edgar Gabriel	bc042259bc	make initialization of the io framework thread safe. Also, remove the lock/unlock in the file_open ompi-interface routines of romio314. The global lock in the romio component does probably not work, it is easy to construct a testcase where two threads perform collective I/O operations on different file handles. With a global lock it is easy to deadlock. THe lock has to be at least on the file handle basis. move the mutex to file/file.c to avoid duplicate symbol problem in file_open.c pfile_open.c	2016-08-21 16:09:00 -05:00
George Bosilca	b96ec77e40	This variable belongs to the tuned modules and not to base.	2016-08-20 15:37:55 -04:00
George Bosilca	e8425eb1f5	Rename an OMPI internal variable (ticket #1955 ).	2016-08-20 15:37:55 -04:00
rhc54	102d3afe2c	Merge pull request #1992 from rhc54/topic/sync Restore the coll/sync module and provide a test to verify its operation	2016-08-20 13:33:28 -05:00
George Bosilca	fd57f5bccd	Remove some of the clang warnings.	2016-08-20 14:21:42 -04:00
Ralph Castain	9888615e75	Restore the coll/sync module and provide a test to verify its operation	2016-08-20 10:14:52 -07:00
Howard Pritchard	61d62b6821	mtl/ofi: fix a botched assignment of av_type Well now the av_type is being assigned correctly Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2016-08-19 17:01:02 -05:00
Valentin Petrov	9790373fc6	coll/hcoll: Fixes predifined types mapping	2016-08-19 11:19:12 +03:00
Nathan Hjelm	e5c7512692	Merge pull request #1983 from hjelmn/request_cb ompi/request: change semantics of ompi request callbacks	2016-08-18 08:31:56 -06:00
Nathan Hjelm	6aa658ae33	ompi/request: change semantics of ompi request callbacks This commit changes the sematics of ompi request callbacks. If a request's callback has freed or re-posted (using start) a request the callback must return 1 instead of OMPI_SUCCESS. This indicates to ompi_request_complete that the request should not be modified further. This fixes a race condition in osc/pt2pt that could lead to the req_state being inconsistent if a request is freed between the callback and setting the request as complete. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-08-17 20:14:01 -06:00
Edgar Gabriel	e14c23ba79	Merge pull request #1980 from edgargabriel/topic/coverty-cleanup io/ompio: Topic/coverty cleanup	2016-08-17 17:27:51 -05:00
Edgar Gabriel	2c8437ce62	fs/pvfs2: fix a common symbol	2016-08-17 13:10:32 -05:00
Edgar Gabriel	eba5293586	fix coverty warning CID 1369021	2016-08-17 13:02:45 -05:00
Nathan Hjelm	40b70889e5	osc/pt2pt: make receive count an unsigned int This receive_count MCA variable should never be negative. Change it to an unsigned int. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-08-17 08:14:24 -06:00
Gilles Gouaillardet	8faa1edafa	osc/pt2pt: silence misc warnings	2016-08-17 14:24:14 +09:00
LANL OMPI Bot	96c7762050	Merge pull request #1942 from hppritcha/topic/minor_ofi_fix mtl/ofi: use mca param to set av type	2016-08-16 14:14:12 -06:00
Nathan Hjelm	9444df1eb7	osc/pt2pt: make lock_all locking on-demand The original lock_all algorithm in osc/pt2pt sent a lock message to each peer in the communicator even if the peer is never the target of an operation. Since this scales very poorly the implementation has been replaced by one that locks the remote peer on first communication after a call to MPI_Win_lock_all. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-08-11 15:33:07 -06:00
Nathan Hjelm	7589a25377	osc/pt2pt: do not repost receive from request callback This commit fixes an issue that can occur if a target gets overwhelmed with requests. This can cause osc/pt2pt to go into deep recursion with a stack like req_complete_cb -> ompi_osc_pt2pt_callback -> start -> req_complete_cb -> ... . At small scale this is fine as the recursion depth stays small but at larger scale we can quickly exhaust the stack processing frag requests. To fix the issue the request callback now simply puts the request on a list and returns. The osc/pt2pt progress function then handles the processing and reposting of the request. As part of this change osc/pt2pt can now post multiple fragment receive requests per window. This should help prevent a target from being overwhelmed. Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-08-11 15:33:07 -06:00
George Bosilca	8d0baf140f	If the RTE fails to deliver the daemon information, gracefully fallback to a non-reordered communicator. Optimize the loops building the process hierarchy.	2016-08-11 13:04:27 -04:00
Howard Pritchard	e46eee3fcb	mtl/ofi: use mca param to set av type Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2016-08-10 16:10:17 -06:00
Gilles Gouaillardet	dfbf2b7be4	opal/threads: add OPAL_THREAD_SUB_SIZE_T macro -1 is not a valid size_t, so instead of OPAL_THREAD_ADD_SIZE_T(..., -1), simply OPAL_THREAD_SUB_SIZE_T(..., 1) and keep picky compilers happy	2016-08-10 13:37:36 +09:00
Nathan Hjelm	799104f688	Merge pull request #1947 from hjelmn/perf pml/ob1: be more selective when using rdma capable btls	2016-08-09 22:15:09 -06:00
Nathan Hjelm	4079eec974	pml/ob1: be more selective when using rdma capable btls This commit updates the btl selection logic for the RDMA and RDMA pipeline protocols to use a btl iff: 1) the btl is also used for eager messages (high exclusivity), or 2) no other RDMA btl is available on an endpoint and the pml_ob1_use_all_rdma MCA variable is true. This fixes a performance regression with shared memory when an RDMA capable network is available. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-08-09 20:54:42 -06:00
Nathan Hjelm	2788083b98	Merge pull request #1936 from hjelmn/osc_pt2pt_fix osc/pt2pt: do not set rdma_frag after start	2016-08-08 14:17:40 -06:00
Nathan Hjelm	e4d7ea75a9	Merge pull request #1935 from hjelmn/persistent_fix pml/ob1: reset req_bytes_packed on start	2016-08-08 14:17:13 -06:00
Todd Kordenbrock	3be6052523	Merge pull request #1896 from PDeveze/Patchs-on-coll-portals4 Patchs on coll portals4	2016-08-08 14:57:02 -05:00
Edgar Gabriel	fb9fa4fbc4	Merge pull request #1938 from edgargabriel/pr/barrier-on-close io/ompio: Add barrier to file_close and to file_set_size	2016-08-08 09:22:08 -05:00
Edgar Gabriel	4709f4229b	Merge pull request #1929 from edgargabriel/pr/ompio-code-reorg io/ompio: next step in code-reorganization	2016-08-08 09:20:54 -05:00
Thananon Patinyasakdikul	23b27c510c	romio: make romio use internal opal_random instead of rand(3). This fixes issue #1877	2016-08-05 09:04:52 -07:00
Howard Pritchard	ff669e7b15	code cleanup: clang is now a happier panda Clang 5.1 on my mac was a sad panda compiling a couple of files, complaining about uninitialized stack variables. This commit makes clang a happier panda (or at least not so sad). Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2016-08-04 19:34:44 -06:00
Edgar Gabriel	9c3180160c	io/ompio: Add barrier to file_close and to file_set_size This fixes a bug reported on the mailing for ompio. https://www.open-mpi.org/community/lists/users/2016/05/29333.php	2016-08-04 11:20:31 -05:00
Gilles Gouaillardet	60e91e890a	coll/base: give a boost to ompi_coll_base_sendrecv_nonzero_actual() Based on current implementation it is faster to use a blocking send than the non-blocking version. Switch the exchange function used in the barrier to use the blocking version combined with the non-blocking version of the receive. This is similar to open-mpi/ompi@223d75595d	2016-08-04 13:31:07 +09:00
Nathan Hjelm	11c853d05e	osc/pt2pt: do not set rdma_frag after start It is possible for the start call to complete the requests. For this reason the module rdma_frag field should be filled in before start is called. If the request completes the completion callback will reset the rdma_frag field to NULL. Fixes a bug discovered by @tkordenbrock. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-08-03 15:20:36 -06:00
Nathan Hjelm	889dd32806	pml/ob1: reset req_bytes_packed on start On start we were not correctly resetting all request fields. This was leading to a double-completion on persistent receives. This commit updates the base start code to reset the receive req_bytes_packed and the send request convertor. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-08-03 11:29:30 -06:00
Edgar Gabriel	aa7e852e44	common/ompio: files are only compiled in case MPI I/O is requested fixes: open-mpi/ompi#1932	2016-08-02 15:01:38 -05:00
Edgar Gabriel	19fe5cac50	io/ompio: next step in code-reorganization - move the sort_iovec operations to fcoll/base - move set_view_internal to common/ompio - move set_file_default to common/ompio - remove io_ompio_sort, not used anymore.	2016-08-02 09:18:29 -05:00
Gilles Gouaillardet	917d96ba50	coll/libnbc: cleanup handling of the second temporary buffer in ireduce	2016-08-02 16:32:15 +09:00
Gilles Gouaillardet	ed9139ca13	coll/libnbc: correctly handle datatype alignment when allocating two buffers at once	2016-08-02 15:44:12 +09:00
Edgar Gabriel	c0bd8728fd	io/ompio: move aggregator selection code to a separate file - move all functions related to aggregator selection to a single file - perform code cleanup fixing many Coverty complains along the way.	2016-08-01 14:04:27 -05:00
Edgar Gabriel	160d9a78c1	Merge pull request #1886 from edgargabriel/pr/ompio-reorg io/ompio: move io/ompio functionality to common/ompio	2016-07-29 12:24:21 -05:00
Joshua Ladd	4a03a657c6	Merge pull request #1913 from vspetrov/hcoll_derived_datatypes coll/hcoll mpi datatypes support	2016-07-29 10:08:23 -04:00
Nathan Hjelm	1da558407c	Merge pull request #1911 from hjelmn/threads opal/thread: clean up and add additional OPAL_THREAD macros	2016-07-29 06:44:11 -06:00
Valentin Petrov	3582bba6b7	coll/hcoll mpi datatypes support	2016-07-29 10:06:39 +03:00

1 2 3 4 5 ...

6065 Коммитов