openmpi

Автор	SHA1	Сообщение	Дата
Nathan Hjelm	ec9712050b	Merge pull request #1118 from hjelmn/mpool_rewrite mpool/rcache rewrite	2016-03-15 10:46:24 -06:00
Nathan Hjelm	deae9e52bf	Merge pull request #1259 from kawashima-fj/pr/osc-sm-align osc/sm: Fix a bus error on MPI_WIN_{POST,START}.	2016-03-15 09:13:38 -06:00
Nathan Hjelm	d4afb16f5a	opal: rework mpool and rcache frameworks This commit rewrites both the mpool and rcache frameworks. Summary of changes: - Before this change a significant portion of the rcache functionality lived in mpool components. This meant that it was impossible to add a new memory pool to use with rdma networks (ugni, openib, etc) without duplicating the functionality of an existing mpool component. All the registration functionality has been removed from the mpool and placed in the rcache framework. - All registration cache mpools components (udreg, grdma, gpusm, rgpusm) have been changed to rcache components. rcaches are allocated and released in the same way mpool components were. - It is now valid to pass NULL as the resources argument when creating an rcache. At this time the gpusm and rgpusm components support this. All other rcache components require non-NULL resources. - A new mpool component has been added: hugepage. This component supports huge page allocations on linux. - Memory pools are now allocated using "hints". Each mpool component is queried with the hints and returns a priority. The current hints supported are NULL (uses posix_memalign/malloc), page_size=x (huge page mpool), and mpool=x. - The sm mpool has been moved to common/sm. This reflects that the sm mpool is specialized and not meant for any general allocations. This mpool may be moved back into the mpool framework if there is any objection. - The opal_free_list_init arguments have been updated. The unused0 argument is not used to pass in the registration cache module. The mpool registration flags are now rcache registration flags. - All components have been updated to make use of the new framework interfaces. As this commit makes significant changes to both the mpool and rcache frameworks both versions have been bumped to 3.0.0. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-03-14 10:50:41 -06:00
Gilles Gouaillardet	fbed6df4a3	coll/base: fix a typo typo was introduced in open-mpi/ompi@c98e97a46e	2016-03-11 14:18:03 +09:00
Aurélien Bouteiller	c98e97a46e	Do not return MPI_ERR_PENDING from collectives.	2016-03-09 16:13:34 -05:00
Joshua Ladd	4dffae2f88	Fixing MXM Yalla and MTL add procs behavior. MXM cannot support dynamic add procs, so propaget this info to the MTL and PML layers.	2016-03-08 01:46:24 +02:00
Aurélien Bouteiller	892e1ed57e	Fix a potential race condition in which a progress matching thread could match a request while we are cancelling it.	2016-03-01 16:43:45 -05:00
Gilles Gouaillardet	8aff67c399	topo/base: correctly support MPI_UNWEIGHTED in mca_topo_base_dist_graph_neighbors() Thanks Jun Kudo for the bug report.	2016-03-01 10:28:28 +09:00
George Bosilca	dbe93b0b19	Use mca_bml_base_get_endpoint Correctly use mca_bml_base_get_endpoint instead of accessing the endpoint directly.	2016-02-25 11:00:30 -06:00
Sylvain Jeaugey	5f32f49eb8	pml/ob1: Fix segmentation fault on CUDA path. Fix segfault due to mca_pml_ob1_cuda_need_buffers not handling the case of the endpoint not being there. Calling mca_bml_get_endpoint() seems to fix the problem. Fixes open-mpi/ompi#1402	2016-02-24 21:32:25 -08:00
Edgar Gabriel	45003ef78d	fix the data size counter for large ops for the static fcoll component	2016-02-23 08:33:50 -06:00
yohann	59b6d041f8	mtl/ofi: Check allocated pointer.	2016-02-19 16:59:47 -08:00
yohann	bd47062764	mtl/ofi: Fix error handling.	2016-02-19 16:58:41 -08:00
yohann	404987e9b3	mtl/ofi: Fix mismatching types.	2016-02-19 16:57:26 -08:00
yohann	3ad59435ce	mtl/ofi: Prevent possible memory leak.	2016-02-19 16:57:02 -08:00
Edgar Gabriel	92d1b99468	optimize the shuffle step: 1. use communicator collectives if possible for performance reasons 2. combined multiple allgathers into a single one	2016-02-19 11:04:04 -06:00
Edgar Gabriel	e63836c653	clean up the mca parameter handling of the component. Add new parameters for number of sub groups and write chunk size. This will allow to perform a systematic parameter study.	2016-02-19 10:15:28 -06:00
Edgar Gabriel	4f400314e0	add the dynamic_gen2 component into the fcoll selection table.	2016-02-19 09:32:54 -06:00
Edgar Gabriel	268d525053	change the tag to be a positive value. handle 0-byte situations correctly.	2016-02-19 08:28:50 -06:00
Edgar Gabriel	ad79012059	first cut on the version which overlaps the communication/computation of 2 iterations.	2016-02-19 08:28:50 -06:00
Ralph Castain	60a7bc2e50	Enable the PMIx notification callback system. This currently is only supported by the pmix120 component, which is not selected by default. All other components will ignore error registration requests, and thus do not support debugger attach when launched via mpirun. Note that direct launched applications will support such attachment, but may not do so in a scalable fashion. Fixes ##1225	2016-02-18 09:29:12 -08:00
yohann	7fe395c82a	mtl/ofi: cleanup	2016-02-16 09:57:57 -08:00
yohann	22eddfee10	mtl/ofi: update copyright dates.	2016-02-16 09:56:09 -08:00
George Bosilca	68c36ea9dc	Fix two annoying warnings in our UCX support.	2016-02-14 00:02:16 -05:00
yohann	67ce4a080a	mtl/ofi: FI_AV_MAP support only.	2016-02-12 10:06:52 -08:00
yohann	b3d8ead76e	mtl/ofi: Fix dynamic add_procs.	2016-02-12 10:05:52 -08:00
Gilles Gouaillardet	b55b9e6aee	sentinel: fix sentinel to proc_name conversion converting an opal_process_name_t means the loss of one bit, it was decided to restrict the local job id to 15 bits, so the useful information of an opal_process_name_t can fit in 63 bits.	2016-02-10 15:44:07 +09:00
Gilles Gouaillardet	030a5f2054	sentinel: use type uintptr_t for sentinel MSB is now automatically cleared when right shifting Thanks George for pointing this	2016-02-10 11:28:56 +09:00
George Bosilca	7c574a3530	Typo.	2016-02-07 07:22:22 +02:00
Nathan Hjelm	5b9c82a964	osc/pt2pt: bug fixes This commit fixes several bugs identified by @ggouaillardet and MTT: - Fix SEGV in long send completion caused by missing update to the request callback data. - Add an MPI_Barrier to the fence short-cut. This fixes potential semantic issues where messages may be received before fence is reached. - Ensure fragments are flushed when using request-based RMA. This allows MPI_Test/MPI_Wait/etc to work as expected. - Restore the tag space back to 16-bits. It was intended that the space be expanded to 32-bits but the required change to the fragment headers was not committed. The tag space may be expanded in a later commit. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-04 16:59:39 -07:00
Gilles Gouaillardet	6eac6a8b00	osc/sm: create datafile into the per proc directory in order to make it unique per communicator Thanks Peter Wind for the report	2016-02-03 10:12:37 +09:00
Nathan Hjelm	519fffb65e	osc/pt2pt: eager sends are always active if MPI_MODE_NOCHECK is used This commit fixes open-mpi/ompi#1299. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-02 12:44:17 -07:00
Nathan Hjelm	d7264aa613	osc/pt2pt: various threading fixes This commit fixes several bugs identified by a new multi-threaded RMA benchmarking suite. The following bugs have been identified and fixed: - The code that signaled the actual start of an access epoch changed the eager_send_active flag on a synchronization object without holding the object's lock. This could cause another thread waiting on eager sends to block indefinitely because the entirety of ompi_osc_pt2pt_sync_expected could exectute between the check of eager_send_active and the conditon wait of ompi_osc_pt2pt_sync_wait. - The bookkeeping of fragments could get screwed up when performing long put/accumulate operations from different threads. This was caused by the fragment flush code at the end of both put and accumulate. This code was put in place to avoid sending a large number of unexpected messages to a peer. To fix the bookkeeping issue we now 1) wait for eager sends to be active before stating any large isend's, and 2) keep track of the number of large isends associated with a fragment. If the number of large isends reaches 32 the active fragment is flushed. - Use atomics to update the large receive/send tag counters. This prevents duplicate tags from being used. The tag space has also been updated to use the entire 16-bits of the tag space. These changes should also fix open-mpi/ompi#1299. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-02 12:33:33 -07:00
Edgar Gabriel	3f7fff5780	Merge pull request #1331 from edgargabriel/solaris-statfs-fix Solaris statfs fix	2016-01-28 20:16:33 -06:00
Nathan Hjelm	a19c265ab5	osc/rdma: fix typo in ompi_osc_rdma_complete_atomic The typo caused SEGVs on systems with only fetching atomic support. Fixes open-mpi/ompi#1329 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-01-26 15:44:07 -07:00
Edgar Gabriel	b4a725c26a	need to check for the parent dir as well, since the file might not exist yet.	2016-01-26 13:49:21 -06:00
Edgar Gabriel	722aab92e6	- extend opal_path_nfs to retrieve the file system type - use opal_path_nfs in the fs_base function to avoid code duplication.	2016-01-26 13:36:21 -06:00
Joshua Ladd	69e3c6f289	Merge pull request #1321 from jladd-mlnx/topic/add-allgatherv-reduce Adding entry points for Allgatherv, iAllgatherv, Reduce, and iReduce.	2016-01-25 20:46:52 -05:00
Nathan Hjelm	500e90422d	Merge pull request #1320 from hjelmn/osc_rdma_fix osc/rdma: fix hang when performing large unaligned gets	2016-01-25 09:36:13 -07:00
Nathan Hjelm	45da311473	osc/rdma: fix hang when performing large unaligned gets This commit adds code to handle large unaligned gets. There are two possible code paths for these transactions: 1) The remote region and local region have the same alignment. In this case the get will be broken down into at most three get transactions: 1 transaction to get the unaligned start of the region (buffered), 1 transaction to get the aligned portion of the region, and 1 transaction to get the end of the region. 2) The remote and local regions do not have the same alignment. This should be an uncommon case and is not optimized. In this case a buffer is allocated and registered locally to hold the aligned data from the remote region. There may be cases where this fails (low memory, can't register memory). Those conditions are unlikely and will be handled later. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-01-22 21:06:46 -07:00
Valentin Petrov	5e2a2c0755	BufFix for coll/hcoll: coll_request must be set to ACTIVE when alloced If the state of the request is not set to OMPI_REQUEST_ACTIVE then MPI_Test would immediately signal such request completed while hcoll may still be working on it. Signed-off-by: Joshua Ladd <jladd.mlnx@gmail.com>	2016-01-23 03:23:59 +02:00
Joshua Ladd	e398bf6f3a	Adding entry points for Allgatherv, iAllgatherv, Reduce, and iReduce. Signed-off-by: Joshua Ladd <jladd.mlnx@gmail.com>	2016-01-23 03:09:29 +02:00
Nathan Hjelm	49d2f44b97	osc/rdma: use correct endpoint for local state If atomics are not globally visible (cpu and nic atomics do not mix) then a btl endpoint must be used to access local ranks. To avoid issues that are caused by having the same region registered with multiple handles osc/rdma was updated to always use the handle for rank 0. There was a bug in the update that caused osc/rdma to continue using the local endpoint for accessing the state even though the pointer/handle are not valid for that endpoint. This commit fixes the bug. Fixes open-mpi/ompi#1241. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-01-22 10:41:27 -07:00
Nathan Hjelm	6180386bea	osc/rdma: disable put aggregation when using threads Optimizing put aggregation in the presence of threads will require a redesign of the code. For now just ensure that put aggregation is turned off when MPI_THREAD_MULTIPLE is enabled. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-01-21 15:50:35 -07:00
Edgar Gabriel	b253d4e887	fix CID 1349739, CID 1349738, CID 1349736 and (probably) CID 1349740 (not entirely sure about the last one, since I don't understand why block[i] is a problem but max_len[i] allocated and treated exactly the same way 1 line later is not).	2016-01-21 08:32:23 -06:00
Edgar Gabriel	9b8d769e41	will rivist the addproc component later in spring, right now it is constantly in the way of doing my tests.	2016-01-20 15:05:51 -06:00
Edgar Gabriel	a9ca37059a	improve the communicaton abstraction. This commit also allows all aggregators to work simultaniously, instead of the slightly staggered way of the previous version.	2016-01-17 09:48:49 -06:00
Edgar Gabriel	56e11bfc97	initialize the stripe_size variable as well.	2016-01-17 09:48:49 -06:00
Edgar Gabriel	26c57ef374	separate the size of the buffer used for the shuffle step and the size of the buffer used for a pwritev operation.	2016-01-17 09:48:49 -06:00
Edgar Gabriel	39d5c8c281	further bug fixes silencing a compiler warning and fixing a memory overrun	2016-01-17 09:48:49 -06:00

1 2 3 4 5 ...

5854 Коммитов