openmpi

Автор	SHA1	Сообщение	Дата
Nathan Hjelm	e03956e099	Update the scif and openib btls for the new btl interface Other changes: - Remove the registration argument from prepare_src since it no longer is meant for RDMA buffers. - Additional cleanup and bugfixes.	2014-11-19 11:33:02 -07:00
Nathan Hjelm	c61e017177	pml: updates to reflect member changes in mca_btl_base_descriptor_t and mca_btl_base_module_t structures	2014-11-19 11:33:02 -07:00
Nathan Hjelm	ec33374339	btl: remove des_remote/des_remote_count from the mca_btl_base_descriptor_t structure This structure member was originally used to specify the remote segment for an RDMA operation. Since the new btl interface no longer uses desriptors for RDMA this member no longer has a purpose. In addition to removing these members the local segment information has been renamed to des_segments/des_segment_count.	2014-11-19 11:33:02 -07:00
Nathan Hjelm	5936411a07	pml/ob1: when using btl_get try to register the entire region before attempting to break the get into multiple rdma fragments A little background. Historically ob1 always registered the entire memory region when the RGET protocol was in use. This changed when Mellanox added support to fragment RGET using the btl_prepare_dst function. Now that the BTL layer has changed to split out the limits of get/put there is explicit fragmentation code in ob1. Before this commit the registration was still done per RGET fragment. This commit will attempt to register the entire region before creating RGET fragments. If the registration is successfull then all RGET fragments will use this registration otherwise they will each attempt to register their own segment of the receive buffer. If that fails enough times each fragment will give up and fall back on send/recv.	2014-11-19 11:33:02 -07:00
Nathan Hjelm	b75bb8aea7	Update pml for btl changes	2014-11-19 11:33:02 -07:00
Nathan Hjelm	66bd698eaf	Update BML for BTL interface changes	2014-11-19 11:33:02 -07:00
Nathan Hjelm	2d381f800f	Update the interface to provide a cleaner interface for RDMA operations. The old BTL interface provided support for RDMA through the use of the btl_prepare_src and btl_prepare_dst functions. These functions were expected to prepare as much of the user buffer as possible for the RDMA operation and return a descriptor. The descriptor contained segment information on the prepared region. The btl user could then pass the RDMA segment information to a remote peer. Once the peer received that information it then packed it into a similar descriptor on the other side that could then be passed into a single btl_put or btl_get operation. Changes: - Removed the btl_prepare_dst function. This reflects the fact that RDMA operations no longer depend on "prepared" descriptors. - Removed the btl_seg_size member. There is no need to btl's to subclass the mca_btl_base_segment_t class anymore. ... Add more	2014-11-19 11:33:02 -07:00
Howard Pritchard	9425ebefae	Be more selective about closing fd's for alps/odls Be more selective about closing fd's for the alps odls component. Don't close fd's of pipes set up by the apshepherd for providing RDMA credentials, etc. Add an entry to the help file in case alps_app_lli_pipes returns an error.	2014-11-19 11:21:30 -07:00
Ralph Castain	0338bc80b7	Add the LSF libraries as required for static build	2014-11-19 09:38:49 -08:00
Ralph Castain	bb91517349	All other layers to register their own print-attribute functions so we can maintain pretty-print capabilities as the attributes are extended.	2014-11-19 09:37:59 -08:00
Ralph Castain	37593b232d	Add a marker for the max attr value being used by ORTE so that other, higher-levels can also use the attribute system	2014-11-19 09:37:59 -08:00
Ralph Castain	da462c4c25	Add *.dwarf to the ignores as Mac Yosemite now generates these files	2014-11-19 09:37:58 -08:00
Howard Pritchard	34c156759e	fix some compiler warnings in ras/alps	2014-11-18 11:32:37 -07:00
Howard Pritchard	4df3447d96	fix compare_nodes bug in alps ras component There was an obvious bug in the alps/ras component compare_nodes method which resulted in the function always evaluating the nodes as being equivalent.	2014-11-18 11:15:02 -07:00
Howard Pritchard	ff362c16ce	add/update copyrights for alps odls component	2014-11-18 10:16:11 -07:00
Howard Pritchard	dc98b62070	add initial support for an alps odls component It turns out that the support for Open MPI apps on Cray was hanging on a thin thread of support when using the mpirun job launcher. It just happened that with a certain set of configuration options things would work. This is bound to backfire at some point. To fix this weakness, as well as to allow for mpirun launched jobs to benefit from many of the advanced placement features provided by the Cray Linux Environment (as opposed to the hwloc only default env of orte), a new odls alps component is introduced.	2014-11-17 14:00:09 -07:00
Todd Kordenbrock	5ac1a94f3f	Merge pull request #274 from jsquyres/topic/delete-opal-check-portals.m4 opal_check_portals.m4: remove this unused file	2014-11-17 10:12:40 -06:00
Jeff Squyres	225c684ccc	opal_check_portals.m4: remove this unused file It looks like OPAL_CHECK_PORTALS is not used anywhere; it looks like it has been completely replaced by opal_check_portals4.m4 (OPAL_CHECK_PORTALS4).	2014-11-16 12:44:14 -08:00
Nathan Hjelm	d495d49b1c	Merge pull request #273 from open-mpi/topic/yoda_rdma_flags OSHMEM: spml yoda: use flags to check if btl is RDMA capable	2014-11-16 12:04:04 -07:00
Alex Mikheev	fbb9dc5b1e	OSHMEM: spml ikrit valgrind fix always initialize request flags	2014-11-16 17:24:16 +02:00
Alex Mikheev	3443c1d5e5	OSHMEM: spml yoda: use flags to check if btl is RDMA capable	2014-11-16 17:20:20 +02:00
Ralph Castain	d9ceb5aea4	Fix C++ builds by removing no-longer-needed type declaration	2014-11-14 11:44:24 -08:00
Gilles Gouaillardet	f3b36fdf6e	orted/pmix: fix pmix_server_release when several jobids are running on the same node	2014-11-14 16:17:28 +09:00
Gilles Gouaillardet	84b21d726e	orte/util: add OPAL_{VPID,JOBID} types to orte_attr_{load,unload}	2014-11-14 15:55:25 +09:00
Gilles Gouaillardet	4de7021acc	configury: fix a typo	2014-11-14 13:39:41 +09:00
Gilles Gouaillardet	2177f9ec3e	fix missing copyright, no code change	2014-11-13 14:56:09 +09:00
Gilles Gouaillardet	cd6e3ecb07	oshmem/yoda: fix a typo in mca_spml_yoda_get_completion	2014-11-13 14:53:32 +09:00
Andrew Friedley	b97cda7fd9	PSM MTL: Don't connect procs already connected PSM has issues when trying calling psm_ep_connect() more than once for a specific peer. Use the psm_ep_connect mask argument to avoid connecting to processes that are already connected. OMPI ticket #268.	2014-11-12 15:52:02 -08:00
Howard Pritchard	a632b632ca	better way to tell if a process is in a Cray PAGG Use a more reliable way to tell if a process is 1) in a Cray PAGG 2) is actually considered an application process on a compute node (not for example, a process in a PAGG on a mom node).	2014-11-12 12:56:15 -07:00
Howard Pritchard	72bb4a2eee	make cray pmi compile again Commit @80f07b65 resulted in changes that caused cray pmi component to no longer compile. This commit fixes that issue.	2014-11-12 12:33:30 -07:00
Nathan Hjelm	cfbb9cba16	btl/vader: don't assume the address in the put/get segment is unmodified when using knem It is valid to modify the remote segment that will be used with the btl put/get operations as long as the resulting address range falls in the originally prepared segment. Vader should have been calculating the offset of the remote address in the registered region. This commit fixes this issue.	2014-11-12 10:12:52 -07:00
rhc54	1fdb6a62d3	Merge pull request #265 from miked-mellanox/topic/undeprecate_env_x ORTE: undeprecate -x var=val in mpirun Looks okay to me - thanks!	2014-11-12 08:46:09 -08:00
rhc54	87fa1061d4	Merge pull request #267 from artpol84/s2_fix Fix SLURM PMI2 component. set s2_nrank to the relative position of a pro... Good catch!	2014-11-12 08:43:03 -08:00
Jeff Squyres	f39b294afe	mca base: fix trivial typos in help message	2014-11-12 08:40:17 -08:00
Artem Polyakov	fce08a3db3	Fix SLURM PMI2 component. set s2_nrank to the relative position of a process inside the node (not relative position of a node inside the allocation).	2014-11-12 16:26:35 +06:00
Gilles Gouaillardet	b088175705	btl/vader: fix a typo in mca_btl_vader_put_knem	2014-11-12 19:00:00 +09:00
Mike Dubman	f83d6045aa	ORTE: undeprecate -x var=val in mpirun mpirun -x var=val is back, actually it is useful alias for -mca mca_base_env_list "var=val"	2014-11-12 10:51:15 +02:00
Ralph Castain	780c93ee57	Per the PR and discussion on today's telecon, extend the process name definition as a two-field struct of uint32_t's down to the OPAL layer. This resolves issues created by prior commits that impacted both heterogeneous and SPARC support. This also simplifies the OMPI code base by removing the need for frequent memcpy's when transitioning between the OMPI/ORTE layers and OPAL. We recognize that this means other users of OPAL will need to "wrap" the opal_process_name_t if they desire to abstract it in some fashion. This is regrettable, and we are looking at possible alternatives that might mitigate that requirement. Meantime, however, we have to put the needs of the OMPI community first, and are taking this step to restore hetero and SPARC support.	2014-11-11 17:00:42 -08:00
Gilles Gouaillardet	40942c62ce	dstore: remove unused variables	2014-11-11 18:14:59 +09:00
Gilles Gouaillardet	43af1e27ca	Merge pull request #245 from ggouaillardet/poc/pmi_slurm pmix: correctly split pmi messages	2014-11-11 03:18:40 -05:00
Gilles Gouaillardet	80f07b65f1	pmix: correctly split pmi messages Thanks to @elenash for all the reviews	2014-11-11 17:16:00 +09:00
Ralph Castain	d0704ef118	Restore handling of physical processors in rankfiles. Note that the prior implementation was likely incorrect as it falsely assumed that physical core indices were unique, which isn't always true. Stipulate that physical rankfiles can only include PU numbers, and bind the result to the core that contains that physical PU. Update the mpirun man page to cover the new use-case.	2014-11-10 14:00:40 -08:00
Ralph Castain	2a90788724	Support physical processor ids in rankfile	2014-11-10 14:00:40 -08:00
Howard Pritchard	6c8c9cb4a3	another fix for --enable-dlopen for ugni btl missed a change to create libmca_common_ugni.la file correctly.	2014-11-10 13:40:59 -07:00
Jeff Squyres	a904a2deeb	OpenMPI.3in: remove trailing blank lines; no content changes	2014-11-10 08:38:24 -08:00
Jeff Squyres	31e79f469a	NEWS: whitespace cleanup; no content changes	2014-11-10 08:38:24 -08:00
Jeff Squyres	5271ed9ec6	NEWS: sync with 1.8.4 bullets	2014-11-10 08:38:24 -08:00
Alex Mikheev	097b469f61	OSHMEM: sshmem verbs: fix shared_mr detection It seems that `5ce2f10067` changed default flag values but it did not modify detection code.	2014-11-10 13:34:04 +02:00
Gilles Gouaillardet	d2d7f39a4b	btl/vader: use FRAG_ALLOC_USER when single_copy_mechanism is VADER_NONE	2014-11-10 17:02:45 +09:00
Gilles Gouaillardet	df6115aac4	topo/base: fix uninitialized variable this commit fixes a bug introduced by commit open-mpi/ompi@e7c59e3adb	2014-11-10 13:06:50 +09:00

1 2 3 4 5 ...

21387 Коммитов