openmpi

Автор	SHA1	Сообщение	Дата
Alina Sklarevich	28586caecf	MTL_MXM/PML_YALLA: fix coverity issues.	2015-03-12 11:49:22 +02:00
Nathan Hjelm	ce6caab2a7	Merge pull request #463 from hjelmn/cuda_async btl/openib: cuda: fix CUDA-aware support with async copy	2015-03-11 09:52:48 -06:00
Alina Sklarevich	f9a9b936a1	PML_YALLA: fix compilation warnings.	2015-03-11 10:58:54 +02:00
Nathan Hjelm	3d32dbd793	btl/openib: cuda: fix CUDA-aware support with async copy This commit should resolve an issue seen with CUDA-aware support. The problem came in with BTL 3.0. Before 3.0 the size of the copy was stored in the incoming segment's des_remote_count field. This field does not exist in BTL 3.0 so I stored the value in the des_segment_count field. This caused problems with the cuda support code. To fix the issue the endpoint pointer is now stored in the in fragment's endpoint pointer which free's up the segment's des_cbdata pointer for storing the transfer size. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-03-10 14:38:12 -06:00
Mike Dubman	6f91a007e1	Merge pull request #458 from yosefe/topic/pml-yalla-fix-segv keep mxm context alive as long as pml_yalla component is open.	2015-03-10 13:38:14 +02:00
yosefe	976144dca7	keep mxm context alive as long as pml_yalla component is open. pml_yalla_del_comm may be called after yalla module is finalized, which leads to invalid memory access if mxm context is already destroyed in this point.	2015-03-10 11:52:44 +02:00
George Bosilca	420ae98dfe	Remove all unnecessary whitespaces and make sure we close the module correctly.	2015-03-05 13:00:13 -05:00
Alex Mikheev	168c83ed95	OMPI/MXM: add out of band barrier at the end of del_procs mxm shutdown requires out of band barrier	2015-03-02 12:56:02 +02:00
Rolf vandeVaart	30e9dd5066	Look in extra rdma array to find bml. This is needed with recent BML changes. Only affects CUDA-aware code.	2015-02-27 09:02:21 -05:00
George Bosilca	3fd8dc099d	Revert "This function is now useless." This reverts commit `0871c5c489`.	2015-02-26 17:54:46 -05:00
George Bosilca	7f90cedf23	Revert "Fix the logic for computing the different weights for each BTLs. This" This reverts commit `de118609ec`.	2015-02-26 17:54:31 -05:00
George Bosilca	d4c2fc9d41	Merge branch 'master' of github.com:open-mpi/ompi	2015-02-25 12:01:57 -05:00
Mike Dubman	a0afb7d96e	Merge pull request #424 from miked-mellanox/topic/master_fix_yalla fixes issue #414	2015-02-25 19:01:47 +02:00
George Bosilca	f3b58006c8	Merge branch 'master' of github.com:open-mpi/ompi	2015-02-25 12:01:35 -05:00
Jeff Squyres	c3381150de	ob1: fix another PERUSE compile error	2015-02-25 05:53:12 -08:00
yosefe	0332ab4d8b	Initialize pml_yalla bsend request status.	2015-02-25 15:33:26 +02:00
Nathan Hjelm	0ac2f08460	pml/ob1: fix peruse compile error Fixes #416	2015-02-24 15:39:46 -07:00
Nathan Hjelm	5ef24000c7	pml/yalla: fix typo in PML_YALLA_FREELIST_INIT	2015-02-24 10:08:54 -07:00
Nathan Hjelm	5f1254d710	Update code base to use the new opal_free_list_t Use of the old ompi_free_list_t and ompi_free_list_item_t is deprecated. These classes will be removed in a future commit. This commit updates the entire code base to use opal_free_list_t and opal_free_list_item_t. Notes: OMPI_FREE_LIST__MT -> opal_free_list_ (uses opal_using_threads ()) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-02-24 10:05:45 -07:00
Nathan Hjelm	ed78553512	Update opal_free_list_t usage to reflect new class interface. Please verify your components have been updated correctly. Keep in mind that in terms of threading: OPAL_FREE_LIST_GET -> opal_free_list_get_st OPAL_FREE_LIST_RETURN -> opal_free_list_return_st I used the opal_using_threads() variant anytime it appeared multiple threads could be operating on the free list. If this is not the case update to _st. If multiple threads are always in use change to _mt.	2015-02-24 10:05:44 -07:00
Howard Pritchard	c9e81b54fb	Merge pull request #412 from hppritcha/topic/owner_files add owner files to opa/ompi/orte mca directories	2015-02-23 09:48:20 -07:00
Howard Pritchard	bf89131f9e	add owner files to opa/ompi/orte mca directories This commit adds an owner file in each of the component directories for each framework. This allows for a simple script to parse the contents of the files and generate, among other things, tables to be used on the project's wiki page. Currently there are two "fields" in the file, an owner and a status. A tool to parse the files and generate tables for the wiki page will be added in a subsequent commit.	2015-02-22 15:10:23 -07:00
Mike Dubman	00d416ba9d	yalla: fix coverity errors dead code fix	2015-02-22 13:57:45 +02:00
George Bosilca	0871c5c489	This function is now useless.	2015-02-21 16:38:17 -05:00
George Bosilca	de118609ec	Fix the logic for computing the different weights for each BTLs. This removes the call to qsort, as the BTLs are already sorted based on their respective bandwidth.	2015-02-21 16:37:18 -05:00
Rolf vandeVaart	dbd0064713	Fix bug in CUDA-aware and GDR introduced by refactoring	2015-02-18 17:44:28 -05:00
Nathan Hjelm	3847025540	pml/ob1: when using btl_get try to register the entire region before attempting to break the get into multiple rdma fragments A little background. Historically ob1 always registered the entire memory region when the RGET protocol was in use. This changed when Mellanox added support to fragment RGET using the btl_prepare_dst function. Now that the BTL layer has changed to split out the limits of get/put there is explicit fragmentation code in ob1. Before this commit the registration was still done per RGET fragment. This commit will attempt to register the entire region before creating RGET fragments. If the registration is successfull then all RGET fragments will use this registration otherwise they will each attempt to register their own segment of the receive buffer. If that fails enough times each fragment will give up and fall back on send/recv. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-02-13 11:46:37 -07:00
Nathan Hjelm	868e10caf2	pml/bfo: ompi ignore until updated for BTL 3.0 interface Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-02-13 11:46:37 -07:00
Nathan Hjelm	c4a0e02261	pml/ob1: update for BTL 3.0 interface Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-02-13 11:46:37 -07:00
Jeff Squyres	f38f2a159b	pml_base: whitespace cleanup; no code changes	2015-02-06 11:27:50 -08:00
Jeff Squyres	46a1722dfc	pml_base: fix errant show_help message	2015-02-06 11:27:50 -08:00
Yohann Burette	1ad188206b	Add OFI MTL to CM PML. This allows the CM PML to be picked when the OFI MTL is selected.	2015-01-20 10:50:14 -08:00
George Bosilca	df0512550e	The extent of the datatype is irrelevant for deciding to do an immediate send as long as we have to pack.	2015-01-19 02:23:12 -05:00
Gilles Gouaillardet	d14daf40d0	ob1: correctly handle types in which size > extent do not send inline if extentcount OR* size*count are greater than 256	2015-01-19 14:07:23 +09:00
Howard Pritchard	3fc7b389ff	initial async progress changes for gni	2014-12-24 11:50:23 -07:00
yosefe	3f152733bf	Add yalla to the list of default PMLs	2014-12-01 13:11:28 +02:00
Nathan Hjelm	1b564f62bd	Revert "Merge pull request #275 from hjelmn/btlmod" This reverts commit `ccaecf0fd6`, reversing changes made to `6a19bf85dd`.	2014-11-19 23:22:43 -07:00
Nathan Hjelm	1a5349ec79	ompi ignore bfo until it is updated for new btl interface	2014-11-19 11:33:04 -07:00
Nathan Hjelm	0110603782	ob1 warning fix	2014-11-19 11:33:04 -07:00
Nathan Hjelm	24427639b6	Fix ob1 warnings	2014-11-19 11:33:03 -07:00
Nathan Hjelm	271818f887	pml/ob1: bug fixes and adjustments for changes in btl_sendi behavior	2014-11-19 11:33:03 -07:00
Nathan Hjelm	ee2b111011	Update PML for latest BTL update	2014-11-19 11:33:02 -07:00
Nathan Hjelm	c61e017177	pml: updates to reflect member changes in mca_btl_base_descriptor_t and mca_btl_base_module_t structures	2014-11-19 11:33:02 -07:00
Nathan Hjelm	5936411a07	pml/ob1: when using btl_get try to register the entire region before attempting to break the get into multiple rdma fragments A little background. Historically ob1 always registered the entire memory region when the RGET protocol was in use. This changed when Mellanox added support to fragment RGET using the btl_prepare_dst function. Now that the BTL layer has changed to split out the limits of get/put there is explicit fragmentation code in ob1. Before this commit the registration was still done per RGET fragment. This commit will attempt to register the entire region before creating RGET fragments. If the registration is successfull then all RGET fragments will use this registration otherwise they will each attempt to register their own segment of the receive buffer. If that fails enough times each fragment will give up and fall back on send/recv.	2014-11-19 11:33:02 -07:00
Nathan Hjelm	b75bb8aea7	Update pml for btl changes	2014-11-19 11:33:02 -07:00
Jeff Squyres	7a5b2e9b13	ob1: change an OPAL_UNLIKELY to OPAL_LIKELY Per `924d39e415 (commitcomment-8378266)`, this OPAN_UNLIKELY should really be OPAL_LIKELY.	2014-10-31 03:22:55 -07:00
George Bosilca	924d39e415	Always OBJ_DESTRUCT the send request.	2014-10-30 01:28:50 -04:00
Gilles Gouaillardet	ed93c8787d	ob1: add a destructor to mca_pml_ob1_recv_request_t opal_mutex_t must be OBJ_DESTRUCTed in order to avoid a memory leak (pthread_mutex_init allocates memory under Cygwin, so pthread_mutex_destroy is mandatory) Thanks to Marco Atzeri for reporting this issue	2014-10-29 13:30:29 +09:00
Jeff Squyres	c22e1ae33b	configury: new OPAL_SET_LIB_PREFIX/ORTE_SET_LIB_PREFIX macros These two macros set the prefix for the OPAL and ORTE libraries, respectively. Specifically, the OPAL library will be named libPREFIXopen-pal.la and the ORTE library will be named libPREFIXopen-rte.la. These macros must be called, even if the prefix argument is empty. The intent is that Open MPI will call these macros with an empty prefix, but other projects (such as ORCM) will call these macros with a non-empty prefix. For example, ORCM libraries can be named liborcm-open-pal.la and liborcm-open-rte.la. This scheme is necessary to allow running Open MPI applications under systems that use their own versions of ORTE and OPAL. For example, when running MPI applications under ORTE, if the ORTE and OPAL libraries between OMPI and ORCM are not identical (which, because they are released at different times, are likely to be different), we need to ensure that the OMPI applications link against their ORTE and OPAL libraries, but the ORCM executables link against their ORTE and OPAL libraries.	2014-10-22 10:32:19 -07:00
yosefe	b4f569b4d4	yalla: address comments on #246 by @jsquires	2014-10-22 10:42:56 +03:00

1 2 3 4 5 ...

1096 Коммитов