openmpi

Автор	SHA1	Сообщение	Дата
Gleb Natapov	e2e211f23b	Add flags parameter to btl_alloc() and btl_prepare_src() functions. If BTL knows at the time of allocation priority of a descriptor it may do some optimizations. This commit was SVN r16901.	2007-12-09 14:08:01 +00:00
Gleb Natapov	7364b7cf47	Add endpoint parameter to btl_alloc() function. Enables various optimizations inside BTL. This commit was SVN r16898.	2007-12-09 14:00:42 +00:00
Gleb Natapov	2d784752dd	Remove descriptor caching form BML. With descriptor caching some optimizations are impossible. This commit was SVN r16897.	2007-12-09 13:58:17 +00:00
Jeff Squyres	3bc940ac27	Fix three things from r15474 (thanks to Brian for noticing): * bml.h had a change that introduced a variable named "_order" to avoid a conflict with a local variable. The namespace starting with _ belongs to the os/compiler/kernel/not us. So we can't start symbols with _. So I replaced it with arg_order, and also updated the threaded equivalent of the macro that was modified. * in btl_openib_proc.c, one opal_output accidentally had its string reverted from "ompi_modex_recv..." to "mca_pml_base_modex_recv....". This was fixed. * The change to ompi/runtime/ompi_preconnect.c was entirely reverted; it was an artifact of debugging. This commit was SVN r15475. The following SVN revision numbers were found above: r15474 --> open-mpi/ompi@8ace07efed	2007-07-18 11:38:06 +00:00
Jeff Squyres	8ace07efed	This commit brings in two major things: 1. Galen's fine-grain control of queue pair resources in the openib BTL. 1. Pasha's new implementation of asychronous HCA event handling. Pasha's new implementation doesn't take much explanation, but the new "multifrag" stuff does. Note that "svn merge" was not used to bring this new code from the /tmp/ib_multifrag branch -- something Bad happened in the periodic trunk pulls on that branch making an actual merge back to the trunk effectively impossible (i.e., lots and lots of arbitrary conflicts and artifical changes). :-( == Fine-grain control of queue pair resources == Galen's fine-grain control of queue pair resources to the OpenIB BTL (thanks to Gleb for fixing broken code and providing additional functionality, Pasha for finding broken code, and Jeff for doing all the svn work and regression testing). Prior to this commit, the OpenIB BTL created two queue pairs: one for eager size fragments and one for max send size fragments. When the use of the shared receive queue (SRQ) was specified (via "-mca btl_openib_use_srq 1"), these QPs would use a shared receive queue for receive buffers instead of the default per-peer (PP) receive queues and buffers. One consequence of this design is that receive buffer utilization (the size of the data received as a percentage of the receive buffer used for the data) was quite poor for a number of applications. The new design allows multiple QPs to be specified at runtime. Each QP can be setup to use PP or SRQ receive buffers as well as giving fine-grained control over receive buffer size, number of receive buffers to post, when to replenish the receive queue (low water mark) and for SRQ QPs, the number of outstanding sends can also be specified. The following is an example of the syntax to describe QPs to the OpenIB BTL using the new MCA parameter btl_openib_receive_queues: {{{ -mca btl_openib_receive_queues \ "P,128,16,4;S,1024,256,128,32;S,4096,256,128,32;S,65536,256,128,32" }}} Each QP description is delimited by ";" (semicolon) with individual fields of the QP description delimited by "," (comma). The above example therefore describes 4 QPs. The first QP is: P,128,16,4 Meaning: per-peer receive buffer QPs are indicated by a starting field of "P"; the first QP (shown above) is therefore a per-peer based QP. The second field indicates the size of the receive buffer in bytes (128 bytes). The third field indicates the number of receive buffers to allocate to the QP (16). The fourth field indicates the low watermark for receive buffers at which time the BTL will repost receive buffers to the QP (4). The second QP is: S,1024,256,128,32 Shared receive queue based QPs are indicated by a starting field of "S"; the second QP (shown above) is therefore a shared receive queue based QP. The second, third and fourth fields are the same as in the per-peer based QP. The fifth field is the number of outstanding sends that are allowed at a given time on the QP (32). This provides a "good enough" mechanism of flow control for some regular communication patterns. QPs MUST be specified in ascending receive buffer size order. This requirement may be removed prior to 1.3 release. This commit was SVN r15474.	2007-07-18 01:15:59 +00:00
George Bosilca	752909c628	These are supposed to have a high probability of success. This commit was SVN r15377.	2007-07-11 23:02:47 +00:00
Gleb Natapov	b88b7dedfe	Rename btl_rdma_offset to btl_pipeline_send_length. This commit was SVN r15153.	2007-06-21 07:12:40 +00:00
Galen Shipman	3401bd2b07	Add optional ordering to the BTL interface. This is required to tighten up the BTL semantics. Ordering is not guaranteed, but, if the BTL returns a order tag in a descriptor (other than MCA_BTL_NO_ORDER) then we may request another descriptor that will obey ordering w.r.t. to the other descriptor. This will allow sane behavior for RDMA networks, where local completion of an RDMA operation on the active side does not imply remote completion on the passive side. If we send a FIN message after local completion and the FIN is not ordered w.r.t. the RDMA operation then badness may occur as the passive side may now try to deregister the memory and the RDMA operation may still be pending on the passive side. Note that this has no impact on networks that don't suffer from this limitation as the ORDER tag can simply always be specified as MCA_BTL_NO_ORDER. This commit was SVN r14768.	2007-05-24 19:51:26 +00:00
Gleb Natapov	3ebaff8dfe	Implement new BTL parameters: We eagerly send data up to btl__eager_limit with the match Upon ACK of the MATCH we start using send/receives of size btl__max_send_size up to the btl__rdma_pipeline_offset After the btl__rdma_pipeline_offset we begin using RDMA writes of size btl__rdma_pipeline_frag_size. Now, on a per message basis we only use the above protocol if the message is larger than btl__min_rdma_pipeline_size btl__eager_limit - > same btl__max_send_size -> same btl__rdma_pipeline_offset -> btl__min_rdma_size btl__rdma_pipeline_frag_size -> btl__max_rdma_size btl_*_min_rdma_pipeline_size is new.. This patch also moves all BTL common parameters initialisation into btl_base_mca.c file. This commit was SVN r14681.	2007-05-17 07:54:27 +00:00
Rich Graham	ce35761683	make sure not to go out of bounds. element i+1 of bml_btls is referenced, which for i-arr_size-1 is beyond the array dimentions. This commit was SVN r14464.	2007-04-22 21:43:34 +00:00
Josh Hursey	dadca7da88	Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD). This merge adds Checkpoint/Restart support to Open MPI. The initial frameworks and components support a LAM/MPI-like implementation. This commit follows the risk assessment presented to the Open MPI core development group on Feb. 22, 2007. This commit closes trac:158 More details to follow. This commit was SVN r14051. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r13912 The following Trac tickets were found above: Ticket 158 --> https://svn.open-mpi.org/trac/ompi/ticket/158	2007-03-16 23:11:45 +00:00
George Bosilca	a02d1c7c8d	No more warnings. This commit was SVN r13382.	2007-01-31 04:27:41 +00:00
Rainer Keller	125ba1acfa	- Reduce the amount of warnings with -Wshadow -- mainly due to usage of index and abs in inline-fcts in header files. This commit was SVN r13217.	2007-01-19 19:48:06 +00:00
Gleb Natapov	4c7dbd36c7	Balance RDMA operation in round robin fashion between all available RDMA BTLs. OB1 always use first element from array of BTLs available for RDMA. The patch change the array creation algorithm, it puts different BTL in the first element in round robin fashion. This commit was SVN r13174.	2007-01-18 09:15:18 +00:00
Gleb Natapov	190e7a27cd	Merge with gleb-mpool branch. All RDMA components use same mpool now (rdma). udapl/openib/vapi/gm mpools a deprecated. rdma mpool has parameter that allows to limit its size mpool_rdma_rcache_size_limit (default is 0 - unlimited). This commit was SVN r12878.	2006-12-17 12:26:41 +00:00
George Bosilca	06563b5dec	Last set of explicit conversions. We are now close to the zero warnings on all platforms. The only exceptions (and I will not deal with them anytime soon) are on Windows: - the write functions which require the length to be an int when it's a size_t on all UNIX variants. - all iovec manipulation functions where the iov_len is again an int when it's a size_t on most of the UNIXes. As these only happens on Windows, so I think we're set for now :) This commit was SVN r12215.	2006-10-20 03:57:44 +00:00
George Bosilca	6afa4c6c64	Windows friendly version. We have to split the OMPI_DECLSPEC in at least 3 different macros, one for each project. Therefore, now we have OPAL_DECLSPEC, ORTE_DECLSPEC and OMPI_DECLSPEC. Please use them based on the sub-project. This commit was SVN r11270.	2006-08-20 15:54:04 +00:00
Galen Shipman	3b49953ce2	Add error callback to the btl interface, this allows error to be delivered to the upperlayer assynchronously although there are some issues with this.. such as there are multiple consumers of the btl's.. who get's the This commit was SVN r11232.	2006-08-16 20:21:38 +00:00
Brian Barrett	dd6fa1da2a	* Fix for ticket #242 , print a friendly error message if we can't reach a particular peer. Will now fail during MPI_INIT. Printing of the error messages about no endpoints can be turned off. This commit was SVN r11181.	2006-08-14 19:17:36 +00:00
Brian Barrett	24de93e2d4	* fix some compile errors with having OB1 direct called This commit was SVN r11170.	2006-08-14 15:43:03 +00:00
Gleb Natapov	91f48f9a79	Merge with gleb-pml branch. Add out of resource handling support to PML layer. If resource is not available request is added to one of the pending list and retried later. This commit was SVN r10900.	2006-07-20 14:44:35 +00:00
Brian Barrett	47725c9b02	* Add new PML (CM) and network drivers (MTL) for high speed interconnects that provide matching logic in the library. Currently includes support for MX and some support for Portals * Fix overuse of proc_pml pointer on the ompi_proc structuer, splitting into proc_pml for pml data and proc_bml for the BML endpoint data * bug fixes in bsend init code, which wasn't being used by the OB1 or DR PMLs... This commit was SVN r10642.	2006-07-04 01:20:20 +00:00
Galen Shipman	e6cd8db0e5	DR will now checksum on a per btl basis (see MCA_BTL_FLAGS_NEED_CSUM). We still always send ACK's, teasing apart completion for ACK/no ACK looks like a pain in the .. This commit was SVN r10530.	2006-06-27 20:23:47 +00:00
George Bosilca	837221831a	Temporary solution for in-bound computation of the next BTL. This commit was SVN r10016.	2006-05-22 23:28:40 +00:00
George Bosilca	e43fbd0082	Remove all useless variables. Minor cleanups. This commit was SVN r10000.	2006-05-21 05:53:22 +00:00
Galen Shipman	9165882c07	fixes for failover... This commit was SVN r9998.	2006-05-20 02:39:05 +00:00
Tim Woodall	fdd622544b	added optional copy routine to allow "derived" class of mca_bml_base_endpoint to copy state if an endpoint is updated (e.g. btl deleted/added) This commit was SVN r9814.	2006-05-04 15:19:12 +00:00
Galen Shipman	5271948ec0	--- opal object changes add object size to opal class no longer need the size when allocating a new object as this is stored in the class structure --- dr changes Previous rev. maintained state on the communicator used for acking duplicate fragments, but the communicator may be destroyed prior to successfull delivery of an ack to the peer. We must therefore maintain this state globally on a per peer, not a per peer, per communicator basis. This requires that we use a global rank on the wire and translate this as appropriate to a local rank within the communicator. This commit was SVN r9454.	2006-03-29 16:19:17 +00:00
Tim Woodall	c1bf71b1be	- updated copyrights - removed unused state - starting to add support for btl failover This commit was SVN r9431.	2006-03-27 22:48:12 +00:00
Galen Shipman	f609204cc5	disable reliability checking in bml This commit was SVN r9392.	2006-03-23 17:50:20 +00:00
Tim Woodall	c1bec478c4	updates to reliability debug code This commit was SVN r9390.	2006-03-23 17:00:20 +00:00
Galen Shipman	7ce7baff15	more bml work This commit was SVN r9341.	2006-03-20 21:58:20 +00:00
Galen Shipman	5600932c2f	fix misc warnings This commit was SVN r9339.	2006-03-20 15:41:45 +00:00
Galen Shipman	15bdbd5ca1	add parameter names to cb func This commit was SVN r9338.	2006-03-20 15:29:35 +00:00
Tim Woodall	bd870519fd	- modified convertor copy_and_prepare routines to accept an addition flag, new flags to be included when convertor is initialized - modified pml/btl module defs and added stub functions for diagnostic output routines to dump state of queues / endpoints - updates to data reliability pml This commit was SVN r9329.	2006-03-17 18:46:48 +00:00
George Bosilca	8fb84e90ce	It's already done in the send ... we don't have to initialize this field several times. This commit was SVN r9282.	2006-03-14 21:55:57 +00:00
Tim Woodall	c83b2fce4d	resolve threading issue This commit was SVN r9233.	2006-03-09 17:57:31 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
Galen Shipman	44fe6c3896	allow pml pipeline to cache memory registrations to enable this (off by default) use: -mca pml_ob1_leave_pinned_pipeline 1 !!AND!!! -mca mpool_use_mem_hooks 1 This commit was SVN r8949.	2006-02-09 15:49:51 +00:00
George Bosilca	8d26696c06	Remove the local variable that shadow the real one. This commit was SVN r8763.	2006-01-20 02:39:30 +00:00
George Bosilca	29c80875a8	As requested move back the alloc and return to macros instead of static inlined functions. This commit was SVN r8762.	2006-01-19 23:58:04 +00:00
George Bosilca	be64d98687	We're thread safe again. The atomic compare-and-swap was not used in the correct way allowing the descriptors to vanish. The PML was thinking that they are in the btl_cache when they weren't ... It lead to memory consumption on most environments when compiled with thread enabled. After modification the latency went down by nearly 0.5 microseconds. Simple way to trigger the bug: limit the number of maximum items in the free list and run any communication intensive application (like Netpipe). This commit was SVN r8741.	2006-01-19 06:45:29 +00:00
Tim Woodall	3c170c410c	changes required by dr This commit was SVN r8580.	2005-12-21 15:11:40 +00:00
George Bosilca	bc2e80cbc6	This one can lead to memory corruptions. We modify the content of a buffer after calling a function supposed to free it. It happens right now that the destruct function just put it in the cache but still I don't think we should touch the buffer anymore. This commit was SVN r8147.	2005-11-13 23:07:37 +00:00
Jeff Squyres	42ec26e640	Update the copyright notices for IU and UTK. This commit was SVN r7999.	2005-11-05 19:57:48 +00:00
Tim Woodall	4a71621410	merge in scheduling changes from release branch This commit was SVN r7699.	2005-10-11 20:41:51 +00:00
Galen Shipman	d932cfd342	merge of rcache work into the trunk.. lotsa fun ;-).. I regression tested before the merge, I will regression test tonight and correct issues that might have crept in. This commit was SVN r7329.	2005-09-12 22:28:23 +00:00
George Bosilca	c9fb1f32f2	And more dependencies fixes. The big commit will follow shortly. This commit was SVN r7319.	2005-09-12 20:22:59 +00:00
Tim Woodall	166ecc9544	removed dependency on ob1 This commit was SVN r6927.	2005-08-18 17:06:59 +00:00
Tim Woodall	f274f524ab	- added get based protocol (if supported by btl) for pre-registered memory - removed 8 bytes from the majority of the pml headers This commit was SVN r6916.	2005-08-17 18:23:38 +00:00

1 2

52 Коммитов