openmpi

Автор	SHA1	Сообщение	Дата
Gleb Natapov	c9a1b06771	Remove trailing whitespaces. No code changes in this commit. This commit was SVN r17167.	2008-01-21 12:11:18 +00:00
Gleb Natapov	621fa223c5	Create free lists of fragments per HCA, not per BTL. Saves memory in case of multiple LMCs. This commit was SVN r17082.	2008-01-09 10:26:21 +00:00
Gleb Natapov	493951e09d	Add heterogeneous support to message coalescing. This commit was SVN r16903.	2007-12-09 14:10:25 +00:00
Gleb Natapov	5313a2baa7	Message coalescing for openib BTL. If fragment is waiting to be transmitted in a pending queue pack another message into it if there is enough space there. This commit was SVN r16900.	2007-12-09 14:05:13 +00:00
Gleb Natapov	7302cd24eb	Call btl_alloc() from btl_prepare_src() to have one point of frag allocation. This commit was SVN r16899.	2007-12-09 14:02:32 +00:00
Gleb Natapov	a9f864d15c	If there is an eager rdma credit, but there is no WQE to send a packet we add it to a pending queue of eager rdma QP instead of correct pending list. This patch fixes this by getting reed of "eager rdma qp" notion. Packet is always send over its order QP. The patch also adds two pending queues for high and low prio packets. Only high prio packets are sent over eager RDMA channel. This commit was SVN r16780.	2007-11-28 07:12:44 +00:00
Gleb Natapov	6a2d210b7d	Use OMPI object system to make fragment hierarchy more object oriented. The main idea (except of cleanup) is to save on initialisation of unneeded fields and to use C type checking system to catch obvious errors. This commit was SVN r16779.	2007-11-28 07:11:14 +00:00
Gleb Natapov	3a63eb6c17	Cleanup macro definitions. This commit was SVN r16554.	2007-10-23 13:33:19 +00:00
Gleb Natapov	d836f3dbbe	Remove unused macro. This commit was SVN r16552.	2007-10-23 13:18:10 +00:00
Gleb Natapov	9e2d5acf8e	Remove unused filed from openib fragment structure. This commit was SVN r16549.	2007-10-23 07:38:29 +00:00
Gleb Natapov	c7105eadc7	Update Voltaire copyright. This commit was SVN r16189.	2007-09-24 10:11:52 +00:00
Gleb Natapov	9c20d67301	1) Return IB header to it's previous size by using char for cm_seen field. 2) Allow to specify rd_win/rd_rsv parameters by user, but make them optional. This commit was SVN r15719.	2007-08-01 12:10:56 +00:00
Galen Shipman	438a56e0d7	update copyrights for ib_multifrag commit This commit was SVN r15612.	2007-07-25 15:03:34 +00:00
Gleb Natapov	5b7d3faedc	Implement "credit management for credit messages" protocol. On each message a sender piggybacks a number of credit messages it received from a peer. A number of outstanding credit messages is limited. This is needed to never ever fall back to HW flow control. This commit was SVN r15580.	2007-07-24 15:19:51 +00:00
Jeff Squyres	8ace07efed	This commit brings in two major things: 1. Galen's fine-grain control of queue pair resources in the openib BTL. 1. Pasha's new implementation of asychronous HCA event handling. Pasha's new implementation doesn't take much explanation, but the new "multifrag" stuff does. Note that "svn merge" was not used to bring this new code from the /tmp/ib_multifrag branch -- something Bad happened in the periodic trunk pulls on that branch making an actual merge back to the trunk effectively impossible (i.e., lots and lots of arbitrary conflicts and artifical changes). :-( == Fine-grain control of queue pair resources == Galen's fine-grain control of queue pair resources to the OpenIB BTL (thanks to Gleb for fixing broken code and providing additional functionality, Pasha for finding broken code, and Jeff for doing all the svn work and regression testing). Prior to this commit, the OpenIB BTL created two queue pairs: one for eager size fragments and one for max send size fragments. When the use of the shared receive queue (SRQ) was specified (via "-mca btl_openib_use_srq 1"), these QPs would use a shared receive queue for receive buffers instead of the default per-peer (PP) receive queues and buffers. One consequence of this design is that receive buffer utilization (the size of the data received as a percentage of the receive buffer used for the data) was quite poor for a number of applications. The new design allows multiple QPs to be specified at runtime. Each QP can be setup to use PP or SRQ receive buffers as well as giving fine-grained control over receive buffer size, number of receive buffers to post, when to replenish the receive queue (low water mark) and for SRQ QPs, the number of outstanding sends can also be specified. The following is an example of the syntax to describe QPs to the OpenIB BTL using the new MCA parameter btl_openib_receive_queues: {{{ -mca btl_openib_receive_queues \ "P,128,16,4;S,1024,256,128,32;S,4096,256,128,32;S,65536,256,128,32" }}} Each QP description is delimited by ";" (semicolon) with individual fields of the QP description delimited by "," (comma). The above example therefore describes 4 QPs. The first QP is: P,128,16,4 Meaning: per-peer receive buffer QPs are indicated by a starting field of "P"; the first QP (shown above) is therefore a per-peer based QP. The second field indicates the size of the receive buffer in bytes (128 bytes). The third field indicates the number of receive buffers to allocate to the QP (16). The fourth field indicates the low watermark for receive buffers at which time the BTL will repost receive buffers to the QP (4). The second QP is: S,1024,256,128,32 Shared receive queue based QPs are indicated by a starting field of "S"; the second QP (shown above) is therefore a shared receive queue based QP. The second, third and fourth fields are the same as in the per-peer based QP. The fifth field is the number of outstanding sends that are allowed at a given time on the QP (32). This provides a "good enough" mechanism of flow control for some regular communication patterns. QPs MUST be specified in ascending receive buffer size order. This requirement may be removed prior to 1.3 release. This commit was SVN r15474.	2007-07-18 01:15:59 +00:00
Pavel Shamis	e2d0e27111	Adding: * openib_finalize flow for openib btl * async event handler for openib btl This commit was SVN r14623.	2007-05-08 21:47:21 +00:00
Rainer Keller	1aceece03f	- Add a few comments for elements for structs, a few spelling fixes. No functional change. This commit was SVN r14534.	2007-04-26 21:03:38 +00:00
Gleb Natapov	1f3ac2d7ae	Hold pointers to free_max/free_eager lists in array indexed by priority. This eliminates couple of ifs from fast path. This commit was SVN r14031.	2007-03-14 14:36:03 +00:00
Gleb Natapov	90fb58de4f	When frags are allocated from mpool by free_list the frag structure is also allocated from mpool memory (which is registered memory for RDMA transports) This is not a problem for a small jobs, but for a big number of ranks an amount of waisted memory is big. This commit was SVN r13921.	2007-03-05 14:17:50 +00:00
Gleb Natapov	2b6cbd6299	Separate frag lists for RDMA descriptors to two, one for src descriptors and another for dst descriptors. This provide partial solution to OB1 protocol deadlock problem. We can limit number of RDMA descriptors (by setting btl_openib_free_list_max to something different from -1) and if we will be lucky to hit this limit before we fail to register more memory the protocol will not deadlock. When we had only one list for src/dst descriptors we deadlocked when we reached max limit for the list. This commit was SVN r13844.	2007-02-28 13:43:38 +00:00
Galen Shipman	4a6ad30440	remove unused macro calls.. This commit was SVN r13107.	2007-01-12 23:17:17 +00:00
Galen Shipman	2097d174f6	heterogeneous fixes to the OpenIB BTL. This includes work by nysal, brian and I. This commit was SVN r13106.	2007-01-12 23:14:45 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Gleb Natapov	190e7a27cd	Merge with gleb-mpool branch. All RDMA components use same mpool now (rdma). udapl/openib/vapi/gm mpools a deprecated. rdma mpool has parameter that allows to limit its size mpool_rdma_rcache_size_limit (default is 0 - unlimited). This commit was SVN r12878.	2006-12-17 12:26:41 +00:00
Brian Barrett	0653dc3f24	Pad headers to eliminate heterogeneous issues. Add conversion functions for switching endianness of headers. Galen is going to add the code to use the endian stuff... This commit was SVN r12876.	2006-12-17 00:50:59 +00:00
Gleb Natapov	d0caffa0aa	Consolidate receive buffers prepost code for HP/LP QPs. This commit was SVN r11552.	2006-09-07 13:05:41 +00:00
Gleb Natapov	c13240a1d1	remove rdma_credits from openib BTL header. Use one field for regular and rdma credits. This commit was SVN r11529.	2006-09-05 16:02:09 +00:00
Gleb Natapov	fe932ca7bf	consolidate part of HP/LP fields. This commit was SVN r11528.	2006-09-05 16:00:18 +00:00
Gleb Natapov	ffe7051488	fix compilation warnings. This commit was SVN r11524.	2006-09-05 09:16:22 +00:00
Gleb Natapov	c70eb43e43	Align eager RDMA buffer so that last byte of the buffer is on the last byte of the CPU cache line. Improves zero byte latency a little bit because of L1 cache miss reduction. This commit was SVN r11465.	2006-08-28 11:03:56 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Gleb Natapov	72575d81d2	Create separate pool for control messages. It is unlimited, but the maximum number of element that are allocated from it is limited by number of connections. This commit was SVN r11028.	2006-07-27 14:09:30 +00:00
Gleb Natapov	3b34dc8df8	remove MCA_BTL_IB_FRAG_ALIGN. Alignment is handled in free_list_t. This commit was SVN r10945.	2006-07-23 12:33:49 +00:00
Gleb Natapov	91f48f9a79	Merge with gleb-pml branch. Add out of resource handling support to PML layer. If resource is not available request is added to one of the pending list and retried later. This commit was SVN r10900.	2006-07-20 14:44:35 +00:00
Gleb Natapov	e58a89ef3e	OMPI_ENABLE_DEBUG is always defined (to 0 or 1). Use #if and nto #ifdef. This commit was SVN r10537.	2006-06-28 11:25:09 +00:00
Galen Shipman	218a438509	finished the ompi_free_list_t class nightmare.. This commit was SVN r10314.	2006-06-12 22:09:03 +00:00
Galen Shipman	0344ae4ac5	Fix to allow eager limit and max send size to be any size (within resource limitations). Instead of storing the ompi_free_list_t * in the fragment, we use the frag type enum, this tells us where the frag came from and where it should return.. This could also be done in mvapi but is not a high priority moving forward.. Review by Brian, needs to hit the trunk + 1.1 release.. This commit was SVN r10157.	2006-06-01 02:32:18 +00:00
Gleb Natapov	f590d8a190	fix eager RDMA on PPC64. This commit was SVN r10059.	2006-05-25 11:05:12 +00:00
Gleb Natapov	79bcfb096f	Add type to frag. Sometimes we need to know that a frag is from short rdma area. I used hack for this that doesn't work for mvapi, so changing it to something more sane. This commit was SVN r9477.	2006-03-30 15:26:21 +00:00
Gleb Natapov	a5a78b10cc	Implementation of short message RDMA. Endpoint registers circular buffer and sends its address and rkey to the peer. Peer uses this buffer to eagerly RDMA small message into it. Endpoint polls the buffer for message arrival before checking HP/LP QPs. Set btl_openib_use_eager_rdma to 1 to enable it. This commit was SVN r9425.	2006-03-26 08:30:50 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
Tim Woodall	a584c60dbe	re-worked flow control logic to take into account the return of credits from the peer prior to local completion, so that we don't overrun the number of send wqes available. This commit was SVN r8683.	2006-01-12 23:42:44 +00:00
Tim Woodall	4a06e8463c	port of flow control from mvapi This commit was SVN r8102.	2005-11-10 20:15:02 +00:00
Jeff Squyres	42ec26e640	Update the copyright notices for IU and UTK. This commit was SVN r7999.	2005-11-05 19:57:48 +00:00
Galen Shipman	946402b980	More openib cleanup.. still note ready for public consumption ;-) This commit was SVN r6565.	2005-07-20 15:17:18 +00:00
Galen Shipman	d7bdc46ac9	compile error and warining fixes for openib.. This commit was SVN r6449.	2005-07-12 21:49:30 +00:00
Galen Shipman	454fdff824	Initial commit of changes to the mvapi btl to the openib btl. Still need to work on the configure.stub to correctly locate the ib libraries. This commit was SVN r6435.	2005-07-12 13:38:54 +00:00
Brian Barrett	761402f95f	* rename ompi_list to opal_list This commit was SVN r6322.	2005-07-03 16:22:16 +00:00
Jeff Squyres	4ab17f019b	Rename src -> ompi This commit was SVN r6269.	2005-07-02 13:43:57 +00:00

50 Коммитов