openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	e361bcb64c	Send optimizations. 1. The send path get shorter. The BTL is allowed to return > 0 to specify that the descriptor was pushed to the networks, and that the memory attached to it is available again for the upper layer. The MCA_BTL_DES_SEND_ALWAYS_CALLBACK flag can be used by the PML to force the BTL to always trigger the callback. Unmodified BTL will continue to work as expected, as they will return OMPI_SUCCESS which force the PML to have exactly the same behavior as before. Some BTLs have been modified: self, sm, tcp, mx. 2. Add send immediate interface to BTL. The idea is to have a mechanism of allowing the BTL to take advantage of send optimizations such as the ability to deliver data "inline". Some network APIs such as Portals allow data to be sent using a "thin" event without packing data into a memory descriptor. This interface change allows the BTL to use such capabilities and allows for other optimizations in the future. All existing BTLs except for Portals and sm have this interface set to NULL. This commit was SVN r18551.	2008-05-30 03:58:39 +00:00
Gleb Natapov	6e4155d111	Initialize local variable before use. This commit was SVN r17170.	2008-01-21 15:17:49 +00:00
Gleb Natapov	e2e211f23b	Add flags parameter to btl_alloc() and btl_prepare_src() functions. If BTL knows at the time of allocation priority of a descriptor it may do some optimizations. This commit was SVN r16901.	2007-12-09 14:08:01 +00:00
Gleb Natapov	52c6160252	MCA_PML_BASE_REQUEST_MPI_COMPLETE() macro does nothing except call to ompi_request_complete(). Remove the macro and call the function directly. This commit was SVN r16498.	2007-10-18 14:20:24 +00:00
George Bosilca	433f8a7694	This patch bring full support for message queues in Open MPI. Now the send and receive queues are shared among all PMLs, they are declared in the base PML, and the selected PML is in charge of initializing and releasing them. The CM PML is slightly different compared with OB1 or DR. Internally it use 2 different types of requests: light and heavy. However, now with this patch both types of requests are stored in the same queue, and cast appropriately on the allocation macro. This means we might use less memory than we allocate, but in exchange we got full support for most of the parallel debuggers. Another thing with this patch, is that now for all PML (CM included) the basic PML requests start with the same fields, and they are declared in the same order in the request structure. Moreover, the fields have been moved in such a way that only one volatile/atomic will exist per line of cache (hopefully). This commit was SVN r15346.	2007-07-10 22:16:38 +00:00
Galen Shipman	3401bd2b07	Add optional ordering to the BTL interface. This is required to tighten up the BTL semantics. Ordering is not guaranteed, but, if the BTL returns a order tag in a descriptor (other than MCA_BTL_NO_ORDER) then we may request another descriptor that will obey ordering w.r.t. to the other descriptor. This will allow sane behavior for RDMA networks, where local completion of an RDMA operation on the active side does not imply remote completion on the passive side. If we send a FIN message after local completion and the FIN is not ordered w.r.t. the RDMA operation then badness may occur as the passive side may now try to deregister the memory and the RDMA operation may still be pending on the passive side. Note that this has no impact on networks that don't suffer from this limitation as the ORDER tag can simply always be specified as MCA_BTL_NO_ORDER. This commit was SVN r14768.	2007-05-24 19:51:26 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Galen Shipman	813e7faea8	more fixes for failover.. and yet still more to come.. This commit was SVN r12450.	2006-11-06 21:27:17 +00:00
Galen Shipman	f7c554df65	Try to failover when we get an async error from the lower layer (BTL).. This commit was SVN r12420.	2006-11-03 15:40:26 +00:00
Andrew Friedley	1177844d7a	Fixes trac:183. Don't try to acquire ompi_request_lock here, which in all cases is already held. Avoids deadlock that occurs even when threads are enabled and we're running a THREAD_SINGLE app. Reviewed by Galen. This commit was SVN r11957. The following Trac tickets were found above: Ticket 183 --> https://svn.open-mpi.org/trac/ompi/ticket/183	2006-10-03 18:08:48 +00:00
George Bosilca	688a16ea78	A long time waiting patch. Get rid of the comm->c_pml_procs. It was (and that was long ago) supposed to be used as a cache for accessing the PML procs. But in all of the PMLs the PML proc contain only one field i.e. a pointer to the ompi_proc. This pointer can be accessed using the c_remote_group easily. Therefore, there is no meaning of keeping the PML procs around. Slim fast commit ... This commit was SVN r11730.	2006-09-20 22:14:46 +00:00
Andrew Friedley	e776b01811	This assert fails if -mca pml_dr_enable_csum 0 is set, which isn't what we want.. This commit was SVN r11719.	2006-09-19 19:57:33 +00:00
George Bosilca	e33c35112b	Correct the conversion between int and bool. Apply it on all files except the one that will be modified by Ralph for the ORTE 2.0. The missing ones are in the rsh PLS. This commit was SVN r11476.	2006-08-28 18:59:16 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Galen Shipman	3b49953ce2	Add error callback to the btl interface, this allows error to be delivered to the upperlayer assynchronously although there are some issues with this.. such as there are multiple consumers of the btl's.. who get's the This commit was SVN r11232.	2006-08-16 20:21:38 +00:00
Galen Shipman	84e7b90a19	Fix DR PML after the great MTL crusade.. Added a bit of debugging while I was in there trying to track things down.. This commit was SVN r11208.	2006-08-15 21:44:55 +00:00
George Bosilca	476c9e64df	Don't keep multiples copies of the datatype and count. The only one we really need is the one provided by the user. For the buffered send the real datatype used for the communication is always MPI_BYTE and the count can be retrieved from the req_bytes_packed field. This will decrease the size of the request by one pointer and one size_t (8 bytes or 16 bytes depending on the architecture). This commit was SVN r10680.	2006-07-06 17:58:25 +00:00
Brian Barrett	47725c9b02	* Add new PML (CM) and network drivers (MTL) for high speed interconnects that provide matching logic in the library. Currently includes support for MX and some support for Portals * Fix overuse of proc_pml pointer on the ompi_proc structuer, splitting into proc_pml for pml data and proc_bml for the BML endpoint data * bug fixes in bsend init code, which wasn't being used by the OB1 or DR PMLs... This commit was SVN r10642.	2006-07-04 01:20:20 +00:00
Galen Shipman	e6cd8db0e5	DR will now checksum on a per btl basis (see MCA_BTL_FLAGS_NEED_CSUM). We still always send ACK's, teasing apart completion for ACK/no ACK looks like a pain in the .. This commit was SVN r10530.	2006-06-27 20:23:47 +00:00
Galen Shipman	8855e5b73a	Fixes for DR as well as better diagnostic.. Successfully passing the intel test suite with/without induced errors/drops. This commit was SVN r10518.	2006-06-26 22:29:29 +00:00
George Bosilca	1f96768b76	For zero length persistent request do not reposition the convertor as it is not initialized. This commit was SVN r10386.	2006-06-16 03:04:41 +00:00
Galen Shipman	218a438509	finished the ompi_free_list_t class nightmare.. This commit was SVN r10314.	2006-06-12 22:09:03 +00:00
Brian Barrett	c70fff6ed0	* Fix for bug #44 for the trunk -- remove a bunch of warnings from the DR PML when compiling on Solaris. Patch won't apply cleanly to the v1.1 branch, so a diff for that is coming up soon. This commit was SVN r10173.	2006-06-01 18:58:38 +00:00
Tim Woodall	d8ff8010f3	track wether the vfrag is being retransmitted This commit was SVN r9817.	2006-05-04 17:30:58 +00:00
Tim Woodall	1b26caa95b	first cut at btl failover - seems to be working for simple test case This commit was SVN r9816.	2006-05-04 16:16:26 +00:00
Galen Shipman	ba0aa46220	make csum's optional in pml dr, on by default, see mca param pml_dr_enable_csum This commit was SVN r9608.	2006-04-10 21:54:46 +00:00
Galen Shipman	641fa6c0d2	more fixes, reset state on completion.. This commit was SVN r9469.	2006-03-29 22:21:35 +00:00
Galen Shipman	5271948ec0	--- opal object changes add object size to opal class no longer need the size when allocating a new object as this is stored in the class structure --- dr changes Previous rev. maintained state on the communicator used for acking duplicate fragments, but the communicator may be destroyed prior to successfull delivery of an ack to the peer. We must therefore maintain this state globally on a per peer, not a per peer, per communicator basis. This requires that we use a global rank on the wire and translate this as appropriate to a local rank within the communicator. This commit was SVN r9454.	2006-03-29 16:19:17 +00:00
Tim Woodall	c724e4c804	- removed unused flags - updated copyrights This commit was SVN r9430.	2006-03-27 22:44:26 +00:00
Galen Shipman	1677ca1cd4	continue to debug retransmission of incorrect offset, only occurs on vfrag timeout.. This commit was SVN r9421.	2006-03-24 22:28:43 +00:00
Tim Woodall	2e376e0ee8	misc cleanup This commit was SVN r9410.	2006-03-24 06:49:45 +00:00
Tim Woodall	996a1b56df	more tweaking This commit was SVN r9399.	2006-03-23 22:08:59 +00:00
Galen Shipman	754b424266	set vf_mask_pending when retransmitting so completion will occur before the request is completed.. This commit was SVN r9394.	2006-03-23 20:28:52 +00:00
Tim Woodall	dc125cf7d5	misc corrections This commit was SVN r9380.	2006-03-23 15:11:06 +00:00
Galen Shipman	70cf1ce562	more work in progress.. This commit was SVN r9369.	2006-03-22 23:06:18 +00:00
Tim Woodall	0f6161c6da	reorg This commit was SVN r9366.	2006-03-22 15:02:36 +00:00
Galen Shipman	bcb23dc762	rework rndv and eager data timeout/retrans This commit was SVN r9358.	2006-03-21 21:23:33 +00:00
Tim Woodall	7a1ad5b6fb	corrections to scheduling logic This commit was SVN r9354.	2006-03-21 14:30:54 +00:00
Galen Shipman	fc42320ea6	check retry counts on NAK retrans as well as timeouts This commit was SVN r9342.	2006-03-20 22:11:23 +00:00
Galen Shipman	ca13833e95	more dr work This commit was SVN r9340.	2006-03-20 21:57:30 +00:00
Tim Woodall	bd870519fd	- modified convertor copy_and_prepare routines to accept an addition flag, new flags to be included when convertor is initialized - modified pml/btl module defs and added stub functions for diagnostic output routines to dump state of queues / endpoints - updates to data reliability pml This commit was SVN r9329.	2006-03-17 18:46:48 +00:00
George Bosilca	612570134f	The request management framework has been redesigned. The main idea is to let the PML (or io, more generally the low level request manager) to have it's own release function (what was before the req_fini). This function will only be called from the low level while the req_free will be called from the upper level (MPI layer) in order to mark the request as not used by the user anymore. From the request point of view the requests will be marked as inactive everytime we read their status (true for persistent as well). As MPI_REQUEST_NULL is already marked as inactive, the test and wait functions are simpler. The drawback is that now we have to change in the ompi_request_{test\|wait} the req_status of the request once we get it's status. This commit was SVN r9290.	2006-03-15 22:53:41 +00:00
Tim Woodall	92c5e26758	correct scheduling This commit was SVN r9277.	2006-03-14 18:25:25 +00:00
Tim Woodall	d350232c04	work in progress This commit was SVN r9209.	2006-03-06 19:30:37 +00:00
Tim Woodall	274ee03df6	work in progress This commit was SVN r9192.	2006-03-04 00:36:16 +00:00
Galen Shipman	4e430b0428	fix warnings, other misc This commit was SVN r9190.	2006-03-03 04:01:10 +00:00
Galen Shipman	84d3055db5	Make sure everything is imediatly acked, even if not matched Buffer first descriptor on the sendreq until postive ACK Set bytes delivered only after postive ACK, removed num_acks, etc, in general trying to remove as much state as possible so that rolling things back isn't such a nightmare This commit was SVN r9187.	2006-03-01 22:37:10 +00:00
Galen Shipman	05140c5f8f	Rework the data reliability PML, still needs quite a bit of work, working on creating a uniform retransmission mechanism otherwise each type of send ends up needing a special case for retransmission. Removed NACK for individual transmissions, we just aggregate these and send them at the end of a vfrag This commit was SVN r9141.	2006-02-24 17:08:14 +00:00
Galen Shipman	0bc3cbf0db	Corrections to pml_dr, now passes intel test suite (p2p_c). Note, the checksums are not enabled currently, setting to zero as the convertor is not ready for checksums yet. Also, we can't call unpack/pack on convertor with 0 bytes, otherwise it crashes. This commit was SVN r9062.	2006-02-16 16:15:16 +00:00

1 2

52 Коммитов