Galen Shipman
84e7b90a19
Fix DR PML after the great MTL crusade.. Added a bit of debugging while I was
...
in there trying to track things down..
This commit was SVN r11208.
2006-08-15 21:44:55 +00:00
Brian Barrett
f6e7e11ee6
Fixes truncate error (ticket #172 ) for the DR PML and therefore closes trac:172.
...
We now set truncation error if we received more than we delivered for both
the OB1 and DR PMLs (the CM PML doesn't need such a fix, as the condition
is set at the MTL level)
This commit was SVN r10812.
The following Trac tickets were found above:
Ticket 172 --> https://svn.open-mpi.org/trac/ompi/ticket/172
2006-07-14 19:45:51 +00:00
George Bosilca
476c9e64df
Don't keep multiples copies of the datatype and count. The only one we really need
...
is the one provided by the user. For the buffered send the real datatype used
for the communication is always MPI_BYTE and the count can be retrieved from
the req_bytes_packed field. This will decrease the size of the request by
one pointer and one size_t (8 bytes or 16 bytes depending on the architecture).
This commit was SVN r10680.
2006-07-06 17:58:25 +00:00
Brian Barrett
47725c9b02
* Add new PML (CM) and network drivers (MTL) for high speed
...
interconnects that provide matching logic in the library.
Currently includes support for MX and some support for
Portals
* Fix overuse of proc_pml pointer on the ompi_proc structuer,
splitting into proc_pml for pml data and proc_bml for
the BML endpoint data
* bug fixes in bsend init code, which wasn't being used by
the OB1 or DR PMLs...
This commit was SVN r10642.
2006-07-04 01:20:20 +00:00
Galen Shipman
e6cd8db0e5
DR will now checksum on a per btl basis (see MCA_BTL_FLAGS_NEED_CSUM). We
...
still always send ACK's, teasing apart completion for ACK/no ACK looks like a
pain in the ..
This commit was SVN r10530.
2006-06-27 20:23:47 +00:00
Galen Shipman
8855e5b73a
Fixes for DR as well as better diagnostic..
...
Successfully passing the intel test suite with/without induced errors/drops.
This commit was SVN r10518.
2006-06-26 22:29:29 +00:00
George Bosilca
27000ef7d6
More compact and readable code. Otherwise, no big difference with the
...
previous version.
This commit was SVN r10389.
2006-06-16 03:07:42 +00:00
George Bosilca
3f96f39e46
If the goal of this code was to copy the iovec and skip the first offset
...
bytes then it was not correct.
This commit was SVN r10388.
2006-06-16 03:06:30 +00:00
George Bosilca
93afe59226
It is not required to initialize the csum.
...
This commit was SVN r10387.
2006-06-16 03:05:20 +00:00
George Bosilca
1f96768b76
For zero length persistent request do not reposition the convertor as
...
it is not initialized.
This commit was SVN r10386.
2006-06-16 03:04:41 +00:00
George Bosilca
3727fa2ae6
Nothing relevant. I add some more output in the case we have a checksum error.
...
Just to be able to know more information about the failure.
This commit was SVN r10337.
2006-06-13 19:36:38 +00:00
Galen Shipman
218a438509
finished the ompi_free_list_t class nightmare..
...
This commit was SVN r10314.
2006-06-12 22:09:03 +00:00
Galen Shipman
18dda70fd0
make ompi_free_list_item_t a class..
...
This will go to the 1.1 branch but will probably require a few changes as
ompi_free_list_t is different in the branch..
This commit was SVN r10306.
2006-06-12 16:44:00 +00:00
Galen Shipman
84479d0b5a
potential fix for iprobe test,, tested with openib.. will have andy try ud..
...
This commit was SVN r10232.
2006-06-06 22:10:41 +00:00
Brian Barrett
c70fff6ed0
* Fix for bug #44 for the trunk -- remove a bunch of warnings from the DR
...
PML when compiling on Solaris. Patch won't apply cleanly to the v1.1
branch, so a diff for that is coming up soon.
This commit was SVN r10173.
2006-06-01 18:58:38 +00:00
George Bosilca
b8ef0cc749
Minor cleanups.
...
This commit was SVN r10001.
2006-05-21 05:55:21 +00:00
Galen Shipman
9165882c07
fixes for failover...
...
This commit was SVN r9998.
2006-05-20 02:39:05 +00:00
Tim Woodall
d8ff8010f3
track wether the vfrag is being retransmitted
...
This commit was SVN r9817.
2006-05-04 17:30:58 +00:00
Tim Woodall
1b26caa95b
first cut at btl failover - seems to be working for simple test case
...
This commit was SVN r9816.
2006-05-04 16:16:26 +00:00
Galen Shipman
ba0aa46220
make csum's optional in pml dr, on by default, see mca param
...
pml_dr_enable_csum
This commit was SVN r9608.
2006-04-10 21:54:46 +00:00
Galen Shipman
c29db49198
return out if we ack a duplicate matched rendezvous from mathed receives
...
sequence tracker and the communicator is null..
This commit was SVN r9521.
2006-04-03 21:04:51 +00:00
Galen Shipman
1d67917b69
must handle header validation correctly for each case, not enough in common
...
for the MACRO
This commit was SVN r9486.
2006-03-30 21:27:21 +00:00
Tim Woodall
9a73fe8beb
check for valid sequence number before attempting to use communicator
...
This commit was SVN r9482.
2006-03-30 19:36:15 +00:00
Galen Shipman
641fa6c0d2
more fixes, reset state on completion..
...
This commit was SVN r9469.
2006-03-29 22:21:35 +00:00
Galen Shipman
5271948ec0
--- opal object changes
...
add object size to opal class
no longer need the size when allocating a new object as this is stored in
the class structure
--- dr changes
Previous rev. maintained state on the communicator used for acking duplicate
fragments, but the communicator may be destroyed prior to successfull
delivery of an ack to the peer. We must therefore maintain this state
globally on a per peer, not a per peer, per communicator basis.
This requires that we use a global rank on the wire and translate this as
appropriate to a local rank within the communicator.
This commit was SVN r9454.
2006-03-29 16:19:17 +00:00
George Bosilca
5d465cf118
Call the constructor on the DR lock.
...
This commit was SVN r9438.
2006-03-28 07:34:02 +00:00
Graham Fagg
19906e66dc
missing lock?
...
This commit was SVN r9436.
2006-03-28 06:15:48 +00:00
Tim Woodall
c724e4c804
- removed unused flags
...
- updated copyrights
This commit was SVN r9430.
2006-03-27 22:44:26 +00:00
Galen Shipman
1677ca1cd4
continue to debug retransmission of incorrect offset,
...
only occurs on vfrag timeout..
This commit was SVN r9421.
2006-03-24 22:28:43 +00:00
Tim Woodall
2e376e0ee8
misc cleanup
...
This commit was SVN r9410.
2006-03-24 06:49:45 +00:00
Tim Woodall
1aaad721e8
clear state on rndv ack
...
This commit was SVN r9404.
2006-03-23 23:36:07 +00:00
Galen Shipman
19732d4c7c
add length to frag_ack
...
This commit was SVN r9403.
2006-03-23 23:06:19 +00:00
Tim Woodall
0fa49f1297
set requests vfrag id when matched
...
This commit was SVN r9402.
2006-03-23 23:04:20 +00:00
Galen Shipman
3595cd8956
use hdr_match..
...
This commit was SVN r9401.
2006-03-23 22:21:15 +00:00
Galen Shipman
bec2ee346c
use correct ack for rendezvous from seq tracker
...
This commit was SVN r9400.
2006-03-23 22:18:09 +00:00
Tim Woodall
996a1b56df
more tweaking
...
This commit was SVN r9399.
2006-03-23 22:08:59 +00:00
Galen Shipman
c38fd90e63
need state to ack sync send retransmits, even after the recvreq is gone..
...
This commit was SVN r9397.
2006-03-23 22:02:59 +00:00
Galen Shipman
754b424266
set vf_mask_pending when retransmitting so completion will occur before
...
the request is completed..
This commit was SVN r9394.
2006-03-23 20:28:52 +00:00
Galen Shipman
e01cf0a166
Seperate out sequence tracking list as stand alone class.
...
This commit was SVN r9391.
2006-03-23 17:02:17 +00:00
Tim Woodall
d9dc534c08
fix bogus comment
...
This commit was SVN r9388.
2006-03-23 16:41:37 +00:00
Tim Woodall
28fa260404
for frag case don't use retrans flag, simply
...
retransmit all segments of vfrag that have not been acked
This commit was SVN r9387.
2006-03-23 16:36:13 +00:00
Tim Woodall
dc125cf7d5
misc corrections
...
This commit was SVN r9380.
2006-03-23 15:11:06 +00:00
Galen Shipman
70cf1ce562
more work in progress..
...
This commit was SVN r9369.
2006-03-22 23:06:18 +00:00
Tim Woodall
0f6161c6da
reorg
...
This commit was SVN r9366.
2006-03-22 15:02:36 +00:00
Galen Shipman
bcb23dc762
rework rndv and eager data timeout/retrans
...
This commit was SVN r9358.
2006-03-21 21:23:33 +00:00
Tim Woodall
7a1ad5b6fb
corrections to scheduling logic
...
This commit was SVN r9354.
2006-03-21 14:30:54 +00:00
Tim Woodall
797a6b2887
dont compute checksum over header - data only
...
This commit was SVN r9343.
2006-03-20 23:08:14 +00:00
Galen Shipman
fc42320ea6
check retry counts on NAK retrans as well as timeouts
...
This commit was SVN r9342.
2006-03-20 22:11:23 +00:00
Galen Shipman
ca13833e95
more dr work
...
This commit was SVN r9340.
2006-03-20 21:57:30 +00:00
Galen Shipman
5600932c2f
fix misc warnings
...
This commit was SVN r9339.
2006-03-20 15:41:45 +00:00