Galen Shipman
1d67917b69
must handle header validation correctly for each case, not enough in common
...
for the MACRO
This commit was SVN r9486.
2006-03-30 21:27:21 +00:00
Tim Woodall
9a73fe8beb
check for valid sequence number before attempting to use communicator
...
This commit was SVN r9482.
2006-03-30 19:36:15 +00:00
Gleb Natapov
256bf70530
Forgot to add file to previous commit
...
This commit was SVN r9480.
2006-03-30 17:37:52 +00:00
Gleb Natapov
79bcfb096f
Add type to frag. Sometimes we need to know that a frag is from short rdma area.
...
I used hack for this that doesn't work for mvapi, so changing it to something more sane.
This commit was SVN r9477.
2006-03-30 15:26:21 +00:00
Gleb Natapov
ea11582191
Porting of short message RDMA from openib BTL. Endpoint registers circular buffer and sends its address and rkey to the peer. Peer uses this buffer to eagerly RDMA small message into it. Endpoint polls the buffer for message arrival before checking HP/LP QPs. Set btl_mvapi_use_eager_rdma to 1 to enable it.
...
This commit was SVN r9474.
2006-03-30 12:55:31 +00:00
Galen Shipman
641fa6c0d2
more fixes, reset state on completion..
...
This commit was SVN r9469.
2006-03-29 22:21:35 +00:00
Galen Shipman
2945f77f9e
randomly drop fragments without local completion, currently commented out as
...
we must handle the other cases first..
This commit was SVN r9468.
2006-03-29 22:19:58 +00:00
Andrew Friedley
0eba366b07
Various pieces all over to make basic small message send/recv work. Next step
...
is clean up the code.. it is in need of refactoring and testing.
Thanks to Brian for help in troubleshooting!
This commit was SVN r9466.
2006-03-29 21:55:41 +00:00
Galen Shipman
5271948ec0
--- opal object changes
...
add object size to opal class
no longer need the size when allocating a new object as this is stored in
the class structure
--- dr changes
Previous rev. maintained state on the communicator used for acking duplicate
fragments, but the communicator may be destroyed prior to successfull
delivery of an ack to the peer. We must therefore maintain this state
globally on a per peer, not a per peer, per communicator basis.
This requires that we use a global rank on the wire and translate this as
appropriate to a local rank within the communicator.
This commit was SVN r9454.
2006-03-29 16:19:17 +00:00
George Bosilca
5d465cf118
Call the constructor on the DR lock.
...
This commit was SVN r9438.
2006-03-28 07:34:02 +00:00
Graham Fagg
19906e66dc
missing lock?
...
This commit was SVN r9436.
2006-03-28 06:15:48 +00:00
George Bosilca
46c442fe0d
We do not have direct access to the module. Grab the one attached to the
...
window instead.
This commit was SVN r9434.
2006-03-28 05:06:40 +00:00
Tim Woodall
c1bf71b1be
- updated copyrights
...
- removed unused state
- starting to add support for btl failover
This commit was SVN r9431.
2006-03-27 22:48:12 +00:00
Tim Woodall
c724e4c804
- removed unused flags
...
- updated copyrights
This commit was SVN r9430.
2006-03-27 22:44:26 +00:00
Gleb Natapov
590c992a7e
fix recursive lock of openib_btl->ib_lock.
...
This commit was SVN r9427.
2006-03-26 15:02:43 +00:00
Gleb Natapov
01a119c3c5
fix compilation bug with --enable-mpi-threads
...
This commit was SVN r9426.
2006-03-26 13:24:10 +00:00
Gleb Natapov
a5a78b10cc
Implementation of short message RDMA. Endpoint registers circular buffer and sends its address and rkey to the peer. Peer uses this buffer to eagerly RDMA small message into it. Endpoint polls the buffer for message arrival before checking HP/LP QPs. Set btl_openib_use_eager_rdma to 1 to enable it.
...
This commit was SVN r9425.
2006-03-26 08:30:50 +00:00
Galen Shipman
1677ca1cd4
continue to debug retransmission of incorrect offset,
...
only occurs on vfrag timeout..
This commit was SVN r9421.
2006-03-24 22:28:43 +00:00
Brian Barrett
01671f2991
* allow user to set "no_locks" info argument as MCA parameter to override the
...
default
* Add ability to start Put and Get requests immediately instead of queuing
until synchronizaion when using Fence. Not entirely sure this is
completely safe, so it must be explicitly enabled by the user, either with
an MCA parameter or info argument to Win_create.
This commit was SVN r9418.
2006-03-24 18:56:59 +00:00
Tim Woodall
2e376e0ee8
misc cleanup
...
This commit was SVN r9410.
2006-03-24 06:49:45 +00:00
George Bosilca
dec87e2cea
Remove a warning by protecting one of the variables around #if/#endif.
...
This commit was SVN r9409.
2006-03-24 04:43:53 +00:00
George Bosilca
dabe47ca3d
A function declared as static inline and who's not used directly, but
...
only as a pointer reference completely confuse some compilers (gcc 4.1
included). Removing the inline (it was there before when the function
was used in the same file) seems to solve the problem. However, the most
strange thing is that the bug only appear when we compile directly in
the trunk directory. It just don't happens when we're using the VPATH
build.
This commit was SVN r9408.
2006-03-24 04:21:30 +00:00
Brian Barrett
6cc582b20e
* Fix "make dist" for peruse
...
* Install peruse.h in $includedir, since applications need to be able
to include it as <peruse.h>
* Fix issue with onesided code always installing it's headers
This commit was SVN r9405.
2006-03-23 23:41:49 +00:00
Tim Woodall
1aaad721e8
clear state on rndv ack
...
This commit was SVN r9404.
2006-03-23 23:36:07 +00:00
Galen Shipman
19732d4c7c
add length to frag_ack
...
This commit was SVN r9403.
2006-03-23 23:06:19 +00:00
Tim Woodall
0fa49f1297
set requests vfrag id when matched
...
This commit was SVN r9402.
2006-03-23 23:04:20 +00:00
Galen Shipman
3595cd8956
use hdr_match..
...
This commit was SVN r9401.
2006-03-23 22:21:15 +00:00
Galen Shipman
bec2ee346c
use correct ack for rendezvous from seq tracker
...
This commit was SVN r9400.
2006-03-23 22:18:09 +00:00
Tim Woodall
996a1b56df
more tweaking
...
This commit was SVN r9399.
2006-03-23 22:08:59 +00:00
Galen Shipman
c38fd90e63
need state to ack sync send retransmits, even after the recvreq is gone..
...
This commit was SVN r9397.
2006-03-23 22:02:59 +00:00
Tim Woodall
d1d8967844
init counters
...
This commit was SVN r9395.
2006-03-23 20:29:18 +00:00
Galen Shipman
754b424266
set vf_mask_pending when retransmitting so completion will occur before
...
the request is completed..
This commit was SVN r9394.
2006-03-23 20:28:52 +00:00
Galen Shipman
f609204cc5
disable reliability checking in bml
...
This commit was SVN r9392.
2006-03-23 17:50:20 +00:00
Galen Shipman
e01cf0a166
Seperate out sequence tracking list as stand alone class.
...
This commit was SVN r9391.
2006-03-23 17:02:17 +00:00
Tim Woodall
c1bec478c4
updates to reliability debug code
...
This commit was SVN r9390.
2006-03-23 17:00:20 +00:00
Tim Woodall
d9dc534c08
fix bogus comment
...
This commit was SVN r9388.
2006-03-23 16:41:37 +00:00
Tim Woodall
28fa260404
for frag case don't use retrans flag, simply
...
retransmit all segments of vfrag that have not been acked
This commit was SVN r9387.
2006-03-23 16:36:13 +00:00
Andrew Friedley
48d61cd99a
Mostly fragment/LMR handling fixes:
...
- Grab the mpool_registration in _frag_common_constructor()
- Save the LMR context in the segment key
- No need for cookie variables - can just cast the frag
- No need to memcpy() data when recv'ing
- Add an LMR triplet to the fragment structure and initialize it
in btl_udapl_alloc().
- Whitespace/typo fixes, remove some opal_output() calls
Looks like I can use triplets describing sub-regions of registered LMR's. So I
do this - prior to this patch I was sending the entire free list memory over,
which isn't correct :)
Back to an earlier problem - when sending address information right after
connection establishment, the receiving end receives a DTO completion event and
appears to have good data. But the sending end never receives a DTO completion
event indicating the send completed, and never completes the client side of the
connection.
This commit was SVN r9386.
2006-03-23 16:21:08 +00:00
Galen Shipman
adf621fcce
enable both mpool_base_use_mem_hooks and mpool_use_mem_hooks, same for
...
disable_sbrk.
This commit was SVN r9385.
2006-03-23 16:15:50 +00:00
Galen Shipman
e548f5f8a8
change pml_ob1_leave_pinned_pipeline param to mpi_leave_pinned_pipeline
...
This commit was SVN r9384.
2006-03-23 15:57:34 +00:00
Tim Woodall
dc125cf7d5
misc corrections
...
This commit was SVN r9380.
2006-03-23 15:11:06 +00:00
Galen Shipman
0dd4af919d
minor fix to special mca_bml_base_send which randomly corrupts and drops
...
packets (used for testing).
This commit was SVN r9378.
2006-03-23 15:04:43 +00:00
Galen Shipman
70cf1ce562
more work in progress..
...
This commit was SVN r9369.
2006-03-22 23:06:18 +00:00
Tim Woodall
078cdcc9a8
cleanup
...
This commit was SVN r9368.
2006-03-22 23:01:37 +00:00
Tim Woodall
0f6161c6da
reorg
...
This commit was SVN r9366.
2006-03-22 15:02:36 +00:00
Galen Shipman
bcb23dc762
rework rndv and eager data timeout/retrans
...
This commit was SVN r9358.
2006-03-21 21:23:33 +00:00
Tim Woodall
c7ee5e13bc
simplification - dont swap src/dst pointers - always leave both
...
src/dst pointing to same segments
This commit was SVN r9357.
2006-03-21 18:20:17 +00:00
Tim Woodall
12e502b10d
use correct loop index
...
This commit was SVN r9356.
2006-03-21 18:18:22 +00:00
George Bosilca
f7a5a582c5
Diagnostic function for mvapi. It print all the credits used for the flow control.
...
This commit was SVN r9355.
2006-03-21 17:02:14 +00:00
Tim Woodall
7a1ad5b6fb
corrections to scheduling logic
...
This commit was SVN r9354.
2006-03-21 14:30:54 +00:00