Brian Barrett
74ade8b181
need to order the pending list before we restart
...
This commit was SVN r26365.
2012-04-30 23:06:00 +00:00
Brian Barrett
5dec52af8d
remove some now unneeded debugging
...
This commit was SVN r26364.
2012-04-30 22:50:52 +00:00
Brian Barrett
c654ee6afc
* Use triggered operations for restart barrier as well
...
This commit was SVN r26363.
2012-04-30 22:48:10 +00:00
Brian Barrett
91a9973bde
* Make flow control on by default
...
* Move alarm code back into a triggered operation
This commit was SVN r26362.
2012-04-30 22:25:40 +00:00
Brian Barrett
e6a0a1cf8a
* Make sure to release all resources on failed send
...
* Avoid triggered ops until we get everything debugged
* Simplify flowctl interface a bit
This commit was SVN r26356.
2012-04-27 21:11:01 +00:00
Brian Barrett
8a70747da2
Fix some naming that doesn't make a ton of sense
...
This commit was SVN r26277.
2012-04-18 01:05:18 +00:00
Brian Barrett
f4d4e87176
add some flow control debugging output
...
This commit was SVN r26276.
2012-04-17 23:14:05 +00:00
Brian Barrett
fe0dfc8e26
First take at flow control protocol
...
This commit was SVN r26274.
2012-04-17 21:46:21 +00:00
Brian Barrett
dde6f094eb
In preperation for flow control changes coming, always utilize ACKs for
...
message completion.
This commit was SVN r26272.
2012-04-16 17:25:27 +00:00
Mike Dubman
34acf769d4
mtl_mxm: support canceling messages
...
This commit was SVN r26256.
2012-04-09 16:02:05 +00:00
Brian Barrett
451af0e832
Ensure async progress for long unexpected messages by waiting for an
...
event on the ME. The events we're likely to see are LINK (the ME was
added to the match list), PUT (weird to see first, but means that the ME
was linked to the match list and then matched), or PUT_OVERFLOW, meaning
the message was unexpected.
This commit was SVN r26199.
2012-03-26 22:54:35 +00:00
Brian Barrett
2a26d0f9a2
Forgot to add new file in the last commit.
...
Mark ME as invalid once we see a completion event, and look for events before
trying to unlink.
This commit was SVN r26198.
2012-03-26 22:39:05 +00:00
Brian Barrett
0e91084385
* Add type field to the request structure to deal with random user requests
...
(ie, cancel)
* Implement cancel for receives. Sends are slightly more complicated...
This commit was SVN r26197.
2012-03-26 22:32:36 +00:00
Brian Barrett
61a090e0d1
Checking for NULL function pointers and direct-call semantics can't work
...
together, so implement all functions in the MTL interface for all
MTLs. The only places NULL was still being set was for add_comm/del_comm,
and matched probe, both of which are straight forward to implement (or
return ERROR_NOT_IMPLEMENTED, since the PML can't emulate matched probe).
This commit was SVN r26194.
2012-03-26 19:27:03 +00:00
Brian Barrett
cdaf110c0f
* Implement mtl_send in addition to mtl_sendi
...
This commit was SVN r26193.
2012-03-26 19:19:11 +00:00
Brian Barrett
27c8f71773
Start of the flow control implementation. #defined out for now.
...
This commit was SVN r26192.
2012-03-26 01:31:58 +00:00
Brian Barrett
cce936b94c
* Implement matched probe for the CM PML. Required adding a peer field to
...
the ompi_message_t structure to properly initialize convertor (the peer
is available in the request in OB1, and wasn't needed when I did the
original implementation).
* Implement matched probe for the Portals4 MTL and add NULL function pointers
for the other MTLs.
* Add add_comm and del_comm functions to portals4 MTL so that direct call
almost works again.
* Add NEWS item that we've implemented matched probe
This commit was SVN r26180.
2012-03-22 22:55:59 +00:00
Brian Barrett
4d12616b64
Frank pointed out that PTL_OK is zero and PtlHandleIsEqual either returns
...
PTL_OK or PTL_FAIL and that I had these backwards.
This commit was SVN r26179.
2012-03-22 15:58:00 +00:00
Brian Barrett
1c6b5a1358
* Set all appropriate flags for portal table entries
...
* split eq into send and receive eqs so that we can control the number
of outstanding events in send eq and ensure we never lose an ack
* Shouldn't ever truncate on short unexpected receive bocks, so don't set
the truncate bit
* Track active vs. waiting for free short unexpected receive blocks so
to ensure an active short unexpected receive block is posted coming out
of flow control. Also allow creation of "temporary" blocks which should
be released once FREE event is received.
* Slight reorganization of some code in preparation for more flow control
work.
This commit was SVN r26174.
2012-03-21 22:20:55 +00:00
Mike Dubman
bd7abd72a9
in mca_mtl_mxm, don't allow negative tags for MPI_ANY_TAG
...
This commit was SVN r26128.
2012-03-09 22:11:14 +00:00
Mike Dubman
540b3c0c25
update mxm mtl to changes in mxm api
...
This commit was SVN r26073.
2012-02-29 22:02:34 +00:00
Mike Dubman
81bd5eee8d
in mxm, use sender_len field and not actual_len when returning result from probe
...
This commit was SVN r25993.
2012-02-21 19:55:16 +00:00
Mike Dubman
6ec768f0c6
fix #2971
...
This commit was SVN r25908.
2012-02-12 09:28:42 +00:00
Mike Dubman
b18a1611c3
- if everything is ok set return value to OMPI_SUCCESS in mtl/mxm
...
This commit was SVN r25879.
2012-02-08 14:19:58 +00:00
Mike Dubman
6188ab7317
* ep init refactoring
...
* split ep_info into fragments to fit PMI limit
This commit was SVN r25857.
2012-02-02 15:00:47 +00:00
Mike Dubman
92873872f5
revert r25813
...
This commit was SVN r25816.
The following SVN revision numbers were found above:
r25813 --> open-mpi/ompi@8ed781d7e9
2012-01-30 13:22:38 +00:00
Mike Dubman
8ed781d7e9
add mca param to enable/disable mxm
...
This commit was SVN r25813.
2012-01-30 11:14:20 +00:00
Mike Dubman
9f0ca9dfc0
fix: extract source from imm request fields instead from depricated api
...
This commit was SVN r25812.
2012-01-30 10:37:37 +00:00
Mike Dubman
6c954ad43f
set mxm to call opal_progress in tight loops
...
This commit was SVN r25788.
2012-01-26 18:33:43 +00:00
Mike Dubman
37dc53bbc9
mxm: return the MXM_REQ_SEND_SYNC flag to mxm_req_send
...
This commit was SVN r25694.
2012-01-06 18:56:28 +00:00
Mike Dubman
3b97d609a8
mtl_mxm: fix double free
...
This commit was SVN r25693.
2012-01-06 16:22:58 +00:00
Brian Barrett
45a27e4f9f
For now, ignore LINK event
...
This commit was SVN r25467.
2011-11-11 02:49:03 +00:00
Mike Dubman
00c27afd52
fix pid
...
This commit was SVN r25463.
2011-11-09 17:53:59 +00:00
Mike Dubman
71398b658e
fix: OMPI_ERR_CONNECTION_FAILED available in v1.5, unavailable in trunk
...
This commit was SVN r25459.
2011-11-08 12:34:01 +00:00
Mike Dubman
4cf9e1323d
fix: return correct error on connection failure
...
This commit was SVN r25452.
2011-11-07 06:13:17 +00:00
Mike Dubman
7595a80a63
fix self pid
...
This commit was SVN r25424.
2011-11-03 06:46:20 +00:00
Mike Dubman
3edd77ea25
update mxm plugin to mxm api change: pass synchronous request as an opcode instead of a flag
...
This commit was SVN r25403.
2011-10-31 22:36:15 +00:00
Mike Dubman
6b50ba22a6
select mxm ptl based on user preferences
...
This commit was SVN r25401.
2011-10-31 10:17:43 +00:00
Mike Dubman
f96ae43e23
pass jobid to mxm/sm module
...
This commit was SVN r25375.
2011-10-27 13:14:52 +00:00
Mike Dubman
9ffeeb69d9
fix help message
...
This commit was SVN r25364.
2011-10-25 14:02:43 +00:00
Samuel Gutierrez
663f4546f5
fix define typo in psm mtl.
...
This commit was SVN r25362.
2011-10-24 18:38:12 +00:00
Brian Barrett
d8b5b544ad
Update list name to match change in spec
...
This commit was SVN r25273.
2011-10-12 20:09:39 +00:00
Mike Dubman
7a9ae43276
added support for shared memory transport in mxm
...
This commit was SVN r25220.
2011-10-03 12:59:55 +00:00
Brian Barrett
fc29ffebdb
* remove two aborts that aren't necessary
...
This commit was SVN r25214.
2011-09-29 22:27:23 +00:00
Brian Barrett
14f32a1a54
* Clean up progress function
...
* Only print returnable errors when verbose=1. Still print errors when
we're going to abort, since those obviously aren't returnable
This commit was SVN r25213.
2011-09-29 22:26:33 +00:00
Brian Barrett
758f8a4d87
* More debugging output
...
* Make recv short block events use the callback mechanism so that can
add overflow debugging
This commit was SVN r25212.
2011-09-29 21:59:48 +00:00
Brian Barrett
c08ea5c0f5
Set options correctly for the two pts
...
This commit was SVN r25211.
2011-09-29 21:56:37 +00:00
Brian Barrett
05f800abae
Properly unpack data for long unexpected
...
This commit was SVN r25210.
2011-09-29 17:25:45 +00:00
Brian Barrett
bb9e73232a
* Leverage hdr_data and opcount to improve debugging
...
* Clean up handling of short synchronous messages
This commit was SVN r25208.
2011-09-28 21:18:47 +00:00
Brian Barrett
71d8300607
* Fix name clash with macros in mtl_portals4.h
...
* hdr_data now includes opcount and length for all messages, which is the match
bits for long and rndv messages
* Re-add probe implementation
This commit was SVN r25207.
2011-09-28 16:53:01 +00:00