Brian Barrett
defaefd59e
Clean up resources from flowcontrol on shutdown
...
This commit was SVN r26605.
2012-06-14 22:38:35 +00:00
Brian Barrett
946ec4cd97
* Update usage of PtlHandleIsEqual to match new semantic
...
* Properly set message to MPI_MESSAGE_NULL in the right places
* Fix double free of buffer for non-contiguous blocking sends
* Remove useless debugging output
This commit was SVN r26604.
2012-06-14 22:24:23 +00:00
Brian Barrett
31279eb641
Fix segfault with long expected messages when using the rndv protocol. We were
...
freeing the ME before the get to grab the long part of the message.
This commit was SVN r26589.
2012-06-11 16:37:01 +00:00
Mike Dubman
10831e111a
detect num of local procs
...
This commit was SVN r26555.
2012-06-05 09:13:16 +00:00
Yevgeny Kliteynik
1cbce83ece
Fixed wording of MXM parameters as suggested By Jeff.
...
This commit was SVN r26545.
2012-06-03 21:48:42 +00:00
Yevgeny Kliteynik
f02bf707a4
Added MXM parameter "np" that controls the minimal number of processes that allow MXM to run
...
Default: 128
MXM advantages kick in with large number of processes.
This commit was SVN r26544.
2012-06-02 11:07:20 +00:00
Brian Barrett
2effbb1ba6
fix copy/paste typo
...
This commit was SVN r26492.
2012-05-24 16:06:20 +00:00
Ralph Castain
c0304eb23a
Fix copy/paste typo
...
This commit was SVN r26491.
2012-05-24 15:47:20 +00:00
Brian Barrett
25693363e9
* Fix internal accounting error regarding number of available credits
...
* Use a single MD covering all of address space for put transfers, rather
than a per-send MD.
This commit was SVN r26458.
2012-05-20 23:42:26 +00:00
Brian Barrett
2e52374847
* Split send and receive eq sizes
...
* Need to look at slot count before flowcontrol for sending to prevent
race in restart
* Need to free pending request fragments when done with the request
* A number of branch prediction optimizations for error conditions
This commit was SVN r26430.
2012-05-10 21:43:48 +00:00
Brian Barrett
0ae2277796
Add a backoff mechanism for re-establishing communication
...
This commit was SVN r26366.
2012-05-01 15:53:00 +00:00
Brian Barrett
74ade8b181
need to order the pending list before we restart
...
This commit was SVN r26365.
2012-04-30 23:06:00 +00:00
Brian Barrett
5dec52af8d
remove some now unneeded debugging
...
This commit was SVN r26364.
2012-04-30 22:50:52 +00:00
Brian Barrett
c654ee6afc
* Use triggered operations for restart barrier as well
...
This commit was SVN r26363.
2012-04-30 22:48:10 +00:00
Brian Barrett
91a9973bde
* Make flow control on by default
...
* Move alarm code back into a triggered operation
This commit was SVN r26362.
2012-04-30 22:25:40 +00:00
Brian Barrett
e6a0a1cf8a
* Make sure to release all resources on failed send
...
* Avoid triggered ops until we get everything debugged
* Simplify flowctl interface a bit
This commit was SVN r26356.
2012-04-27 21:11:01 +00:00
Brian Barrett
8a70747da2
Fix some naming that doesn't make a ton of sense
...
This commit was SVN r26277.
2012-04-18 01:05:18 +00:00
Brian Barrett
f4d4e87176
add some flow control debugging output
...
This commit was SVN r26276.
2012-04-17 23:14:05 +00:00
Brian Barrett
fe0dfc8e26
First take at flow control protocol
...
This commit was SVN r26274.
2012-04-17 21:46:21 +00:00
Brian Barrett
dde6f094eb
In preperation for flow control changes coming, always utilize ACKs for
...
message completion.
This commit was SVN r26272.
2012-04-16 17:25:27 +00:00
Mike Dubman
34acf769d4
mtl_mxm: support canceling messages
...
This commit was SVN r26256.
2012-04-09 16:02:05 +00:00
Brian Barrett
451af0e832
Ensure async progress for long unexpected messages by waiting for an
...
event on the ME. The events we're likely to see are LINK (the ME was
added to the match list), PUT (weird to see first, but means that the ME
was linked to the match list and then matched), or PUT_OVERFLOW, meaning
the message was unexpected.
This commit was SVN r26199.
2012-03-26 22:54:35 +00:00
Brian Barrett
2a26d0f9a2
Forgot to add new file in the last commit.
...
Mark ME as invalid once we see a completion event, and look for events before
trying to unlink.
This commit was SVN r26198.
2012-03-26 22:39:05 +00:00
Brian Barrett
0e91084385
* Add type field to the request structure to deal with random user requests
...
(ie, cancel)
* Implement cancel for receives. Sends are slightly more complicated...
This commit was SVN r26197.
2012-03-26 22:32:36 +00:00
Brian Barrett
61a090e0d1
Checking for NULL function pointers and direct-call semantics can't work
...
together, so implement all functions in the MTL interface for all
MTLs. The only places NULL was still being set was for add_comm/del_comm,
and matched probe, both of which are straight forward to implement (or
return ERROR_NOT_IMPLEMENTED, since the PML can't emulate matched probe).
This commit was SVN r26194.
2012-03-26 19:27:03 +00:00
Brian Barrett
cdaf110c0f
* Implement mtl_send in addition to mtl_sendi
...
This commit was SVN r26193.
2012-03-26 19:19:11 +00:00
Brian Barrett
27c8f71773
Start of the flow control implementation. #defined out for now.
...
This commit was SVN r26192.
2012-03-26 01:31:58 +00:00
Brian Barrett
cce936b94c
* Implement matched probe for the CM PML. Required adding a peer field to
...
the ompi_message_t structure to properly initialize convertor (the peer
is available in the request in OB1, and wasn't needed when I did the
original implementation).
* Implement matched probe for the Portals4 MTL and add NULL function pointers
for the other MTLs.
* Add add_comm and del_comm functions to portals4 MTL so that direct call
almost works again.
* Add NEWS item that we've implemented matched probe
This commit was SVN r26180.
2012-03-22 22:55:59 +00:00
Brian Barrett
4d12616b64
Frank pointed out that PTL_OK is zero and PtlHandleIsEqual either returns
...
PTL_OK or PTL_FAIL and that I had these backwards.
This commit was SVN r26179.
2012-03-22 15:58:00 +00:00
Brian Barrett
1c6b5a1358
* Set all appropriate flags for portal table entries
...
* split eq into send and receive eqs so that we can control the number
of outstanding events in send eq and ensure we never lose an ack
* Shouldn't ever truncate on short unexpected receive bocks, so don't set
the truncate bit
* Track active vs. waiting for free short unexpected receive blocks so
to ensure an active short unexpected receive block is posted coming out
of flow control. Also allow creation of "temporary" blocks which should
be released once FREE event is received.
* Slight reorganization of some code in preparation for more flow control
work.
This commit was SVN r26174.
2012-03-21 22:20:55 +00:00
Mike Dubman
bd7abd72a9
in mca_mtl_mxm, don't allow negative tags for MPI_ANY_TAG
...
This commit was SVN r26128.
2012-03-09 22:11:14 +00:00
Mike Dubman
540b3c0c25
update mxm mtl to changes in mxm api
...
This commit was SVN r26073.
2012-02-29 22:02:34 +00:00
Mike Dubman
81bd5eee8d
in mxm, use sender_len field and not actual_len when returning result from probe
...
This commit was SVN r25993.
2012-02-21 19:55:16 +00:00
Mike Dubman
6ec768f0c6
fix #2971
...
This commit was SVN r25908.
2012-02-12 09:28:42 +00:00
Mike Dubman
b18a1611c3
- if everything is ok set return value to OMPI_SUCCESS in mtl/mxm
...
This commit was SVN r25879.
2012-02-08 14:19:58 +00:00
Mike Dubman
6188ab7317
* ep init refactoring
...
* split ep_info into fragments to fit PMI limit
This commit was SVN r25857.
2012-02-02 15:00:47 +00:00
Mike Dubman
92873872f5
revert r25813
...
This commit was SVN r25816.
The following SVN revision numbers were found above:
r25813 --> open-mpi/ompi@8ed781d7e9
2012-01-30 13:22:38 +00:00
Mike Dubman
8ed781d7e9
add mca param to enable/disable mxm
...
This commit was SVN r25813.
2012-01-30 11:14:20 +00:00
Mike Dubman
9f0ca9dfc0
fix: extract source from imm request fields instead from depricated api
...
This commit was SVN r25812.
2012-01-30 10:37:37 +00:00
Mike Dubman
6c954ad43f
set mxm to call opal_progress in tight loops
...
This commit was SVN r25788.
2012-01-26 18:33:43 +00:00
Mike Dubman
37dc53bbc9
mxm: return the MXM_REQ_SEND_SYNC flag to mxm_req_send
...
This commit was SVN r25694.
2012-01-06 18:56:28 +00:00
Mike Dubman
3b97d609a8
mtl_mxm: fix double free
...
This commit was SVN r25693.
2012-01-06 16:22:58 +00:00
Brian Barrett
45a27e4f9f
For now, ignore LINK event
...
This commit was SVN r25467.
2011-11-11 02:49:03 +00:00
Mike Dubman
00c27afd52
fix pid
...
This commit was SVN r25463.
2011-11-09 17:53:59 +00:00
Mike Dubman
71398b658e
fix: OMPI_ERR_CONNECTION_FAILED available in v1.5, unavailable in trunk
...
This commit was SVN r25459.
2011-11-08 12:34:01 +00:00
Mike Dubman
4cf9e1323d
fix: return correct error on connection failure
...
This commit was SVN r25452.
2011-11-07 06:13:17 +00:00
Mike Dubman
7595a80a63
fix self pid
...
This commit was SVN r25424.
2011-11-03 06:46:20 +00:00
Mike Dubman
3edd77ea25
update mxm plugin to mxm api change: pass synchronous request as an opcode instead of a flag
...
This commit was SVN r25403.
2011-10-31 22:36:15 +00:00
Mike Dubman
6b50ba22a6
select mxm ptl based on user preferences
...
This commit was SVN r25401.
2011-10-31 10:17:43 +00:00
Mike Dubman
f96ae43e23
pass jobid to mxm/sm module
...
This commit was SVN r25375.
2011-10-27 13:14:52 +00:00