Brian Barrett
f42783ae1a
Move the RTE framework change into the trunk. With this change, all non-CR
...
runtime code goes through one of the rte, dpm, or pubsub frameworks.
This commit was SVN r27934.
2013-01-27 23:25:10 +00:00
Mike Dubman
a454341e2b
add support for mxm 2.0
...
This commit was SVN r27661.
2012-12-09 22:58:37 +00:00
Brian Barrett
702451111b
Remove Portals 3.3 support
...
This commit was SVN r27656.
2012-12-06 20:11:27 +00:00
Aleksey Senin
ae92f64842
Check that MXM runtime version match compiled.
...
Reviewed by Mike Dubman.
This commit was SVN r27575.
2012-11-07 14:44:33 +00:00
Nathan Hjelm
bdedd8b0d3
Per RFC modify the behavior of mca_base_components_close to NOT close the output. Modify frameworks to always close their output and set to -1.
...
Reasoning: The old behavior was a little confusing. mca_base_components_open does not open an output stream so it is a little unexpected that mca_base_components_close does. To add to this several frameworks (that don't use mca_base_components_close) failed to close their output in the framework close function and others closed their output a second time. This change is an improvement to the symantics of mca_base_components_open/close as they are now symetric in their functionality.
This commit was SVN r27570.
2012-11-06 19:09:26 +00:00
Eugene Loh
25ad84b925
Ensure that MPI_Status objects have proper alignment:
...
- fix the Fortran layer to use new macros to convert Fortran-to-C status
- change the C internals to pull out old OMPI_SET_STATUS* macros
Also, change name of "status" argument in topo_test_f.c to "topo_type".
This commit was SVN r27403.
2012-10-04 14:39:51 +00:00
Nathan Hjelm
9200473a59
mtl/psm: do not let psm set processor affinity
...
This commit was SVN r27389.
2012-10-02 15:56:51 +00:00
Brian Barrett
fa4c2af9ed
THe Portals 4 reference implementation will sometimes return a NI_FLOWCTL for both a
...
send and an ack. I'm not sure whether this violates the spec, so work around until
we decide...
This commit was SVN r27244.
2012-09-05 19:36:19 +00:00
Aleksey Senin
33ae1fe6c7
Fix untitialized return code in ompi_mtl_mxm_add_procs function.
...
This commit was SVN r27216.
2012-09-02 13:17:49 +00:00
Aleksey Senin
68e0894a58
MXM send/recv request changes.
...
Adapt OMPI to the latest MXM changes in send/recv request.
Use memory handle structure instead of memory key.
This commit was SVN r27155.
2012-08-28 05:57:36 +00:00
Samuel Gutierrez
7867330dcc
Fix the PSM MTL in trunk by gathering node locality information differently.
...
This commit was SVN r27063.
2012-08-16 00:50:24 +00:00
Yael Dayan
79e6b9c91d
Adapt OMPI to use newer version of MXM.
...
This commit was SVN r26974.
2012-08-08 15:29:38 +00:00
Yael Dayan
954bcdc0a5
adapt the way to find amount of local processes to OMPI trunk.
...
This commit was SVN r26973.
2012-08-08 15:26:28 +00:00
Vasily Filipov
fc712182db
MTL MXM: make MXM use MXM_VERSION macro for MXM version checking.
...
This commit was SVN r26952.
2012-08-06 06:35:57 +00:00
Vasily Filipov
c386847d9a
MTL MXM: Adding MXM version protect for Mprobe, Mrecv resources.
...
This commit was SVN r26922.
2012-07-31 07:57:25 +00:00
Vasily Filipov
4e66ff030b
MTL MXM Mrecv: adding missed return message to a free list.
...
This commit was SVN r26870.
2012-07-26 11:22:22 +00:00
Vasily Filipov
ef9bd8e4cb
MTL MXM: MPI_Mprobe, MPI_Mrecv implementation for MXM adding.
...
This commit was SVN r26866.
2012-07-25 13:26:40 +00:00
Mike Dubman
4784253f5c
revert commit, breaks backwards compatability, will be revised
...
This commit was SVN r26852.
2012-07-24 11:48:18 +00:00
Vasily Filipov
99bd5977bd
MTL MXM: small fix in the mxm_req_probe func interface.
...
This commit was SVN r26850.
2012-07-24 08:46:38 +00:00
Vasily Filipov
597a422272
MTL: make MXM work with read (in blocking send case) call-backs.
...
This commit was SVN r26807.
2012-07-19 13:28:06 +00:00
George Bosilca
4326537fe9
Remove compiler warning about uninitialized variable.
...
This commit was SVN r26760.
2012-07-08 00:07:52 +00:00
Yevgeny Kliteynik
0e28fa984b
Remove dead code that was related to ticket #2971
...
This commit was SVN r26701.
2012-07-02 11:19:09 +00:00
Jeff Squyres
5d030278e1
Refs trac:3130: Per comment 8 on the ticket, this MX patch fixes the cases
...
where the MX BTL and MTL are stepping on each other regarding the
mpool. Thanks to Yong Qin for assistance in tracking this down.
This commit was SVN r26698.
The following Trac tickets were found above:
Ticket 3130 --> https://svn.open-mpi.org/trac/ompi/ticket/3130
2012-06-29 13:52:40 +00:00
Ralph Castain
0dfe29b1a6
Roll in the rest of the modex change. Eliminate all non-modex API access of RTE info from the MPI layer - in some cases, the info was already present (either in the ompi_proc_t or in the orte_process_info struct) and no call was necessary. This removes all calls to orte_ess from the MPI layer. Calls to orte_grpcomm remain required.
...
Update all the orte ess components to remove their associated APIs for retrieving proc data. Update the grpcomm API to reflect transfer of set/get modex info to the db framework.
Note that this doesn't recreate the old GPR. This is strictly a local db storage that may (at some point) obtain any missing data from the local daemon as part of an async methodology. The framework allows us to experiment with such methods without perturbing the default one.
This commit was SVN r26678.
2012-06-27 14:53:55 +00:00
Josh Hursey
28681deffa
Backout the ORCA commit. :(
...
There is a linking issue on Mac OSX that needs to be addressed before this is able to come back into the trunk.
This commit was SVN r26676.
2012-06-27 01:28:28 +00:00
Josh Hursey
542330e3a7
Commit of ORCA: Open MPI Runtime Collaborative Abstraction
...
This is a runtime interposition project that sits between the OMPI and ORTE layers in Open MPI.
The project is described on the wiki:
https://svn.open-mpi.org/trac/ompi/wiki/Runtime_Interposition
And on this email thread:
http://www.open-mpi.org/community/lists/devel/2012/06/11109.php
This commit was SVN r26670.
2012-06-26 21:42:16 +00:00
Brian Barrett
defaefd59e
Clean up resources from flowcontrol on shutdown
...
This commit was SVN r26605.
2012-06-14 22:38:35 +00:00
Brian Barrett
946ec4cd97
* Update usage of PtlHandleIsEqual to match new semantic
...
* Properly set message to MPI_MESSAGE_NULL in the right places
* Fix double free of buffer for non-contiguous blocking sends
* Remove useless debugging output
This commit was SVN r26604.
2012-06-14 22:24:23 +00:00
Brian Barrett
31279eb641
Fix segfault with long expected messages when using the rndv protocol. We were
...
freeing the ME before the get to grab the long part of the message.
This commit was SVN r26589.
2012-06-11 16:37:01 +00:00
Mike Dubman
10831e111a
detect num of local procs
...
This commit was SVN r26555.
2012-06-05 09:13:16 +00:00
Yevgeny Kliteynik
1cbce83ece
Fixed wording of MXM parameters as suggested By Jeff.
...
This commit was SVN r26545.
2012-06-03 21:48:42 +00:00
Yevgeny Kliteynik
f02bf707a4
Added MXM parameter "np" that controls the minimal number of processes that allow MXM to run
...
Default: 128
MXM advantages kick in with large number of processes.
This commit was SVN r26544.
2012-06-02 11:07:20 +00:00
Brian Barrett
2effbb1ba6
fix copy/paste typo
...
This commit was SVN r26492.
2012-05-24 16:06:20 +00:00
Ralph Castain
c0304eb23a
Fix copy/paste typo
...
This commit was SVN r26491.
2012-05-24 15:47:20 +00:00
Brian Barrett
25693363e9
* Fix internal accounting error regarding number of available credits
...
* Use a single MD covering all of address space for put transfers, rather
than a per-send MD.
This commit was SVN r26458.
2012-05-20 23:42:26 +00:00
Brian Barrett
2e52374847
* Split send and receive eq sizes
...
* Need to look at slot count before flowcontrol for sending to prevent
race in restart
* Need to free pending request fragments when done with the request
* A number of branch prediction optimizations for error conditions
This commit was SVN r26430.
2012-05-10 21:43:48 +00:00
Brian Barrett
0ae2277796
Add a backoff mechanism for re-establishing communication
...
This commit was SVN r26366.
2012-05-01 15:53:00 +00:00
Brian Barrett
74ade8b181
need to order the pending list before we restart
...
This commit was SVN r26365.
2012-04-30 23:06:00 +00:00
Brian Barrett
5dec52af8d
remove some now unneeded debugging
...
This commit was SVN r26364.
2012-04-30 22:50:52 +00:00
Brian Barrett
c654ee6afc
* Use triggered operations for restart barrier as well
...
This commit was SVN r26363.
2012-04-30 22:48:10 +00:00
Brian Barrett
91a9973bde
* Make flow control on by default
...
* Move alarm code back into a triggered operation
This commit was SVN r26362.
2012-04-30 22:25:40 +00:00
Brian Barrett
e6a0a1cf8a
* Make sure to release all resources on failed send
...
* Avoid triggered ops until we get everything debugged
* Simplify flowctl interface a bit
This commit was SVN r26356.
2012-04-27 21:11:01 +00:00
Brian Barrett
8a70747da2
Fix some naming that doesn't make a ton of sense
...
This commit was SVN r26277.
2012-04-18 01:05:18 +00:00
Brian Barrett
f4d4e87176
add some flow control debugging output
...
This commit was SVN r26276.
2012-04-17 23:14:05 +00:00
Brian Barrett
fe0dfc8e26
First take at flow control protocol
...
This commit was SVN r26274.
2012-04-17 21:46:21 +00:00
Brian Barrett
dde6f094eb
In preperation for flow control changes coming, always utilize ACKs for
...
message completion.
This commit was SVN r26272.
2012-04-16 17:25:27 +00:00
Mike Dubman
34acf769d4
mtl_mxm: support canceling messages
...
This commit was SVN r26256.
2012-04-09 16:02:05 +00:00
Brian Barrett
451af0e832
Ensure async progress for long unexpected messages by waiting for an
...
event on the ME. The events we're likely to see are LINK (the ME was
added to the match list), PUT (weird to see first, but means that the ME
was linked to the match list and then matched), or PUT_OVERFLOW, meaning
the message was unexpected.
This commit was SVN r26199.
2012-03-26 22:54:35 +00:00
Brian Barrett
2a26d0f9a2
Forgot to add new file in the last commit.
...
Mark ME as invalid once we see a completion event, and look for events before
trying to unlink.
This commit was SVN r26198.
2012-03-26 22:39:05 +00:00
Brian Barrett
0e91084385
* Add type field to the request structure to deal with random user requests
...
(ie, cancel)
* Implement cancel for receives. Sends are slightly more complicated...
This commit was SVN r26197.
2012-03-26 22:32:36 +00:00