1
1

3968 Коммитов

Автор SHA1 Сообщение Дата
Ralph Castain
bd8b4f7f1e Sorry for mid-day commit, but I had promised on the call to do this upon my return.
Roll in the ORTE state machine. Remove last traces of opal_sos. Remove UTK epoch code.

Please see the various emails about the state machine change for details. I'll send something out later with more info on the new arch.

This commit was SVN r26242.
2012-04-06 14:23:13 +00:00
Brian Barrett
d46d55ee9b If we're locking the local window, need to wait until the lock returns.
This commit was SVN r26234.
2012-04-04 16:27:24 +00:00
Josh Hursey
d1571b027a Fix a few error return paths
This commit was SVN r26233.
2012-04-04 15:11:03 +00:00
Nathan Hjelm
b0c3c18e02 Initial upload of grdma mpool
This commit was SVN r26232.
2012-04-03 23:03:03 +00:00
Mike Dubman
ff1c84c53f revert previous commit
This commit was SVN r26206.
2012-03-29 14:07:13 +00:00
Mike Dubman
43a5775e8a performance fix: set alignment for openib internal buffers
This commit was SVN r26205.
2012-03-29 14:00:08 +00:00
Nathan Hjelm
d62c0f1872 ugni: handle smsg failure in mca_btl_ugni_ep_connect_finish
This commit was SVN r26202.
2012-03-28 05:40:16 +00:00
Brian Barrett
451af0e832 Ensure async progress for long unexpected messages by waiting for an
event on the ME.  The events we're likely to see are LINK (the ME was
added to the match list), PUT (weird to see first, but means that the ME
was linked to the match list and then matched), or PUT_OVERFLOW, meaning
the message was unexpected.

This commit was SVN r26199.
2012-03-26 22:54:35 +00:00
Brian Barrett
2a26d0f9a2 Forgot to add new file in the last commit.
Mark ME as invalid once we see a completion event, and look for events before
trying to unlink.

This commit was SVN r26198.
2012-03-26 22:39:05 +00:00
Brian Barrett
0e91084385 * Add type field to the request structure to deal with random user requests
(ie, cancel)
* Implement cancel for receives.  Sends are slightly more complicated...

This commit was SVN r26197.
2012-03-26 22:32:36 +00:00
Brian Barrett
61a090e0d1 Checking for NULL function pointers and direct-call semantics can't work
together, so implement all functions in the MTL interface for all
MTLs.  The only places NULL was still being set was for add_comm/del_comm,
and matched probe, both of which are straight forward to implement (or
return ERROR_NOT_IMPLEMENTED, since the PML can't emulate matched probe).

This commit was SVN r26194.
2012-03-26 19:27:03 +00:00
Brian Barrett
cdaf110c0f * Implement mtl_send in addition to mtl_sendi
This commit was SVN r26193.
2012-03-26 19:19:11 +00:00
Brian Barrett
27c8f71773 Start of the flow control implementation. #defined out for now.
This commit was SVN r26192.
2012-03-26 01:31:58 +00:00
Jeff Squyres
fa8980157a Fix typo.
This commit was SVN r26183.
2012-03-23 00:12:32 +00:00
Brian Barrett
cce936b94c * Implement matched probe for the CM PML. Required adding a peer field to
the ompi_message_t structure to properly initialize convertor (the peer
  is available in the request in OB1, and wasn't needed when I did the
  original implementation).
* Implement matched probe for the Portals4 MTL and add NULL function pointers
  for the other MTLs.
* Add add_comm and del_comm functions to portals4 MTL so that direct call
  almost works again.
* Add NEWS item that we've implemented matched probe

This commit was SVN r26180.
2012-03-22 22:55:59 +00:00
Brian Barrett
4d12616b64 Frank pointed out that PTL_OK is zero and PtlHandleIsEqual either returns
PTL_OK or PTL_FAIL and that I had these backwards.

This commit was SVN r26179.
2012-03-22 15:58:00 +00:00
Brian Barrett
1c6b5a1358 * Set all appropriate flags for portal table entries
* split eq into send and receive eqs so that we can control the number
  of outstanding events in send eq and ensure we never lose an ack
* Shouldn't ever truncate on short unexpected receive bocks, so don't set
  the truncate bit
* Track active vs. waiting for free short unexpected receive blocks so
  to ensure an active short unexpected receive block is posted coming out
  of flow control.  Also allow creation of "temporary" blocks which should
  be released once FREE event is received.
* Slight reorganization of some code in preparation for more flow control
  work.

This commit was SVN r26174.
2012-03-21 22:20:55 +00:00
Mike Dubman
a45898ea9c fix support for fca 2.2, warning fixes on rhel 6.x
This commit was SVN r26166.
2012-03-20 10:00:52 +00:00
Nathan Hjelm
135ac32b64 ugni: use hash table to keep track of smsg frag completion
This commit was SVN r26154.
2012-03-15 20:15:59 +00:00
Nathan Hjelm
fca42347e3 ugni: use hash table to keep track of smsg frag completion
This commit was SVN r26153.
2012-03-15 20:13:32 +00:00
Nathan Hjelm
deddf0b33e ugni: fix frag leak in sendi
This commit was SVN r26152.
2012-03-15 20:13:20 +00:00
Nathan Hjelm
99f05d56e3 ugni: updated parameters and code cleanup
This commit was SVN r26151.
2012-03-15 20:13:11 +00:00
Nathan Hjelm
921176745d vader: remove lock based fifos
This commit was SVN r26150.
2012-03-15 20:12:59 +00:00
Nathan Hjelm
4e01440b05 vader: clean frag alloc/return
This commit was SVN r26149.
2012-03-15 20:12:46 +00:00
Terry Dontje
e73df369e4 Update bfo pml with code from ob1 to support mprobe, improbe, mrecv, imrecv and cuda.
This commit was SVN r26145.
2012-03-15 10:20:46 +00:00
Christopher Yeoh
524de80eaa Adds support for Cross Memory Attach in the sm btl.
This feature can be enabled at compile time with --with-cma passed
to configure.

At runtime it is also necessary to add "--mca btl btl_sm_use_cma 1"
to the mpirun command.

If both CMA and KNEM are compiled in and enabled at runtime then 
KNEM will take precedence and CMA will disable itself

This commit was SVN r26134.
2012-03-14 06:29:09 +00:00
Mike Dubman
bd7abd72a9 in mca_mtl_mxm, don't allow negative tags for MPI_ANY_TAG
This commit was SVN r26128.
2012-03-09 22:11:14 +00:00
Rolf vandeVaart
41870ce6ee Mostly fix some of the verbose output. Also fix issue
where memory handle was blocking other registration.

This commit was SVN r26124.
2012-03-09 21:28:56 +00:00
George Bosilca
de1078a71b Thanks to Alex Margolin for pointing out this relique.
This commit was SVN r26121.
2012-03-09 14:01:45 +00:00
Pavel Shamis
102da281c4 OPENIB BTL - use orte_show_help instead of BTL_ERROR print in case ibv_reg_mr failed.
This commit was SVN r26111.
2012-03-08 09:04:03 +00:00
Mike Dubman
4e7e7d7c3f print error which is ignored on upper layer
This commit was SVN r26106.
2012-03-06 14:25:56 +00:00
George Bosilca
a78a7bd8e8 The tuned collectives can now deal with more than 2Gb of data.
This commit was SVN r26103.
2012-03-05 22:23:44 +00:00
George Bosilca
762b3e13a9 Use the correct name for the datatype destruction function.
This commit was SVN r26100.
2012-03-05 15:54:53 +00:00
George Bosilca
7d523a8852 Avoid calling the bcast with counts larger than INT_MAX.
This commit was SVN r26098.
2012-03-05 14:30:30 +00:00
George Bosilca
e8c358c188 Allow Open MPI to deal with size_t internally.
This commit was SVN r26097.
2012-03-05 14:10:26 +00:00
Abhishek Kulkarni
08ca0f80bc Fix a C/R bug where the restart hung due to
dangling fds in the openib btl.

This commit was SVN r26094.
2012-03-04 06:57:33 +00:00
George Bosilca
f83670211e Allow the user to define dynamic rules for messages larger than 2GB.
This commit was SVN r26084.
2012-03-02 21:16:23 +00:00
George Bosilca
8791ade293 Help he selection of the right algorithm for large data (> 2Gb). Thanks
to Fujitsu for the patch.

This commit was SVN r26080.
2012-03-02 19:12:22 +00:00
Terry Dontje
3e70cad203 Correct a few alignment problems to address the issue brought up in ticket #2964
This commit was SVN r26078.
2012-03-01 17:29:40 +00:00
Nathan Hjelm
f1525bdbff ob1: fix two fragment leaks
- MAJOR! get src descriptor leaks if mca_bml_base_send fails
 - minor. descriptor leaked in mca_pml_send_request_start_copy if the btl returns OMPI_ERR_RESOURCE_BUSY.

This commit was SVN r26077.
2012-03-01 15:53:39 +00:00
Mike Dubman
540b3c0c25 update mxm mtl to changes in mxm api
This commit was SVN r26073.
2012-02-29 22:02:34 +00:00
Nathan Hjelm
a7209e309a ugni: opps, sendi was missing from Makefile.am
This commit was SVN r26067.
2012-02-28 16:10:35 +00:00
Edgar Gabriel
636cf786be local_heap_sort should really be a static function.
This commit was SVN r26065.
2012-02-28 14:42:56 +00:00
Vishwanath Venkatesan
7c9c3ede61 Modified implementation for the static segmentation read algorithm with improved
performance and bug fixes.

This commit was SVN r26056.
2012-02-24 20:55:33 +00:00
Vishwanath Venkatesan
d5a9223a9a Removed a variable which was allocated but never used.
This commit was SVN r26055.
2012-02-24 20:48:52 +00:00
Vishwanath Venkatesan
326bc69df4 Modified implementation for static file write all algorithm which fixes all the previous
bugs and provides improved performance.

This commit was SVN r26054.
2012-02-24 20:47:09 +00:00
Rolf vandeVaart
b0a84b0a7d New btl that extends sm btl to support GPU transfers within a node.
Uses new CUDA IPC support.  Also, a few minor changes in PML to take
advantage of it.

This code has no effect unless user asks for it explicitly via 
configure arguments.  Otherwise, it is either #ifdef'ed out or
not compiled.

This commit was SVN r26039.
2012-02-24 02:13:33 +00:00
Nathan Hjelm
8217c46666 ompi_free_list: allocate payload if payload size > 0 in the fl_mpool = NULL case
This commit was SVN r26027.
2012-02-23 16:47:28 +00:00
Nathan Hjelm
9843cd0466 ugni: missed one more merge typo
This commit was SVN r26026.
2012-02-23 16:39:15 +00:00
Nathan Hjelm
d7cd95c802 vader: fix typo
This commit was SVN r26025.
2012-02-23 16:29:45 +00:00