Ralph Castain
66fe57f746
Revert r26804 so openib can build again
...
This commit was SVN r26810.
The following SVN revision numbers were found above:
r26804 --> open-mpi/ompi@610be870f9
2012-07-19 16:16:38 +00:00
Nathan Hjelm
610be870f9
btl/openib: limit each process to a ppn fraction of the available registered memory when using mellanox hardware (mlx4 and mthca)
...
This commit was SVN r26804.
2012-07-18 17:29:48 +00:00
Nathan Hjelm
4a97ecbdd2
btl/openib: remove tab characters
...
This commit was SVN r26803.
2012-07-18 17:29:37 +00:00
Nathan Hjelm
771b427027
udcm: unmonitor the fd BEFORE tearing down the listen qp
...
This commit was SVN r26800.
2012-07-18 14:22:45 +00:00
Nathan Hjelm
35de50b823
remove the elan btl
...
This commit was SVN r26798.
2012-07-17 14:51:41 +00:00
Nathan Hjelm
fc1b295606
udcm: evict from the lru of the openib device's grdma mpool if a qp can not be created. Note: there doesn't appear to be a standard way to differentiate between ibv_create_qp failing because the node is out of registered memory and failing because no more qps are available
...
This commit was SVN r26797.
2012-07-14 01:58:29 +00:00
Nathan Hjelm
3798f38386
do not print out an error message if ibv_reg_mr fails
...
This commit was SVN r26796.
2012-07-14 01:35:45 +00:00
Nathan Hjelm
4d1920ee87
Fix a bug on 32-bit systems introduced by r26626. This fix ensures that all supported btls (with exception of wv-- shiqing will need to help bring that one up to date with r26626) set the lval in prepare_src/dst when preparing a put or get segment. This fix also ensures a consistent use of lval in put and get for both local and remote segments.
...
This commit was SVN r26793.
The following SVN revision numbers were found above:
r26626 --> open-mpi/ompi@249066e06d
2012-07-13 21:19:16 +00:00
Nathan Hjelm
344fe61616
remove assertion in udcm
...
This commit was SVN r26790.
2012-07-13 15:14:48 +00:00
Jeff Squyres
e719d6ab78
It turns out that "sppp" on the Oracle Mx000 series of servers (where x =
...
{3, 4, 5, 9}, SPARC VI-based machines) is not a 127.x.y.z interface,
so it needs to stay in the exclude list.
This commit was SVN r26789.
2012-07-13 12:11:41 +00:00
Jeff Squyres
196bc0a53e
Update the TCP BTL MCA param btl_tcp_if_exclude default value to use
...
CIDR notation 127.0.0.1/8 to ignore localhost devices instead of the
imprecise (and not always correct!) "lo,sppp".
This commit was SVN r26788.
2012-07-12 15:13:08 +00:00
Nathan Hjelm
b79a61a360
move btl_vader.c to btl_vader_module.c
...
This commit was SVN r26785.
2012-07-11 20:14:19 +00:00
Terry Dontje
6f3195faca
add some missing casts
...
This commit was SVN r26779.
2012-07-10 18:03:29 +00:00
Nathan Hjelm
05c5c1f412
remove unused i_initiate function from udcm
...
This commit was SVN r26778.
2012-07-10 17:22:19 +00:00
Jeff Squyres
bb13e21538
Roll back r26730, but bump the default CQ length base up to 1500, not
...
1000. Refs trac:3154.
IB/iWarp vendors need to get together to figure out a real fix.
This commit was SVN r26777.
The following SVN revision numbers were found above:
r26730 --> open-mpi/ompi@5315c91baf
The following Trac tickets were found above:
Ticket 3154 --> https://svn.open-mpi.org/trac/ompi/ticket/3154
2012-07-10 16:53:27 +00:00
Nathan Hjelm
4c0c937953
Remove use of ompi_ptr_ltop in BTLs. This fixes a crash seen on big-endian 32-bit platforms with MPI one-sided.
...
This commit was SVN r26776.
2012-07-10 16:18:53 +00:00
George Bosilca
7d6006a5a6
Fix various compiler warnings.
...
This commit was SVN r26774.
2012-07-10 15:57:15 +00:00
Abhishek Kulkarni
2ca8292f46
Fix a typo in the sm btl (related to CMA support).
...
This commit was SVN r26772.
2012-07-10 00:12:05 +00:00
Terry Dontje
43314776ae
add cast to correct a type mismatch warning
...
This commit was SVN r26767.
2012-07-09 18:32:39 +00:00
George Bosilca
57f08ec2c8
Make it compile!
...
This commit was SVN r26759.
2012-07-08 00:06:13 +00:00
Terry Dontje
025b42bbb7
corrected the change of pval to lval introduced in r26626
...
This commit was SVN r26751.
The following SVN revision numbers were found above:
r26626 --> open-mpi/ompi@249066e06d
2012-07-05 13:31:24 +00:00
Terry Dontje
95a3b4a423
corrected the change of pval to lval introduced in r26626
...
This commit was SVN r26732.
The following SVN revision numbers were found above:
r26626 --> open-mpi/ompi@249066e06d
2012-07-03 18:52:18 +00:00
Terry Dontje
1895ca2bc4
corrected a typo (lval instead of pval) introduced in r26626
...
This commit was SVN r26731.
The following SVN revision numbers were found above:
r26626 --> open-mpi/ompi@249066e06d
2012-07-03 17:46:43 +00:00
Jeff Squyres
5315c91baf
Fixes trac:3152: slightly more advanced than the patch on the ticket:
...
* If the MCA param btl_openib_cq_size is set to 0 (which is the
default), use the device CQ max size. Otherwise, use the MCA param
value (and never adjust it again).
* Remove the CQ size adjustment code. Since we default to max CQ
size, there really isn't much point in having it any more. I think
people setting an absolute CQ size is going to be rare, so let's
not do anything fancy with it.
* If the MCA param value is larger than what the device supports,
print a warning (only once per process) and default to using the
device max
* Add a BTL_VERBOSE displaying which CQ size we used
This commit was SVN r26730.
The following Trac tickets were found above:
Ticket 3152 --> https://svn.open-mpi.org/trac/ompi/ticket/3152
2012-07-03 16:49:59 +00:00
Nathan Hjelm
9f3717959e
remove sync step from udcm as it really isn't necessary
...
This commit was SVN r26724.
2012-07-02 22:54:44 +00:00
Pavel Shamis
f7664b3814
1. Adding 2 new components:
...
ofacm - generic connection manager for IB interconnects.
ofautils - IB common utilities and compatibility code
2. Updating OpenIB configure code
- ORNL & Mellanox Teams
This commit was SVN r26707.
2012-07-02 15:20:12 +00:00
Nathan Hjelm
a847df9ba5
ugni: fix eager get
...
This commit was SVN r26699.
2012-06-29 15:43:29 +00:00
Jeff Squyres
5d030278e1
Refs trac:3130: Per comment 8 on the ticket, this MX patch fixes the cases
...
where the MX BTL and MTL are stepping on each other regarding the
mpool. Thanks to Yong Qin for assistance in tracking this down.
This commit was SVN r26698.
The following Trac tickets were found above:
Ticket 3130 --> https://svn.open-mpi.org/trac/ompi/ticket/3130
2012-06-29 13:52:40 +00:00
Jeff Squyres
b936229b54
Refs trac:3130: fix the openib BTL to properly set the memalign malloc
...
hook early in the setup, but ''not'' during the component register
function. And then properly unset it if was set.
This commit was SVN r26697.
The following Trac tickets were found above:
Ticket 3130 --> https://svn.open-mpi.org/trac/ompi/ticket/3130
2012-06-29 13:51:36 +00:00
Jeff Squyres
f3a8722360
Fix comment.
...
This commit was SVN r26696.
2012-06-29 01:38:04 +00:00
Ralph Castain
0dfe29b1a6
Roll in the rest of the modex change. Eliminate all non-modex API access of RTE info from the MPI layer - in some cases, the info was already present (either in the ompi_proc_t or in the orte_process_info struct) and no call was necessary. This removes all calls to orte_ess from the MPI layer. Calls to orte_grpcomm remain required.
...
Update all the orte ess components to remove their associated APIs for retrieving proc data. Update the grpcomm API to reflect transfer of set/get modex info to the db framework.
Note that this doesn't recreate the old GPR. This is strictly a local db storage that may (at some point) obtain any missing data from the local daemon as part of an async methodology. The framework allows us to experiment with such methods without perturbing the default one.
This commit was SVN r26678.
2012-06-27 14:53:55 +00:00
Josh Hursey
28681deffa
Backout the ORCA commit. :(
...
There is a linking issue on Mac OSX that needs to be addressed before this is able to come back into the trunk.
This commit was SVN r26676.
2012-06-27 01:28:28 +00:00
Josh Hursey
542330e3a7
Commit of ORCA: Open MPI Runtime Collaborative Abstraction
...
This is a runtime interposition project that sits between the OMPI and ORTE layers in Open MPI.
The project is described on the wiki:
https://svn.open-mpi.org/trac/ompi/wiki/Runtime_Interposition
And on this email thread:
http://www.open-mpi.org/community/lists/devel/2012/06/11109.php
This commit was SVN r26670.
2012-06-26 21:42:16 +00:00
Nathan Hjelm
37c624ee43
prepare to delete mpool/rdma
...
This commit was SVN r26664.
2012-06-26 15:55:23 +00:00
Nathan Hjelm
2dbe630138
fix more udapl warnings/errors
...
This commit was SVN r26648.
2012-06-25 15:18:50 +00:00
Nathan Hjelm
6a0ccf41e6
one more file
...
This commit was SVN r26638.
2012-06-22 18:21:57 +00:00
Ralph Castain
e6f3586415
Remove the orte notifier framework, per discussion at the devel meeting and follow-up with Jeff (who took the action item)
...
This commit was SVN r26637.
2012-06-22 18:09:23 +00:00
Nathan Hjelm
03f00c42b8
fix udapl compile problems from r26626
...
This commit was SVN r26635.
The following SVN revision numbers were found above:
r26626 --> open-mpi/ompi@249066e06d
2012-06-22 14:20:45 +00:00
Nathan Hjelm
249066e06d
Timeout! Per RFC update the BTL interface to hide segment keys. All BTLs (with the exception of wv), all relevant PMLs, and osc/rdma have been updated for the new interface.
...
This commit was SVN r26626.
2012-06-21 17:09:12 +00:00
Nathan Hjelm
e3bc6c0f73
btl/ugni: use grdma mpool to take advantage of shared lru
...
This commit was SVN r26623.
2012-06-20 23:03:59 +00:00
Nathan Hjelm
3d86b5055e
btl/ugni: don't call opal_convertor_pack if there is nothing to pack
...
This commit was SVN r26622.
2012-06-20 23:01:37 +00:00
Yevgeny Kliteynik
df783c0472
Precise speed of FDR and EDR
...
This commit was SVN r26614.
2012-06-17 07:06:37 +00:00
Rolf vandeVaart
d6881f3a4f
Rename one function. Add some new functions that can support asynchronous CUDA copies.
...
This commit was SVN r26611.
2012-06-15 16:56:30 +00:00
Terry Dontje
634fc278d9
Fix issue with sctp config scripts not detecting netinet/in.h dependency. Also removing tabs from sctp m4 file
...
This commit was SVN r26599.
2012-06-13 10:38:28 +00:00
Jeff Squyres
13707ec0af
Remove this comment: it turns out that the benefit was to make
...
multiple SM ''modules'', not multiple SM ''mpools''.
This commit was SVN r26584.
2012-06-08 22:37:26 +00:00
Jeff Squyres
56a537a5f5
This component wasn't even in 1.5.0; no one has had a GM network in
...
forever. There is no point in carrying this component forward.
This commit was SVN r26563.
2012-06-06 21:43:54 +00:00
Nathan Hjelm
71bffa5158
ugni: update to latest btl code. bug fixes and cleanup
...
This commit was SVN r26529.
2012-05-31 20:02:41 +00:00
George Bosilca
e890a8379b
Various minor cleanups.
...
This commit was SVN r26461.
2012-05-21 13:15:24 +00:00
Rolf vandeVaart
c228bd2311
Fix broken compile. Keep in sync with sm btl.
...
This commit was SVN r26440.
2012-05-15 15:32:33 +00:00
Yevgeny Kliteynik
d59b8d5dc4
Fixing malformed error message
...
This commit was SVN r26434.
2012-05-12 21:13:42 +00:00