rhc54
95c40e64b9
Merge pull request #584 from nkogteva/oob_ud_stress_test
...
oob ud: fixed a bug that prevented the work with QoS framework
2015-05-20 09:56:08 -06:00
Gilles Gouaillardet
b6c67e051d
io/ompio: fix misc memory leaks
...
as reported by Coverity with CIDs 72147-72149,72187,72188,731274,731275,741356,
1269889,1269893,1271535 and 1269872
2015-05-20 17:19:39 +09:00
Gilles Gouaillardet
dd28b1f680
orted/dfs: fix misc memory leaks
...
as reported by Coverity with CIDs 739887, 747706, 1196707-1196709 and 1269849
2015-05-20 13:09:46 +09:00
Howard Pritchard
62a278d29c
Merge pull request #590 from hppritcha/topic/coverity_133
...
pmix/base: fix coverity error
2015-05-18 06:52:37 -06:00
Gilles Gouaillardet
69f900ab9d
libfabric: check the psm_epconn_t type is available before building the PSM provider
...
embedded libfabric configury does it its own way, so "backport" ofiwg/libfabric#1031
2015-05-18 14:04:41 +09:00
Howard Pritchard
0980423c5f
pmix/base: fix coverity error
...
Remove some obviously dead code and thus fix a coverity
error - CID #133
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-16 13:24:03 -06:00
Howard Pritchard
4d77897d70
Merge pull request #589 from hppritcha/topic/fix_gni_common_symbol
...
btl/ugni: silence common symbol squawk
2015-05-16 12:38:56 -06:00
Howard Pritchard
d9f080b0c7
btl/ugni: silence common symbol squawk
...
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-16 10:23:06 -05:00
Howard Pritchard
00dafb39f6
Merge pull request #586 from hppritcha/topic/pmix_cray_loc_fix
...
pmix/cray: fix locality setting
2015-05-15 16:34:50 -06:00
Howard Pritchard
a1d65cfd8b
pmix/cray: fix locality setting
...
Code for setting proc node locality
was absent after the removal of Cray
PMI KVS usage. This commit puts that
functionality back in place.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-15 12:17:15 -07:00
Ralph Castain
7929387b4c
Merge branch 'master' of https://github.com/open-mpi/ompi
2015-05-15 07:14:27 -06:00
Ralph Castain
d3d3e73099
Per request from George, use defined(__APPLE__) instead of OPAL_HAVE_MAC. Don't try to close a negative socket
2015-05-15 07:13:42 -06:00
Gilles Gouaillardet
c05b271c68
man: fix a trivial typo in MPI_Neighbor_allgather.3in
2015-05-15 16:02:01 +09:00
George Bosilca
675dccf9d9
Print the port in host byte order.
2015-05-15 00:14:28 -04:00
Ralph Castain
0a345d34e6
Plug the memory leak identified by George
2015-05-14 21:33:48 -06:00
Howard Pritchard
578430c36d
oob/alps: remove comment with personal reference
...
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-14 20:06:21 -07:00
Ralph Castain
8e30579e6e
The Mac appears to have problems with the keepalive support - once keepalive starts, the memory footprint soars. So disable keepalive on the Mac
2015-05-14 18:09:13 -06:00
Gilles Gouaillardet
1488e82efd
osc/pt2pt: enable heterogeneous support
2015-05-14 16:42:48 +09:00
Gilles Gouaillardet
c4ebdba035
always align words if heterogeneous support is enabled
2015-05-14 15:54:21 +09:00
Gilles Gouaillardet
973a9ec247
configury: fix the error message of the --enable-mpi-fortran option
2015-05-14 15:39:43 +09:00
MPI Team
6dce955e55
update github wiki script: remove debugging comment
2015-05-13 19:05:01 -04:00
Jeff Squyres
bd0a4f0f8b
Script to update Github wiki ComponentOwners page
...
First cut.
2015-05-13 18:58:02 -04:00
Jeff Squyres
ccfee0cd2d
check-owner.pl: fix comments
2015-05-13 18:31:12 -04:00
Todd Kordenbrock
c42e277385
mtl-portals4: thread multiple updates
...
When activating short receive blocks on the overflow list, remove
the PTL_ME_EVENT_LINK_DISABLE flag so the event gets generated.
Without PTL_EVENT_LINK, the block status can't reach the activated
state.
Replace #ifdef with #if for Open MPI configure booleans, because
Open MPI configure booleans are always defined and the value must
be checked.
2015-05-13 17:06:18 -05:00
Nadezhda Kogteva
d9dcf8352e
oob ud: fixed a bug that prevented the work with QoS framework (oob_stress_channel test)
2015-05-13 11:40:01 +03:00
Yohann Burette
27f1884cf8
mtl/ofi: Reworked header files. Added compat to ease maintenance.
2015-05-12 15:47:50 -07:00
rhc54
b59fa14004
Merge pull request #583 from rhc54/topic/mallocwarnings
...
Silence malloc(0) warnings reported by Lisandro
2015-05-12 13:37:38 -07:00
Ralph Castain
9a70765f27
Silence malloc(0) warnings reported by Lisandro
2015-05-12 12:38:58 -07:00
Jeff Squyres
8e8d104520
oob ud: ibv_get_device_list()==NULL can mean no devices present
...
...which is not an error. Don't complain about it.
2015-05-12 10:54:39 -07:00
Gilles Gouaillardet
5142194058
oshmem: there's no fortran sentinels in oshmem
...
Turns out that this is just copy-n-pasted code from OMPI. To be
clear: there's no need for the oshmem layer to instantiate sentinels
like mpi_fortran_bottom.
Thanks @jsquyres for pointing this.
2015-05-12 13:01:16 +09:00
Jeff Squyres
8f941a6613
oob ud: better error msgs, tolerate systems without UD devices
...
It is perfectly ok to be on a system without UD devices.
Also, make some of the error messages better -- so that the user has a
clue about where the error messages are coming from, and what they
should do.
2015-05-11 13:11:51 -07:00
Jeff Squyres
e95010b095
common verbs: only install fake usnic driver when relevant
...
Only install the fake usnic libibverbs driver when there are actually
usnic kernel devices present. This prevents some run-time weirdness
on the Cray verbs emulation environment, where apparently
ibv_register_driver() either is not implemented or does not work
properly.
2015-05-11 12:57:06 -07:00
Ryan Grant
bbeaf41a52
Merge pull request #580 from tkordenbrock/topic/mtl.add.status.to.short.recv.blocks
...
mtl-portals4: add status to short recv blocks to coordinate out of or…
2015-05-11 13:44:45 -06:00
Ryan Grant
265682bdb9
Merge pull request #581 from tkordenbrock/topic/remove.overlapping.multiMD.code
...
portals4: use a single Memory Descriptor to cover all of memory
2015-05-11 13:20:32 -06:00
George Bosilca
78f5f0f8a9
Show the name of the collective that failed to get initialized.
2015-05-11 15:10:37 -04:00
Mike Dubman
894ba28390
Merge pull request #559 from nkogteva/oob_ud
...
oob ud: made component more user adaptive; opal outputs were replaced by...
2015-05-11 21:09:28 +03:00
Todd Kordenbrock
9df163f116
portals4: use a single Memory Descriptor to cover all of memory
...
In days past, some implementations of Portals4 could not cover all
of memory with a single Memory Descriptor so multiple large
overlapping Memory Descriptors were created. Because none of the
current implementations have this limitation (and no future
implementations should either), this commit removes the overlapping
Memory Descriptors code.
2015-05-11 11:49:41 -05:00
Todd Kordenbrock
074583060d
mtl-portals4: add status to short recv blocks to coordinate out of order events
...
If OMPI is initialized as thread multiple, then it is possible for
Portals events to be processed out of order by different threads.
Out of order events could lead to reactivation of the block
(PTL_EVENT_AUTO_FREE) before the block is removed from the active
list (PTL_EVENT_AUTO_UNLINK). This commit adds a status field to
ompi_mtl_portals4_recv_short_block_t that coordinates these events.
2015-05-11 11:48:25 -05:00
Ralph Castain
3cee4152fc
Fix the intercommunictor issue reported by Gilles. Instead of directly checking the reachability bitmap, ask the component if the proc is reachable when doing a send as the component is the final arbiter in such cases. Recirculate any messages that a daemon is trying to send to void race conditions. Cleanup listener sockets so we don't leak them
2015-05-11 09:16:25 -07:00
Howard Pritchard
8497a6a140
Merge pull request #577 from hppritcha/topic/ess_alps_fix
...
ess/alps: remove unnecessary vpid calc
2015-05-10 07:17:13 -06:00
Howard Pritchard
3382d3ce61
ess/alps: remove unnecessary vpid calc
...
There was a redundant computation of the vpid
for orted's happening in ess/alps rte_init
method. Keep the more efficient alps based
method.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-09 20:07:38 -07:00
Ralph Castain
a49d965332
Remove stale file
2015-05-09 07:08:50 -07:00
Ralph Castain
b5382c9bf9
Rework the OOB selection logic to allow a component (e.g., usock) to direct that it be the sole active component. Remove prior disqualifying code in the oob/tcp component as it was too restrictive - if usock wasn't able to run, it left apps with no way to communicate to their daemon. Have the local daemon check the global modex for the RML URI info of the local procs so it can route messages between them when tcp is the primary channel.
...
A few other minor cleanups included.
2015-05-08 11:15:21 -07:00
Howard Pritchard
94576993b0
Merge pull request #574 from hppritcha/topic/ugni_common_symbol
...
common/ugni: fix common symbol problem
2015-05-08 05:55:15 -06:00
Howard Pritchard
341b773724
common/ugni: fix common symbol problem
...
Stop nagging of common symbol detection script for ugni
stuff.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-08 04:27:01 -07:00
Jeff Squyres
7a577c0ed2
libfabric: delete tarball
...
Oops -- the tarball itself should not have been committed to the repo.
2015-05-08 03:24:09 -07:00
Gilles Gouaillardet
650289bc33
romio314: update one more romio->romio314 name
...
Also missed this in open-mpi/ompi@db257cdbc0 .
2015-05-08 18:26:33 +09:00
Ralph Castain
6e95bcd583
Fix typo in oob_tcp.c when IPV6 enabled. Cleanup a few other warnings, including a type in coll_sm that prevented that component from registering its MCA params!
2015-05-07 21:05:08 -07:00
Gilles Gouaillardet
f1258c3b6c
ompi/errhandler: make most ompi_err_* variables static
...
Thanks @hjelmn for pointing this !
2015-05-08 10:11:59 +09:00
Gilles Gouaillardet
a80fda25d8
orte: rename the global variable component_map into orte_component_map
...
Thanks @goodell for pointing this !
2015-05-08 10:11:59 +09:00