Jeff Squyres
3ad89f16b8
openmpi-release.sh: haven't used "sowing" in years
2015-06-19 12:26:12 -07:00
Edgar Gabriel
3fcb88a727
Merge pull request #646 from edgargabriel/pr/non-blocking-coll
...
Pr/non blocking coll
2015-06-19 07:46:28 +02:00
rhc54
5d38283c7f
Merge pull request #647 from rhc54/topic/hangs
...
Ensure we properly commit suicide if/when we lose connection to the daemon.
2015-06-18 10:25:07 -07:00
Ralph Castain
cc9b416ab3
Ensure we properly commit suicide if/when we lose connection to the daemon. There are multiple paths by which a lost daemon can be reported, and so a race condition exists in the pmix support. Our MPI layer wants the ability to determine the response to the failure, and so it will call down to the RTE with any abort request. This comes down to the pmix layer as a "pmix_abort" command, which involves communicating the request to the daemon - who is gone. Sadly, the pmix component may not know that just yet, and so we hang.
...
So add a brief timer event to kick us out of the communication. The precise amount of time we should wait is somewhat TBD, but set something short for now and we can adjust.
2015-06-18 09:45:52 -07:00
Edgar Gabriel
dedeee9771
finishing the changes for the non-blocking and split cpllective I/O operations. Everything except for the
...
interface changes to the io framework is done.
2015-06-18 06:22:41 -05:00
Edgar Gabriel
3b11a8b61c
making the current work compile.
2015-06-18 05:56:51 -05:00
Edgar Gabriel
cc219281ba
checkpoint of the current work, since I need to resync wioth master to fix the compilation problems
2015-06-18 05:20:07 -05:00
Edgar Gabriel
100515e321
remove split collective interfaces from fcoll and their fake implemenations. Not required anymore
2015-06-18 05:20:07 -05:00
Edgar Gabriel
19cac73a9b
first part of the changes trequired to support non-blocking colelctive io operations
2015-06-18 05:20:07 -05:00
Ralph Castain
706884652f
Silence Coverity warning about failing to check return code
2015-06-17 19:24:51 -07:00
rhc54
e08ba4860f
Merge pull request #645 from rhc54/topic/binding
...
Correct the way we handle binding to objects during comm_spawn
2015-06-17 10:09:23 -07:00
Ralph Castain
869b2891c4
When doing comm-spawn, track the last object we bound to and ensure that we start the next job on the next object so we avoid overload situations when they aren't necessary
2015-06-17 09:20:08 -07:00
Jeff Squyres
8ab2b11f88
btl_openib.c: fix another compiler warning
...
Remove this unused variable
2015-06-17 09:00:12 -07:00
Jeff Squyres
f688289aaf
btl_openib.c: fix compiler warning
...
This return code is not used; tell the compiler we're not going to
use it.
2015-06-17 08:56:56 -07:00
Jeff Squyres
097b48d521
mca_base_component_respository.c: fix compiler warning
...
This function is only used in the DL case -- it can be #if'ed out if
we're not compiling with DL support to avoid a compiler warning about
defined-but-not-used.
2015-06-17 08:54:59 -07:00
Jeff Squyres
dfa36197ea
usnic/Makefile.am: ensure static builds include -lfabric
2015-06-17 08:15:29 -07:00
Gilles Gouaillardet
2cef2d0fe6
opal/memory: silence a warning
...
as reported by Coverity with CID 71663
2015-06-17 11:17:55 +09:00
Gilles Gouaillardet
ec679b3fc2
orte/orted: fix misc memory leaks
2015-06-17 11:17:55 +09:00
Gilles Gouaillardet
b72e9288bc
rmaps: fix a misc memory leak
...
as reported by Coverity with CID 1269887
2015-06-17 11:17:55 +09:00
Gilles Gouaillardet
27b4727fcf
orte/orted: fix misc memory leak
...
as reported by Coverity with CID 743448
2015-06-17 11:17:55 +09:00
Gilles Gouaillardet
ac5921d7da
orte/util: fix misc memory leak
...
as reported by Coverity with CID 1196738-1196739
2015-06-17 11:17:55 +09:00
Gilles Gouaillardet
e77d3057d6
orte-submit: fix a misc memory leak
...
as reported by Coverity with CID 710651
2015-06-17 11:17:54 +09:00
Gilles Gouaillardet
67638690ea
orte/util: fix a misc memory leak
...
as reported by Coverity with CID 710652
2015-06-17 11:17:54 +09:00
Gilles Gouaillardet
a43abceb88
fix dfs misc memory leaks
...
as reported by Coverity with CIDs 739887, 747706, 1196707-1196709
2015-06-17 11:17:54 +09:00
Gilles Gouaillardet
0f08070a1c
ompio: fix misc memory leaks
...
as identified by Coverity with CIDs 72147-72149, 731275 and 1269872
2015-06-17 11:17:54 +09:00
Gilles Gouaillardet
58d1b3f4d0
opal_os_dirpath_create: fix TOCTOU
...
as reported by Coverity with CID 70396
2015-06-17 11:17:54 +09:00
Gilles Gouaillardet
de66447ebb
opal_cmd_line_get_usage_msg: silence warning
...
as reported by Coverity with CID 1269967
2015-06-17 11:17:54 +09:00
Gilles Gouaillardet
f2f66e6e63
opal_daemon_init: silence warning
...
as reported by Coverity with CID 710642
2015-06-17 11:17:53 +09:00
Gilles Gouaillardet
8427e87ee9
opal_argv_delete: silence warning
...
as reported by Coverity with CID 71914
2015-06-17 11:17:53 +09:00
Gilles Gouaillardet
d9c490cf9f
refactor opal_bitmap_get_string
...
make it more efficient and fix CID 71992 (dead code)
2015-06-17 11:17:53 +09:00
Gilles Gouaillardet
0f17cdfc57
fcoll: fix misc memory leaks
...
as reported by Coverity with CIDs 72293,72294 and 1269894
2015-06-17 11:17:52 +09:00
Nathan Hjelm
c33b786dd9
Merge pull request #620 from hjelmn/ompi_coverity
...
ompi coverity fixes
2015-06-16 06:10:40 -06:00
rhc54
9a8bda0b72
Merge pull request #637 from jithinjosepkl/pr/pml-cm-opt
...
pml-cm bug fixes
2015-06-15 19:25:09 -07:00
rhc54
ea78f8226d
Merge pull request #631 from jsquyres/pr/remove-embedded-libfabric
...
remove embedded libfabric
2015-06-15 18:14:33 -07:00
Jithin Jose
7ccde09a09
Do opal_convertor_copy_and_prepare_for_send for buffered send mode as
...
MCA_PML_CM_HVY_SEND_REQUEST_BSEND_ALLOC calls opal_convertor_pack
directly.
Signed-off-by: Jithin Jose <jithin.jose@intel.com>
2015-06-15 17:12:50 -07:00
Howard Pritchard
7cce0159dc
Merge pull request #643 from hppritcha/topic/java_common_symbols
...
mpi/java: suppress common symbols warning
2015-06-15 18:12:40 -06:00
Howard Pritchard
224f97b009
mpi/java: suppress common symbols warning
...
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-06-15 17:13:40 -06:00
Jeff Squyres
cc66745e7a
mtl/ofi: convert to use external libfabric
...
Use the new OPAL_CHECK_LIBFABRIC macro.
2015-06-15 15:17:06 -07:00
Jeff Squyres
44e7646de9
usnic/configure.m4: convert to use external libfabric
...
Use the new OPAL_CHECK_LIBFABRIC macro.
2015-06-15 15:17:06 -07:00
Jeff Squyres
0b343fa599
opal_check_libfabric.m4: add libfabric m4 macro
...
Similar to other m4 macros checking for the existence / usability of
external libraries.
2015-06-15 15:17:05 -07:00
Jeff Squyres
3e1b85ceb3
libfabric: remove embedded libfabric
...
OMPI now only builds against external libfabric installations.
2015-06-15 15:17:05 -07:00
Jeff Squyres
c74ab51dd4
opal/mca/dl/dl.h: fix the #ifndef/#define name
...
Thanks to Scott Atchley for noticing the name mismatch.
2015-06-15 13:08:57 -07:00
Gilles Gouaillardet
ee3a1da28a
pml/ob1:mca_pml_ob1_recv_request_put_frag silence a warning
...
proc local variable is used only in heterogeneous mode
2015-06-15 10:00:53 +09:00
rhc54
adbff46a13
Merge pull request #642 from rhc54/topic/hwloc
...
Update hwloc to 1.11.0
2015-06-13 12:09:58 -07:00
Ralph Castain
ff92781ec4
Replace hwloc191 with hwloc1110
...
Fix hwloc compile. Ignore LAMA mapper due to deprecated hwloc functions
2015-06-13 10:11:45 -07:00
Jeff Squyres
4384131e65
openib: minor style and defensive programming fixes
...
Minor comment/whitespace fixes. Also some minor logic changes that
are mainly for defensive programming purposes (i.e., ensure to always
set malloc_hook_set to true or false, and then check it before we try
to actually invoke it).
2015-06-12 20:11:47 -07:00
Jeff Squyres
2f137ff151
openib: reset memalign threshhold properly
...
Now that open-mpi/ompi#638 is fixed, reset the openib BTL memalign
threshhold properly.
This effectively re-instates commit
open-mpi/ompi@ce915b5757 .
2015-06-12 20:11:47 -07:00
Jeff Squyres
88c13adc8c
openib: only set the memory hook if it is enabled
...
Instead of unconditionally setting the memory hook, only set it when
the memory hooks are both available and have been enabled (e.g.,
opal/mca/memory/linux has decided that it *can* be enabled, and when
the mpi_leave_pinned MCA param is set to 1, or is set to -1 and some
component requested the memory hooks be enabled).
If we set the memory hook when memory hooks are not enabled,
__malloc_hook will be NULL, which will cause problems when
btl_openib_malloc_hook() tries to invoke it.
Fixes open-mpi/ompi#638 .
2015-06-12 20:11:47 -07:00
rhc54
2de42c873d
Merge pull request #641 from rhc54/mq
...
Bring over George's message_queue repair
2015-06-12 15:07:38 -07:00
George Bosilca
67b70bb47a
Add multi-threaded support.
2015-06-12 14:22:17 -07:00