1
1

23053 Коммитов

Автор SHA1 Сообщение Дата
Jeff Squyres
ba429dc890 usnic: temporarily disable the BTL put method
The usnic BTL put method is currently broken.  Disable it until we can
fix it properly.
2015-07-10 06:51:03 -07:00
Jeff Squyres
f265358fbe usnic: handle FI_MSG_PREFIX differences libfabric v1.0.0->v1.1.0
In libfabric v1.0.0 (i.e., API v1.0), the usnic provider handled
FI_MSG_PREFIX inconsistently between sends and receives.  This has
been fixed in libfabric v1.1.0 (i.e., API v1.1): FI_MSG_PREFIX is
handled consistently for both sends and receives.

Run-time detect which libfabric we are running with and adapt behavior
appropriately.
2015-07-10 06:51:03 -07:00
Jeff Squyres
ddd0de6cfc usnic: make more OS-bypass memory Valgrind-defined
This helps reduce false positives when running MPI apps through
Valgrind.
2015-07-10 06:51:03 -07:00
Jeff Squyres
9bc7a54e0c usnic: correctly count CRC errors
Handle the differences between libfabric v1.0.0 and v1.1.0 in the
return value of fi_cq_readerr().

Also consolidate CRC and truncation errors into the same handling
block, since truncation errors are typically another symptom of CRC
errors.  This ensures that buffers get reposted properly.
2015-07-10 06:51:03 -07:00
Jeff Squyres
fc686f5538 usnic: make configure complain if libfabric cannot be found
Instead of silently determining that the usnic BTL can't be built,
announce that usnic is checking for libfabric support, and then
AC_MSG_RESULT the result of that check.
2015-07-10 06:45:33 -07:00
Jeff Squyres
4341639a66 Revert "configury: fix (again) XRC detection on OFED < 3.12"
@ggouaillardet is likely offline for the weekend, but master is broken
on RHEL 6.5 systems that do not have MOFED installed.  So I'm taking
the liberty of revering this commit; I'm guessing Gilles will fixup
and re-commit next week.

This reverts commit 77f8282d51d8f40f6ae988ef84c9c852de75c625.
2015-07-10 06:45:33 -07:00
Gilles Gouaillardet
77f8282d51 configury: fix (again) XRC detection on OFED < 3.12
since ibv_create_xrc_rcv_qp is now deprecated, and in order to
be "future-proof", we have to consider the case in which only XRC Domains are supported.

Thanks Paul Hargrove for the detailled report.
2015-07-10 15:31:45 +09:00
Gilles Gouaillardet
9a49d42f81 configury: fix XRC detection on OFED < 3.12
Thanks Paul Hargrove for the report
2015-07-10 10:21:46 +09:00
Rolf vandeVaart
8be2c9759b Merge pull request #702 from rolfv/pr/fix-cuda-mca-register
Make explicit call to initalize MCA parameters in common CUDA code.
2015-07-09 15:37:22 -04:00
Rolf vandeVaart
ae0f3cfee7 Make explicit call to initalize MCA parameters in common CUDA code. This allows us to view them with ompi_info and possibly modify with tools interface 2015-07-09 12:51:55 -04:00
bosilca
c0e0510db4 Merge pull request #698 from rolfv/pr/smcuda-ipc-perf-fix
Force smcuda BTL to use CUDA IPC path for all GPU buffers where possible
2015-07-09 09:54:32 +02:00
Gilles Gouaillardet
8eede3c7f1 configury: fix a typo in XRC support detection
Thanks to Ben Menadue for the report
2015-07-09 13:37:49 +09:00
Rolf vandeVaart
cdffa4724d Force smcuda BTL to use CUDA IPC path for all GPU buffers where possible 2015-07-08 17:11:25 -04:00
Rolf vandeVaart
77950374a5 Merge pull request #695 from rolfv/pr/fix-previous-cuda-macro-change
Fix errors from previous commit
2015-07-08 12:59:10 -04:00
Mike Dubman
099d7411e9 Merge pull request #696 from yosefe/topic/yalla-fix-config
yalla: fix mxm configuration parsing.
2015-07-08 19:56:59 +03:00
yosefe
103cac5bd9 yalla: fix mxm configuration parsing.
Take configuration from MXM_MPI_xx instead of MXM_PML_xx, same as mtl
mxm.
2015-07-08 19:18:23 +03:00
Rolf vandeVaart
1229dd3ed6 Fix errors from previous commit 2015-07-08 11:46:52 -04:00
Rolf vandeVaart
2797f5b8fc Merge pull request #694 from rolfv/pr/update-cuda-macro
Some minor cleanup and renaming of query function.
2015-07-08 11:09:17 -04:00
Rolf vandeVaart
6372baa7b9 Some minor cleanup and renaming of query function. 2015-07-08 10:17:35 -04:00
Jeff Squyres
d5e16839e9 mpi_f08: restore C/Fortran compatibility with F08 BUFFER_DETACH
The C int/Fint interoperability code was erroneously removed when this
function was moved to the use-mpi-f08 directory.
2015-07-08 03:55:50 -07:00
rhc54
c6bb227073 Merge pull request #692 from rhc54/topic/mapper
Fix hetero operations. An error in the hwloc utilities only allocated…
2015-07-07 13:33:42 -07:00
Ralph Castain
ed93154e43 Fix hetero operations. An error in the hwloc utilities only allocated memory for the first display of a binding map, and then assumed that all nodes had the same number of cores in them. This resulted in memory corruption whenever someone displayed a binding pattern for a hetero cluster, and a smaller node was first in line. 2015-07-07 12:52:16 -07:00
rhc54
a4aff5e3d9 Merge pull request #691 from rhc54/topic/mapper
Add a bunch of debug, and correct an error that caused us to use the …
2015-07-07 11:08:01 -07:00
Ralph Castain
7455802a36 Add a bunch of debug, and correct an error that caused us to use the wrong mapping policy when determining the default binding policy 2015-07-07 10:13:10 -07:00
Jeff Squyres
4b8967f532 Merge pull request #690 from jsquyres/pr/really-fix-f08-buffer-detatch-really-really-really
mpi_f08: move the f08 version of buffer_detach to use-mpi-f08
2015-07-07 11:17:27 -04:00
Jeff Squyres
8ff60c47cc mpi_f08: move the f08 version of buffer_detach to use-mpi-f08
@ggouaillardet was right -- we should have put the
ompi_buffer_detach_f08() function in the use-mpi-f08 directory to
begin with.  Putting it in the mpif-h directory made it complicated as
to whether the function would be built or not (e.g., whether weak
symbols were supported or not, whether the profiling layer was
disabled or not, ...etc.).

Just put it in the use-mpi-f08 directory and always build it (when the
mpi_f08 module is built, of course), and keep it simple.
2015-07-07 07:38:09 -07:00
Gilles Gouaillardet
478120ecb4 configury: fix alignment check of bool when cross-compiling 2015-07-07 11:04:07 +09:00
Gilles Gouaillardet
409874eb47 remove trigraph '??)' from comment
Fujitsu compilers issue way too many warnings because of this trigraph
2015-07-07 11:00:13 +09:00
Gilles Gouaillardet
9e89985f3d restore whitespaces into the pdf files 2015-07-07 09:17:00 +09:00
Howard Pritchard
fb9c218e5d Merge pull request #684 from nrgraham23/java_bindings
ompi/java: add MPI_Rget and MPI_Rput java bindings
2015-07-06 15:25:06 -06:00
Jeff Squyres
e28f016793 fortran only compile f08 buffer_detach once
Since there is no profiling version of the f08 buffer_detach function
(or, more specifically, the Fortran compile does the name mangling of
MPI and PMPI to the back-end C function for us), ensure that it is
only compiled once.

Also, per Gilles' observation, the f08-related #pragmas are no longer
relevant.
2015-07-06 09:52:27 -07:00
Ralph Castain
eb582b8276 Minor whitespace cleanups 2015-07-06 09:38:33 -07:00
Gilles Gouaillardet
9f171de412 btl/openib: queue pending fragments once only when running out of credit
Fixes open-mpi/ompi#640
2015-07-06 09:45:01 +09:00
rhc54
2c2945eaac Merge pull request #689 from rhc54/topic/dvm
There is no reason for tools to have an async progress thread
2015-07-05 13:17:50 -05:00
Ralph Castain
836f49597d There is no reason for tools to have an async progress thread as they can loop the event library themselves. This has the added benefit of causing the tool to "block" while waiting for events so they don't use cpu.
Also, fix orte-submit so it appropriately handles --help option
2015-07-05 10:45:28 -07:00
Jeff Squyres
4694b16256 mpi_f08: further fixes for MPI_BUFFER_DETACH
A few minor fixes in addition to
open-mpi/ompi@bd94da7de4 and
open-mpi/ompi@fc69c0be24.
2015-07-04 05:27:22 -07:00
Jeff Squyres
bd94da7de4 mpi_f08: fix MPI_BUFFER_DETACH interfaces
Add an mpi_f08-specific implementation for MPI_BUFFER_DETACH.

Per MPI-3.1:3.6, p45, the buffer argument is ignored in
MPI_BUFFER_DETACH for mpif.h and the mpi module.  But in the mpi_f08
module, the buffer argument is treated like it is in the C binding.
2015-07-03 08:56:34 -07:00
bosilca
77367ca02c Merge pull request #687 from rolfv/pr/fix-smcuda-perfprob
Add the ability use different size buffers for host and CUDA buffers
2015-07-02 18:42:41 -04:00
Jeff Squyres
4e7d979f8d Merge pull request #686 from jsquyres/pr/autogen-no-ompi-bool-fixes
bool: use SIZEOF__BOOL, not SIZEOF_BOOL
2015-07-02 12:19:07 -04:00
Jeff Squyres
cce57da0c4 openmpi-update-hg-svn.h: remove stale file
This file accidentaly got left over when we switched from SVN to git.
2015-07-02 12:17:02 -04:00
Rolf vandeVaart
30a872b478 Add the ability to send host buffers through one sized staging buffers and CUDA buffers through different sized buffers. Fixes performance issues 2015-07-02 11:11:15 -04:00
Jeff Squyres
f1353947ff libfabric: fix wrappers for static builds
Need to set the WRAPPER_EXTRA flags so that the wrappers for static
builds pull in -lfabric.

Also update/fix some comments.
2015-07-02 07:58:16 -07:00
Jeff Squyres
cd5751c217 bool: use SIZEOF__BOOL, not SIZEOF_BOOL
When you "autogen.pl --no-ompi", the AC_SIZEOF(bool) test is not run.
But we *do* run AC_SIZEOF(_Bool), which is the equivalent.  So switch
the uses of SIZEOF_BOOL in the code base to be SIZEOF__BOOL, and it's
all good.
2015-07-02 07:32:02 -07:00
rhc54
71fd76fc7c Merge pull request #685 from rhc54/topic/fence
This is the third time I am fixing this - I have no idea who or why this is being reset.
2015-07-02 09:11:07 -05:00
Ralph Castain
861fe1d9dd This is the third time I am fixing this - I have no idea who or why this is being reset. 2015-07-02 08:39:48 -05:00
Gilles Gouaillardet
6356c4b113 mpif: fix typo in interface name
* MPI_Comm_dup_with_info
 * MPI_Comm_split_type
 * PMPI_Comm_split_type
2015-07-02 17:13:07 +09:00
Gilles Gouaillardet
fa50e0d712 mpi_f08: fix typo in binding name
replace MPI_Win_flush_all_local with MPI_Win_flush_local_all
2015-07-02 15:47:38 +09:00
Jeff Squyres
24747397c8 VERSION: we have a 2.x branch, to bump major to 3 2015-07-01 12:57:27 -07:00
Nathan Hjelm
0325100823 Fix PGI compilation of opal_condition test
PGI was failing to build this test due to the {} initializers of some
of the global variables. The initializers were added to remove common
symbols from the test repository. This commit removes the initializers
and makes the variables static which should fix both the compile issue
and avoid the common symbols.

This commit additionally removes the check for
OPAL_ENABLE_MULTI_THREADS since Open MPI requires thread support.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-07-01 09:10:49 -06:00
Ralph Castain
6829e192ad Okay, that's it - trash it 2015-07-01 05:27:30 -05:00