1
1
Граф коммитов

25464 Коммитов

Автор SHA1 Сообщение Дата
rhc54
19a2dbb04f Merge pull request #1915 from rhc54/topic/connect
Support timeout values when performing connect/accept operations. Bum…
2016-07-28 15:51:06 -07:00
Jeff Squyres
cc651408dc help-orterun: remove blank line at end of help message
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-28 14:53:34 -07:00
Ralph Castain
cacb582ecd Support timeout values when performing connect/accept operations. Bump default timeout to 10 minutes so folks have time to start the partnering application 2016-07-28 14:09:06 -07:00
Nathan Hjelm
c281bd3c7f Merge pull request #1908 from hjelmn/udreg_fix
rcache/udreg: make reference count thread safe
2016-07-28 09:27:16 -06:00
Nathan Hjelm
6f7e680c57 Merge pull request #1909 from hjelmn/threads
Fix performance regression caused by enabling opal thread support
2016-07-28 08:41:13 -06:00
Nathan Hjelm
a8c3699484 Fix performance regression caused by enabling opal thread support
This commit adds opal_using_threads() protection around the atomic
operation in OBJ_RETAIN/OBJ_RELEASE. This resolves the performance
issues seen when running psm with MPI_THREAD_SINGLE.

To avoid issues with header dependencies opal_using_threads() has been
moved to a new header (thread_usage.h). The OPAL_THREAD_ADD* and
OPAL_THREAD_CMPSET* macros have also been relocated to this header.

This commit is cherry-picked off a fix that was submitted for the v1.8
release series but never applied to master. This fixes part of the
problem reported by @nysal in #1902.

(cherry picked from commit open-mpi/ompi-release@ce91307918)

Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2016-07-28 07:01:27 -06:00
Gilles Gouaillardet
a0a999e63d coll/base: fix ompi_coll_base_allgatherv_intra_basic_default() with MPI_IN_PLACE 2016-07-28 13:57:18 +09:00
Gilles Gouaillardet
b8a1ffb87e coll/base: fix ompi_coll_base_allgatherv_intra_basic_default()
Fixes open-mpi/ompi#1907
2016-07-28 13:50:04 +09:00
Nathan Hjelm
4658b761e4 rcache/udreg: make reference count thread safe
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-27 13:40:35 -06:00
Nathan Hjelm
1eb4ef438e Merge pull request #1903 from hjelmn/openib_fixes
btl/openib: set send flags only after endpoint is connected
2016-07-27 09:01:49 -06:00
Howard Pritchard
1dc7e9ed8f Merge pull request #1904 from hppritcha/topic/fix_cray_srun_native_launch
pmix/cray: switch to using wildcards for some
2016-07-27 07:12:02 -06:00
Jeff Squyres
2e0c3c7d77 libompitrace: explicitly set the .so version
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-27 07:05:58 -04:00
Howard Pritchard
b65bbe017f pmix/cray: switch to using wildcards for some
items so that at least srun native launch on
cray works again.

More issues to fix when using alps.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2016-07-26 17:07:58 -05:00
Nathan Hjelm
5e13e1ab7d btl/openib: set send flags only after endpoint is connected
The max inline send size on a queue pair is not available until after
the endpoint is connected. Before this commit the send flags
(including the inline flag) were set before this value was
initialized. This commit moves setting the send_flags down to
mca_btl_openib_put_internal which is only called after the endpoint is
connected. This fixes a bug when using osc/rdma.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-26 16:01:11 -06:00
Nathan Hjelm
95dead0151 Merge pull request #1891 from ggouaillardet/topic/openib_exp_atomic_cap
configury + btl/openib: fix a typo
2016-07-26 13:38:55 -06:00
rhc54
292563f94d Merge pull request #1899 from rhc54/topic/nodeid
Pass the nodeid for each proc in the job. Fix a mistaken error output message
2016-07-25 17:24:40 -07:00
Ralph Castain
9ab20cafe3 Pass the nodeid for each proc in the job. Fix a mistaken error output message 2016-07-25 15:41:15 -07:00
Artem Polyakov
9807a6d585 Merge pull request #1883 from karasevb/fix_mca_spml_call
oshmem/fortran: fix warning mesages
2016-07-22 16:59:11 +06:00
Boris Karasev
49b67094e0 oshmem/fortran: fix warning mesages && fix size 2016-07-22 15:54:01 +06:00
Gilles Gouaillardet
bbc6d4b3d4 ompi/communicator: remove an other debug print statement in ompi_comm_allreduce_intra_pmix_nb() 2016-07-22 15:42:56 +09:00
Gilles Gouaillardet
91ccec342c btl/openib: remove some dead code
remove useless call to opal_mem_hooks_support_level() and the value local variable.
2016-07-22 09:26:33 +09:00
Gilles Gouaillardet
1b3be0ac8c configury + btl/openib: fix a typo
test for existence of struct ibv_exp_device_attr.exp_atomic_cap.
That was previously mistyped struct ibv_exp_device_attr.ext_atomic_cap
2016-07-22 09:26:33 +09:00
Edgar Gabriel
2170eee5b7 Merge pull request #1882 from edgargabriel/pr/coverty-fixes
fcoll/static and io/ompio: fix coverty warnings
2016-07-21 16:12:15 -05:00
Edgar Gabriel
a899c0fb38 fcoll/static: fix coverty warnings
fix coverty warnings CID 72144, CID 710677, CID 1364164
2016-07-21 13:08:15 -05:00
rhc54
941d4fdb8b Merge pull request #1897 from rhc54/topic/fixes
Cleanup the new naming requirements to ensure that info is correctly retrieved
2016-07-21 11:05:09 -07:00
Ralph Castain
71de03fc67 Cleanup the new naming requirements to ensure that info is correctly retrieved
Cleanup permissions

Restore singleton operations
2016-07-21 09:46:03 -07:00
rhc54
58e9caca73 Merge pull request #1892 from rhc54/topic/covclean
Cleanup Coverity warnings
2016-07-21 08:20:53 -07:00
Jeff Squyres
af15ec31bb Merge pull request #1889 from ggouaillardet/topic/test_travis
test
2016-07-21 09:31:51 -04:00
Ralph Castain
2b55ee8118 Cleanup Coverity warnings 2016-07-20 20:31:58 -07:00
Todd Kordenbrock
37ad6aa711 Merge pull request #1853 from PDeveze/Patchs-on-osc-portals4
Patchs on osc portals4
2016-07-20 09:22:19 -05:00
Todd Kordenbrock
210534adb3 Merge pull request #1850 from PDeveze/Patchs-on-mtl-portals4
Patchs on mtl portals4
2016-07-20 08:21:03 -05:00
rhc54
4bc5048608 Merge pull request #1888 from rhc54/topic/pmixup
Update pmix2 component
2016-07-20 06:14:05 -07:00
Ralph Castain
01a653d50a Remove a debug print in comm_cid.c. Update PMIx2 to include the revised PMIx_Get logic for higher performance by reducing the number of hash table lookups. Fix a bug where requests for data from a proc in another nspace could hang, or result in "not found".
Remove stale file reference

Restore autogen pass thru pmix

Remove generated file
2016-07-20 00:58:19 -07:00
Gilles Gouaillardet
936967e121 Merge pull request #1887 from ggouaillardet/topic/configury_fix
configury: fix autogen-sh --no-ompi
2016-07-20 16:18:51 +09:00
Gilles Gouaillardet
af23dcc123 travis: use gcc 6 and run brew upgrade
brew update seems to remove /usr/local/Library/ENV, which breaks glibtool,
so manually upgrade a few formulas
2016-07-20 15:48:36 +09:00
Gilles Gouaillardet
252fadf099 ompi: fix #if vs #ifdef HAVE___MALLOC_INITIALIZE_HOOK usage 2016-07-20 13:18:11 +09:00
Gilles Gouaillardet
3a32c7d8b6 configury: fix autogen-sh --no-ompi
This fixes a regression from open-mpi/ompi@dc5adc5a91

Fortran is only required by ompi, so m4_ifdef([project_ompi],...) protect Fortran related stuff in opal

Fixes open-mpi/ompi#1884
2016-07-20 12:23:46 +09:00
Ralph Castain
19a1837713 Revert "configury: fix autogen-sh --no-ompi"
This reverts commit f3d3633e74.
2016-07-19 20:09:56 -07:00
Ralph Castain
36a9063466 Silence warnings 2016-07-19 17:36:13 -07:00
Gilles Gouaillardet
f3d3633e74 configury: fix autogen-sh --no-ompi
This fixes a regression from open-mpi/ompi@dc5adc5a91

Fortran is only required by ompi, so m4_ifdef([project_ompi],...) protect Fortran related stuff in opal

Fixes open-mpi/ompi#1884
2016-07-20 09:25:14 +09:00
Nathan Hjelm
40f71f2d7a Merge pull request #1873 from hjelmn/comm_split_update
Improve MPI_Comm_split_type scalability
2016-07-19 14:36:44 -06:00
Nathan Hjelm
5edab9cb22 Merge pull request #1855 from hjelmn/comm_rework
ompi/comm: refactor communicator cid code
2016-07-19 10:04:17 -06:00
Pascal Deveze
9cac32ba6a mtl/portals4: Modifications concerning the short message management 2016-07-19 11:21:50 +02:00
Pascal Deveze
49e9936914 mtl/portals4: Some little patches 2016-07-19 11:18:55 +02:00
Nathan Hjelm
ced853476f Merge pull request #1878 from hjelmn/f_rops
ompi/fortran: fix typos in request RMA bindings
2016-07-18 13:48:41 -06:00
Nathan Hjelm
8bdcb40dc4 ompi/fortran: fix typos in request RMA bindings
This commit fixes typos on the C side of the request-based RMA binding. We
were not returning the request on success but on failure. Thanks to
@alazzaro for reporting and @ggouaillardet, and @vondele for tracking
this down.

Fixes part of open-mpi/ompi#1869

Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2016-07-18 13:46:28 -06:00
Nathan Hjelm
4c49c42dd0 ompi/comm: improve comm_split_type scalability
This commit introduces a new algorithm for MPI_Comm_split_type. The
old algorithm performed an allgather on the communicator to decide
which processes were part of the new communicators. This does not
scale well in either time or memory.

The new algorithm performs a couple of all reductions to determine the
global parameters of the MPI_Comm_split_type call. If any rank gives
an inconsistent split_type (as defined by the standard) an error is
returned without proceeding further. The algorithm then creates a
communicator with all the ranks that match the split_type (no
communication required) in the same order as the original
communicator. It then does an allgather on the new communicator (which
should be much smaller) to determine 1) if the new communicator is in
the correct order, and 2) if any ranks in the new communicator
supplied MPI_UNDEFINED as the split_type. If either of these
conditions are detected the new communicator is split using
ompi_comm_split and the intermediate communicator is freed.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-18 12:47:05 -06:00
Nathan Hjelm
035c2e2e2a ompi/comm: refactor communicator cid code
This commit simplifies the communicator context ID generation by
removing the blocking code. The high level calls: ompi_comm_nextcid
and ompi_comm_activate remain but now call the non-blocking variants
and wait on the resulting request. This was done to remove the
parallel paths for context ID generation in preperation for further
improvements of the CID generation code.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-18 12:47:05 -06:00
Nathan Hjelm
8081cf651e Merge pull request #1881 from hjelmn/pmix_bug_fix
pmix/pmix2x: add missing increment in loop
2016-07-18 12:01:59 -06:00
Edgar Gabriel
31b7be6e88 Merge pull request #1851 from edgargabriel/pr/ompio-reorg
Some ompio code cleanup and reoarganization
2016-07-18 12:19:34 -05:00