1
1

27409 Коммитов

Автор SHA1 Сообщение Дата
Artem Polyakov
79c10c884d orte/pmix/server: Fix direct modex response with error status
`send_error()` is only packing status and peer info in the reply.
While remote counterpart in `pmix_server_dmdx_resp()` expects
the "hotel room number" to proceed correctly.

Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-20 23:50:57 +07:00
Howard Pritchard
0386c26177 Merge pull request #3860 from hppritcha/readme_update_pgi-os-x
README: say we don't support PGI on OS-X
2017-07-20 08:10:21 -06:00
Ralph Castain
7c6bd7d48d Merge pull request #3302 from ggouaillardet/poc/hwloc2
POC: add support for hwloc v2
2017-07-20 07:57:13 -05:00
Gilles Gouaillardet
593e4ce63f hwloc: add hwloc2x
internal hwloc 2x is used with --with-hwloc=future

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:51 +09:00
Gilles Gouaillardet
60aa9cfcb6 hwloc: add support for hwloc v2 API
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:44 +09:00
Gilles Gouaillardet
9f29f3bff4 hwloc: since WHOLE_SYSTEM is no more used, remove useless
checks related to offline and disallowed elements

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:21 +09:00
Gilles Gouaillardet
1a34224948 hwloc: do not set the HWLOC_TOPOLOGY_FLAG_WHOLE_SYSTEM flag
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:16 +09:00
Ralph Castain
fca68b070b Merge pull request #3934 from rhc54/topic/singleton
Fix the isolated pmix component. Cleanup the ess/singleton component …
2017-07-19 16:02:37 -05:00
Ralph Castain
543c16b28d Fix the isolated pmix component. Cleanup the ess/singleton component - we shouldn't be automatically discovering the local topology as that is now done on-demand.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-19 12:14:29 -07:00
Artem Polyakov
6cbea90209 Merge pull request #3932 from artpol84/rm_yoda_news/master
NEWS: fix removed legacy support section for v3.0.0
2017-07-19 08:48:49 -07:00
Howard Pritchard
503de6ca7d Merge pull request #3933 from hppritcha/topic/fix_s1_bug
pmix/s1: fix problems with ref counting in s1
2017-07-19 07:03:38 -06:00
Geoffrey Paulsen
71333a4b14 Transitioning ownership of rmaps/seq and rmaps/rank_file from Intel to IBM. 2017-07-18 21:31:01 -04:00
Gilles Gouaillardet
da34e2f109 ess/base: silence a warning
by fixing a static initializer

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-19 09:30:53 +09:00
Howard Pritchard
2fa0c4c6ec pmix/s1: fix problems with ref counting in s1
s1 pmix component wasn't doing proper ref counting

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-07-18 15:59:28 -06:00
Artem Polyakov
e34362de0a NEWS: fix removed legacy support section for v3.0.0
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-19 02:14:15 +07:00
Ralph Castain
83d21e3130 Merge pull request #3930 from rhc54/topic/signal
Fix signal forwarding on ORTE daemons
2017-07-18 13:44:37 -05:00
Ralph Castain
8a98aab6cc Fix signal forwarding on ORTE daemons so that _all_ daemons do it, regardless of environment. Add missing support for SIGTSTP and a few others.
Thanks to Eugene Dedits for reporting the problem.

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-18 09:58:55 -07:00
Josh Hursey
8688219091 Merge pull request #3775 from jjhursey/fix/mca_base_verbose-file
opal/mca: Fix mca_base_verbose file suffix processing
2017-07-18 10:14:42 -05:00
Edgar Gabriel
b363a0f4db Merge pull request #3908 from edgargabriel/pr/lustre-header-update
fs/lustre: update lustre header file used in the component
2017-07-18 09:57:11 -05:00
Gilles Gouaillardet
6d6f5fdfb8 Merge pull request #3894 from ggouaillardet/topic/pml_ob1_progress
pml/ob1: fix mca_pml_ob1_progress_needed usage
2017-07-18 10:43:44 +09:00
Artem Polyakov
bdb698a9b6 Merge pull request #3923 from artpol84/rm_yoda_news/master
Yoda removal (update docs)
2017-07-17 17:51:36 -07:00
Gilles Gouaillardet
1b46fe2d9a pml/ob1: fix mca_pml_ob1_progress_needed usage
correctly use OPAL_ATOMIC_ADD32() that returns the *new* value
and *not* the previous one.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-18 09:30:57 +09:00
Howard Pritchard
771f51af12 Merge pull request #3917 from hppritcha/topic/remove_cr_config_master
configure: remove CR/FT related options
2017-07-17 16:12:07 -06:00
Artem Polyakov
cc156a332b Sync NEWS with v3.0.x
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-18 05:05:40 +07:00
Artem Polyakov
a70d28cbb0 oshmem: Update README file to reflect available SPML options.
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-18 04:39:03 +07:00
Nathan Hjelm
5a5edfce88 Merge pull request #3876 from hppritcha/topic/add_psm2_get_stats
mtl/psm2: add pvar support for PSM2 MQ stats
2017-07-17 16:02:51 -05:00
Nathan Hjelm
2060fcf8bb mca/base: use the project name when registering pvars
References #3918. Close when applied to v2.0.x, v2.x, and v3.0.x.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-07-17 15:57:50 -05:00
Howard Pritchard
45e2771162 configure: remove CR/FT related options
As part of the process for addressing removal of CR/FT related
code from master (and hence from the 3.0.0 release), it was agreed
at the OMPI devel F2F on 7/13/17 that we'd break this in to two
pieces:

1) remove the configure arguments (fewer changes)
2) remove all the CR/FT code, etc. in a subsequent bigger commit
    that may not make it in to 3.0.0 in time.

By doing 1), the available configure options would not change
in a subsequent 3.0.x release if we end up not being able to do 2)
before 3.0.0 is released.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-07-17 13:48:59 -06:00
Nathan Hjelm
e5343c16c0 btl/vader: remove debug code that should not be in a release
References #3902. Close when in master, v3.0.x, and v2.x.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-07-17 11:58:47 -05:00
Edgar Gabriel
13b14f5efe Merge pull request #3906 from edgargabriel/pr/lazy_open_fix
common/ompio: fix the lazy_open flag
2017-07-17 10:58:13 -05:00
Edgar Gabriel
931c2021c7 Merge pull request #3905 from edgargabriel/pr/fix-grouping-options
io/ompio: fix grouping option
2017-07-17 10:55:57 -05:00
Edgar Gabriel
bc8f642211 fs/lustre: update lustre header file used in the component
liblustreapi.h is at this point deprecated. Switch to lustreapi.h instead

fixes issue #3223

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-07-17 10:33:52 -05:00
Nathan Hjelm
9b702fb9bd ompi: clean up topo helper functions
This commit removes the communicator topo helper functions in favor
of functions in mca/topo/base.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-07-17 10:30:42 -05:00
Edgar Gabriel
8e17827a13 common/ompio: fix the lazy_open flag
fixes an erroneous error code being returned when activating
the mca_io_ompio_sharedfp_lazy_open flag with MPI_MODE_APPEND.

fixes issue #3904

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-07-17 09:44:34 -05:00
Edgar Gabriel
4bdddfb74b io/ompio: fix grouping option
changing the value of mca_io_ompio_grouping_option lead to a segfault due to
a double-free problem. Remove the erroneous free statements that have been introduced
and add a note ensuring that we are not re-adding them back at that spot.

fixes issue #3903

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-07-17 09:38:10 -05:00
Yossi Itigin
b0692c6836 Merge pull request #3833 from yosefe/topic/pml-yalla-dt-size-fix
pml/yalla: fix getting size of a continuous type.
2017-07-16 11:19:16 +03:00
Gilles Gouaillardet
6e35cfc19a btl/sm: fix misc memory leak
as reported by Coverity with CID 1415105

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-16 13:02:55 +09:00
Jeff Squyres
0cbd52b062 Merge pull request #3889 from jsquyres/pr/die-sm-btl-die-die-die
btl/sm: effectively delete the SM BTL
2017-07-15 17:09:31 -04:00
Jeff Squyres
5cf64e6555 btl/sm: effectively delete the SM BTL
If a user explicitly asks for the "sm" BTL, print a show_help message
saying that the SM BTL is dead, and the user should be using "vader".

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-07-15 09:33:08 -07:00
Artem Polyakov
0929c32cd8 Merge pull request #3893 from karasevb/yoda_spml_remove
Remove Yoda SPML
2017-07-15 08:47:31 -07:00
Artem Polyakov
b8ab4c1ad7 Merge pull request #3895 from artpol84/fix_btl_defaults/master
contrib: Fix mellanox platform defaults (btl/sm -> btl/vader)
2017-07-15 02:36:34 -05:00
Artem Polyakov
35f15a0ba5 contrib: Fix mellanox platform defaults (btl/sm -> btl/vader)
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-15 14:32:26 +07:00
Howard Pritchard
701a1d0218 mtl/psm2: add pvar support for PSM2 MQ stats
Add pvars for PSM2 MQ stats to help in analyzing performance
of Omnipath.

Tested (modestly) using modified OSU pt2pt benchmarks.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-07-14 10:31:35 -06:00
Gilles Gouaillardet
9124afbeae pmix: do not invoke PMIX_INFO_CREATE() with a zero size
Thanks Lisandro Dalcin for the report

Fixes open-mpi/ompi#3854

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-14 15:00:05 +09:00
Boris Karasev
77c50efb95 Yoda SPML is removed
Signed-off-by: Boris Karasev <karasev.b@gmail.com>
2017-07-14 08:47:16 +03:00
Artem Polyakov
4d3e22e815 Merge pull request #3870 from hppritcha/topic/repair_s2_launch
pmix/s2: fix srun native launch for pmi2
2017-07-13 12:45:22 -05:00
Josh Hursey
2726df8d41 Merge pull request #3866 from jjhursey/doc/ppc-xl-pgi-note
README: Note about ld issue for XL and PGI on PPC
2017-07-13 11:30:38 -05:00
Ryan Grant
0ce8590e7c Merge pull request #3837 from tkordenbrock/topic/master/get.retry.timeout
master: mtl-portals4: add timeout to rendezvous get fragments
2017-07-13 09:59:54 -06:00
Nathan Hjelm
6fb81f20e4 mtl/psm2: create mca variables to shadow PSM2 environment variables
This commit enables MCA support for the following PSM2 environment
variables: PSM2_DEVICES, PSM2_MEMORY, PSM2_MQ_SENDREQS_MAX,
PSM2_MQ_RECVREQS_MAX, PSM2_MQ_RNDV_HFI_THRESH,
PSM2_MQ_RNDV_SHM_THRESH, PSM2_RCVTHREAD, PSM2_SHAREDCONTEXTS,
PSM2_SHAREDCONTEXTS_MAX, and PSM2_TRACEMASK. These variable can be set
by MCA if they are not already set in the environment.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-07-13 09:48:46 -06:00
Howard Pritchard
aefb828bc5 README: say we don't support PGI on OS-X
[skip ci]
fixes #3859

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-07-13 09:40:16 -06:00