1
1

30610 Коммитов

Автор SHA1 Сообщение Дата
Jeff Squyres
cdd3a9fbcc Update hwloc submodule to fix "make distcheck"
Hwloc upstream has fixed a problem with embedded "make distcheck" that
was breaking that surfaced when you ran autogen in an Open MPI
tarball.

This submodule update takes in the upstream hwloc fixes for this
issue.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-21 13:07:08 -08:00
Ralph Castain
820d9ac65c
Merge pull request #7447 from rhc54/topic/myen
Pickup change in check for deleted vs deprecated options and add deprecated ones
2020-02-21 11:30:00 -08:00
Ralph Castain
f9643b84b9
Merge pull request #7441 from rhc54/topic/hack
Create a hack to protect against non-integer jobids
2020-02-21 11:28:51 -08:00
Jeff Squyres
6d34b064be
Merge pull request #7446 from jsquyres/pr/remove-unused-compress-framework
Remove "compress" OPAL framework
2020-02-21 10:35:13 -08:00
Edgar Gabriel
df6e3e503a sharedfp/individual: defer error when not being able to open datafile
This commit changes the behavior of the individual sharedfp component. If
the component cannot create either the datafile or the metadatafile during File_open,
no error is being raised going forward. This allows applications that do not use shared
file pointer operations to continue execution without any issue.

If the user however subsequently calls MPI_File_write_shared or similar operations, an error
will be raised.

Fixes issue #7429

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2020-02-21 12:13:39 -06:00
Ralph Castain
fae5965506
Merge pull request #7445 from rhc54/topic/tests
Save the old ORTE simple tests
2020-02-21 08:48:30 -08:00
Ralph Castain
8d66045e95
Deprecate the enable-orterun-prefix-by-default options
Mark the --enable-orterun-prefix-by-default and
--enable-mpirun-prefix-by-default options as deprecated, but continue to
honor them by translating them to the new
--enable-prte-prefix-by-default option.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 08:02:13 -08:00
Jeff Squyres
3bfc7a7b62
Tweak: rename "deprecated" --> "deleted"
We're dealing with CLI options that have been deleted, not
deprecated.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 5aea4446288db9207fbc60f23ee903b149217121)
2020-02-21 07:33:05 -08:00
Jeff Squyres
66da0c6361 Remove "compress" OPAL framework
This framework is no longer used.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-21 06:28:16 -08:00
Ralph Castain
7e2874a83d
Save the old ORTE simple tests
Useful when debugging RTE-related issues

Not for inclusion in the tarball - just added to git repo for use by
developers.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 06:15:06 -08:00
Ralph Castain
829fd478b3
Create a hack to protect against non-integer jobids
If someone gives us a namespace that doesn't easily translate to an
integer, we have to create a mechanism for working around the
disconnect. PRRTE has been updated to give us a flag so we know we were
"natively" launched. If we don't see it, then fall back to generating a
hash of the nspace as our jobid. We then have to translate back/forth
between nspace and jobid using a lookup table.

Probably not the right long-term solution, but hopefully helps get us
thru for a bit.

Includes update of PRRTE pointer

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 06:04:55 -08:00
Thomas Jahns
8ecbe1ce97 Fix incorrect argument in manual page.
Signed-off-by: Thomas Jahns <jahns@dkrz.de>
2020-02-21 11:25:38 +01:00
Ralph Castain
3366f3ec40
Merge pull request #7436 from rhc54/topic/pstat
Remove the unused opal/pstat framework
2020-02-20 10:52:15 -08:00
Austen Lauria
dd5991f513
Merge pull request #7204 from devreal/shmwin_contig
Correctly set baseptr in contiguous shared memory window with local size zero
2020-02-20 13:22:40 -05:00
Ralph Castain
254dd2288a
Remove the unused opal/pstat framework
ORTE was the only one who used it, and ORTE is...gone!

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-20 09:23:40 -08:00
Ralph Castain
e8e089c681
Merge pull request #7435 from rhc54/topic/agn
Update PRRTE and PMIx pointers
2020-02-20 02:56:29 -08:00
Ralph Castain
b13c697d53
Update PRRTE and PMIx pointers
- remove stale s390 and MIPS atomics
- ensure envars from spawn are propagated
- fix make tarball
- ensure cleanup of default hostfile

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-19 22:51:19 -08:00
Ralph Castain
18ea9a13d8
Merge pull request #7431 from rhc54/topic/dep
Deprecate the --with-pmi option
2020-02-19 19:48:02 -08:00
Josh Hursey
c4e1b657ba
Merge pull request #7432 from jjhursey/automake-jobs
Add a -j option to autogen.pl to set AUTOMAKE_JOBS
2020-02-19 16:01:28 -08:00
Ralph Castain
7ebaee7437
Deprecate the --with-pmi option
Per the developer's meeting, add detection of the deprecated --with-pmi
(and its associated --with-pmi-libdir) configure option and error out
with a polite note of the change in support

Since "--with-pmi" now shows in the configure help output, mark the help
string with a giant *DEPRECATED* to warn users not to use it

Signed-off-by: Ralph Castain <rhc@pmix.org>

Ma

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-19 14:39:45 -08:00
Nathan Hjelm
c8ed75bba5
Merge pull request #7426 from hjelmn/opal_atomics_clean_out_sparcv9_as_it_is_not_supported
opal/asm: remove MIPS
2020-02-19 12:35:09 -08:00
Ralph Castain
58157f179d
Merge pull request #7430 from rhc54/topic/up
Update PRRTE and PMIx pointers
2020-02-19 12:16:32 -08:00
Joshua Hursey
f01df73853 Add a -j option to autogen.pl to set AUTOMAKE_JOBS
* `AUTOMAKE_JOBS` can improve the performance to `autogen.pl`
 * The user can set this envar in the environment before calling
   `autogen.pl` or use the new `-j #` option to set it.

Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2020-02-19 14:01:47 -05:00
Brice Goglin
a0ea5abec8 hwloc: clarify the error message when infiniband/verbs.h is missing but hwloc's verbs support is requested
Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>
2020-02-19 19:01:36 +01:00
Brice Goglin
b5df92a201 hwloc: remove a stale configure hack for hwloc 1.3.2 vs suse pci issues
opal_hwloc_hwloc132_save_enable_pci doesn't exist anymore

Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>
2020-02-19 19:01:11 +01:00
Ralph Castain
16f0adf7ec
Update PRRTE and PMIx pointers
- fix LSF compile issue
- protect against NULL user home directory
- update reachable component in PRRTE (still unused)

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-19 09:27:48 -08:00
Nathan Hjelm
b2f17241b3
Merge branch 'master' into opal_atomics_clean_out_sparcv9_as_it_is_not_supported 2020-02-19 08:16:46 -08:00
Geoff Paulsen
960c5f736f
Merge pull request #7425 from hjelmn/opal_atomics_clean_out_s390_as_it_is_not_supported
opal/atomic: clean out s390(x)
2020-02-19 07:27:29 -06:00
Nathan Hjelm
843fdc068c README: update tested systems
These days Open MPI is being tested with PPC64 and ARMv8 reguarly. Move
these to the tested list. Also call out the version of GCC that is
reguarly tested.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-18 21:42:54 -08:00
Nathan Hjelm
9eb5ef92da opal/asm: remove MIPS
This commit removes the code specific to MIPS. This architecture
has been unsupported for some time. Open MPI will continue to work
on MIPS with C11 and __atomic but will not longer use CMA for
shared memory.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-18 21:35:40 -08:00
Nathan Hjelm
3d1495510c opal/atomic: clean out s390(x)
This commit removes the CMA support for s390 and s390x. These
architectures have been unsupported for awhile and no one has
verified that CMA actually works with Open MPI on these systems.

s390 and s390x will continue to work with Open MPI without CMA.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-18 21:14:10 -08:00
Nathan Hjelm
a0567b51ba
Merge pull request #7421 from hjelmn/osc_rdma_fix_one_more_issue_with_nondebug_builds_because_assert_is_optd_out
osc/rdma: fix bug in attach for non-debug builds
2020-02-18 20:25:23 -08:00
Brice Goglin
06219648fc hwloc: remove unused xml configure-time check
It's not used, and XML is always enabled anyway

Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>
2020-02-18 22:05:29 +01:00
Nathan Hjelm
8ee80d8855 osc/rdma: fix bug in attach for non-debug builds
This commit fixes an issue with non-debug builds where adding an
attachment to the attachment list doesn't actually happen. This
causes all MPI_Win_detach calls to fail. The call was within an
assert which is optimized out in optimized builds.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-18 12:59:46 -08:00
Ralph Castain
0276679595
Merge pull request #7418 from rhc54/topic/up
Update PRRTE and PMIx pointers
2020-02-18 00:42:49 -08:00
Ralph Castain
82c71fae78
Update PRRTE and PMIx pointers
- Fix VPATH installs
- Protect against NULL home directories

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-17 21:45:22 -08:00
Nathan Hjelm
73469cb847
Merge pull request #7416 from hjelmn/lets_crash_on_debug_builds_if_the_user_home_directory_is_not_available
opal/mca: check if the user home directory is NULL
2020-02-17 17:50:07 -07:00
Nathan Hjelm
8197efa021 opal/mca: check if the user home directory is NULL
This commit fixes an issue in the MCA base variable system. The
code was retrieving the user home directory (from HOME) and
attempting to use it to build a search path for config files.
In this case user-level configuration directories have been
enabled so the appropriate thing to do is to print an error
message and return. This commit makes that change. It does not
ensure that HOME is set correctly.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-17 14:51:46 -08:00
Nathan Hjelm
eeb3d7f845
Merge pull request #7387 from hjelmn/osc_rdma_allow_overlapping_registration_regions_and_return_the_correct_error_code_when_regions_overlap
osc/rdma: modify attach to check for region overlap
2020-02-17 14:26:53 -07:00
Jeff Squyres
3b7fd5ad0f
Merge pull request #7404 from jsquyres/pr/m4-holy-hell
opal_setup_cli.m4: do not escape $
2020-02-17 13:05:55 -08:00
Nathan Hjelm
54c8233f4f osc/rdma: bump the default max dynamic attachments to 64
This commit increaes the osc_rdma_max_attach variable from 32
to 64. The new default is kept low due to the small number
of registration resources on some systems (Cray Aries). A
larger max attachement value can be set by the user on other
systems.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-16 17:09:20 -08:00
Nathan Hjelm
6649aef8bd osc/rdma: modify attach to check for region overlap
This commit addresses two issues in osc/rdma:

 1) It is erroneous to attach regions that overlap. This was being
    allowed but the standard does not allow overlapping attachments.

 2) Overlapping registration regions (4k alignment of attachments)
    appear to be allowed. Add attachment bases to the bookeeping
    structure so we can keep better track of what can be detached.

It is possible that the standard did not intend to allow #2. If that
is the case then #2 should fail in the same way as #1. There should
be no technical reason to disallow #2 at this time.

References #7384

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-16 17:09:06 -08:00
Ralph Castain
731b7e89e3
Merge pull request #7408 from rhc54/topic/myup
Update PRRTE and PMIx
2020-02-16 07:32:57 -08:00
Ralph Castain
274fba3126
Update PRRTE and PMIx
Correct platform file support
Fix configure cli capture to silence warning

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-15 19:38:34 -08:00
Ralph Castain
796886206b
Merge pull request #7406 from rhc54/topic/p2
Enable build against PMIx v2.2 without internal PRRTE
2020-02-15 18:58:25 -08:00
Ralph Castain
edaf9160ae
Enable build against PMIx v2.2 without internal PRRTE
If you autogen.pl --without-prrte, we wouldn't configure or build PRRTE
support. However, configuring with --disable-internal-rte wasn't working
as it was being ignored. This led to some false errors when compiling
with an earlier PMIx v2.2 release.

That said, there were a couple of places that needed protection against
PMIx v2.2.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-15 16:28:16 -08:00
Jeff Squyres
04c50c668e opal_setup_cli.m4: do not escape $
We do not want to escape $, because the resulting quoted string ends
up in C code, and "\$" is not recognized by printf (and some compilers
warn about it).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-15 11:50:16 -08:00
Ralph Castain
8eebeee9bf
Merge pull request #7403 from rhc54/topic/plt
Provide hooks for PRRTE and PMIx platform files
2020-02-15 10:59:19 -08:00
Ralph Castain
344346f27e
Provide hooks for PRRTE and PMIx platform files
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-15 08:40:56 -08:00
Ralph Castain
4bdb5a8103
Merge pull request #7402 from rhc54/topic/up3
Resolve the PMIx v3 incompatibility
2020-02-15 06:05:08 -08:00