1
1
Граф коммитов

30448 Коммитов

Автор SHA1 Сообщение Дата
Howard Pritchard
31d7748afd
Merge pull request #7434 from hppritcha/topic/fix_a_config_with_ext_pmix_prob
fix an issue with configuring with external pmix
2020-02-27 12:19:02 -07:00
Jeff Squyres
af1ec9a594
Merge pull request #7323 from bosilca/fix/7320
Trap wrong parameters to MPI_Init_thread.
2020-02-27 06:28:44 -05:00
Jeff Squyres
19acb32a3f
Merge pull request #7484 from yanagibashi/pr/fix-typos-and-descriptions
man: fix typos and descriptions.
2020-02-27 06:24:50 -05:00
Jeff Squyres
85db54969e
Merge pull request #7422 from bgoglin/hwloc-cleanup
minor hwloc configure fixes
2020-02-27 06:20:29 -05:00
Jeff Squyres
ae7f4f46f6
Merge pull request #7464 from artemry-mlnx/artemry-mlnx/disable-per-commit-ci
Disabled Mellanox Open MPI per-commit CI (as redundant).
2020-02-27 06:19:29 -05:00
Tsubasa Yanagibashi
b604f1f1fe add a description in MPI_WIN_DETACH man page.
Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-27 18:19:55 +09:00
Tsubasa Yanagibashi
070d4c15bc update a description in MPI_Request_free man page.
Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-27 18:17:44 +09:00
Tsubasa Yanagibashi
6c342aef68 fix some typos and spacing in man pages.
Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-27 18:14:26 +09:00
KAWASHIMA Takahiro
a9b16299c9
Merge pull request #7479 from yanagibashi/pr/fix-opal-initialized-ref-counter
opal: Fix opal_initialized reference counter
2020-02-27 08:49:02 +09:00
Jeff Squyres
ba1f016508
Merge pull request #7428 from hjelmn/finally_kill_the_old_cpp_bindings
ompi: remove obsolete c++ bindings
2020-02-26 17:40:25 -05:00
Nathan Hjelm
9cc0f6348d
Merge pull request #7453 from hjelmn/purge_sparc_v9_atomic_support_in_favor_of_just_builtins_for_this_platform
Purge Sparc v9 and sync atomics.
2020-02-26 14:39:46 -08:00
Nathan Hjelm
0b8baa217d ompi: remove obsolete c++ bindings
This commit contains the following changes:

The C++ bindings were removed from the standard in MPI-3.0. This
commit removes the entirety of the C++ bindings as well as the
support configury.

Removes all references to C++ from the man pages. This includes the
bindings themselves, all references to what C++ bindings return,
all not-available comments, and differences between C++ and other
language bindings.

If the user passes --enable-mpi-cxx, --enable-mpi-cxx-seek, or
--enable-cxx-exceptions, print a warning message an abort configure.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:55 -08:00
Nathan Hjelm
2277453737 opal: fix c11 compare-exchange 128-bit check
There were too many arguments being passed to the compare_exchange_strong
macro. This commit removes the extra arguments and restores the
functionality.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:04 -08:00
Nathan Hjelm
489c0840d1 asm: cleanup
Remove ASM formats as they have not been used in some time.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:04 -08:00
Nathan Hjelm
d2dd27b008 asm: remove support for Sparc v9
This commit removes the specialized support for Sparc v9 as the
architecture is unsupported. The architecture will continue to
work without CMA and using the GCC built-in atomic support.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:04 -08:00
Raafat Feki
ceb52126a8
Merge pull request #7476 from raafatfeki/topic/gpfs
Topic/gpfs
2020-02-26 10:11:38 -06:00
Nathan Hjelm
038dcad8b5 asm: remove support for __sync built-in atomics
This commit removes the unsupported __sync built-in atomics in
favor of the GCC built-ins. The priority order (if not modified
by configure flags) is: C11, custom atomics
(opal/include/opal/sys/*), then GCC built-ins.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 06:30:34 -08:00
Nathan Hjelm
547d6c4380 asm: remove ARMv4 and ARMv5 CMA support
This commit removes the reference to ARMv4 and ARMv5. These are
unsupported architectures and no one is testing if CMA is working
on these systems.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 06:25:10 -08:00
Nathan Hjelm
65a096116f opal: remove remaining atomic references to IA64
IA64 atomic support was deleted some time ago. Some of the references
to the architecture were not removed when the atomic support was. This
commit removes those lingering references. IA64 will continue to work
unsupported with the built-in atomics.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 06:22:37 -08:00
Tsubasa Yanagibashi
7d5fbcfd76 opal: Fix opal_initialized reference counter
Before this change, the reference counters `opal_util_initialized`
and `opal_initialized` were incremented at the beginning of the
`opal_init_util` and the `opal_init` functions respectively.
In other words, they were incremented before fully initialized.

This causes the following program to abort by SIGFPE if
`--enable-timing` is enabled on `configure`.

```c
// need -lm option on link

int main(int argc, char *argv[])
{
    // raise SIGFPE on division-by-zero
    feenableexcept(FE_DIVBYZERO);
    MPI_Init(&argc, &argv);
    MPI_Finalize();
    return 0;
}
```

The logic of the SIGFPE is:

1. `MPI_Init` calls `opal_init` through `ompi_rte_init`.
2. `opal_init` changes the value of `opal_initialized` to 1.
3. `opal_init` calls `opal_init_util`.
4. `opal_init_util` calls `opal_timing_ts_func` through
   `OPAL_TIMING_ENV_INIT`, and `opal_timing_ts_func` returns
   `get_ts_cycle` instead of `get_ts_gettimeofday` because
   `opal_initialized` to 1.
   (This is the problem)
5. `opal_init_util` calls `get_ts_cycle` through
   `OPAL_TIMING_ENV_INIT`.
6. `get_ts_cycle` executes
   `opal_timer_base_get_cycles()) / opal_timer_base_get_freq()`
   and it raises SIGFPE (division-by-zero) because the OPAL TIMER
   framework is not initialized yet and `opal_timer_base_get_freq`
   returns 0.

This commit changes the increment timing of `opal_util_initialized`
and `opal_initialized` to the end of `opal_init_util` and the
`opal_init` functions respectively.

Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-26 14:09:19 +09:00
raafatfeki
f46cfc120d fs/gpfs: Solve issues while setting GPFS hints
1- Remove the common symbols issue: global variable not initialized. (#7424)
Move the variables to local scope within the set_info function.
2- Remove GPFS hints using datashipping: not used anymore
3- Redirect output stream to corresponding fs framework.

Signed-off-by: raafatfeki <fekiraafat@gmail.com>
2020-02-25 18:14:20 -05:00
raafatfeki
9ba6ab8209 mca/fs: Check the existence of communicator in file query
The communicator might be not existent yet when mca_fs_gpfs_component_file_query() is called.
Therefore, we need to check it first before calling brodcast function.

Signed-off-by: raafatfeki <fekiraafat@gmail.com>
2020-02-25 16:26:46 -05:00
Jeff Squyres
f496f256cd
Merge pull request #7470 from jsquyres/pr/die-sm-btl-die-die-die
btl/sm: remove the deprecation-notice shell
2020-02-25 15:22:12 -05:00
Jeff Squyres
cdf478e963 btl/sm: remove the deprecation-notice shell
The SM BTL was effectively removed a long time ago.  All that was left
was a shell that warned people if they tried to use the SM BTL.  For
v5.0, we plan to finally remove this ancient shell (and possibly
replace it with vader).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-25 11:48:42 -05:00
Artem Ryabov
c5a5c7e1d3 Disabled Mellanox Open MPI per-commit CI (as redundant).
The CI is triggered only upon a PR creation or by special PR comments.

Signed-off-by: Artem Ryabov <artemry@mellanox.com>
2020-02-25 00:58:01 +03:00
Geoff Paulsen
207b267135
Merge pull request #7455 from rhc54/topic/warn
Silence a bunch of warnings
2020-02-24 08:55:30 -06:00
Geoff Paulsen
6f28a18f4e
Merge pull request #7444 from tjahns/master
Fix incorrect argument in manual page.
2020-02-24 08:40:04 -06:00
Jeff Squyres
3bf1ba527c
Merge pull request #7456 from artemry-mlnx/artemry-mlnx/reduce_mellanox_ci_time
Mellanox Open MPI CI: optimized git checkout step to reduce CI duration
2020-02-24 09:17:27 -05:00
Jeff Squyres
4fb7e01158
Merge pull request #7427 from hjelmn/update_readme_architectures
README: update tested systems
2020-02-24 09:16:36 -05:00
Ralph Castain
89f3418c2c
Merge pull request #7459 from rhc54/topic/cid
Fix comm_spawn
2020-02-23 15:29:44 -08:00
Edgar Gabriel
28776c5d95
Merge pull request #7448 from edgargabriel/topic/individual-as-dummy-module
sharedfp/individual: defer error when not being able to open datafile
2020-02-23 16:35:23 -06:00
Ralph Castain
b35b0f7897
Fix comm_spawn
Use the correct data type in the CID exchange

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-23 13:51:22 -08:00
Howard Pritchard
488f656c11 fix an issue with configuring with external pmix
External pmix installs are frequently in non-standard locations and
the path to their shared libraries are not ldconfig'd in because
there may be multiple pmix installs.

The way the configury was set up prior to this patch, the configuration
would fail soon after the PMIX config stuff was called because it
added some pmix lib stuff to the LDFLAGS, resulting in configury tests
for things that require running the configure test to fail.

This patch avoids this problem by resetting the LDFLAGS and LIBS back
to what they were prior to the run of the external PMIX detection.

The CFLAGS setting is left because there are many places in the ompi
and opal source code where pmix_common.h needs to be included.

Signed-off-by: Howard Pritchard <hppritcha@gmail.com>
2020-02-23 13:04:24 -07:00
Ralph Castain
a01884a6a0
Clean out ORTE references from .gitignore
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 13:24:18 -08:00
Artem Ryabov
f638f4aa6a Mellanox Open MPI CI: optimized git checkout step to reduce CI duration
Signed-off-by: Artem Ryabov <artemry@mellanox.com>
2020-02-23 00:21:27 +03:00
Ralph Castain
dcf110d432
Add missing Makefile
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 13:17:34 -08:00
Ralph Castain
86de81baca
Silence a bunch of warnings
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 13:05:28 -08:00
Ralph Castain
9cfdc381d3
Merge pull request #7454 from rhc54/topic/orte
Remove lingering ORTE references
2020-02-22 12:05:39 -08:00
Ralph Castain
76b9c15825
Remove lingering ORTE references
Wrapper compiler is trying to link in a libopen-rte. Man pages are
setting an ORTE release date.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 07:59:19 -08:00
Jeff Squyres
7c76237e0d
Merge pull request #7449 from jsquyres/pr/update-hwloc-to-fix-make-dist
Update hwloc submodule to fix "make distcheck"
2020-02-22 05:26:04 -08:00
Ralph Castain
ae35a385a0
Merge pull request #7452 from rhc54/topic/dang
Add missing update to PRRTE
2020-02-21 21:28:17 -08:00
Ralph Castain
13fb44cbbe
Add missing update to PRRTE
Needed to fix ninghtly tarball generator

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 20:09:21 -08:00
Ralph Castain
b1fff49f50
Merge pull request #7451 from rhc54/topic/slurm
Fix Slurm process name
2020-02-21 16:55:49 -08:00
Ralph Castain
06c8a30415
Merge pull request #7450 from rhc54/topic/mips
Remove stale reference to MIPS
2020-02-21 16:55:39 -08:00
Ralph Castain
4c56a7744a
Fix Slurm process name
Ensure that we truncate the local jobid to 15-bits

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 15:46:53 -08:00
Ralph Castain
4960b6b76a
Remove stale reference to MIPS
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 15:35:17 -08:00
Jeff Squyres
cdd3a9fbcc Update hwloc submodule to fix "make distcheck"
Hwloc upstream has fixed a problem with embedded "make distcheck" that
was breaking that surfaced when you ran autogen in an Open MPI
tarball.

This submodule update takes in the upstream hwloc fixes for this
issue.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-21 13:07:08 -08:00
Ralph Castain
820d9ac65c
Merge pull request #7447 from rhc54/topic/myen
Pickup change in check for deleted vs deprecated options and add deprecated ones
2020-02-21 11:30:00 -08:00
Ralph Castain
f9643b84b9
Merge pull request #7441 from rhc54/topic/hack
Create a hack to protect against non-integer jobids
2020-02-21 11:28:51 -08:00
Jeff Squyres
6d34b064be
Merge pull request #7446 from jsquyres/pr/remove-unused-compress-framework
Remove "compress" OPAL framework
2020-02-21 10:35:13 -08:00