1
1

30437 Коммитов

Автор SHA1 Сообщение Дата
Tsubasa Yanagibashi
b604f1f1fe add a description in MPI_WIN_DETACH man page.
Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-27 18:19:55 +09:00
Tsubasa Yanagibashi
070d4c15bc update a description in MPI_Request_free man page.
Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-27 18:17:44 +09:00
Tsubasa Yanagibashi
6c342aef68 fix some typos and spacing in man pages.
Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-27 18:14:26 +09:00
KAWASHIMA Takahiro
a9b16299c9
Merge pull request #7479 from yanagibashi/pr/fix-opal-initialized-ref-counter
opal: Fix opal_initialized reference counter
2020-02-27 08:49:02 +09:00
Jeff Squyres
ba1f016508
Merge pull request #7428 from hjelmn/finally_kill_the_old_cpp_bindings
ompi: remove obsolete c++ bindings
2020-02-26 17:40:25 -05:00
Nathan Hjelm
9cc0f6348d
Merge pull request #7453 from hjelmn/purge_sparc_v9_atomic_support_in_favor_of_just_builtins_for_this_platform
Purge Sparc v9 and sync atomics.
2020-02-26 14:39:46 -08:00
Nathan Hjelm
0b8baa217d ompi: remove obsolete c++ bindings
This commit contains the following changes:

The C++ bindings were removed from the standard in MPI-3.0. This
commit removes the entirety of the C++ bindings as well as the
support configury.

Removes all references to C++ from the man pages. This includes the
bindings themselves, all references to what C++ bindings return,
all not-available comments, and differences between C++ and other
language bindings.

If the user passes --enable-mpi-cxx, --enable-mpi-cxx-seek, or
--enable-cxx-exceptions, print a warning message an abort configure.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:55 -08:00
Nathan Hjelm
2277453737 opal: fix c11 compare-exchange 128-bit check
There were too many arguments being passed to the compare_exchange_strong
macro. This commit removes the extra arguments and restores the
functionality.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:04 -08:00
Nathan Hjelm
489c0840d1 asm: cleanup
Remove ASM formats as they have not been used in some time.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:04 -08:00
Nathan Hjelm
d2dd27b008 asm: remove support for Sparc v9
This commit removes the specialized support for Sparc v9 as the
architecture is unsupported. The architecture will continue to
work without CMA and using the GCC built-in atomic support.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 13:04:04 -08:00
Raafat Feki
ceb52126a8
Merge pull request #7476 from raafatfeki/topic/gpfs
Topic/gpfs
2020-02-26 10:11:38 -06:00
Nathan Hjelm
038dcad8b5 asm: remove support for __sync built-in atomics
This commit removes the unsupported __sync built-in atomics in
favor of the GCC built-ins. The priority order (if not modified
by configure flags) is: C11, custom atomics
(opal/include/opal/sys/*), then GCC built-ins.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 06:30:34 -08:00
Nathan Hjelm
547d6c4380 asm: remove ARMv4 and ARMv5 CMA support
This commit removes the reference to ARMv4 and ARMv5. These are
unsupported architectures and no one is testing if CMA is working
on these systems.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 06:25:10 -08:00
Nathan Hjelm
65a096116f opal: remove remaining atomic references to IA64
IA64 atomic support was deleted some time ago. Some of the references
to the architecture were not removed when the atomic support was. This
commit removes those lingering references. IA64 will continue to work
unsupported with the built-in atomics.

Signed-off-by: Nathan Hjelm <hjelmn@google.com>
2020-02-26 06:22:37 -08:00
Tsubasa Yanagibashi
7d5fbcfd76 opal: Fix opal_initialized reference counter
Before this change, the reference counters `opal_util_initialized`
and `opal_initialized` were incremented at the beginning of the
`opal_init_util` and the `opal_init` functions respectively.
In other words, they were incremented before fully initialized.

This causes the following program to abort by SIGFPE if
`--enable-timing` is enabled on `configure`.

```c
// need -lm option on link

int main(int argc, char *argv[])
{
    // raise SIGFPE on division-by-zero
    feenableexcept(FE_DIVBYZERO);
    MPI_Init(&argc, &argv);
    MPI_Finalize();
    return 0;
}
```

The logic of the SIGFPE is:

1. `MPI_Init` calls `opal_init` through `ompi_rte_init`.
2. `opal_init` changes the value of `opal_initialized` to 1.
3. `opal_init` calls `opal_init_util`.
4. `opal_init_util` calls `opal_timing_ts_func` through
   `OPAL_TIMING_ENV_INIT`, and `opal_timing_ts_func` returns
   `get_ts_cycle` instead of `get_ts_gettimeofday` because
   `opal_initialized` to 1.
   (This is the problem)
5. `opal_init_util` calls `get_ts_cycle` through
   `OPAL_TIMING_ENV_INIT`.
6. `get_ts_cycle` executes
   `opal_timer_base_get_cycles()) / opal_timer_base_get_freq()`
   and it raises SIGFPE (division-by-zero) because the OPAL TIMER
   framework is not initialized yet and `opal_timer_base_get_freq`
   returns 0.

This commit changes the increment timing of `opal_util_initialized`
and `opal_initialized` to the end of `opal_init_util` and the
`opal_init` functions respectively.

Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>
2020-02-26 14:09:19 +09:00
raafatfeki
f46cfc120d fs/gpfs: Solve issues while setting GPFS hints
1- Remove the common symbols issue: global variable not initialized. (#7424)
Move the variables to local scope within the set_info function.
2- Remove GPFS hints using datashipping: not used anymore
3- Redirect output stream to corresponding fs framework.

Signed-off-by: raafatfeki <fekiraafat@gmail.com>
2020-02-25 18:14:20 -05:00
raafatfeki
9ba6ab8209 mca/fs: Check the existence of communicator in file query
The communicator might be not existent yet when mca_fs_gpfs_component_file_query() is called.
Therefore, we need to check it first before calling brodcast function.

Signed-off-by: raafatfeki <fekiraafat@gmail.com>
2020-02-25 16:26:46 -05:00
Jeff Squyres
f496f256cd
Merge pull request #7470 from jsquyres/pr/die-sm-btl-die-die-die
btl/sm: remove the deprecation-notice shell
2020-02-25 15:22:12 -05:00
Jeff Squyres
cdf478e963 btl/sm: remove the deprecation-notice shell
The SM BTL was effectively removed a long time ago.  All that was left
was a shell that warned people if they tried to use the SM BTL.  For
v5.0, we plan to finally remove this ancient shell (and possibly
replace it with vader).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-25 11:48:42 -05:00
Geoff Paulsen
207b267135
Merge pull request #7455 from rhc54/topic/warn
Silence a bunch of warnings
2020-02-24 08:55:30 -06:00
Geoff Paulsen
6f28a18f4e
Merge pull request #7444 from tjahns/master
Fix incorrect argument in manual page.
2020-02-24 08:40:04 -06:00
Jeff Squyres
3bf1ba527c
Merge pull request #7456 from artemry-mlnx/artemry-mlnx/reduce_mellanox_ci_time
Mellanox Open MPI CI: optimized git checkout step to reduce CI duration
2020-02-24 09:17:27 -05:00
Jeff Squyres
4fb7e01158
Merge pull request #7427 from hjelmn/update_readme_architectures
README: update tested systems
2020-02-24 09:16:36 -05:00
Ralph Castain
89f3418c2c
Merge pull request #7459 from rhc54/topic/cid
Fix comm_spawn
2020-02-23 15:29:44 -08:00
Edgar Gabriel
28776c5d95
Merge pull request #7448 from edgargabriel/topic/individual-as-dummy-module
sharedfp/individual: defer error when not being able to open datafile
2020-02-23 16:35:23 -06:00
Ralph Castain
b35b0f7897
Fix comm_spawn
Use the correct data type in the CID exchange

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-23 13:51:22 -08:00
Ralph Castain
a01884a6a0
Clean out ORTE references from .gitignore
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 13:24:18 -08:00
Artem Ryabov
f638f4aa6a Mellanox Open MPI CI: optimized git checkout step to reduce CI duration
Signed-off-by: Artem Ryabov <artemry@mellanox.com>
2020-02-23 00:21:27 +03:00
Ralph Castain
dcf110d432
Add missing Makefile
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 13:17:34 -08:00
Ralph Castain
86de81baca
Silence a bunch of warnings
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 13:05:28 -08:00
Ralph Castain
9cfdc381d3
Merge pull request #7454 from rhc54/topic/orte
Remove lingering ORTE references
2020-02-22 12:05:39 -08:00
Ralph Castain
76b9c15825
Remove lingering ORTE references
Wrapper compiler is trying to link in a libopen-rte. Man pages are
setting an ORTE release date.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-22 07:59:19 -08:00
Jeff Squyres
7c76237e0d
Merge pull request #7449 from jsquyres/pr/update-hwloc-to-fix-make-dist
Update hwloc submodule to fix "make distcheck"
2020-02-22 05:26:04 -08:00
Ralph Castain
ae35a385a0
Merge pull request #7452 from rhc54/topic/dang
Add missing update to PRRTE
2020-02-21 21:28:17 -08:00
Ralph Castain
13fb44cbbe
Add missing update to PRRTE
Needed to fix ninghtly tarball generator

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 20:09:21 -08:00
Ralph Castain
b1fff49f50
Merge pull request #7451 from rhc54/topic/slurm
Fix Slurm process name
2020-02-21 16:55:49 -08:00
Ralph Castain
06c8a30415
Merge pull request #7450 from rhc54/topic/mips
Remove stale reference to MIPS
2020-02-21 16:55:39 -08:00
Ralph Castain
4c56a7744a
Fix Slurm process name
Ensure that we truncate the local jobid to 15-bits

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 15:46:53 -08:00
Ralph Castain
4960b6b76a
Remove stale reference to MIPS
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 15:35:17 -08:00
Jeff Squyres
cdd3a9fbcc Update hwloc submodule to fix "make distcheck"
Hwloc upstream has fixed a problem with embedded "make distcheck" that
was breaking that surfaced when you ran autogen in an Open MPI
tarball.

This submodule update takes in the upstream hwloc fixes for this
issue.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-21 13:07:08 -08:00
Ralph Castain
820d9ac65c
Merge pull request #7447 from rhc54/topic/myen
Pickup change in check for deleted vs deprecated options and add deprecated ones
2020-02-21 11:30:00 -08:00
Ralph Castain
f9643b84b9
Merge pull request #7441 from rhc54/topic/hack
Create a hack to protect against non-integer jobids
2020-02-21 11:28:51 -08:00
Jeff Squyres
6d34b064be
Merge pull request #7446 from jsquyres/pr/remove-unused-compress-framework
Remove "compress" OPAL framework
2020-02-21 10:35:13 -08:00
Edgar Gabriel
df6e3e503a sharedfp/individual: defer error when not being able to open datafile
This commit changes the behavior of the individual sharedfp component. If
the component cannot create either the datafile or the metadatafile during File_open,
no error is being raised going forward. This allows applications that do not use shared
file pointer operations to continue execution without any issue.

If the user however subsequently calls MPI_File_write_shared or similar operations, an error
will be raised.

Fixes issue #7429

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2020-02-21 12:13:39 -06:00
Ralph Castain
fae5965506
Merge pull request #7445 from rhc54/topic/tests
Save the old ORTE simple tests
2020-02-21 08:48:30 -08:00
Ralph Castain
8d66045e95
Deprecate the enable-orterun-prefix-by-default options
Mark the --enable-orterun-prefix-by-default and
--enable-mpirun-prefix-by-default options as deprecated, but continue to
honor them by translating them to the new
--enable-prte-prefix-by-default option.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 08:02:13 -08:00
Jeff Squyres
3bfc7a7b62
Tweak: rename "deprecated" --> "deleted"
We're dealing with CLI options that have been deleted, not
deprecated.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
(cherry picked from commit 5aea4446288db9207fbc60f23ee903b149217121)
2020-02-21 07:33:05 -08:00
Jeff Squyres
66da0c6361 Remove "compress" OPAL framework
This framework is no longer used.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-02-21 06:28:16 -08:00
Ralph Castain
7e2874a83d
Save the old ORTE simple tests
Useful when debugging RTE-related issues

Not for inclusion in the tarball - just added to git repo for use by
developers.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 06:15:06 -08:00
Ralph Castain
829fd478b3
Create a hack to protect against non-integer jobids
If someone gives us a namespace that doesn't easily translate to an
integer, we have to create a mechanism for working around the
disconnect. PRRTE has been updated to give us a flag so we know we were
"natively" launched. If we don't see it, then fall back to generating a
hash of the nspace as our jobid. We then have to translate back/forth
between nspace and jobid using a lookup table.

Probably not the right long-term solution, but hopefully helps get us
thru for a bit.

Includes update of PRRTE pointer

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-02-21 06:04:55 -08:00