1
1

31216 Коммитов

Автор SHA1 Сообщение Дата
Gilles Gouaillardet
26aeb59f6d
Merge pull request #8280 from ggouaillardet/topic/dist_cleanup
do not include automatically generated file into the dist tarball
2020-12-11 23:08:09 +09:00
Gilles Gouaillardet
6f282bd385 do not include automatically generated file into the dist tarball
- ompi/mpi/fortran/use-mpi-f08/mod/mpi-f08-interfaces.h
 - ompi/mpiext/shortfloat/c/mpiext_shortfloat_c.h
 - ompi/mpiext/shortfloat/mpif-h/mpiext_shortfloat_mpifh.h
 - ompi/mpiext/shortfloat/use-mpi-f08/mpiext_shortfloat_usempif08.h

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2020-12-11 16:01:33 +09:00
Edgar Gabriel
5385e5f85f fbtl/posix/preadv.c: first cut on adding data sieving
the lack of performing data sieving has been identified as a main reason for the poor performance in some instances on the Lustre file system. This commit introduces the fundamental ability to perform data sieving for read operations (which should not be controversial). The code itself is correct, what is still lacking is a) the logic when and how to activate data sieving and b) the logic to limit the size of the temporary buffer when doing data sieving.

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2020-12-10 14:16:07 -06:00
Edgar Gabriel
f70bb4774a dynamic_gen2/file_write_all: fix chunk assignment per stride
the dynamic_gen_file_write_all component distinguishes between the amount of data communicated
to aggregators, and the amount of data written in a cycle by the aggregator (in contrary e.g. to the vulcan component).
There was a bug in calculating which chunks have to be written in a cycle by an aggregator: we added as many elements into the
io_array until we filled one stripe. Unfortuantely, the metric used was the amount of data instead of ensuring that all offsets
fall within a single stripe. This commit fixes this issue. Note, the bug did not create a correctness problem, just a performance
problem in case there were gaps in the file view.

Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2020-12-10 08:53:11 -06:00
Jeff Squyres
2d5e666808
Merge pull request #8267 from hpcraink/man-pages
Fix man-page for MPI_File_set_view
2020-12-07 12:16:58 -05:00
Rainer Keller
3202a16e7d Fix man-page for MPI_File_set_view
The datarep may not be NULL, as check was added in ba955883329.

Signed-off-by: Rainer Keller <rainer.keller@hs-esslingen.de>
2020-12-04 22:05:53 +01:00
Yossi Itigin
47fb05f82a
Merge pull request #8184 from gleon99/master
Replace usage of the deprecated NB API of UCX with NBX
2020-11-25 13:02:48 +02:00
Leonid Genkin
7f9a305a64 Replace usage of the deprecated NB API of UCX with NBX
Signed-off-by: Leonid Genkin <lgenkin@nvidia.com>
2020-11-25 12:21:06 +02:00
Jeff Squyres
a8f883a73a
Merge pull request #8250 from devreal/fix-han-commselect-new
coll/han: fix coll preference selection in mca_coll_han_comm_create_new
2020-11-24 09:21:57 -05:00
Joseph Schuchart
33105b031b coll/han: fix coll preference selection in mca_coll_han_comm_create_new
Exclude HAN, don't include it.

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-24 10:09:14 +01:00
Ralph Castain
7dc9815340
Merge pull request #8245 from rhc54/topic/up
Update PMIx/PRRTE pointers
2020-11-23 12:07:08 -08:00
Ralph Castain
3c2dbac64f
Update PMIx/PRRTE pointers
Resolves the PRRTE launch scale limitation

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-11-23 11:13:03 -08:00
Nikola Dancejic
5af8497a05 common/ofi: fixing error message to be a debug output
A path that was being used in oversubscribed cases caused a help message to
output for each process. This replaces the help message with a debug output to
prevent excessive output unless the user enables debug output.

Signed-off-by: Nikola Dancejic <dancejic@amazon.com>
2020-11-23 08:35:33 -08:00
Jeff Squyres
61dc29f0f6
Merge pull request #8239 from ggouaillardet/topic/OPAL_GET_VERSION
configury: fix OPAL_GET_VERSION
2020-11-23 10:56:05 -05:00
Jeff Squyres
3edd62e568
Merge pull request #8203 from jsquyres/pr/fix-warnings
Fix many compiler warnings
2020-11-23 10:15:15 -05:00
Gilles Gouaillardet
c134664c02
Merge pull request #8222 from ggouaillardet/topic/libtool_bigsur
autogen.pl: patch libtool.m4 for OSX Big Sur
2020-11-23 08:10:50 +09:00
Gilles Gouaillardet
3f45ceda1b autogen.pl: patch libtool.m4 for OSX Big Sur
Thanks FX Coudert for reporting this issue and pointing
to a solution.

Refs. open-mpi/ompi#8218

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-11-22 10:21:46 -08:00
Gilles Gouaillardet
930d3c4695 configury: fix OPAL_GET_VERSION
- fix path to getdate.sh
 - do not prepend "date" to the revision
 - support git worktree

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2020-11-22 21:05:32 +09:00
Artem Polyakov
159856f9aa
Merge pull request #8219 from hoopoepg/topic/fixed-coverity-warning-wpool
PML/UCX/WPOOL: fixed coverity issue
2020-11-20 17:31:04 -08:00
Raghu Raja
38d2f12112
Merge pull request #8220 from devreal/fix-coll-base-preference
Fix preference treatment in coll/base
2020-11-20 08:14:37 -08:00
Jeff Squyres
28779321a7
Merge pull request #8233 from Webcretaire/typo
Typo in ob1 comments, and uniform conditions
2020-11-19 17:07:49 -05:00
Julien EMMANUEL
7d493c6bcd Uniform conditions in ob1 recv
In ob1 we have four similar conditions but they are not written
in a uniform way

Signed-off-by: Julien EMMANUEL <julien.emmanuel@inria.fr>
2020-11-19 22:27:49 +01:00
Joseph Schuchart
1cdc85564e coll/han: reduce default segment size for reduce/allreduce to 64k
This has shown to be more effective in achieving overlap
of inter- and intra-node communication and reduces the inital
delay before hitting the network.

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-19 19:23:32 +01:00
Joseph Schuchart
971d58c524 coll/han: remove references to experimental solo and shared collective components
Also make coll/tuned the default for shared memory communication
as coll/sm has shown performance issues that need investigation.

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-19 19:22:06 +01:00
Joseph Schuchart
09c2f4af94 coll/[sm|han|adapt]: don't disqualify on priority 0
Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-19 19:10:09 +01:00
Joseph Schuchart
dd54af9450 coll/base: Fix collective module selection preference treatment
The selectable list is sorted with lowest to highest priority so the
user-defined preferences should be appended to the list.
The preference treatment should also maintain the order provided by the user
(first item has highest priority) so switch the loop order.

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-19 19:06:28 +01:00
Julien EMMANUEL
208c2d270b Typo in ob1 comments
Seems like a copy/pasted typo in ob1 comments

Signed-off-by: Julien EMMANUEL <julien.emmanuel@inria.fr>
2020-11-19 16:50:29 +01:00
Ralph Castain
3f02fafb7d
Merge pull request #8229 from rhc54/topic/dbg
Do not pass --enable-debug to internal hwloc
2020-11-18 09:37:49 -08:00
Ralph Castain
a43d8908c0
Do not pass --enable-debug to internal hwloc
Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-11-18 07:04:11 -08:00
Austen Lauria
a89830b6c7
Merge pull request #8223 from paklui/master
oshmem/tools/oshmem_info: fix fortran keyword issue when compiling param.c
2020-11-17 12:20:44 -05:00
Jeff Squyres
12796a4aad
Merge pull request #8209 from devreal/fix-tuned-allgatherv
COLL TUNED: Use per-rank data size instead of total size for decision in allgatherv
2020-11-17 11:46:35 -05:00
Sergey Oblomov
1aa6e74d1b PML/UCX/WPOOL: fixe potential leak in error processing
- there was potential leak in error handling, fixed

Signed-off-by: Sergey Oblomov <sergeyo@nvidia.com>
2020-11-17 18:08:15 +02:00
Pak Lui
3cdead0d0c oshmem/tools/oshmem_info: fix an issue with fortran keyword when compiling param.c
Signed-off-by: Pak Lui <pak.lui@amd.com>
2020-11-17 09:29:34 -06:00
Sergey Oblomov
a6e00e3d41 PML/UCX/WPOOL: fixed coverity issue
- fixed issue reported by coverity

Signed-off-by: Sergey Oblomov <sergeyo@nvidia.com>
2020-11-16 10:30:32 +02:00
Jeff Squyres
282be20e6f
Merge pull request #8210 from vspetrov/master
coll/hcoll: svatterv inplace fix
2020-11-14 10:25:25 -05:00
Jeff Squyres
14aa5fae3c Fix many compiler warnings:
- Add some missing AC_CHECK_SIZEOF's in configure.ac
- Remove some unused variables
- Initialize some variables
- Fix some parameter types
- Cast where appropriate/safe to fix warnings
- Move ompi/mca/common/monitoring Fortran bindings to a separate .c
  file so that they can use different #define's than the C bindings,
  and therefore compile properly / without warnings.
- Fix signedness discrepancies
- Who knew?  Separated these into multiple #if's, instead:
  ```
  // This is undefined behavior
  #define HAVE_FOO defined(FOO)
  #define YOW (HAVE_FOO && defined(BAR))
  ```
- Fix some typos in OMPI_BUILD_HOST logic
- Don't "2>/dev/null" in OMPI_BUILD_HOST logic; it just hides errors

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2020-11-14 07:20:30 -08:00
Jeff Squyres
67421c5d23
Merge pull request #8207 from ggouaillardet/topic/retain_datatypes_w
coll/base: do not drop const qualifier
2020-11-14 09:21:37 -05:00
Artem Polyakov
f9ef4b4ac0
Merge pull request #7632 from devreal/osc-ucx-progress
UCX osc: make progress on idle worker if none are active
2020-11-13 13:27:31 -08:00
Valentin Petrov
9fa00155f3 coll/hcoll: scatterv inplace fix
Signed-off-by: Valentin Petrov <valentinp@nvidia.com>
2020-11-13 19:15:37 +02:00
Joseph Schuchart
f670364d76 COLL TUNED: Use per-rank data size instead of total size for decision
The total size depends on number of ranks so the usual ranges don't work.
Thus, use the average across all ranks to make a decision.

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-13 12:18:42 +01:00
Gilles Gouaillardet
c49e5e5c4a coll/base: do not drop const qualifier
MPI_Ialltoallw() and friends take a const MPI_Datatype types[] argument.
In order to be able to call OBJ_RELEASE(types[0]), we used to simply
drop the const modifier. This change make it right by introducing the
OBJ_RELEASE_NO_NULLIFY(object) macro that no more set object = NULL
if the object is freed.

Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2020-11-13 17:31:04 +09:00
Raghu Raja
b7b9254488
Merge pull request #8178 from rajachan/whack-remote-cq-data-query
mtl/ofi: Check cq_data_size without querying providers again
2020-11-12 13:00:10 -08:00
Joseph Schuchart
d11f625ed5 SPC: allow counters to be attached solely through MPI_T and reduce overhead
- only make MCA parameters available if SPC is enabled

- do not compile SPC code if SPC is disabled

- move includes into ompi_spc.c

- allow counters to be enabled through MPI_T without setting MCA parameter

- inline counter update calls that are likely in the critical path

- fix test to succeed even if encountering invalid pvars

- move timer_[start|stop] to header and move attachment info into ompi_spc_t

There is no need to store the name in the ompi_spc_t struct too, we can use that space
for the attachment info instead to avoid accessing another cache line.

- make timer/watermark flags a property of the spc description

This is meant to making adding counters easier in the future by
centralizing the necessary information. By storing a copy of these flags
in the ompi_spc_t structure (without adding to its size) reduces
cache pollution for timer/watermark events.

- allocate ompi_spc_t objects with cache-alignment

This prevents objects from spanning multiple cache lines and thus
ensures that only one cache line is loaded per update.

- fix handling of timer and timer conversion

- only call opal_timer_base_get_cycles if necesary to reduce overhead

- Remove use of OPAL_UNLIKELY to improve code generated by GCC

It appears that GCC makes less effort in optimizing the unlikely path
and generates bloated code.

- Allocate ompi_spc_events statically to reduce loads in critical path

- duplicate comm_world only when dumping is requested

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-12 21:17:56 +01:00
Raghu Raja
30831fb7f0
Merge pull request #8186 from devreal/fix-tuned-dynamic
Fix some issues with dynamic algorithm selection in coll/tuned
2020-11-12 11:20:57 -08:00
Joseph Schuchart
581478dc91 UCX osc: make progress on default worker if none are active
Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>
2020-11-11 23:23:31 +01:00
Ralph Castain
d489030925
Merge pull request #8199 from rhc54/topic/locality
Fix confusion between cpuset and locality
2020-11-11 10:22:03 -08:00
Joseph Schuchart
a15e5dc7f0 COLL TUNED: remove stray selection of linear algs for alreduce and allgather
These selections seem harmful in my measurements and don't seem to be
motivated by previous measurement data.

Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>
2020-11-11 18:40:24 +01:00
Ralph Castain
2f7f1feca5
Fix confusion between cpuset and locality
Ensure we correctly collect and save the cpuset of the process
separately from its locality string. Ensure we use the correct one when
computing things like relative locality between processes.

Signed-off-by: Ralph Castain <rhc@pmix.org>
2020-11-10 16:41:00 -08:00
Jeff Squyres
57ccb830c2
Merge pull request #8197 from Webcretaire/typo
Typo in error message for ompi_proc_world
2020-11-10 16:28:59 -05:00
Jeff Squyres
fd59b46a0b
Merge pull request #8191 from jsquyres/pr/markdown-ftw
Convert README files to Markdown
2020-11-10 15:09:24 -05:00