1
1

Merge pull request #3923 from artpol84/rm_yoda_news/master

Yoda removal (update docs)
Этот коммит содержится в:
Artem Polyakov 2017-07-17 17:51:36 -07:00 коммит произвёл GitHub
родитель 771f51af12 cc156a332b
Коммит bdb698a9b6
2 изменённых файлов: 228 добавлений и 12 удалений

214
NEWS
Просмотреть файл

@ -66,6 +66,220 @@ Master (not on release branches yet)
- Removed embedded VampirTrace. It is in maintenance mode since 2013.
Please consider Score-P (score-p.org) as an external replacement.
3.0.0 -- July, 2017
-------------------
Major new features:
- Use UCX allocator for OSHMEM symmetric heap allocations to optimize intra-node
data transfers. UCX SPML only.
- Use UCX multi-threaded API in the UCX PML. Requires UCX 1.0 or later.
Changes in behavior compared to prior versions:
Removed legacy support:
- "yoda" SPML component is removed from the Open SHMEM codebase. It was utilizing
Open MPI BTL components for supported networks.
Currently avalable SPML components are:
- "ikrit" based on Mellanox Messaging Accelerator (MXM) communication library;
- "ucx" based on the open-source Unified Communication X (UCX) communication
library (http://www.openucx.org/).
Known issues (to be addressed in v3.0.1):
2.1.1 -- April, 2017
--------------------
Bug fixes/minor improvements:
- Fix a problem with one of Open MPI's fifo data structures which led to
hangs in a make check test. Thanks to Nicolas Morey-Chaisemartin for
reporting.
- Add missing MPI_AINT_ADD/MPI_AINT_DIFF function definitions to mpif.h.
Thanks to Aboorva Devarajan for reporting.
- Fix the error return from MPI_WIN_LOCK when rank argument is invalid.
Thanks to Jeff Hammond for reporting and fixing this issue.
- Fix a problem with mpirun/orterun when started under a debugger. Thanks
to Gregory Leff for reporting.
- Add configury option to disable use of CMA by the vader BTL. Thanks
to Sascha Hunold for reporting.
- Add configury check for MPI_DOUBLE_COMPLEX datatype support.
Thanks to Alexander Klein for reporting.
- Fix memory allocated by MPI_WIN_ALLOCATE_SHARED to
be 64 bit aligned. Thanks to Joseph Schuchart for
reporting.
- Update MPI_WTICK man page to reflect possibly higher
resolution than 10e-6. Thanks to Mark Dixon for
reporting
- Add missing MPI_T_PVAR_SESSION_NULL definition to mpi.h
include file. Thanks to Omri Mor for this contribution.
- Enhance the Open MPI spec file to install modulefile in /opt
if installed in a non-default location. Thanks to Kevin
Buckley for reporting and supplying a fix.
- Fix a problem with conflicting PMI symbols when linking statically.
Thanks to Kilian Cavalotti for reporting.
Known issues (to be addressed in v2.1.2):
- See the list of fixes slated for v2.1.2 here:
https://github.com/open-mpi/ompi/milestone/28
2.1.0 -- March, 2017
--------------------
Major new features:
- The main focus of the Open MPI v2.1.0 release was to update to PMIx
v1.2.1. When using PMIx (e.g., via mpirun-based launches, or via
direct launches with recent versions of popular resource managers),
launch time scalability is improved, and the run time memory
footprint is greatly decreased when launching large numbers of MPI /
OpenSHMEM processes.
- Update OpenSHMEM API conformance to v1.3.
- The usnic BTL now supports MPI_THREAD_MULTIPLE.
- General/overall performance improvements to MPI_THREAD_MULTIPLE.
- Add a summary message at the bottom of configure that tells you many
of the configuration options specified and/or discovered by Open
MPI.
Changes in behavior compared to prior versions:
- None.
Removed legacy support:
- The ptmalloc2 hooks have been removed from the Open MPI code base.
This is not really a user-noticable change; it is only mentioned
here because there was much rejoycing in the Open MPI developer
community.
Bug fixes/minor improvements:
- New MCA parameters:
- iof_base_redirect_app_stderr_to_stdout: as its name implies, it
combines MPI / OpenSHMEM applications' stderr into its stdout
stream.
- opal_event_include: allow the user to specify which FD selection
mechanism is used by the underlying event engine.
- opal_stacktrace_output: indicate where stacktraces should be sent
upon MPI / OpenSHMEM process crashes ("none", "stdout", "stderr",
"file:filename").
- orte_timeout_for_stack_trace: number of seconds to wait for stack
traces to be reported (or <=0 to wait forever).
- mtl_ofi_control_prog_type/mtl_ofi_data_prog_type: specify libfabric
progress model to be used for control and data.
- Fix MPI_WTICK regression where the time reported may be inaccurate
on systems with processor frequency scalaing enabled.
- Fix regression that lowered the memory maximum message bandwidth for
large messages on some BTL network transports, such as openib, sm,
and vader.
- Fix a name collision in the shared file pointer MPI IO file locking
scheme. Thanks to Nicolas Joly for reporting the issue.
- Fix datatype extent/offset errors in MPI_PUT and MPI_RACCUMULATE
when using the Portals 4 one-sided component.
- Add support for non-contiguous datatypes to the Portals 4 one-sided
component.
- Various updates for the UCX PML.
- Updates to the following man pages:
- mpirun(1)
- MPI_COMM_CONNECT(3)
- MPI_WIN_GET_NAME(3). Thanks to Nicolas Joly for reporting the
typo.
- MPI_INFO_GET_[NKEYS|NTHKEY](3). Thanks to Nicolas Joly for
reporting the typo.
- Fixed a problem in the TCP BTL when using MPI_THREAD_MULTIPLE.
Thanks to Evgueni Petrov for reporting.
- Fixed external32 representation in the romio314 module. Note that
for now, external32 representation is not correctly supported by the
ompio module. Thanks to Thomas Gastine for bringing this to our
attention.
- Add note how to disable a warning message about when a high-speed
MPI transport is not found. Thanks to Susan Schwarz for reporting
the issue.
- Ensure that sending SIGINT when using the rsh/ssh launcher does not
orphan children nodes in the launch tree.
- Fix the help message when showing deprecated MCA param names to show
the correct (i.e., deprecated) name.
- Enable support for the openib BTL to use multiple different
InfiniBand subnets.
- Fix a minor error in MPI_AINT_DIFF.
- Fix bugs with MPI_IN_PLACE handling in:
- MPI_ALLGATHER[V]
- MPI_[I][GATHER|SCATTER][V]
- MPI_IREDUCE[_SCATTER]
- Thanks to all the users who helped diagnose these issues.
- Allow qrsh to tree spawn (if the back-end system supports it).
- Fix MPI_T_PVAR_GET_INDEX to return the correct index.
- Correctly position the shared file pointer in append mode in the
OMPIO component.
- Add some deprecated names into shmem.h for backwards compatibility
with legacy codes.
- Fix MPI_MODE_NOCHECK support.
- Fix a regression in PowerPC atomics support. Thanks to Orion
Poplawski for reporting the issue.
- Fixes for assembly code with aggressively-optimized compilers on
x86_64/AMD64 platforms.
- Fix one more place where configure was mangling custom CFLAGS.
Thanks to Phil Tooley (@Telemin) for reporting the issue.
- Better handle builds with external installations of hwloc.
- Fixed a hang with MPI_PUT and MPI_WIN_LOCK_ALL.
- Fixed a bug when using MPI_GET on non-contiguous datatypes and
MPI_LOCK/MPI_UNLOCK.
- Fixed a bug when using POST/START/COMPLETE/WAIT after a fence.
- Fix configure portability by cleaning up a few uses of "==" with
"test". Thanks to Kevin Buckley for pointing out the issue.
- Fix bug when using darrays with lib and extent of darray datatypes.
- Updates to make Open MPI binary builds more bit-for-bit
reproducable. Thanks to Alastair McKinstry for the suggestion.
- Fix issues regarding persistent request handling.
- Ensure that shmemx.h is a standalone OpenSHMEM header file. Thanks
to Nick Park (@nspark) for the report.
- Ensure that we always send SIGTERM prior to SIGKILL. Thanks to Noel
Rycroft for the report.
- Added ConnectX-5 and Chelsio T6 device defaults for the openib BTL.
- OpenSHMEM no longer supports MXM less than v2.0.
- Plug a memory leak in ompi_osc_sm_free. Thanks to Joseph Schuchart
for the report.
- The "self" BTL now uses less memory.
- The vader BTL is now more efficient in terms of memory usage when
using XPMEM.
- Removed the --enable-openib-failover configure option. This is not
considered backwards-incompatible because this option was stale and
had long-since stopped working, anyway.
- Allow jobs launched under Cray aprun to use hyperthreads if
opal_hwloc_base_hwthreads_as_cpus MCA parameter is set.
- Add support for 32-bit and floating point Cray Aries atomic
operations.
- Add support for network AMOs for MPI_ACCUMULATE, MPI_FETCH_AND_OP,
and MPI_COMPARE_AND_SWAP if the "ompi_single_intrinsic" info key is
set on the window or the "acc_single_intrinsic" MCA param is set.
- Automatically disqualify RDMA CM support in the openib BTL if
MPI_THREAD_MULTIPLE is used.
- Make configure smarter/better about auto-detecting Linux CMA
support.
- Improve the scalability of MPI_COMM_SPLIT_TYPE.
- Fix the mixing of C99 and C++ header files with the MPI C++
bindings. Thanks to Alastair McKinstry for the bug report.
- Add support for ARM v8.
- Several MCA parameters now directly support MPI_T enumerator
semantics (i.e., they accept a limited set of values -- e.g., MCA
parameters that accept boolean values).
- Added --with-libmpi-name=STRING configure option for vendor releases
of Open MPI. See the README for more detail.
- Fix a problem with Open MPI's internal memory checker. Thanks to Yvan
Fournier for reporting.
- Fix a multi-threaded issue with MPI_WAIT. Thanks to Pascal Deveze for
reporting.
Known issues (to be addressed in v2.1.1):
- See the list of fixes slated for v2.1.1 here:
https://github.com/open-mpi/ompi/milestone/26
2.0.3 -- June 2017
------------------
Bug fixes/minor improvements:
2.0.2 -- 26 January 2017
-------------------------

26
README
Просмотреть файл

@ -626,19 +626,21 @@ Network Support
or
shell$ mpirun --mca pml cm ...
- Similarly, there are two OpenSHMEM network models available: "yoda",
and "ikrit". "yoda" also uses the BTL components for supported
networks. "ikrit" interfaces directly with Mellanox MXM.
- Similarly, there are two OpenSHMEM network models available: "ucx",
and "ikrit":
- "ucx" interfaces directly with UCX;
- "ikrit" interfaces directly with Mellanox MXM.
- "yoda" supports a variety of networks that can be used:
- OpenFabrics: InfiniBand, iWARP, and RoCE
- Loopback (send-to-self)
- Shared memory
- TCP
- usNIC
- "ikrit" only supports Mellanox MXM.
- UCX is the Unified Communication X (UCX) communication library
(http://www.openucx.org/).
This is an open-source project developed in collaboration between
industry, laboratories, and academia to create an open-source
production grade communication framework for data centric and
high-performance applications.
UCX currently supports:
- OFA Verbs;
- Cray's uGNI;
- NVIDIA CUDA drivers.
- MXM is the Mellanox Messaging Accelerator library utilizing a full
range of IB transports to provide the following messaging services