Merge pull request #3923 from artpol84/rm_yoda_news/master
Yoda removal (update docs)
Этот коммит содержится в:
Коммит
bdb698a9b6
214
NEWS
214
NEWS
@ -66,6 +66,220 @@ Master (not on release branches yet)
|
||||
- Removed embedded VampirTrace. It is in maintenance mode since 2013.
|
||||
Please consider Score-P (score-p.org) as an external replacement.
|
||||
|
||||
3.0.0 -- July, 2017
|
||||
-------------------
|
||||
|
||||
Major new features:
|
||||
|
||||
- Use UCX allocator for OSHMEM symmetric heap allocations to optimize intra-node
|
||||
data transfers. UCX SPML only.
|
||||
- Use UCX multi-threaded API in the UCX PML. Requires UCX 1.0 or later.
|
||||
|
||||
Changes in behavior compared to prior versions:
|
||||
|
||||
Removed legacy support:
|
||||
- "yoda" SPML component is removed from the Open SHMEM codebase. It was utilizing
|
||||
Open MPI BTL components for supported networks.
|
||||
Currently avalable SPML components are:
|
||||
- "ikrit" based on Mellanox Messaging Accelerator (MXM) communication library;
|
||||
- "ucx" based on the open-source Unified Communication X (UCX) communication
|
||||
library (http://www.openucx.org/).
|
||||
|
||||
Known issues (to be addressed in v3.0.1):
|
||||
|
||||
2.1.1 -- April, 2017
|
||||
--------------------
|
||||
|
||||
Bug fixes/minor improvements:
|
||||
|
||||
- Fix a problem with one of Open MPI's fifo data structures which led to
|
||||
hangs in a make check test. Thanks to Nicolas Morey-Chaisemartin for
|
||||
reporting.
|
||||
- Add missing MPI_AINT_ADD/MPI_AINT_DIFF function definitions to mpif.h.
|
||||
Thanks to Aboorva Devarajan for reporting.
|
||||
- Fix the error return from MPI_WIN_LOCK when rank argument is invalid.
|
||||
Thanks to Jeff Hammond for reporting and fixing this issue.
|
||||
- Fix a problem with mpirun/orterun when started under a debugger. Thanks
|
||||
to Gregory Leff for reporting.
|
||||
- Add configury option to disable use of CMA by the vader BTL. Thanks
|
||||
to Sascha Hunold for reporting.
|
||||
- Add configury check for MPI_DOUBLE_COMPLEX datatype support.
|
||||
Thanks to Alexander Klein for reporting.
|
||||
- Fix memory allocated by MPI_WIN_ALLOCATE_SHARED to
|
||||
be 64 bit aligned. Thanks to Joseph Schuchart for
|
||||
reporting.
|
||||
- Update MPI_WTICK man page to reflect possibly higher
|
||||
resolution than 10e-6. Thanks to Mark Dixon for
|
||||
reporting
|
||||
- Add missing MPI_T_PVAR_SESSION_NULL definition to mpi.h
|
||||
include file. Thanks to Omri Mor for this contribution.
|
||||
- Enhance the Open MPI spec file to install modulefile in /opt
|
||||
if installed in a non-default location. Thanks to Kevin
|
||||
Buckley for reporting and supplying a fix.
|
||||
- Fix a problem with conflicting PMI symbols when linking statically.
|
||||
Thanks to Kilian Cavalotti for reporting.
|
||||
|
||||
Known issues (to be addressed in v2.1.2):
|
||||
|
||||
- See the list of fixes slated for v2.1.2 here:
|
||||
https://github.com/open-mpi/ompi/milestone/28
|
||||
|
||||
2.1.0 -- March, 2017
|
||||
--------------------
|
||||
|
||||
Major new features:
|
||||
|
||||
- The main focus of the Open MPI v2.1.0 release was to update to PMIx
|
||||
v1.2.1. When using PMIx (e.g., via mpirun-based launches, or via
|
||||
direct launches with recent versions of popular resource managers),
|
||||
launch time scalability is improved, and the run time memory
|
||||
footprint is greatly decreased when launching large numbers of MPI /
|
||||
OpenSHMEM processes.
|
||||
- Update OpenSHMEM API conformance to v1.3.
|
||||
- The usnic BTL now supports MPI_THREAD_MULTIPLE.
|
||||
- General/overall performance improvements to MPI_THREAD_MULTIPLE.
|
||||
- Add a summary message at the bottom of configure that tells you many
|
||||
of the configuration options specified and/or discovered by Open
|
||||
MPI.
|
||||
|
||||
Changes in behavior compared to prior versions:
|
||||
|
||||
- None.
|
||||
|
||||
Removed legacy support:
|
||||
|
||||
- The ptmalloc2 hooks have been removed from the Open MPI code base.
|
||||
This is not really a user-noticable change; it is only mentioned
|
||||
here because there was much rejoycing in the Open MPI developer
|
||||
community.
|
||||
|
||||
Bug fixes/minor improvements:
|
||||
|
||||
- New MCA parameters:
|
||||
- iof_base_redirect_app_stderr_to_stdout: as its name implies, it
|
||||
combines MPI / OpenSHMEM applications' stderr into its stdout
|
||||
stream.
|
||||
- opal_event_include: allow the user to specify which FD selection
|
||||
mechanism is used by the underlying event engine.
|
||||
- opal_stacktrace_output: indicate where stacktraces should be sent
|
||||
upon MPI / OpenSHMEM process crashes ("none", "stdout", "stderr",
|
||||
"file:filename").
|
||||
- orte_timeout_for_stack_trace: number of seconds to wait for stack
|
||||
traces to be reported (or <=0 to wait forever).
|
||||
- mtl_ofi_control_prog_type/mtl_ofi_data_prog_type: specify libfabric
|
||||
progress model to be used for control and data.
|
||||
- Fix MPI_WTICK regression where the time reported may be inaccurate
|
||||
on systems with processor frequency scalaing enabled.
|
||||
- Fix regression that lowered the memory maximum message bandwidth for
|
||||
large messages on some BTL network transports, such as openib, sm,
|
||||
and vader.
|
||||
- Fix a name collision in the shared file pointer MPI IO file locking
|
||||
scheme. Thanks to Nicolas Joly for reporting the issue.
|
||||
- Fix datatype extent/offset errors in MPI_PUT and MPI_RACCUMULATE
|
||||
when using the Portals 4 one-sided component.
|
||||
- Add support for non-contiguous datatypes to the Portals 4 one-sided
|
||||
component.
|
||||
- Various updates for the UCX PML.
|
||||
- Updates to the following man pages:
|
||||
- mpirun(1)
|
||||
- MPI_COMM_CONNECT(3)
|
||||
- MPI_WIN_GET_NAME(3). Thanks to Nicolas Joly for reporting the
|
||||
typo.
|
||||
- MPI_INFO_GET_[NKEYS|NTHKEY](3). Thanks to Nicolas Joly for
|
||||
reporting the typo.
|
||||
- Fixed a problem in the TCP BTL when using MPI_THREAD_MULTIPLE.
|
||||
Thanks to Evgueni Petrov for reporting.
|
||||
- Fixed external32 representation in the romio314 module. Note that
|
||||
for now, external32 representation is not correctly supported by the
|
||||
ompio module. Thanks to Thomas Gastine for bringing this to our
|
||||
attention.
|
||||
- Add note how to disable a warning message about when a high-speed
|
||||
MPI transport is not found. Thanks to Susan Schwarz for reporting
|
||||
the issue.
|
||||
- Ensure that sending SIGINT when using the rsh/ssh launcher does not
|
||||
orphan children nodes in the launch tree.
|
||||
- Fix the help message when showing deprecated MCA param names to show
|
||||
the correct (i.e., deprecated) name.
|
||||
- Enable support for the openib BTL to use multiple different
|
||||
InfiniBand subnets.
|
||||
- Fix a minor error in MPI_AINT_DIFF.
|
||||
- Fix bugs with MPI_IN_PLACE handling in:
|
||||
- MPI_ALLGATHER[V]
|
||||
- MPI_[I][GATHER|SCATTER][V]
|
||||
- MPI_IREDUCE[_SCATTER]
|
||||
- Thanks to all the users who helped diagnose these issues.
|
||||
- Allow qrsh to tree spawn (if the back-end system supports it).
|
||||
- Fix MPI_T_PVAR_GET_INDEX to return the correct index.
|
||||
- Correctly position the shared file pointer in append mode in the
|
||||
OMPIO component.
|
||||
- Add some deprecated names into shmem.h for backwards compatibility
|
||||
with legacy codes.
|
||||
- Fix MPI_MODE_NOCHECK support.
|
||||
- Fix a regression in PowerPC atomics support. Thanks to Orion
|
||||
Poplawski for reporting the issue.
|
||||
- Fixes for assembly code with aggressively-optimized compilers on
|
||||
x86_64/AMD64 platforms.
|
||||
- Fix one more place where configure was mangling custom CFLAGS.
|
||||
Thanks to Phil Tooley (@Telemin) for reporting the issue.
|
||||
- Better handle builds with external installations of hwloc.
|
||||
- Fixed a hang with MPI_PUT and MPI_WIN_LOCK_ALL.
|
||||
- Fixed a bug when using MPI_GET on non-contiguous datatypes and
|
||||
MPI_LOCK/MPI_UNLOCK.
|
||||
- Fixed a bug when using POST/START/COMPLETE/WAIT after a fence.
|
||||
- Fix configure portability by cleaning up a few uses of "==" with
|
||||
"test". Thanks to Kevin Buckley for pointing out the issue.
|
||||
- Fix bug when using darrays with lib and extent of darray datatypes.
|
||||
- Updates to make Open MPI binary builds more bit-for-bit
|
||||
reproducable. Thanks to Alastair McKinstry for the suggestion.
|
||||
- Fix issues regarding persistent request handling.
|
||||
- Ensure that shmemx.h is a standalone OpenSHMEM header file. Thanks
|
||||
to Nick Park (@nspark) for the report.
|
||||
- Ensure that we always send SIGTERM prior to SIGKILL. Thanks to Noel
|
||||
Rycroft for the report.
|
||||
- Added ConnectX-5 and Chelsio T6 device defaults for the openib BTL.
|
||||
- OpenSHMEM no longer supports MXM less than v2.0.
|
||||
- Plug a memory leak in ompi_osc_sm_free. Thanks to Joseph Schuchart
|
||||
for the report.
|
||||
- The "self" BTL now uses less memory.
|
||||
- The vader BTL is now more efficient in terms of memory usage when
|
||||
using XPMEM.
|
||||
- Removed the --enable-openib-failover configure option. This is not
|
||||
considered backwards-incompatible because this option was stale and
|
||||
had long-since stopped working, anyway.
|
||||
- Allow jobs launched under Cray aprun to use hyperthreads if
|
||||
opal_hwloc_base_hwthreads_as_cpus MCA parameter is set.
|
||||
- Add support for 32-bit and floating point Cray Aries atomic
|
||||
operations.
|
||||
- Add support for network AMOs for MPI_ACCUMULATE, MPI_FETCH_AND_OP,
|
||||
and MPI_COMPARE_AND_SWAP if the "ompi_single_intrinsic" info key is
|
||||
set on the window or the "acc_single_intrinsic" MCA param is set.
|
||||
- Automatically disqualify RDMA CM support in the openib BTL if
|
||||
MPI_THREAD_MULTIPLE is used.
|
||||
- Make configure smarter/better about auto-detecting Linux CMA
|
||||
support.
|
||||
- Improve the scalability of MPI_COMM_SPLIT_TYPE.
|
||||
- Fix the mixing of C99 and C++ header files with the MPI C++
|
||||
bindings. Thanks to Alastair McKinstry for the bug report.
|
||||
- Add support for ARM v8.
|
||||
- Several MCA parameters now directly support MPI_T enumerator
|
||||
semantics (i.e., they accept a limited set of values -- e.g., MCA
|
||||
parameters that accept boolean values).
|
||||
- Added --with-libmpi-name=STRING configure option for vendor releases
|
||||
of Open MPI. See the README for more detail.
|
||||
- Fix a problem with Open MPI's internal memory checker. Thanks to Yvan
|
||||
Fournier for reporting.
|
||||
- Fix a multi-threaded issue with MPI_WAIT. Thanks to Pascal Deveze for
|
||||
reporting.
|
||||
|
||||
Known issues (to be addressed in v2.1.1):
|
||||
|
||||
- See the list of fixes slated for v2.1.1 here:
|
||||
https://github.com/open-mpi/ompi/milestone/26
|
||||
|
||||
2.0.3 -- June 2017
|
||||
------------------
|
||||
|
||||
Bug fixes/minor improvements:
|
||||
|
||||
2.0.2 -- 26 January 2017
|
||||
-------------------------
|
||||
|
26
README
26
README
@ -626,19 +626,21 @@ Network Support
|
||||
or
|
||||
shell$ mpirun --mca pml cm ...
|
||||
|
||||
- Similarly, there are two OpenSHMEM network models available: "yoda",
|
||||
and "ikrit". "yoda" also uses the BTL components for supported
|
||||
networks. "ikrit" interfaces directly with Mellanox MXM.
|
||||
- Similarly, there are two OpenSHMEM network models available: "ucx",
|
||||
and "ikrit":
|
||||
- "ucx" interfaces directly with UCX;
|
||||
- "ikrit" interfaces directly with Mellanox MXM.
|
||||
|
||||
- "yoda" supports a variety of networks that can be used:
|
||||
|
||||
- OpenFabrics: InfiniBand, iWARP, and RoCE
|
||||
- Loopback (send-to-self)
|
||||
- Shared memory
|
||||
- TCP
|
||||
- usNIC
|
||||
|
||||
- "ikrit" only supports Mellanox MXM.
|
||||
- UCX is the Unified Communication X (UCX) communication library
|
||||
(http://www.openucx.org/).
|
||||
This is an open-source project developed in collaboration between
|
||||
industry, laboratories, and academia to create an open-source
|
||||
production grade communication framework for data centric and
|
||||
high-performance applications.
|
||||
UCX currently supports:
|
||||
- OFA Verbs;
|
||||
- Cray's uGNI;
|
||||
- NVIDIA CUDA drivers.
|
||||
|
||||
- MXM is the Mellanox Messaging Accelerator library utilizing a full
|
||||
range of IB transports to provide the following messaging services
|
||||
|
Загрузка…
x
Ссылка в новой задаче
Block a user