diff --git a/NEWS b/NEWS index 3031b57b8a..86f0a55850 100644 --- a/NEWS +++ b/NEWS @@ -66,6 +66,220 @@ Master (not on release branches yet) - Removed embedded VampirTrace. It is in maintenance mode since 2013. Please consider Score-P (score-p.org) as an external replacement. +3.0.0 -- July, 2017 +------------------- + +Major new features: + +- Use UCX allocator for OSHMEM symmetric heap allocations to optimize intra-node + data transfers. UCX SPML only. +- Use UCX multi-threaded API in the UCX PML. Requires UCX 1.0 or later. + +Changes in behavior compared to prior versions: + +Removed legacy support: +- "yoda" SPML component is removed from the Open SHMEM codebase. It was utilizing + Open MPI BTL components for supported networks. + Currently avalable SPML components are: + - "ikrit" based on Mellanox Messaging Accelerator (MXM) communication library; + - "ucx" based on the open-source Unified Communication X (UCX) communication + library (http://www.openucx.org/). + +Known issues (to be addressed in v3.0.1): + +2.1.1 -- April, 2017 +-------------------- + +Bug fixes/minor improvements: + +- Fix a problem with one of Open MPI's fifo data structures which led to + hangs in a make check test. Thanks to Nicolas Morey-Chaisemartin for + reporting. +- Add missing MPI_AINT_ADD/MPI_AINT_DIFF function definitions to mpif.h. + Thanks to Aboorva Devarajan for reporting. +- Fix the error return from MPI_WIN_LOCK when rank argument is invalid. + Thanks to Jeff Hammond for reporting and fixing this issue. +- Fix a problem with mpirun/orterun when started under a debugger. Thanks + to Gregory Leff for reporting. +- Add configury option to disable use of CMA by the vader BTL. Thanks + to Sascha Hunold for reporting. +- Add configury check for MPI_DOUBLE_COMPLEX datatype support. + Thanks to Alexander Klein for reporting. +- Fix memory allocated by MPI_WIN_ALLOCATE_SHARED to + be 64 bit aligned. Thanks to Joseph Schuchart for + reporting. +- Update MPI_WTICK man page to reflect possibly higher + resolution than 10e-6. Thanks to Mark Dixon for + reporting +- Add missing MPI_T_PVAR_SESSION_NULL definition to mpi.h + include file. Thanks to Omri Mor for this contribution. +- Enhance the Open MPI spec file to install modulefile in /opt + if installed in a non-default location. Thanks to Kevin + Buckley for reporting and supplying a fix. +- Fix a problem with conflicting PMI symbols when linking statically. + Thanks to Kilian Cavalotti for reporting. + +Known issues (to be addressed in v2.1.2): + +- See the list of fixes slated for v2.1.2 here: + https://github.com/open-mpi/ompi/milestone/28 + +2.1.0 -- March, 2017 +-------------------- + +Major new features: + +- The main focus of the Open MPI v2.1.0 release was to update to PMIx + v1.2.1. When using PMIx (e.g., via mpirun-based launches, or via + direct launches with recent versions of popular resource managers), + launch time scalability is improved, and the run time memory + footprint is greatly decreased when launching large numbers of MPI / + OpenSHMEM processes. +- Update OpenSHMEM API conformance to v1.3. +- The usnic BTL now supports MPI_THREAD_MULTIPLE. +- General/overall performance improvements to MPI_THREAD_MULTIPLE. +- Add a summary message at the bottom of configure that tells you many + of the configuration options specified and/or discovered by Open + MPI. + +Changes in behavior compared to prior versions: + +- None. + +Removed legacy support: + +- The ptmalloc2 hooks have been removed from the Open MPI code base. + This is not really a user-noticable change; it is only mentioned + here because there was much rejoycing in the Open MPI developer + community. + +Bug fixes/minor improvements: + +- New MCA parameters: + - iof_base_redirect_app_stderr_to_stdout: as its name implies, it + combines MPI / OpenSHMEM applications' stderr into its stdout + stream. + - opal_event_include: allow the user to specify which FD selection + mechanism is used by the underlying event engine. + - opal_stacktrace_output: indicate where stacktraces should be sent + upon MPI / OpenSHMEM process crashes ("none", "stdout", "stderr", + "file:filename"). + - orte_timeout_for_stack_trace: number of seconds to wait for stack + traces to be reported (or <=0 to wait forever). + - mtl_ofi_control_prog_type/mtl_ofi_data_prog_type: specify libfabric + progress model to be used for control and data. +- Fix MPI_WTICK regression where the time reported may be inaccurate + on systems with processor frequency scalaing enabled. +- Fix regression that lowered the memory maximum message bandwidth for + large messages on some BTL network transports, such as openib, sm, + and vader. +- Fix a name collision in the shared file pointer MPI IO file locking + scheme. Thanks to Nicolas Joly for reporting the issue. +- Fix datatype extent/offset errors in MPI_PUT and MPI_RACCUMULATE + when using the Portals 4 one-sided component. +- Add support for non-contiguous datatypes to the Portals 4 one-sided + component. +- Various updates for the UCX PML. +- Updates to the following man pages: + - mpirun(1) + - MPI_COMM_CONNECT(3) + - MPI_WIN_GET_NAME(3). Thanks to Nicolas Joly for reporting the + typo. + - MPI_INFO_GET_[NKEYS|NTHKEY](3). Thanks to Nicolas Joly for + reporting the typo. +- Fixed a problem in the TCP BTL when using MPI_THREAD_MULTIPLE. + Thanks to Evgueni Petrov for reporting. +- Fixed external32 representation in the romio314 module. Note that + for now, external32 representation is not correctly supported by the + ompio module. Thanks to Thomas Gastine for bringing this to our + attention. +- Add note how to disable a warning message about when a high-speed + MPI transport is not found. Thanks to Susan Schwarz for reporting + the issue. +- Ensure that sending SIGINT when using the rsh/ssh launcher does not + orphan children nodes in the launch tree. +- Fix the help message when showing deprecated MCA param names to show + the correct (i.e., deprecated) name. +- Enable support for the openib BTL to use multiple different + InfiniBand subnets. +- Fix a minor error in MPI_AINT_DIFF. +- Fix bugs with MPI_IN_PLACE handling in: + - MPI_ALLGATHER[V] + - MPI_[I][GATHER|SCATTER][V] + - MPI_IREDUCE[_SCATTER] + - Thanks to all the users who helped diagnose these issues. +- Allow qrsh to tree spawn (if the back-end system supports it). +- Fix MPI_T_PVAR_GET_INDEX to return the correct index. +- Correctly position the shared file pointer in append mode in the + OMPIO component. +- Add some deprecated names into shmem.h for backwards compatibility + with legacy codes. +- Fix MPI_MODE_NOCHECK support. +- Fix a regression in PowerPC atomics support. Thanks to Orion + Poplawski for reporting the issue. +- Fixes for assembly code with aggressively-optimized compilers on + x86_64/AMD64 platforms. +- Fix one more place where configure was mangling custom CFLAGS. + Thanks to Phil Tooley (@Telemin) for reporting the issue. +- Better handle builds with external installations of hwloc. +- Fixed a hang with MPI_PUT and MPI_WIN_LOCK_ALL. +- Fixed a bug when using MPI_GET on non-contiguous datatypes and + MPI_LOCK/MPI_UNLOCK. +- Fixed a bug when using POST/START/COMPLETE/WAIT after a fence. +- Fix configure portability by cleaning up a few uses of "==" with + "test". Thanks to Kevin Buckley for pointing out the issue. +- Fix bug when using darrays with lib and extent of darray datatypes. +- Updates to make Open MPI binary builds more bit-for-bit + reproducable. Thanks to Alastair McKinstry for the suggestion. +- Fix issues regarding persistent request handling. +- Ensure that shmemx.h is a standalone OpenSHMEM header file. Thanks + to Nick Park (@nspark) for the report. +- Ensure that we always send SIGTERM prior to SIGKILL. Thanks to Noel + Rycroft for the report. +- Added ConnectX-5 and Chelsio T6 device defaults for the openib BTL. +- OpenSHMEM no longer supports MXM less than v2.0. +- Plug a memory leak in ompi_osc_sm_free. Thanks to Joseph Schuchart + for the report. +- The "self" BTL now uses less memory. +- The vader BTL is now more efficient in terms of memory usage when + using XPMEM. +- Removed the --enable-openib-failover configure option. This is not + considered backwards-incompatible because this option was stale and + had long-since stopped working, anyway. +- Allow jobs launched under Cray aprun to use hyperthreads if + opal_hwloc_base_hwthreads_as_cpus MCA parameter is set. +- Add support for 32-bit and floating point Cray Aries atomic + operations. +- Add support for network AMOs for MPI_ACCUMULATE, MPI_FETCH_AND_OP, + and MPI_COMPARE_AND_SWAP if the "ompi_single_intrinsic" info key is + set on the window or the "acc_single_intrinsic" MCA param is set. +- Automatically disqualify RDMA CM support in the openib BTL if + MPI_THREAD_MULTIPLE is used. +- Make configure smarter/better about auto-detecting Linux CMA + support. +- Improve the scalability of MPI_COMM_SPLIT_TYPE. +- Fix the mixing of C99 and C++ header files with the MPI C++ + bindings. Thanks to Alastair McKinstry for the bug report. +- Add support for ARM v8. +- Several MCA parameters now directly support MPI_T enumerator + semantics (i.e., they accept a limited set of values -- e.g., MCA + parameters that accept boolean values). +- Added --with-libmpi-name=STRING configure option for vendor releases + of Open MPI. See the README for more detail. +- Fix a problem with Open MPI's internal memory checker. Thanks to Yvan + Fournier for reporting. +- Fix a multi-threaded issue with MPI_WAIT. Thanks to Pascal Deveze for + reporting. + +Known issues (to be addressed in v2.1.1): + +- See the list of fixes slated for v2.1.1 here: + https://github.com/open-mpi/ompi/milestone/26 + +2.0.3 -- June 2017 +------------------ + +Bug fixes/minor improvements: 2.0.2 -- 26 January 2017 -------------------------