48e12cf766
[skip ci] Signed-off-by: Howard Pritchard <howardp@lanl.gov>
4156 строки
196 KiB
Plaintext
4156 строки
196 KiB
Plaintext
Copyright (c) 2004-2010 The Trustees of Indiana University and Indiana
|
||
University Research and Technology
|
||
Corporation. All rights reserved.
|
||
Copyright (c) 2004-2006 The University of Tennessee and The University
|
||
of Tennessee Research Foundation. All rights
|
||
reserved.
|
||
Copyright (c) 2004-2008 High Performance Computing Center Stuttgart,
|
||
University of Stuttgart. All rights reserved.
|
||
Copyright (c) 2004-2006 The Regents of the University of California.
|
||
All rights reserved.
|
||
Copyright (c) 2006-2018 Cisco Systems, Inc. All rights reserved.
|
||
Copyright (c) 2006 Voltaire, Inc. All rights reserved.
|
||
Copyright (c) 2006 Sun Microsystems, Inc. All rights reserved.
|
||
Use is subject to license terms.
|
||
Copyright (c) 2006-2018 Los Alamos National Security, LLC. All rights
|
||
reserved.
|
||
Copyright (c) 2010-2018 IBM Corporation. All rights reserved.
|
||
Copyright (c) 2012 Oak Ridge National Labs. All rights reserved.
|
||
Copyright (c) 2012 Sandia National Laboratories. All rights reserved.
|
||
Copyright (c) 2012 University of Houston. All rights reserved.
|
||
Copyright (c) 2013 NVIDIA Corporation. All rights reserved.
|
||
Copyright (c) 2013-2018 Intel, Inc. All rights reserved.
|
||
Copyright (c) 2018 Amazon.com, Inc. or its affiliates. All Rights
|
||
reserved.
|
||
$COPYRIGHT$
|
||
|
||
Additional copyrights may follow
|
||
|
||
$HEADER$
|
||
|
||
===========================================================================
|
||
|
||
This file contains the main features as well as overviews of specific
|
||
bug fixes (and other actions) for each version of Open MPI since
|
||
version 1.0.
|
||
|
||
As more fully described in the "Software Version Number" section in
|
||
the README file, Open MPI typically releases two separate version
|
||
series simultaneously. Since these series have different goals and
|
||
are semi-independent of each other, a single NEWS-worthy item may be
|
||
introduced into different series at different times. For example,
|
||
feature F was introduced in the vA.B series at version vA.B.C, and was
|
||
later introduced into the vX.Y series at vX.Y.Z.
|
||
|
||
The first time feature F is released, the item will be listed in the
|
||
vA.B.C section, denoted as:
|
||
|
||
(** also to appear: X.Y.Z) -- indicating that this item is also
|
||
likely to be included in future release
|
||
version vX.Y.Z.
|
||
|
||
When vX.Y.Z is later released, the same NEWS-worthy item will also be
|
||
included in the vX.Y.Z section and be denoted as:
|
||
|
||
(** also appeared: A.B.C) -- indicating that this item was previously
|
||
included in release version vA.B.C.
|
||
|
||
4.0.0 -- September, 2018
|
||
------------------------
|
||
|
||
- OSHMEM updated to the OpenSHMEM 1.4 API.
|
||
- Do not build OpenSHMEM layer when there are no SPMLs available.
|
||
Currently, this means the OpenSHMEM layer will only build if
|
||
a MXM or UCX library is found.
|
||
- A UCX BTL was added for enhanced MPI RMA support using UCX
|
||
- With this release, OpenIB BTL now only supports iWarp and RoCE by default.
|
||
- Updated internal HWLOC to 2.0.2
|
||
- Updated internal PMIx to 3.0.2
|
||
- Change the priority for selecting external verses internal HWLOC
|
||
and PMIx packages to build. Starting with this release, configure
|
||
by default selects available external HWLOC and PMIx packages over
|
||
the internal ones.
|
||
- Updated internal ROMIO to 3.2.1.
|
||
- Removed support for the MXM MTL.
|
||
- Removed support for SCIF.
|
||
- Improved CUDA support when using UCX.
|
||
- Enable use of CUDA allocated buffers for OMPIO.
|
||
- Improved support for two phase MPI I/O operations when using OMPIO.
|
||
- Added support for Software-based Performance Counters, see
|
||
https://github.com/davideberius/ompi/wiki/How-to-Use-Software-Based-Performance-Counters-(SPCs)-in-Open-MPI
|
||
- Change MTL OFI from opting-IN on "psm,psm2,gni" to opting-OUT on
|
||
"shm,sockets,tcp,udp,rstream"
|
||
- Various improvements to MPI RMA performance when using RDMA
|
||
capable interconnects.
|
||
- Update memkind component to use the memkind 1.6 public API.
|
||
- Fix a problem with javadoc builds using OpenJDK 11. Thanks to
|
||
Siegmar Gross for reporting.
|
||
- Fix a memory leak using UCX. Thanks to Charles Taylor for reporting.
|
||
- Fix hangs in MPI_FINALIZE when using UCX.
|
||
- Fix a problem with building Open MPI using an external PMIx 2.1.2
|
||
library. Thanks to Marcin Krotkiewski for reporting.
|
||
- Fix race conditions in Vader (shared memory) transport.
|
||
- Fix problems with use of newer map-by mpirun options. Thanks to
|
||
Tony Reina for reporting.
|
||
- Fix rank-by algorithms to properly rank by object and span
|
||
- Allow for running as root of two environment variables are set.
|
||
Requested by Axel Huebl.
|
||
- Fix a problem with building the Java bindings when using Java 10.
|
||
Thanks to Bryce Glover for reporting.
|
||
- Fix a problem with ORTE not reporting error messages if an application
|
||
terminated normally but exited with non-zero error code. Thanks to
|
||
Emre Brookes for reporting.
|
||
|
||
3.1.2 -- August, 2018
|
||
------------------------
|
||
|
||
- A subtle race condition bug was discovered in the "vader" BTL
|
||
(shared memory communications) that, in rare instances, can cause
|
||
MPI processes to crash or incorrectly classify (or effectively drop)
|
||
an MPI message sent via shared memory. If you are using the "ob1"
|
||
PML with "vader" for shared memory communication (note that vader is
|
||
the default for shared memory communication with ob1), you need to
|
||
upgrade to v3.1.2 or later to fix this issue. You may also upgrade
|
||
to the following versions to fix this issue:
|
||
- Open MPI v2.1.5 (expected end of August, 2018) or later in the
|
||
v2.1.x series
|
||
- Open MPI v3.0.1 (released March, 2018) or later in the v3.0.x
|
||
series
|
||
- Assorted Portals 4.0 bug fixes.
|
||
- Fix for possible data corruption in MPI_BSEND.
|
||
- Move shared memory file for vader btl into /dev/shm on Linux.
|
||
- Fix for MPI_ISCATTER/MPI_ISCATTERV Fortran interfaces with MPI_IN_PLACE.
|
||
- Upgrade PMIx to v2.1.3.
|
||
- Numerous One-sided bug fixes.
|
||
- Fix for race condition in uGNI BTL.
|
||
- Improve handling of large number of interfaces with TCP BTL.
|
||
- Numerous UCX bug fixes.
|
||
|
||
3.1.1 -- June, 2018
|
||
-------------------
|
||
|
||
- Fix potential hang in UCX PML during MPI_FINALIZE
|
||
- Update internal PMIx to v2.1.2rc2 to fix forward version compatibility.
|
||
- Add new MCA parameter osc_sm_backing_store to allow users to specify
|
||
where in the filesystem the backing file for the shared memory
|
||
one-sided component should live. Defaults to /dev/shm on Linux.
|
||
- Fix potential hang on non-x86 platforms when using builds with
|
||
optimization flags turned off.
|
||
- Disable osc/pt2pt when using MPI_THREAD_MULTIPLE due to numerous
|
||
race conditions in the component.
|
||
- Fix dummy variable names for the mpi and mpi_f08 Fortran bindings to
|
||
match the MPI standard. This may break applications which use
|
||
name-based parameters in Fortran which used our internal names
|
||
rather than those documented in the MPI standard.
|
||
- Revamp Java detection to properly handle new Java versions which do
|
||
not provide a javah wrapper.
|
||
- Fix RMA function signatures for use-mpi-f08 bindings to have the
|
||
asynchonous property on all buffers.
|
||
- Improved configure logic for finding the UCX library.
|
||
|
||
3.1.0 -- May, 2018
|
||
------------------
|
||
|
||
- Various OpenSHMEM bug fixes.
|
||
- Properly handle array_of_commands argument to Fortran version of
|
||
MPI_COMM_SPAWN_MULTIPLE.
|
||
- Fix bug with MODE_SEQUENTIAL and the sharedfp MPI-IO component.
|
||
- Use "javac -h" instead of "javah" when building the Java bindings
|
||
with a recent version of Java.
|
||
- Fix mis-handling of jostepid under SLURM that could cause problems
|
||
with PathScale/OmniPath NICs.
|
||
- Disable the POWER 7/BE block in configure. Note that POWER 7/BE is
|
||
still not a supported platform, but it is no longer automatically
|
||
disabled. See
|
||
https://github.com/open-mpi/ompi/issues/4349#issuecomment-374970982
|
||
for more information.
|
||
- The output-filename option for mpirun is now converted to an
|
||
absolute path before being passed to other nodes.
|
||
- Add monitoring component for PML, OSC, and COLL to track data
|
||
movement of MPI applications. See
|
||
ompi/mca/commmon/monitoring/HowTo_pml_monitoring.tex for more
|
||
information about the monitoring framework.
|
||
- Add support for communicator assertions: mpi_assert_no_any_tag,
|
||
mpi_assert_no_any_source, mpi_assert_exact_length, and
|
||
mpi_assert_allow_overtaking.
|
||
- Update PMIx to version 2.1.1.
|
||
- Update hwloc to 1.11.7.
|
||
- Many one-sided behavior fixes.
|
||
- Improved performance for Reduce and Allreduce using Rabenseifner's algorithm.
|
||
- Revamped mpirun --help output to make it a bit more manageable.
|
||
- Portals4 MTL improvements: Fix race condition in rendezvous protocol and
|
||
retry logic.
|
||
- UCX OSC: initial implementation.
|
||
- UCX PML improvements: add multi-threading support.
|
||
- Yalla PML improvements: Fix error with irregular contiguous datatypes.
|
||
- Openib BTL: disable XRC support by default.
|
||
- TCP BTL: Add check to detect and ignore connections from processes
|
||
that aren't MPI (such as IDS probes) and verify that source and
|
||
destination are using the same version of Open MPI, fix issue with very
|
||
large message transfer.
|
||
- ompi_info parsable output now escapes double quotes in values, and
|
||
also quotes values can contains colons. Thanks to Lev Givon for the
|
||
suggestion.
|
||
- CUDA-aware support can now handle GPUs within a node that do not
|
||
support CUDA IPC. Earlier versions would get error and abort.
|
||
- Add a mca parameter ras_base_launch_orted_on_hn to allow for launching
|
||
MPI processes on the same node where mpirun is executing using a separate
|
||
orte daemon, rather than the mpirun process. This may be useful to set to
|
||
true when using SLURM, as it improves interoperability with SLURM's signal
|
||
propagation tools. By default it is set to false, except for Cray XC systems.
|
||
- Remove LoadLeveler RAS support.
|
||
- Remove IB XRC support from the OpenIB BTL due to lack of support.
|
||
- Add functionality for IBM s390 platforms. Note that regular
|
||
regression testing does not occur on the s390 and it is not
|
||
considered a supported platform.
|
||
- Remove support for big endian PowerPC.
|
||
- Remove support for XL compilers older than v13.1.
|
||
- Remove support for atomic operations using MacOS atomics library.
|
||
|
||
3.0.2 -- June, 2018
|
||
-------------------
|
||
|
||
- Disable osc/pt2pt when using MPI_THREAD_MULTIPLE due to numerous
|
||
race conditions in the component.
|
||
- Fix dummy variable names for the mpi and mpi_f08 Fortran bindings to
|
||
match the MPI standard. This may break applications which use
|
||
name-based parameters in Fortran which used our internal names
|
||
rather than those documented in the MPI standard.
|
||
- Fixed MPI_SIZEOF in the "mpi" Fortran module for the NAG compiler.
|
||
- Fix RMA function signatures for use-mpi-f08 bindings to have the
|
||
asynchonous property on all buffers.
|
||
- Fix Fortran MPI_COMM_SPAWN_MULTIPLE to properly follow the count
|
||
length argument when parsing the array_of_commands variable.
|
||
- Revamp Java detection to properly handle new Java versions which do
|
||
not provide a javah wrapper.
|
||
- Improved configure logic for finding the UCX library.
|
||
- Add support for HDR InfiniBand link speeds.
|
||
- Disable the POWER 7/BE block in configure. Note that POWER 7/BE is
|
||
still not a supported platform, but it is no longer automatically
|
||
disabled. See
|
||
https://github.com/open-mpi/ompi/issues/4349#issuecomment-374970982
|
||
for more information.
|
||
|
||
3.0.1 -- March, 2018
|
||
----------------------
|
||
|
||
- Fix ability to attach parallel debuggers to MPI processes.
|
||
- Fix a number of issues in MPI I/O found by the HDF5 test suite.
|
||
- Fix (extremely) large message transfers with shared memory.
|
||
- Fix out of sequence bug in multi-NIC configurations.
|
||
- Fix stdin redirection bug that could result in lost input.
|
||
- Disable the LSF launcher if CSM is detected.
|
||
- Plug a memory leak in MPI_Mem_free(). Thanks to Philip Blakely for reporting.
|
||
- Fix the tree spawn operation when the number of nodes is larger than the radix.
|
||
Thanks to Carlos Eduardo de Andrade for reporting.
|
||
- Fix Fortran 2008 macro in MPI extensions. Thanks to Nathan T. Weeks for
|
||
reporting.
|
||
- Add UCX to list of interfaces that OpenSHMEM will use by default.
|
||
- Add --{enable|disable}-show-load-errors-by-default to control
|
||
default behavior of the load errors option.
|
||
- OFI MTL improvements: handle empty completion queues properly, fix
|
||
incorrect error message around fi_getinfo(), use default progress
|
||
option for provider by default, Add support for reading multiple
|
||
CQ events in ofi_progress.
|
||
- PSM2 MTL improvements: Allow use of GPU buffers, thread fixes.
|
||
- Numerous corrections to memchecker behavior.
|
||
- Add a mca parameter ras_base_launch_orted_on_hn to allow for launching
|
||
MPI processes on the same node where mpirun is executing using a separate
|
||
orte daemon, rather than the mpirun process. This may be useful to set to
|
||
true when using SLURM, as it improves interoperability with SLURM's signal
|
||
propagation tools. By default it is set to false, except for Cray XC systems.
|
||
- Fix a problem reported on the mailing separately by Kevin McGrattan and Stephen
|
||
Guzik about consistency issues on NFS file systems when using OMPIO. This fix
|
||
also introduces a new mca parameter fs_ufs_lock_algorithm which allows to
|
||
control the locking algorithm used by ompio for read/write operations. By
|
||
default, ompio does not perfom locking on local UNIX file systems, locks the
|
||
entire file per operation on NFS file systems, and selective byte-range
|
||
locking on other distributed file systems.
|
||
- Add an mca parameter pmix_server_usock_connections to allow mpirun to
|
||
support applications statically built against the Open MPI v2.x release,
|
||
or installed in a container along with the Open MPI v2.x libraries. It is
|
||
set to false by default.
|
||
|
||
3.0.0 -- September, 2017
|
||
------------------------
|
||
|
||
Major new features:
|
||
|
||
- Use UCX allocator for OSHMEM symmetric heap allocations to optimize intra-node
|
||
data transfers. UCX SPML only.
|
||
- Use UCX multi-threaded API in the UCX PML. Requires UCX 1.0 or later.
|
||
- Added support for Flux PMI
|
||
- Update embedded PMIx to version 2.1.0
|
||
- Update embedded hwloc to version 1.11.7
|
||
|
||
Changes in behavior compared to prior versions:
|
||
|
||
- Per Open MPI's versioning scheme (see the README), increasing the
|
||
major version number to 3 indicates that this version is not
|
||
ABI-compatible with prior versions of Open MPI. In addition, there may
|
||
be differences in MCA parameter names and defaults from previous releases.
|
||
Command line options for mpirun and other commands may also differ from
|
||
previous versions. You will need to recompile MPI and OpenSHMEM applications
|
||
to work with this version of Open MPI.
|
||
- With this release, Open MPI supports MPI_THREAD_MULTIPLE by default.
|
||
- New configure options have been added to specify the locations of libnl
|
||
and zlib.
|
||
- A new configure option has been added to request Flux PMI support.
|
||
- The help menu for mpirun and related commands is now context based.
|
||
"mpirun --help compatibility" generates the help menu in the same format
|
||
as previous releases.
|
||
|
||
Removed legacy support:
|
||
- AIX is no longer supported.
|
||
- Loadlever is no longer supported.
|
||
- OpenSHMEM currently supports the UCX and MXM transports via the ucx and ikrit
|
||
SPMLs respectively.
|
||
- Remove IB XRC support from the OpenIB BTL due to lack of support.
|
||
- Remove support for big endian PowerPC.
|
||
- Remove support for XL compilers older than v13.1
|
||
|
||
Known issues:
|
||
|
||
- MPI_Connect/accept between applications started by different mpirun
|
||
commands will fail, even if ompi-server is running.
|
||
|
||
2.1.5 -- August 2018
|
||
--------------------
|
||
|
||
- A subtle race condition bug was discovered in the "vader" BTL
|
||
(shared memory communications) that, in rare instances, can cause
|
||
MPI processes to crash or incorrectly classify (or effectively drop)
|
||
an MPI message sent via shared memory. If you are using the "ob1"
|
||
PML with "vader" for shared memory communication (note that vader is
|
||
the default for shared memory communication with ob1), you need to
|
||
upgrade to v2.1.5 to fix this issue. You may also upgrade to the
|
||
following versions to fix this issue:
|
||
- Open MPI v3.0.1 (released March, 2018) or later in the v3.0.x
|
||
series
|
||
- Open MPI v3.1.2 (expected end of August, 2018) or later
|
||
- A link issue was fixed when the UCX library was not located in the
|
||
linker-default search paths.
|
||
|
||
2.1.4 -- August, 2018
|
||
---------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
- Disable the POWER 7/BE block in configure. Note that POWER 7/BE is
|
||
still not a supported platform, but it is no longer automatically
|
||
disabled. See
|
||
https://github.com/open-mpi/ompi/issues/4349#issuecomment-374970982
|
||
for more information.
|
||
- Fix bug with request-based one-sided MPI operations when using the
|
||
"rdma" component.
|
||
- Fix issue with large data structure in the TCP BTL causing problems
|
||
in some environments. Thanks to @lgarithm for reporting the issue.
|
||
- Minor Cygwin build fixes.
|
||
- Minor fixes for the openib BTL:
|
||
- Support for the QLogic RoCE HCA
|
||
- Support for the Boradcom Cumulus RoCE HCA
|
||
- Enable support for HDR link speeds
|
||
- Fix MPI_FINALIZED hang if invoked from an attribute destructor
|
||
during the MPI_COMM_SELF destruction in MPI_FINALIZE. Thanks to
|
||
@AndrewGaspar for reporting the issue.
|
||
- Java fixes:
|
||
- Modernize Java framework detection, especially on OS X/MacOS.
|
||
Thanks to Bryce Glover for reporting and submitting the fixes.
|
||
- Prefer "javac -h" to "javah" to support newer Java frameworks.
|
||
- Fortran fixes:
|
||
- Use conformant dummy parameter names for Fortran bindings. Thanks
|
||
to Themos Tsikas for reporting and submitting the fixes.
|
||
- Build the MPI_SIZEOF() interfaces in the "TKR"-style "mpi" module
|
||
whenever possible. Thanks to Themos Tsikas for reporting the
|
||
issue.
|
||
- Fix array of argv handling for the Fortran bindings of
|
||
MPI_COMM_SPAWN_MULTIPLE (and its associated man page).
|
||
- Make NAG Fortran compiler support more robust in configure.
|
||
- Disable the "pt2pt" one-sided MPI component when MPI_THREAD_MULTIPLE
|
||
is used. This component is simply not safe in MPI_THREAD_MULTIPLE
|
||
scenarios, and will not be fixed in the v2.1.x series.
|
||
- Make the "external" hwloc component fail gracefully if it is tries
|
||
to use an hwloc v2.x.y installation. hwloc v2.x.y will not be
|
||
supported in the Open MPI v2.1.x series.
|
||
- Fix "vader" shared memory support for messages larger than 2GB.
|
||
Thanks to Heiko Bauke for the bug report.
|
||
- Configure fixes for external PMI directory detection. Thanks to
|
||
Davide Vanzo for the report.
|
||
|
||
2.1.3 -- March, 2018
|
||
--------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
- Update internal PMIx version to 1.2.5.
|
||
- Fix a problem with ompi_info reporting using param option.
|
||
Thanks to Alexander Pozdneev for reporting.
|
||
- Correct PMPI_Aint_{add|diff} to be functions (not subroutines)
|
||
in the Fortran mpi_f08 module.
|
||
- Fix a problem when doing MPI I/O using data types with large
|
||
extents in conjunction with MPI_TYPE_CREATE_SUBARRAY. Thanks to
|
||
Christopher Brady for reporting.
|
||
- Fix a problem when opening many files using MPI_FILE_OPEN.
|
||
Thanks to William Dawson for reporting.
|
||
- Fix a problem with debuggers failing to attach to a running job.
|
||
Thanks to Dirk Schubert for reporting.
|
||
- Fix a problem when using madvise and the OpenIB BTL. Thanks to
|
||
Timo Bingmann for reporting.
|
||
- Fix a problem in the Vader BTL that resulted in failures of
|
||
IMB under certain circumstances. Thanks to Nicolas Morey-
|
||
Chaisemartin for reporting.
|
||
- Fix a problem preventing Open MPI from working under Cygwin.
|
||
Thanks to Marco Atzeri for reporting.
|
||
- Reduce some verbosity being emitted by the USNIC BTL under certain
|
||
circumstances. Thanks to Peter Forai for reporting.
|
||
- Fix a problem with misdirection of SIGKILL. Thanks to Michael Fern
|
||
for reporting.
|
||
- Replace use of posix_memalign with malloc for small allocations. Thanks
|
||
to Ben Menaude for reporting.
|
||
- Fix a problem with Open MPI's out of band TCP network for file descriptors
|
||
greater than 32767. Thanks to Wojtek Wasko for reporting and fixing.
|
||
- Plug a memory leak in MPI_Mem_free(). Thanks to Philip Blakely for reporting.
|
||
|
||
2.1.2 -- September, 2017
|
||
------------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
- Update internal PMIx version to 1.2.3.
|
||
- Fix some problems when using the NAG Fortran compiler to build Open MPI
|
||
and when using the compiler wrappers. Thanks to Neil Carlson for reporting.
|
||
- Fix a compilation problem with the SM BTL. Thanks to Paul Hargrove for
|
||
reporting.
|
||
- Fix a problem with MPI_IALLTOALLW when using zero-length messages.
|
||
Thanks to Dahai Guo for reporting.
|
||
- Fix a problem with C11 generic type interface for SHMEM_G. Thanks
|
||
to Nick Park for reporting.
|
||
- Switch to using the lustreapi.h include file when building Open MPI
|
||
with Lustre support.
|
||
- Fix a problem in the OB1 PML that led to hangs with OSU collective tests.
|
||
- Fix a progression issue with MPI_WIN_FLUSH_LOCAL. Thanks to
|
||
Joseph Schuchart for reporting.
|
||
- Fix an issue with recent versions of PBSPro requiring libcrypto.
|
||
Thanks to Petr Hanousek for reporting.
|
||
- Fix a problem when using MPI_ANY_SOURCE with MPI_SENDRECV.
|
||
- Fix an issue that prevented signals from being propagated to ORTE
|
||
daemons.
|
||
- Ensure that signals are forwarded from ORTE daemons to all processes
|
||
in the process group created by the daemons. Thanks to Ted Sussman
|
||
for reporting.
|
||
- Fix a problem with launching a job under a debugger. Thanks to
|
||
Greg Lee for reporting.
|
||
- Fix a problem with Open MPI native I/O MPI_FILE_OPEN when using
|
||
a communicator having an associated topology. Thanks to
|
||
Wei-keng Liao for reporting.
|
||
- Fix an issue when using MPI_ACCUMULATE with derived datatypes.
|
||
- Fix a problem with Fortran bindings that led to compilation errors
|
||
for user defined reduction operations. Thanks to Nathan Weeks for
|
||
reporting.
|
||
- Fix ROMIO issues with large writes/reads when using NFS file systems.
|
||
- Fix definition of Fortran MPI_ARGV_NULL and MPI_ARGVS_NULL.
|
||
- Enable use of the head node of a SLURM allocation on Cray XC systems.
|
||
- Fix a problem with synchronous sends when using the UCX PML.
|
||
- Use default socket buffer size to improve TCP BTL performance.
|
||
- Add a mca parameter ras_base_launch_orted_on_hn to allow for launching
|
||
MPI processes on the same node where mpirun is executing using a separate
|
||
orte daemon, rather than the mpirun process. This may be useful to set to
|
||
true when using SLURM, as it improves interoperability with SLURM's signal
|
||
propagation tools. By default it is set to false, except for Cray XC systems.
|
||
- Fix --without-lsf when lsf is installed in the default search path.
|
||
- Remove support for big endian PowerPC.
|
||
- Remove support for XL compilers older than v13.1
|
||
- Remove IB XRC support from the OpenIB BTL due to loss of maintainer.
|
||
|
||
2.1.1 -- April, 2017
|
||
--------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
|
||
- Fix a problem with one of Open MPI's fifo data structures which led to
|
||
hangs in a make check test. Thanks to Nicolas Morey-Chaisemartin for
|
||
reporting.
|
||
- Add missing MPI_AINT_ADD/MPI_AINT_DIFF function definitions to mpif.h.
|
||
Thanks to Aboorva Devarajan for reporting.
|
||
- Fix the error return from MPI_WIN_LOCK when rank argument is invalid.
|
||
Thanks to Jeff Hammond for reporting and fixing this issue.
|
||
- Fix a problem with mpirun/orterun when started under a debugger. Thanks
|
||
to Gregory Leff for reporting.
|
||
- Add configury option to disable use of CMA by the vader BTL. Thanks
|
||
to Sascha Hunold for reporting.
|
||
- Add configury check for MPI_DOUBLE_COMPLEX datatype support.
|
||
Thanks to Alexander Klein for reporting.
|
||
- Fix memory allocated by MPI_WIN_ALLOCATE_SHARED to
|
||
be 64 bit aligned. Thanks to Joseph Schuchart for
|
||
reporting.
|
||
- Update MPI_WTICK man page to reflect possibly higher
|
||
resolution than 10e-6. Thanks to Mark Dixon for
|
||
reporting
|
||
- Add missing MPI_T_PVAR_SESSION_NULL definition to mpi.h
|
||
include file. Thanks to Omri Mor for this contribution.
|
||
- Enhance the Open MPI spec file to install modulefile in /opt
|
||
if installed in a non-default location. Thanks to Kevin
|
||
Buckley for reporting and supplying a fix.
|
||
- Fix a problem with conflicting PMI symbols when linking statically.
|
||
Thanks to Kilian Cavalotti for reporting.
|
||
|
||
Known issues (to be addressed in v2.1.2):
|
||
|
||
- See the list of fixes slated for v2.1.2 here:
|
||
https://github.com/open-mpi/ompi/milestone/28
|
||
|
||
2.1.0 -- March, 2017
|
||
--------------------
|
||
|
||
Major new features:
|
||
|
||
- The main focus of the Open MPI v2.1.0 release was to update to PMIx
|
||
v1.2.1. When using PMIx (e.g., via mpirun-based launches, or via
|
||
direct launches with recent versions of popular resource managers),
|
||
launch time scalability is improved, and the run time memory
|
||
footprint is greatly decreased when launching large numbers of MPI /
|
||
OpenSHMEM processes.
|
||
- Update OpenSHMEM API conformance to v1.3.
|
||
- The usnic BTL now supports MPI_THREAD_MULTIPLE.
|
||
- General/overall performance improvements to MPI_THREAD_MULTIPLE.
|
||
- Add a summary message at the bottom of configure that tells you many
|
||
of the configuration options specified and/or discovered by Open
|
||
MPI.
|
||
|
||
Changes in behavior compared to prior versions:
|
||
|
||
- None.
|
||
|
||
Removed legacy support:
|
||
|
||
- The ptmalloc2 hooks have been removed from the Open MPI code base.
|
||
This is not really a user-noticable change; it is only mentioned
|
||
here because there was much rejoycing in the Open MPI developer
|
||
community.
|
||
|
||
Bug fixes/minor improvements:
|
||
|
||
- New MCA parameters:
|
||
- iof_base_redirect_app_stderr_to_stdout: as its name implies, it
|
||
combines MPI / OpenSHMEM applications' stderr into its stdout
|
||
stream.
|
||
- opal_event_include: allow the user to specify which FD selection
|
||
mechanism is used by the underlying event engine.
|
||
- opal_stacktrace_output: indicate where stacktraces should be sent
|
||
upon MPI / OpenSHMEM process crashes ("none", "stdout", "stderr",
|
||
"file:filename").
|
||
- orte_timeout_for_stack_trace: number of seconds to wait for stack
|
||
traces to be reported (or <=0 to wait forever).
|
||
- mtl_ofi_control_prog_type/mtl_ofi_data_prog_type: specify libfabric
|
||
progress model to be used for control and data.
|
||
- Fix MPI_WTICK regression where the time reported may be inaccurate
|
||
on systems with processor frequency scalaing enabled.
|
||
- Fix regression that lowered the memory maximum message bandwidth for
|
||
large messages on some BTL network transports, such as openib, sm,
|
||
and vader.
|
||
- Fix a name collision in the shared file pointer MPI IO file locking
|
||
scheme. Thanks to Nicolas Joly for reporting the issue.
|
||
- Fix datatype extent/offset errors in MPI_PUT and MPI_RACCUMULATE
|
||
when using the Portals 4 one-sided component.
|
||
- Add support for non-contiguous datatypes to the Portals 4 one-sided
|
||
component.
|
||
- Various updates for the UCX PML.
|
||
- Updates to the following man pages:
|
||
- mpirun(1)
|
||
- MPI_COMM_CONNECT(3)
|
||
- MPI_WIN_GET_NAME(3). Thanks to Nicolas Joly for reporting the
|
||
typo.
|
||
- MPI_INFO_GET_[NKEYS|NTHKEY](3). Thanks to Nicolas Joly for
|
||
reporting the typo.
|
||
- Fixed a problem in the TCP BTL when using MPI_THREAD_MULTIPLE.
|
||
Thanks to Evgueni Petrov for reporting.
|
||
- Fixed external32 representation in the romio314 module. Note that
|
||
for now, external32 representation is not correctly supported by the
|
||
ompio module. Thanks to Thomas Gastine for bringing this to our
|
||
attention.
|
||
- Add note how to disable a warning message about when a high-speed
|
||
MPI transport is not found. Thanks to Susan Schwarz for reporting
|
||
the issue.
|
||
- Ensure that sending SIGINT when using the rsh/ssh launcher does not
|
||
orphan children nodes in the launch tree.
|
||
- Fix the help message when showing deprecated MCA param names to show
|
||
the correct (i.e., deprecated) name.
|
||
- Enable support for the openib BTL to use multiple different
|
||
InfiniBand subnets.
|
||
- Fix a minor error in MPI_AINT_DIFF.
|
||
- Fix bugs with MPI_IN_PLACE handling in:
|
||
- MPI_ALLGATHER[V]
|
||
- MPI_[I][GATHER|SCATTER][V]
|
||
- MPI_IREDUCE[_SCATTER]
|
||
- Thanks to all the users who helped diagnose these issues.
|
||
- Allow qrsh to tree spawn (if the back-end system supports it).
|
||
- Fix MPI_T_PVAR_GET_INDEX to return the correct index.
|
||
- Correctly position the shared file pointer in append mode in the
|
||
OMPIO component.
|
||
- Add some deprecated names into shmem.h for backwards compatibility
|
||
with legacy codes.
|
||
- Fix MPI_MODE_NOCHECK support.
|
||
- Fix a regression in PowerPC atomics support. Thanks to Orion
|
||
Poplawski for reporting the issue.
|
||
- Fixes for assembly code with aggressively-optimized compilers on
|
||
x86_64/AMD64 platforms.
|
||
- Fix one more place where configure was mangling custom CFLAGS.
|
||
Thanks to Phil Tooley (@Telemin) for reporting the issue.
|
||
- Better handle builds with external installations of hwloc.
|
||
- Fixed a hang with MPI_PUT and MPI_WIN_LOCK_ALL.
|
||
- Fixed a bug when using MPI_GET on non-contiguous datatypes and
|
||
MPI_LOCK/MPI_UNLOCK.
|
||
- Fixed a bug when using POST/START/COMPLETE/WAIT after a fence.
|
||
- Fix configure portability by cleaning up a few uses of "==" with
|
||
"test". Thanks to Kevin Buckley for pointing out the issue.
|
||
- Fix bug when using darrays with lib and extent of darray datatypes.
|
||
- Updates to make Open MPI binary builds more bit-for-bit
|
||
reproducable. Thanks to Alastair McKinstry for the suggestion.
|
||
- Fix issues regarding persistent request handling.
|
||
- Ensure that shmemx.h is a standalone OpenSHMEM header file. Thanks
|
||
to Nick Park (@nspark) for the report.
|
||
- Ensure that we always send SIGTERM prior to SIGKILL. Thanks to Noel
|
||
Rycroft for the report.
|
||
- Added ConnectX-5 and Chelsio T6 device defaults for the openib BTL.
|
||
- OpenSHMEM no longer supports MXM less than v2.0.
|
||
- Plug a memory leak in ompi_osc_sm_free. Thanks to Joseph Schuchart
|
||
for the report.
|
||
- The "self" BTL now uses less memory.
|
||
- The vader BTL is now more efficient in terms of memory usage when
|
||
using XPMEM.
|
||
- Removed the --enable-openib-failover configure option. This is not
|
||
considered backwards-incompatible because this option was stale and
|
||
had long-since stopped working, anyway.
|
||
- Allow jobs launched under Cray aprun to use hyperthreads if
|
||
opal_hwloc_base_hwthreads_as_cpus MCA parameter is set.
|
||
- Add support for 32-bit and floating point Cray Aries atomic
|
||
operations.
|
||
- Add support for network AMOs for MPI_ACCUMULATE, MPI_FETCH_AND_OP,
|
||
and MPI_COMPARE_AND_SWAP if the "ompi_single_intrinsic" info key is
|
||
set on the window or the "acc_single_intrinsic" MCA param is set.
|
||
- Automatically disqualify RDMA CM support in the openib BTL if
|
||
MPI_THREAD_MULTIPLE is used.
|
||
- Make configure smarter/better about auto-detecting Linux CMA
|
||
support.
|
||
- Improve the scalability of MPI_COMM_SPLIT_TYPE.
|
||
- Fix the mixing of C99 and C++ header files with the MPI C++
|
||
bindings. Thanks to Alastair McKinstry for the bug report.
|
||
- Add support for ARM v8.
|
||
- Several MCA parameters now directly support MPI_T enumerator
|
||
semantics (i.e., they accept a limited set of values -- e.g., MCA
|
||
parameters that accept boolean values).
|
||
- Added --with-libmpi-name=STRING configure option for vendor releases
|
||
of Open MPI. See the README for more detail.
|
||
- Fix a problem with Open MPI's internal memory checker. Thanks to Yvan
|
||
Fournier for reporting.
|
||
- Fix a multi-threaded issue with MPI_WAIT. Thanks to Pascal Deveze for
|
||
reporting.
|
||
|
||
Known issues (to be addressed in v2.1.1):
|
||
|
||
- See the list of fixes slated for v2.1.1 here:
|
||
https://github.com/open-mpi/ompi/milestone/26
|
||
|
||
2.0.4 -- November, 2017
|
||
-----------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
- Fix an issue with visibility of functions defined in the built-in PMIx.
|
||
Thanks to Siegmar Gross for reporting this issue.
|
||
- Add configure check to prevent trying to build this release of
|
||
Open MPI with an external hwloc 2.0 or newer release.
|
||
- Add ability to specify layered providers for OFI MTL.
|
||
- Fix a correctness issue with Open MPI's memory manager code
|
||
that could result in corrupted message data. Thanks to
|
||
Valentin Petrov for reporting.
|
||
- Fix issues encountered when using newer versions of PBS Pro.
|
||
Thanks to Petr Hanousek for reporting.
|
||
- Fix a problem with MPI_GET when using the vader BTL. Thanks
|
||
to Dahai Guo for reporting.
|
||
- Fix a problem when using MPI_ANY_SOURCE with MPI_SENDRECV_REPLACE.
|
||
Thanks to Dahai Guo for reporting.
|
||
- Fix a problem using MPI_FILE_OPEN with a communicator with an
|
||
attached cartesian topology. Thanks to Wei-keng Liao for reporting.
|
||
- Remove IB XRC support from the OpenIB BTL due to lack of support.
|
||
- Remove support for big endian PowerPC.
|
||
- Remove support for XL compilers older than v13.1
|
||
|
||
2.0.3 -- June 2017
|
||
------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
|
||
- Fix a problem with MPI_IALLTOALLW when zero size messages are present.
|
||
Thanks to @mathbird for reporting.
|
||
- Add missing MPI_USER_FUNCTION definition to the mpi_f08 module.
|
||
Thanks to Nathan Weeks for reporting this issue.
|
||
- Fix a problem with MPI_WIN_LOCK not returning an error code when
|
||
a negative rank is supplied. Thanks to Jeff Hammond for reporting and
|
||
providing a fix.
|
||
- Fix a problem with make check that could lead to hangs. Thanks to
|
||
Nicolas Morey-Chaisemartin for reporting.
|
||
- Resolve a symbol conflict problem with PMI-1 and PMI-2 PMIx components.
|
||
Thanks to Kilian Cavalotti for reporting this issue.
|
||
- Insure that memory allocations returned from MPI_WIN_ALLOCATE_SHARED are
|
||
64 byte aligned. Thanks to Joseph Schuchart for reporting this issue.
|
||
- Make use of DOUBLE_COMPLEX, if available, for Fortran bindings. Thanks
|
||
to Alexander Klein for reporting this issue.
|
||
- Add missing MPI_T_PVAR_SESSION_NULL definition to Open MPI mpi.h include
|
||
file. Thanks to Omri Mor for reporting and fixing.
|
||
- Fix a problem with use of MPI shared file pointers when accessing
|
||
a file from independent jobs. Thanks to Nicolas Joly for reporting
|
||
this issue.
|
||
- Optimize zero size MPI_IALLTOALL{V,W} with MPI_IN_PLACE. Thanks to
|
||
Lisandro Dalcin for the report.
|
||
- Fix a ROMIO buffer overflow problem for large transfers when using NFS
|
||
filesystems.
|
||
- Fix type of MPI_ARGV[S]_NULL which prevented it from being used
|
||
properly with MPI_COMM_SPAWN[_MULTIPLE] in the mpi_f08 module.
|
||
- Ensure to add proper linker flags to the wrapper compilers for
|
||
dynamic libraries on platforms that need it (e.g., RHEL 7.3 and
|
||
later).
|
||
- Get better performance on TCP-based networks 10Gbps and higher by
|
||
using OS defaults for buffer sizing.
|
||
- Fix a bug with MPI_[R][GET_]ACCUMULATE when using DARRAY datatypes.
|
||
- Fix handling of --with-lustre configure command line argument.
|
||
Thanks to Prentice Bisbal and Tim Mattox for reporting the issue.
|
||
- Added MPI_AINT_ADD and MPI_AINT_DIFF declarations to mpif.h. Thanks
|
||
to Aboorva Devarajan (@AboorvaDevarajan) for the bug report.
|
||
- Fix a problem in the TCP BTL when Open MPI is initialized with
|
||
MPI_THREAD_MULTIPLE support. Thanks to Evgueni Petro for analyzing and
|
||
reporting this issue.
|
||
- Fix yalla PML to properly handle underflow errors, and fixed a
|
||
memory leak with blocking non-contiguous sends.
|
||
- Restored ability to run autogen.pl on official distribution tarballs
|
||
(although this is still not recommended for most users!).
|
||
- Fix accuracy problems with MPI_WTIME on some systems by always using
|
||
either clock_gettime(3) or gettimeofday(3).
|
||
- Fix a problem where MPI_WTICK was not returning a higher time resolution
|
||
when available. Thanks to Mark Dixon for reporting this issue.
|
||
- Restore SGE functionality. Thanks to Kevin Buckley for the initial
|
||
report.
|
||
- Fix external hwloc compilation issues, and extend support to allow
|
||
using external hwloc installations as far back as v1.5.0. Thanks to
|
||
Orion Poplawski for raising the issue.
|
||
- Added latest Mellanox Connect-X and Chelsio T-6 adapter part IDs to
|
||
the openib list of default values.
|
||
- Do a better job of cleaning up session directories (e.g., in /tmp).
|
||
- Update a help message to indicate how to suppress a warning about
|
||
no high performance networks being detected by Open MPI. Thanks to
|
||
Susan Schwarz for reporting this issue.
|
||
- Fix a problem with mangling of custom CFLAGS when configuring Open MPI.
|
||
Thanks to Phil Tooley for reporting.
|
||
- Fix some minor memory leaks and remove some unused variables.
|
||
Thanks to Joshua Gerrard for reporting.
|
||
- Fix MPI_ALLGATHERV bug with MPI_IN_PLACE.
|
||
|
||
Known issues (to be addressed in v2.0.4):
|
||
|
||
- See the list of fixes slated for v2.0.4 here:
|
||
https://github.com/open-mpi/ompi/milestone/29
|
||
|
||
2.0.2 -- 26 January 2017
|
||
-------------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
|
||
- Fix a problem with MPI_FILE_WRITE_SHARED when using MPI_MODE_APPEND and
|
||
Open MPI's native MPI-IO implementation. Thanks to Nicolas Joly for
|
||
reporting.
|
||
- Fix a typo in the MPI_WIN_GET_NAME man page. Thanks to Nicolas Joly
|
||
for reporting.
|
||
- Fix a race condition with ORTE's session directory setup. Thanks to
|
||
@tbj900 for reporting this issue.
|
||
- Fix a deadlock issue arising from Open MPI's approach to catching calls to
|
||
munmap. Thanks to Paul Hargrove for reporting and helping to analyze this
|
||
problem.
|
||
- Fix a problem with PPC atomics which caused make check to fail unless builtin
|
||
atomics configure option was enabled. Thanks to Orion Poplawski for reporting.
|
||
- Fix a problem with use of x86_64 cpuid instruction which led to segmentation
|
||
faults when Open MPI was configured with -O3 optimization. Thanks to Mark
|
||
Santcroos for reporting this problem.
|
||
- Fix a problem when using built in atomics configure options on PPC platforms
|
||
when building 32 bit applications. Thanks to Paul Hargrove for reporting.
|
||
- Fix a problem with building Open MPI against an external hwloc installation.
|
||
Thanks to Orion Poplawski for reporting this issue.
|
||
- Remove use of DATE in the message queue version string reported to debuggers to
|
||
insure bit-wise reproducibility of binaries. Thanks to Alastair McKinstry
|
||
for help in fixing this problem.
|
||
- Fix a problem with early exit of a MPI process without calling MPI_FINALIZE
|
||
or MPI_ABORT that could lead to job hangs. Thanks to Christof Koehler for
|
||
reporting.
|
||
- Fix a problem with forwarding of SIGTERM signal from mpirun to MPI processes
|
||
in a job. Thanks to Noel Rycroft for reporting this problem
|
||
- Plug some memory leaks in MPI_WIN_FREE discovered using Valgrind. Thanks
|
||
to Joseph Schuchart for reporting.
|
||
- Fix a problems MPI_NEIGHOR_ALLTOALL when using a communicator with an empty topology
|
||
graph. Thanks to Daniel Ibanez for reporting.
|
||
- Fix a typo in a PMIx component help file. Thanks to @njoly for reporting this.
|
||
- Fix a problem with Valgrind false positives when using Open MPI's internal memchecker.
|
||
Thanks to Yvan Fournier for reporting.
|
||
- Fix a problem with MPI_FILE_DELETE returning MPI_SUCCESS when
|
||
deleting a non-existent file. Thanks to Wei-keng Liao for reporting.
|
||
- Fix a problem with MPI_IMPROBE that could lead to hangs in subsequent MPI
|
||
point to point or collective calls. Thanks to Chris Pattison for reporting.
|
||
- Fix a problem when configure Open MPI for powerpc with --enable-mpi-cxx
|
||
enabled. Thanks to Alastair McKinstry for reporting.
|
||
- Fix a problem using MPI_IALLTOALL with MPI_IN_PLACE argument. Thanks to
|
||
Chris Ward for reporting.
|
||
- Fix a problem using MPI_RACCUMULATE with the Portals4 transport. Thanks to
|
||
@PDeveze for reporting.
|
||
- Fix an issue with static linking and duplicate symbols arising from PMIx
|
||
Slurm components. Thanks to Limin Gu for reporting.
|
||
- Fix a problem when using MPI dynamic memory windows. Thanks to
|
||
Christoph Niethammer for reporting.
|
||
- Fix a problem with Open MPI's pkgconfig files. Thanks to Alastair McKinstry
|
||
for reporting.
|
||
- Fix a problem with MPI_IREDUCE when the same buffer is supplied for the
|
||
send and recv buffer arguments. Thanks to Valentin Petrov for reporting.
|
||
- Fix a problem with atomic operations on PowerPC. Thanks to Paul
|
||
Hargrove for reporting.
|
||
|
||
Known issues (to be addressed in v2.0.3):
|
||
|
||
- See the list of fixes slated for v2.0.3 here:
|
||
https://github.com/open-mpi/ompi/milestone/23
|
||
|
||
2.0.1 -- 2 September 2016
|
||
-----------------------
|
||
|
||
Bug fixes/minor improvements:
|
||
|
||
- Short message latency and message rate performance improvements for
|
||
all transports.
|
||
- Fix shared memory performance when using RDMA-capable networks.
|
||
Thanks to Tetsuya Mishima and Christoph Niethammer for reporting.
|
||
- Fix bandwith performance degredation in the yalla (MXM) PML. Thanks
|
||
to Andreas Kempf for reporting the issue.
|
||
- Fix OpenSHMEM crash when running on non-Mellanox MXM-based networks.
|
||
Thanks to Debendra Das for reporting the issue.
|
||
- Fix a crash occuring after repeated calls to MPI_FILE_SET_VIEW with
|
||
predefined datatypes. Thanks to Eric Chamberland and Matthew
|
||
Knepley for reporting and helping chase down this issue.
|
||
- Fix stdin propagation to MPI processes. Thanks to Jingchao Zhang
|
||
for reporting the issue.
|
||
- Fix various runtime and portability issues by updating the PMIx
|
||
internal component to v1.1.5.
|
||
- Fix process startup failures on Intel MIC platforms due to very
|
||
large entries in /proc/mounts.
|
||
- Fix a problem with use of relative path for specifing executables to
|
||
mpirun/oshrun. Thanks to David Schneider for reporting.
|
||
- Various improvements when running over portals-based networks.
|
||
- Fix thread-based race conditions with GNI-based networks.
|
||
- Fix a problem with MPI_FILE_CLOSE and MPI_FILE_SET_SIZE. Thanks
|
||
to Cihan Altinay for reporting.
|
||
- Remove all use of rand(3) from within Open MPI so as not to perturb
|
||
applications use of it. Thanks to Matias Cabral and Noel Rycroft
|
||
for reporting.
|
||
- Fix crash in MPI_COMM_SPAWN.
|
||
- Fix types for MPI_UNWEIGHTED and MPI_WEIGHTS_EMPTY. Thanks to
|
||
Lisandro Dalcin for reporting.
|
||
- Correctly report the name of MPI_INTEGER16.
|
||
- Add some missing MPI constants to the Fortran bindings.
|
||
- Fixed compile error when configuring Open MPI with --enable-timing.
|
||
- Correctly set the shared library version of libompitrace.so. Thanks
|
||
to Alastair McKinstry for reporting.
|
||
- Fix errors in the MPI_RPUT, MPI_RGET, MPI_RACCUMULATE, and
|
||
MPI_RGET_ACCUMULATE Fortran bindings. Thanks to Alfio Lazzaro and
|
||
Joost VandeVondele for tracking this down.
|
||
- Fix problems with use of derived datatypes in non-blocking
|
||
collectives. Thanks to Yuki Matsumoto for reporting.
|
||
- Fix problems with OpenSHMEM header files when using CMake. Thanks to
|
||
Paul Kapinos for reporting the issue.
|
||
- Fix problem with use use of non-zero lower bound datatypes in
|
||
collectives. Thanks to Hristo Iliev for reporting.
|
||
- Fix a problem with memory allocation within MPI_GROUP_INTERSECTION.
|
||
Thanks to Lisandro Dalcin for reporting.
|
||
- Fix an issue with MPI_ALLGATHER for communicators that don't consist
|
||
of two ranks. Thanks to David Love for reporting.
|
||
- Various fixes for collectives when used with esoteric MPI datatypes.
|
||
- Fixed corner cases of handling DARRAY and HINDEXED_BLOCK datatypes.
|
||
- Fix a problem with filesystem type check for OpenBSD.
|
||
Thanks to Paul Hargrove for reporting.
|
||
- Fix some debug input within Open MPI internal functions. Thanks to
|
||
Durga Choudhury for reporting.
|
||
- Fix a typo in a configury help message. Thanks to Paul Hargrove for
|
||
reporting.
|
||
- Correctly support MPI_IN_PLACE in MPI_[I]ALLTOALL[V|W] and
|
||
MPI_[I]EXSCAN.
|
||
- Fix alignment issues on SPARC platforms.
|
||
|
||
Known issues (to be addressed in v2.0.2):
|
||
|
||
- See the list of fixes slated for v2.0.2 here:
|
||
https://github.com/open-mpi/ompi/milestone/20, and
|
||
https://github.com/open-mpi/ompi-release/milestone/19
|
||
(note that the "ompi-release" Github repo will be folded/absorbed
|
||
into the "ompi" Github repo at some point in the future)
|
||
|
||
|
||
2.0.0 -- 12 July 2016
|
||
---------------------
|
||
|
||
**********************************************************************
|
||
* Open MPI is now fully MPI-3.1 compliant
|
||
**********************************************************************
|
||
|
||
Major new features:
|
||
|
||
- Many enhancements to MPI RMA. Open MPI now maps MPI RMA operations
|
||
on to native RMA operations for those networks which support this
|
||
capability.
|
||
- Greatly improved support for MPI_THREAD_MULTIPLE (when configured
|
||
with --enable-mpi-thread-multiple).
|
||
- Enhancements to reduce the memory footprint for jobs at scale. A
|
||
new MCA parameter, "mpi_add_procs_cutoff", is available to set the
|
||
threshold for using this feature.
|
||
- Completely revamped support for memory registration hooks when using
|
||
OS-bypass network transports.
|
||
- Significant OMPIO performance improvements and many bug fixes.
|
||
- Add support for PMIx - Process Management Interface for Exascale.
|
||
Version 1.1.2 of PMIx is included internally in this release.
|
||
- Add support for PLFS file systems in Open MPI I/O.
|
||
- Add support for UCX transport.
|
||
- Simplify build process for Cray XC systems. Add support for
|
||
using native SLURM.
|
||
- Add a --tune mpirun command line option to simplify setting many
|
||
environment variables and MCA parameters.
|
||
- Add a new MCA parameter "orte_default_dash_host" to offer an analogue
|
||
to the existing "orte_default_hostfile" MCA parameter.
|
||
- Add the ability to specify the number of desired slots in the mpirun
|
||
--host option.
|
||
|
||
Changes in behavior compared to prior versions:
|
||
|
||
- In environments where mpirun cannot automatically determine the
|
||
number of slots available (e.g., when using a hostfile that does not
|
||
specify "slots", or when using --host without specifying a ":N"
|
||
suffix to hostnames), mpirun now requires the use of "-np N" to
|
||
specify how many MPI processes to launch.
|
||
- The MPI C++ bindings -- which were removed from the MPI standard in
|
||
v3.0 -- are no longer built by default and will be removed in some
|
||
future version of Open MPI. Use the --enable-mpi-cxx-bindings
|
||
configure option to build the deprecated/removed MPI C++ bindings.
|
||
- ompi_info now shows all components, even if they do not have MCA
|
||
parameters. The prettyprint output now separates groups with a
|
||
dashed line.
|
||
- OMPIO is now the default implementation of parallel I/O, with the
|
||
exception for Lustre parallel filesystems (where ROMIO is still the
|
||
default). The default selection of OMPI vs. ROMIO can be controlled
|
||
via the "--mca io ompi|romio" command line switch to mpirun.
|
||
- Per Open MPI's versioning scheme (see the README), increasing the
|
||
major version number to 2 indicates that this version is not
|
||
ABI-compatible with prior versions of Open MPI. You will need to
|
||
recompile MPI and OpenSHMEM applications to work with this version
|
||
of Open MPI.
|
||
- Removed checkpoint/restart code due to loss of maintainer. :-(
|
||
- Change the behavior for handling certain signals when using PSM and
|
||
PSM2 libraries. Previously, the PSM and PSM2 libraries would trap
|
||
certain signals in order to generate tracebacks. The mechanism was
|
||
found to cause issues with Open MPI's own error reporting mechanism.
|
||
If not already set, Open MPI now sets the IPATH_NO_BACKTRACE and
|
||
HFI_NO_BACKTRACE environment variables to disable PSM/PSM2's
|
||
handling these signals.
|
||
|
||
Removed legacy support:
|
||
|
||
- Removed support for OS X Leopard.
|
||
- Removed support for Cray XT systems.
|
||
- Removed VampirTrace.
|
||
- Removed support for Myrinet/MX.
|
||
- Removed legacy collective module:ML.
|
||
- Removed support for Alpha processors.
|
||
- Removed --enable-mpi-profiling configure option.
|
||
|
||
Known issues (to be addressed in v2.0.1):
|
||
|
||
- See the list of fixes slated for v2.0.1 here:
|
||
https://github.com/open-mpi/ompi/milestone/16, and
|
||
https://github.com/open-mpi/ompi-release/milestone/16
|
||
(note that the "ompi-release" Github repo will be folded/absorbed
|
||
into the "ompi" Github repo at some point in the future)
|
||
|
||
- ompi-release#986: Fix data size counter for large ops with fcoll/static
|
||
- ompi-release#987: Fix OMPIO performance on Lustre
|
||
- ompi-release#1013: Fix potential inconsistency in btl/openib default settings
|
||
- ompi-release#1014: Do not return MPI_ERR_PENDING from collectives
|
||
- ompi-release#1056: Remove dead profile code from oshmem
|
||
- ompi-release#1081: Fix MPI_IN_PLACE checking for IALLTOALL{V|W}
|
||
- ompi-release#1081: Fix memchecker in MPI_IALLTOALLW
|
||
- ompi-release#1081: Support MPI_IN_PLACE in MPI_(I)ALLTOALLW and MPI_(I)EXSCAN
|
||
- ompi-release#1107: Allow future PMIx support for RM spawn limits
|
||
- ompi-release#1108: Fix sparse group process reference counting
|
||
- ompi-release#1109: If specified to be oversubcribed, disable binding
|
||
- ompi-release#1122: Allow NULL arrays for empty datatypes
|
||
- ompi-release#1123: Fix signed vs. unsigned compiler warnings
|
||
- ompi-release#1123: Make max hostname length uniform across code base
|
||
- ompi-release#1127: Fix MPI_Compare_and_swap
|
||
- ompi-release#1127: Fix MPI_Win_lock when used with MPI_Win_fence
|
||
- ompi-release#1132: Fix typo in help message for --enable-mca-no-build
|
||
- ompi-release#1154: Ensure pairwise coll algorithms disqualify themselves properly
|
||
- ompi-release#1165: Fix typos in debugging/verbose message output
|
||
- ompi-release#1178: Fix ROMIO filesystem check on OpenBSD 5.7
|
||
- ompi-release#1197: Fix Fortran pthread configure check
|
||
- ompi-release#1205: Allow using external PMIx 1.1.4 and 2.0
|
||
- ompi-release#1215: Fix configure to support the NAG Fortran compiler
|
||
- ompi-release#1220: Fix combiner args for MPI_HINDEXED_BLOCK
|
||
- ompi-release#1225: Fix combiner args for MPI_DARRAY
|
||
- ompi-release#1226: Disable old memory hooks with recent gcc versions
|
||
- ompi-release#1231: Fix new "patcher" support for some XLC platforms
|
||
- ompi-release#1244: Fix Java error handling
|
||
- ompi-release#1250: Ensure TCP is not selected for RDMA operations
|
||
- ompi-release#1252: Fix verbose output in coll selection
|
||
- ompi-release#1253: Set a default name for user-defined MPI_Op
|
||
- ompi-release#1254: Add count==0 checks in some non-blocking colls
|
||
- ompi-release#1258: Fix "make distclean" when using external pmix/hwloc/libevent
|
||
- ompi-release#1260: Clean up/uniform mca/coll/base memory management
|
||
- ompi-release#1261: Remove "patcher" warning message for static builds
|
||
- ompi-release#1263: Fix IO MPI_Request for 0-size read/write
|
||
- ompi-release#1264: Add blocking fence for SLURM operations
|
||
|
||
Bug fixes / minor enhancements:
|
||
|
||
- Updated internal/embedded copies of third-party software:
|
||
- Update the internal copy of ROMIO to that which shipped in MPICH
|
||
3.1.4.
|
||
- Update internal copy of libevent to v2.0.22.
|
||
- Update internal copy of hwloc to v1.11.2.
|
||
- Notable new MCA parameters:
|
||
- opal_progress_lp_call_ration: Control how often low-priority
|
||
callbacks are made during Open MPI's main progress loop.
|
||
- opal_common_verbs_want_fork_support: This replaces the
|
||
btl_openib_want_fork_support parameter.
|
||
- Add --with-platform-patches-dir configure option.
|
||
- Add --with-pmi-libdir configure option for environments that install
|
||
PMI libs in a non-default location.
|
||
- Various configure-related compatibility updates for newer versions
|
||
of libibverbs and OFED.
|
||
- Numerous fixes/improvements to orte-dvm. Special thanks to Mark
|
||
Santcroos for his help.
|
||
- Fix a problem with timer code on ia32 platforms. Thanks to
|
||
Paul Hargrove for reporting this and providing a patch.
|
||
- Fix a problem with use of a 64 bit atomic counter. Thanks to
|
||
Paul Hargrove for reporting.
|
||
- Fix a problem with singleton job launching. Thanks to Lisandro
|
||
Dalcin for reporting.
|
||
- Fix a problem with use of MPI_UNDEFINED with MPI_COMM_SPLIT_TYPE.
|
||
Thanks to Lisandro Dalcin for reporting.
|
||
- Silence a compiler warning in PSM MTL. Thanks to Adrian Reber for
|
||
reporting this.
|
||
- Properly detect Intel TrueScale and OmniPath devices in the ACTIVE
|
||
state. Thanks to Durga Choudhury for reporting the issue.
|
||
- Fix detection and use of Solaris Studio 12.5 (beta) compilers.
|
||
Thanks to Paul Hargrove for reporting and debugging.
|
||
- Fix various small memory leaks.
|
||
- Allow NULL arrays when creating empty MPI datatypes.
|
||
- Replace use of alloca with malloc for certain datatype creation
|
||
functions. Thanks to Bogdan Sataric for reporting this.
|
||
- Fix use of MPI_LB and MPI_UB in creation of of certain MPI datatypes.
|
||
Thanks to Gus Correa for helping to fix this.
|
||
- Implement a workaround for a GNU Libtool problem. Thanks to Eric
|
||
Schnetter for reporting and fixing.
|
||
- Improve hcoll library detection in configure. Thanks to David
|
||
Shrader and Ake Sandgren for reporting this.
|
||
- Miscellaneous minor bug fixes in the hcoll component.
|
||
- Miscellaneous minor bug fixes in the ugni component.
|
||
- Fix problems with XRC detection in OFED 3.12 and older releases.
|
||
Thanks to Paul Hargrove for his analysis of this problem.
|
||
- Update (non-standard/experimental) Java MPI interfaces to support
|
||
MPI-3.1 functionality.
|
||
- Fix an issue with MCA parameters for Java bindings. Thanks to
|
||
Takahiro Kawashima and Siegmar Gross for reporting this issue.
|
||
- Fix a problem when using persistent requests in the Java bindings.
|
||
Thanks to Nate Chambers for reporting.
|
||
- Fix problem with Java bindings on OX X 10.11. Thanks to Alexander
|
||
Daryin for reporting this issue.
|
||
- Fix a performance problem for large messages for Cray XC systems.
|
||
Thanks to Jerome Vienne for reporting this.
|
||
- Fix an issue with MPI_WIN_LOCK_ALL. Thanks to Thomas Jahns for
|
||
reporting.
|
||
- Fix an issue with passing a parameter to configure multiple times.
|
||
Thanks to QuesarVII for reporting and supplying a fix.
|
||
- Add support for ALPS resource allocation system on Cray CLE 5.2 and
|
||
later. Thanks to Mark Santcroos.
|
||
- Corrections to the HACKING file. Thanks to Maximilien Levesque.
|
||
- Fix an issue with user supplied reduction operator functions.
|
||
Thanks to Rupert Nash for reporting this.
|
||
- Fix an issue with an internal list management function. Thanks to
|
||
Adrian Reber for reporting this.
|
||
- Fix a problem with MPI-RMA PSCW epochs. Thanks to Berk Hess for
|
||
reporting this.
|
||
- Fix a problem in neighborhood collectives. Thanks to Lisandro
|
||
Dalcin for reporting.
|
||
- Fix MPI_IREDUCE_SCATTER_BLOCK for a one-process communicator. Thanks
|
||
to Lisandro Dalcin for reporting.
|
||
- Add (Open MPI-specific) additional flavors to MPI_COMM_SPLIT_TYPE.
|
||
See MPI_Comm_split_type(3) for details. Thanks to Nick Andersen for
|
||
supplying this enhancement.
|
||
- Improve closing of file descriptors during the job launch phase.
|
||
Thanks to Piotr Lesnicki for reporting and providing this
|
||
enhancement.
|
||
- Fix a problem in MPI_GET_ACCUMULATE and MPI_RGET_ACCUMULATE when
|
||
using Portals4. Thanks to Nicolas Chevalier for reporting.
|
||
- Use correct include file for lstat prototype in ROMIO. Thanks to
|
||
William Throwe for finding and providing a fix.
|
||
- Add missing Fortran bindings for MPI_WIN_ALLOCATE. Thanks to Christoph
|
||
Niethammer for reporting and fixing.
|
||
- Fortran related fixes to handle Intel 2016 compiler. Thanks to
|
||
Fabrice Roy for reporting this.
|
||
- Fix a Fortran linkage issue. Thanks to Macro Atzeri for finding and
|
||
suggesting a fix.
|
||
- Fix problem with using BIND(C) for Fortran bindings with logical
|
||
parameters. Thanks to Paul Romano for reporting.
|
||
- Fix an issue with use of DL-related macros in opal library. Thanks to
|
||
Scott Atchley for finding this.
|
||
- Fix an issue with parsing mpirun command line options which contain
|
||
colons. Thanks to Lev Given for reporting.
|
||
- Fix a problem with Open MPI's package configury files. Thanks to
|
||
Christoph Junghans for reporting.
|
||
- Fix a typo in the MPI_INTERCOMM_MERGE man page. Thanks To Harald
|
||
Servat for reporting and correcting.
|
||
- Update man pages for non-blocking sends per MPI 3.1 standard.
|
||
Thanks to Alexander Pozdneev for reporting.
|
||
- Fix problem when compiling against PVFS2. Thanks to Dave Love for
|
||
reporting.
|
||
- Fix problems with MPI_NEIGHBOR_ALLTOALL{V,W}. Thanks to Willem
|
||
Vermin for reporting this issue.
|
||
- Fix various compilation problems on Cygwin. Thanks to Marco Atzeri
|
||
for supplying these fixes.
|
||
- Fix problem with resizing of subarray and darray data types. Thanks
|
||
to Keith Bennett and Dan Garmann for reporting.
|
||
- Fix a problem with MPI_COMBINER_RESIZED. Thanks to James Ramsey for
|
||
the report.
|
||
- Fix an hwloc binding issue. Thanks to Ben Menadue for reporting.
|
||
- Fix a problem with the shared memory (sm) BTL. Thanks to Peter Wind
|
||
for the report.
|
||
- Fixes for heterogeneous support. Thanks to Siegmar Gross for reporting.
|
||
- Fix a problem with memchecker. Thanks to Clinton Simpson for reporting.
|
||
- Fix a problem with MPI_UNWEIGHTED in topology functions. Thanks to
|
||
Jun Kudo for reporting.
|
||
- Fix problem with a MCA parameter base filesystem types. Thanks to
|
||
Siegmar Gross for reporting.
|
||
- Fix a problem with some windows info argument types. Thanks to
|
||
Alastair McKinstry for reporting.
|
||
|
||
|
||
1.10.7 - 16 May 2017
|
||
------
|
||
- Fix bug in TCP BTL that impacted performance on 10GbE (and faster)
|
||
networks by not adjusting the TCP send/recv buffer sizes and using
|
||
system default values
|
||
- Add missing MPI_AINT_ADD and MPI_AINT_DIFF function delcarations in
|
||
mpif.h
|
||
- Fixed time reported by MPI_WTIME; it was previously reported as
|
||
dependent upon the CPU frequency.
|
||
- Fix platform detection on FreeBSD
|
||
- Fix a bug in the handling of MPI_TYPE_CREATE_DARRAY in
|
||
MPI_(R)(GET_)ACCUMULATE
|
||
- Fix openib memory registration limit calculation
|
||
- Add missing MPI_T_PVAR_SESSION_NULL in mpi.h
|
||
- Fix "make distcheck" when using external hwloc and/or libevent packages
|
||
- Add latest ConnectX-5 vendor part id to OpenIB device params
|
||
- Fix race condition in the UCX PML
|
||
- Fix signal handling for rsh launcher
|
||
- Fix Fortran compilation errors by removing MPI_SIZEOF in the Fortran
|
||
interfaces when the compiler does not support it
|
||
- Fixes for the pre-ignore-TKR "mpi" Fortran module implementation
|
||
(i.e., for older Fortran compilers -- these problems did not exist
|
||
in the "mpi" module implementation for modern Fortran compilers):
|
||
- Add PMPI_* interfaces
|
||
- Fix typo in MPI_FILE_WRITE_AT_ALL_BEGIN interface name
|
||
- Fix typo in MPI_FILE_READ_ORDERED_BEGIN interface name
|
||
- Fixed the type of MPI_DISPLACEMENT_CURRENT in all Fortran interfaces
|
||
to be an INTEGER(KIND=MPI_OFFSET_KIND).
|
||
- Fixed typos in MPI_INFO_GET_* man pages. Thanks to Nicolas Joly for
|
||
the patch
|
||
- Fix typo bugs in wrapper compiler script
|
||
|
||
|
||
1.10.6 - 17 Feb 2017
|
||
------
|
||
- Fix bug in timer code that caused problems at optimization settings
|
||
greater than 2
|
||
- OSHMEM: make mmap allocator the default instead of sysv or verbs
|
||
- Support MPI_Dims_create with dimension zero
|
||
- Update USNIC support
|
||
- Prevent 64-bit overflow on timer counter
|
||
- Add support for forwarding signals
|
||
- Fix bug that caused truncated messages on large sends over TCP BTL
|
||
- Fix potential infinite loop when printing a stacktrace
|
||
|
||
|
||
1.10.5 - 19 Dec 2016
|
||
------
|
||
- Update UCX APIs
|
||
- Fix bug in darray that caused MPI/IO failures
|
||
- Use a MPI_Get_library_version() like string to tag the debugger DLL.
|
||
Thanks to Alastair McKinstry for the report
|
||
- Fix multi-threaded race condition in coll/libnbc
|
||
- Several fixes to OSHMEM
|
||
- Fix bug in UCX support due to uninitialized field
|
||
- Fix MPI_Ialltoallv with MPI_IN_PLACE and without MPI param check
|
||
- Correctly reset receive request type before init. Thanks Chris Pattison
|
||
for the report and test case.
|
||
- Fix bug in iallgather[v]
|
||
- Fix concurrency issue with MPI_Comm_accept. Thanks to Pieter Noordhuis
|
||
for the patch
|
||
- Fix ompi_coll_base_{gather,scatter}_intra_binomial
|
||
- Fixed an issue with MPI_Type_get_extent returning the wrong extent
|
||
for distributed array datatypes.
|
||
- Re-enable use of rtdtsc instruction as a monotonic clock source if
|
||
the processor has a core-invariant tsc. This is a partial fix for a
|
||
performance regression introduced in Open MPI v1.10.3.
|
||
|
||
|
||
1.10.4 - 01 Sept 2016
|
||
------
|
||
|
||
- Fix assembler support for MIPS
|
||
- Improve memory handling for temp buffers in collectives
|
||
- Fix [all]reduce with non-zero lower bound datatypes
|
||
Thanks Hristo Iliev for the report
|
||
- Fix non-standard ddt handling. Thanks Yuki Matsumoto for the report
|
||
- Various libnbc fixes. Thanks Yuki Matsumoto for the report
|
||
- Fix typos in request RMA bindings for Fortran. Thanks to @alazzaro
|
||
and @vondele for the assist
|
||
- Various bug fixes and enhancements to collective support
|
||
- Fix predefined types mapping in hcoll
|
||
- Revive the coll/sync component to resolve unexpected message issues
|
||
during tight loops across collectives
|
||
- Fix typo in wrapper compiler for Fortran static builds
|
||
|
||
|
||
1.10.3 - 15 June 2016
|
||
------
|
||
|
||
- Fix zero-length datatypes. Thanks to Wei-keng Liao for reporting
|
||
the issue.
|
||
- Minor manpage cleanups
|
||
- Implement atomic support in OSHMEM/UCX
|
||
- Fix support of MPI_COMBINER_RESIZED. Thanks to James Ramsey
|
||
for the report
|
||
- Fix computation of #cpus when --use-hwthread-cpus is used
|
||
- Add entry points for Allgatherv, iAllgatherv, Reduce, and iReduce
|
||
for the HCOLL library
|
||
- Fix an HCOLL integration bug that could signal completion of request
|
||
while still being worked
|
||
- Fix computation of cores when SMT is enabled. Thanks to Ben Menadue
|
||
for the report
|
||
- Various USNIC fixes
|
||
- Create a datafile in the per-proc directory in order to make it
|
||
unique per communicator. Thanks to Peter Wind for the report
|
||
- Fix zero-size malloc in one-sided pt-to-pt code. Thanks to Lisandro
|
||
Dalcin for the report
|
||
- Fix MPI_Get_address when passed MPI_BOTTOM to not return an error.
|
||
Thanks to Lisandro Dalcin for the report
|
||
- Fix MPI_TYPE_SET_ATTR with NULL value. Thanks to Lisandro Dalcin for
|
||
the report
|
||
- Fix various Fortran08 binding issues
|
||
- Fix memchecker no-data case. Thanks to Clinton Stimpson for the report
|
||
- Fix CUDA support under OS-X
|
||
- Fix various OFI/MTL integration issues
|
||
- Add MPI_T man pages
|
||
- Fix one-sided pt-to-pt issue by preventing communication from happening
|
||
before a target enters a fence, even in the no-precede case
|
||
- Fix a bug that disabled Totalview for MPMD use-case
|
||
- Correctly support MPI_UNWEIGHTED in topo-graph-neighbors. Thanks to
|
||
Jun Kudo for the report
|
||
- Fix singleton operations under SLURM when PMI2 is enabled
|
||
- Do not use MPI_IN_PLACE in neighborhood collectives for non-blocking
|
||
collectives (libnbc). Thanks to Jun Kudo for the report
|
||
- Silence autogen deprecation warnings for newer versions of Perl
|
||
- Do not return MPI_ERR_PENDING from collectives
|
||
- Use type int* for MPI_WIN_DISP_UNIT, MPI_WIN_CREATE_FLAVOR, and MPI_WIN_MODEL.
|
||
Thanks to Alastair McKinstry for the report
|
||
- Fix register_datarep stub function in IO/OMPIO. Thanks to Eric
|
||
Chamberland for the report
|
||
- Fix a bus error on MPI_WIN_[POST,START] in the shared memory one-sided component
|
||
- Add several missing MPI_WIN_FLAVOR constants to the Fortran support
|
||
- Enable connecting processes from different subnets using the openib BTL
|
||
- Fix bug in basic/barrier algorithm in OSHMEM
|
||
- Correct process binding for the --map-by node case
|
||
- Include support for subnet-to-subnet routing over InfiniBand networks
|
||
- Fix usnic resource check
|
||
- AUTHORS: Fix an errant reference to Subversion IDs
|
||
- Fix affinity for MPMD jobs running under LSF
|
||
- Fix many Fortran binding bugs
|
||
- Fix `MPI_IN_PLACE`-related bugs
|
||
- Fix PSM/PSM2 support for singleton operations
|
||
- Ensure MPI transports continue to progress during RTE barriers
|
||
- Update HWLOC to 1.9.1 end-of-series
|
||
- Fix a bug in the Java command line parser when the
|
||
-Djava.library.path options was given by the user
|
||
- Update the MTL/OFI provider selection behavior
|
||
- Add support for clock_gettime on Linux.
|
||
- Correctly detect and configure for Solaris Studio 12.5
|
||
beta compilers
|
||
- Correctly compute #slots when -host is used for MPMD case
|
||
- Fix a bug in the hcoll collectives due to an uninitialized field
|
||
- Do not set a binding policy when oversubscribing a node
|
||
- Fix hang in intercommunicator operations when oversubscribed
|
||
- Speed up process termination during MPI_Abort
|
||
- Disable backtrace support by default in the PSM/PSM2 libraries to
|
||
prevent unintentional conflicting behavior.
|
||
|
||
|
||
|
||
1.10.2: 26 Jan 2016
|
||
-------------------
|
||
|
||
**********************************************************************
|
||
* OSHMEM is now 1.2 compliant
|
||
**********************************************************************
|
||
|
||
- Fix NBC_Copy for legitimate zero-size messages
|
||
- Fix multiple bugs in OSHMEM
|
||
- Correctly handle mpirun --host <user>@<ip-address>
|
||
- Centralize two MCA params to avoid duplication between OMPI and
|
||
OSHMEM layers: opal_abort_delay and opal_abort_print_stack
|
||
- Add support for Fujitsu compilers
|
||
- Add UCX support for OMPI and OSHMEM
|
||
- Correctly handle oversubscription when not given directives
|
||
to permit it. Thanks to @ammore1 for reporting it
|
||
- Fix rpm spec file to not include the /usr directory
|
||
- Add Intel HFI1 default parameters for the openib BTL
|
||
- Resolve symbol conflicts in the PSM2 library
|
||
- Add ability to empty the rgpusm cache when full if requested
|
||
- Fix another libtool bug when -L requires a space between it
|
||
and the path. Thanks to Eric Schnetter for the patch.
|
||
- Add support for OSHMEM v1.2 APIs
|
||
- Improve efficiency of oshmem_preconnect_all algorithm
|
||
- Fix bug in buffered sends support
|
||
- Fix double free in edge case of mpirun. Thanks to @jsharpe for
|
||
the patch
|
||
- Multiple one-sided support fixes
|
||
- Fix integer overflow in the tuned "reduce" collective when
|
||
using buffers larger than INT_MAX in size
|
||
- Fix parse of user environment variables in mpirun. Thanks to
|
||
Stefano Garzarella for the patch
|
||
- Performance improvements in PSM2 support
|
||
- Fix NBS iBarrier for inter-communicators
|
||
- Fix bug in vader BTL during finalize
|
||
- Improved configure support for Fortran compilers
|
||
- Fix rank_file mapper to support default --slot-set. Thanks
|
||
to Matt Thompson for reporting it
|
||
- Update MPI_Testsome man page. Thanks to Eric Schnetter for
|
||
the suggestion
|
||
- Fix missing resize of the returned type for subarray and
|
||
darray types. Thanks to Keith Bennett and Dan Garmann for
|
||
reporting it
|
||
- Fix Java support on OSX 10.11. Thanks to Alexander Daryin
|
||
for reporting the problem
|
||
- Fix some compilation issues on Solaris 11.2. Thanks to
|
||
Paul Hargrove for his continued help in such areas
|
||
|
||
|
||
1.10.1: 4 Nov 2015
|
||
------------------
|
||
|
||
- Workaround an optimization problem with gcc compilers >= 4.9.2 that
|
||
causes problems with memory registration, and forced
|
||
mpi_leave_pinned to default to 0 (i.e., off). Thanks to @oere for
|
||
the fix.
|
||
- Fix use of MPI_LB and MPI_UB in subarray and darray datatypes.
|
||
Thanks to Gus Correa and Dimitar Pashov for pointing out the issue.
|
||
- Minor updates to mpi_show_mpi_alloc_mem_leaks and
|
||
ompi_debug_show_handle_leaks functionality.
|
||
- Fix segv when invoking non-blocking reductions with a user-defined
|
||
operation. Thanks to Rupert Nash and Georg Geiser for identifying
|
||
the issue.
|
||
- No longer probe for PCI topology on Solaris (unless running as root).
|
||
- Fix for Intel Parallel Studio 2016 ifort partial support of the
|
||
!GCC$ pragma. Thanks to Fabrice Roy for reporting the problem.
|
||
- Bunches of Coverity / static analysis fixes.
|
||
- Fixed ROMIO to look for lstat in <sys/stat.h>. Thanks to William
|
||
Throwe for submitting the patch both upstream and to Open MPI.
|
||
- Fixed minor memory leak when attempting to open plugins.
|
||
- Fixed type in MPI_IBARRIER C prototype. Thanks to Harald Servat for
|
||
reporting the issue.
|
||
- Add missing man pages for MPI_WIN_CREATE_DYNAMIC, MPI_WIN_ATTACH,
|
||
MPI_WIN_DETACH, MPI_WIN_ALLOCATE, MPI_WIN_ALLOCATE_SHARED.
|
||
- When mpirun-launching new applications, only close file descriptors
|
||
that are actually open (resulting in a faster launch in some
|
||
environments).
|
||
- Fix "test ==" issues in Open MPI's configure script. Thank to Kevin
|
||
Buckley for pointing out the issue.
|
||
- Fix performance issue in usnic BTL: ensure progress thread is
|
||
throttled back to not aggressively steal CPU cycles.
|
||
- Fix cache line size detection on POWER architectures.
|
||
- Add missing #include in a few places. Thanks to Orion Poplawski for
|
||
supplying the patch.
|
||
- When OpenSHMEM building is disabled, no longer install its header
|
||
files, help files, or man pages. Add man pages for oshrun, oshcc,
|
||
and oshfort.
|
||
- Fix mpi_f08 implementations of MPI_COMM_SET_INFO, and profiling
|
||
versions of MPI_BUFFER_DETACH, MPI_WIN_ALLOCATE,
|
||
MPI_WIN_ALLOCATE_SHARED, MPI_WTICK, and MPI_WTIME.
|
||
- Add orte_rmaps_dist_device MCA param, allowing users to map near a
|
||
specific device.
|
||
- Various updates/fixes to the openib BTL.
|
||
- Add missing defaults for the Mellanox ConnectX 3 card to the openib BTL.
|
||
- Minor bug fixes in the OFI MTL.
|
||
- Various updates to Mellanox's MXM, hcoll, and FCA components.
|
||
- Add OpenSHMEM man pages. Thanks to Tony Curtis for sharing the man
|
||
pages files from openshmem.org.
|
||
- Add missing "const" attributes to MPI_COMPARE_AND_SWAP,
|
||
MPI_FETCH_AND_OP, MPI_RACCUMULATE, and MPI_WIN_DETACH prototypes.
|
||
Thanks to Michael Knobloch and Takahiro Kawashima for bringing this
|
||
to our attention.
|
||
- Fix linking issues on some platforms (e.g., SLES 12).
|
||
- Fix hang on some corner cases when MPI applications abort.
|
||
- Add missing options to mpirun man page. Thanks to Daniel Letai
|
||
for bringing this to our attention.
|
||
- Add new --with-platform-patches-dir configure option
|
||
- Adjust relative selection priorities to ensure that MTL
|
||
support is favored over BTL support when both are available
|
||
- Use CUDA IPC for all sized messages for performance
|
||
|
||
|
||
1.10.0: 25 Aug 2015
|
||
-------------------
|
||
|
||
** NOTE: The v1.10.0 release marks the transition to Open MPI's new
|
||
** version numbering scheme. The v1.10.x release series is based on
|
||
** the v1.8.x series, but with a few new features. v2.x will be the
|
||
** next series after the v1.10.x series, and complete the transition
|
||
** to the new version numbering scheme. See README for more details
|
||
** on the new versioning scheme.
|
||
**
|
||
** NOTE: In accordance with OMPI version numbering, the v1.10 is *not*
|
||
** API compatible with the v1.8 release series.
|
||
|
||
- Added libfabric support (see README for more details):
|
||
- usNIC BTL updated to use libfabric.
|
||
- Added OFI MTL (usable with PSM in libfabric v1.1.0).
|
||
- Added Intel Omni-Path support via new PSM2 MTL.
|
||
- Added "yalla" PML for faster MXM support.
|
||
- Removed support for MX
|
||
- Added persistent distributed virtual machine (pDVM) support for fast
|
||
workflow executions.
|
||
- Fixed typo in GCC inline assembly introduced in Open MPI v1.8.8.
|
||
Thanks to Paul Hargrove for pointing out the issue.
|
||
- Add missing man pages for MPI_Win_get|set_info(3).
|
||
- Ensure that session directories are cleaned up at the end of a run.
|
||
- Fixed linking issues on some OSs where symbols of dependent
|
||
libraries are not automatically publicly available.
|
||
- Improve hcoll and fca configury library detection. Thanks to David
|
||
Shrader for helping track down the issue.
|
||
- Removed the LAMA mapper (for use in setting affinity). Its
|
||
functionality has been largely superseded by other mpirun CLI
|
||
options.
|
||
- CUDA: Made the asynchronous copy mode be the default.
|
||
- Fix a malloc(0) warning in MPI_IREDUCE_SCATTER_BLOCK. Thanks to
|
||
Lisandro Dalcin for reporting the issue.
|
||
- Fix typo in MPI_Scatter(3) man page. Thanks to Akshay Venkatesh for
|
||
noticing the mistake.
|
||
- Add rudimentary protection from TCP port scanners.
|
||
- Fix typo in Open MPI error handling. Thanks to <20>ke Sandgren for
|
||
pointing out the error.
|
||
- Increased the performance of the CM PML (i.e., the Portals, PSM,
|
||
PSM2, MXM, and OFI transports).
|
||
- Restored visibility of blocking send requests in message queue
|
||
debuggers (e.g., TotalView, DDT).
|
||
- Fixed obscure IPv6-related bug in the TCP BTL.
|
||
- Add support for the "no_locks" MPI_Info key for one-sided
|
||
functionality.
|
||
- Fixed ibv_fork support for verbs-based networks.
|
||
- Fixed a variety of small bugs in OpenSHMEM.
|
||
- Fixed MXM configure with additional CPPFLAGS and LDFLAGS. Thanks to
|
||
David Shrader for the patch.
|
||
- Fixed incorrect memalign threshhold in the openib BTL. Thanks to
|
||
Xavier Besseron for pointing out the issue.
|
||
|
||
|
||
1.8.8: 5 Aug 2015
|
||
-----------------
|
||
|
||
- Fix a segfault in MPI_FINALIZE with the PSM MTL.
|
||
- Fix mpi_f08 sentinels (e.g., MPI_STATUS_IGNORE) handling.
|
||
- Set some additional MXM default values for OSHMEM.
|
||
- Fix an invalid memory access in MPI_MRECV and MPI_IMRECV.
|
||
- Include two fixes that were mistakenly left out of the official
|
||
v1.8.7 tarball:
|
||
- Fixed MPI_WIN_POST and MPI_WIN_START for zero-size messages
|
||
- Protect the OOB TCP ports from segfaulting when accessed by port
|
||
scanners
|
||
|
||
|
||
1.8.7: 15 Jul 2015
|
||
------------------
|
||
|
||
** NOTE: v1.8.7 technically breaks ABI with prior versions
|
||
** in the 1.8 series because it repairs two incorrect API
|
||
** signatures. However, users will only need to recompile
|
||
** if they were using those functions - which they couldn't
|
||
** have been, because the signatures were wrong :-)
|
||
|
||
- Plugged a memory leak that impacted blocking sends
|
||
- Fixed incorrect declaration for MPI_T_pvar_get_index and added
|
||
missing return code MPI_T_INVALID_NAME.
|
||
- Fixed an uninitialized variable in PMI2 support
|
||
- Added new vendor part id for Mellanox ConnectX4-LX
|
||
- Fixed NBC_Copy for legitimate zero-size messages
|
||
- Fixed MPI_Win_post and MPI_Win_start for zero-size messages
|
||
- Protect the OOB ports from segfaulting when accessed by port scanners
|
||
- Fixed several Fortran typos
|
||
- Fixed configure detection of XRC support
|
||
- Fixed support for highly heterogeneous systems to avoid
|
||
memory corruption when printing out the bindings
|
||
|
||
1.8.6: 17 Jun 2015
|
||
------------------
|
||
|
||
- Fixed memory leak on Mac OS-X exposed by TCP keepalive
|
||
- Fixed keepalive support to ensure that daemon/node failure
|
||
results in complete job cleanup
|
||
- Update Java binding support
|
||
- Fixed MPI_THREAD_MULTIPLE bug in vader shared memory BTL
|
||
- Fixed issue during shutdown when CUDA initialization wasn't complete
|
||
- Fixed orted environment when no prefix given
|
||
- Fixed trivial typo in MPI_Neighbor_allgather manpage
|
||
- Fixed tree-spawn support for sh and ksh shells
|
||
- Several data type fixes
|
||
- Fixed IPv6 support bug
|
||
- Cleaned up an unlikely build issue
|
||
- Fixed PMI2 process map parsing for cyclic mappings
|
||
- Fixed memalign threshold in openib BTL
|
||
- Fixed debugger access to message queues for blocking send/recv
|
||
|
||
|
||
1.8.5: 5 May 2015
|
||
-----------------
|
||
|
||
- Fixed configure problems in some cases when using an external hwloc
|
||
installation. Thanks to Erick Schnetter for reporting the error and
|
||
helping track down the source of the problem.
|
||
- Fixed linker error on OS X when using the clang compiler. Thanks to
|
||
Erick Schnetter for reporting the error and helping track down the
|
||
source of the problem.
|
||
- Fixed MPI_THREAD_MULTIPLE deadlock error in the vader BTL. Thanks
|
||
to Thomas Klimpel for reporting the issue.
|
||
- Fixed several Valgrind warnings. Thanks for Lisandro Dalcin for
|
||
contributing a patch fixing some one-sided code paths.
|
||
- Fixed version compatibility test in OOB that broke ABI within the
|
||
1.8 series. NOTE: this will not resolve the problem between pre-1.8.5
|
||
versions, but will fix it going forward.
|
||
- Fix some issues related to running on Intel Xeon Phi coprocessors.
|
||
- Opportunistically switch away from using GNU Libtool's libltdl
|
||
library when possible (by default).
|
||
- Fix some VampirTrace errors. Thanks to Paul Hargrove for reporting
|
||
the issues.
|
||
- Correct default binding patterns when --use-hwthread-cpus was
|
||
specified and nprocs <= 2.
|
||
- Fix warnings about -finline-functions when compiling with clang.
|
||
- Updated the embedded hwloc with several bug fixes, including the
|
||
"duplicate Lhwloc1 symbol" that multiple users reported on some
|
||
platforms.
|
||
- Do not error when mpirun is invoked with with default bindings
|
||
(i.e., no binding was specified), and one or more nodes do not
|
||
support bindings. Thanks to Annu Desari for pointing out the
|
||
problem.
|
||
- Let root invoke "mpirun --version" to check the version without
|
||
printing the "Don't run as root!" warnings. Thanks to Robert McLay
|
||
for the suggestion.
|
||
- Fixed several bugs in OpenSHMEM support.
|
||
- Extended vader shared memory support to 32-bit architectures.
|
||
- Fix handling of very large datatypes. Thanks to Bogdan Sataric for
|
||
the bug report.
|
||
- Fixed a bug in handling subarray MPI datatypes, and a bug when using
|
||
MPI_LB and MPI_UB. Thanks to Gus Correa for pointing out the issue.
|
||
- Restore user-settable bandwidth and latency PML MCA variables.
|
||
- Multiple bug fixes for cleanup during MPI_FINALIZE in unusual
|
||
situations.
|
||
- Added support for TCP keepalive signals to ensure timely termination
|
||
when sockets between daemons cannot be created (e.g., due to a
|
||
firewall).
|
||
- Added MCA parameter to allow full use of a SLURM allocation when
|
||
started from a tool (supports LLNL debugger).
|
||
- Fixed several bugs in the configure logic for PMI and hwloc.
|
||
- Fixed incorrect interface index in TCP communications setup. Thanks
|
||
to Mark Kettenis for spotting the problem and providing a patch.
|
||
- Fixed MPI_IREDUCE_SCATTER with single-process communicators when
|
||
MPI_IN_PLACE was not used.
|
||
- Added XRC support for OFED v3.12 and higher.
|
||
- Various updates and bug fixes to the Mellanox hcoll collective
|
||
support.
|
||
- Fix problems with Fortran compilers that did not support
|
||
REAL*16/COMPLEX*32 types. Thanks to Orion Poplawski for identifying
|
||
the issue.
|
||
- Fixed problem with rpath/runpath support in pkg-config files.
|
||
Thanks to Christoph Junghans for notifying us of the issue.
|
||
- Man page fixes:
|
||
- Removed erroneous "color" discussion from MPI_COMM_SPLIT_TYPE.
|
||
Thanks to Erick Schnetter for spotting the outdated text.
|
||
- Fixed prototypes for MPI_IBARRIER. Thanks to Maximilian for
|
||
finding the issue.
|
||
- Updated docs about buffer usage in non-blocking communications.
|
||
Thanks to Alexander Pozdneev for citing the outdated text.
|
||
- Added documentation about the 'ompi_unique' MPI_Info key with
|
||
MPI_PUBLISH_NAME.
|
||
- Fixed typo in MPI_INTERCOMM_MERGE. Thanks to Harald Servat for
|
||
noticing and sending a patch.
|
||
- Updated configure paths in HACKING. Thanks to Maximilien Levesque
|
||
for the fix.
|
||
- Fixed Fortran typo in MPI_WIN_LOCK_ALL. Thanks to Thomas Jahns
|
||
for pointing out the issue.
|
||
- Fixed a number of MPI one-sided bugs.
|
||
- Fixed MPI_COMM_SPAWN when invoked from a singleton job.
|
||
- Fixed a number of minor issues with CUDA support, including
|
||
registering of shared memory and supporting reduction support for
|
||
GPU buffers.
|
||
- Improved support for building OMPI on Cray platforms.
|
||
- Fixed performance regression introduced by the inadvertent default
|
||
enabling of MPI_THREAD_MULTIPLE support.
|
||
|
||
|
||
1.8.4: 19 Dec 2014
|
||
------------------
|
||
|
||
- Fix MPI_SIZEOF; now available in mpif.h for modern Fortran compilers
|
||
(see README for more details). Also fixed various compiler/linker
|
||
errors.
|
||
- Fixed inadvertant Fortran ABI break between v1.8.1 and v1.8.2 in the
|
||
mpi interface module when compiled with gfortran >= v4.9.
|
||
- Fix various MPI_THREAD_MULTIPLE issues in the TCP BTL.
|
||
- mpirun no longer requires the --hetero-nodes switch; it will
|
||
automatically detect when running in heterogeneous scenarios.
|
||
- Update LSF support, to include revamped affinity functionality.
|
||
- Update embedded hwloc to v1.9.1.
|
||
- Fixed max registerable memory computation in the openib BTL.
|
||
- Updated error message when debuggers are unable to find various
|
||
symbols/types to be more clear. Thanks to Dave Love for raising the
|
||
issue.
|
||
- Added proper support for LSF and PBS/Torque libraries in static builds.
|
||
- Rankfiles now support physical processor IDs.
|
||
- Fixed potential hang in MPI_ABORT.
|
||
- Fixed problems with the PSM MTL and "re-connect" scenarios, such as
|
||
MPI_INTERCOMM_CREATE.
|
||
- Fix MPI_IREDUCE_SCATTER with a single process.
|
||
- Fix (rare) race condition in stdout/stderr funneling to mpirun where
|
||
some trailing output could get lost when a process terminated.
|
||
- Removed inadvertent change that set --enable-mpi-thread-multiple "on"
|
||
by default, thus impacting performance for non-threaded apps.
|
||
- Significantly reduced startup time by optimizing internal hash table
|
||
implementation.
|
||
- Fixed OS X linking with the Fortran mpi module when used with
|
||
gfortran >= 4.9. Thanks to Github user yafshar for raising the
|
||
issue.
|
||
- Fixed memory leak on Cygwin platforms. Thanks for Marco Atzeri for
|
||
reporting the issue.
|
||
- Fixed seg fault in neighborhood collectives when the degree of the
|
||
topology is higher than the communicator size. Thanks to Lisandro
|
||
Dalcin for reporting the issue.
|
||
- Fixed segfault in neighborhood collectives under certain use-cases.
|
||
- Fixed various issues regarding Solaris support. Thanks to Siegmar
|
||
Gross for patiently identifying all the issues.
|
||
- Fixed PMI configure tests for certain Slurm installation patterns.
|
||
- Fixed param registration issue in Java bindings. Thanks to Takahiro
|
||
Kawashima and Siegmar Gross for identifying the issue.
|
||
- Several man page fixes.
|
||
- Silence several warnings and close some memory leaks (more remain,
|
||
but it's better than it was).
|
||
- Re-enabled the use of CMA and knem in the shared memory BTL.
|
||
- Updated mpirun manpage to correctly explain new map/rank/binding options.
|
||
- Fixed MPI_IALLGATHER problem with intercommunicators. Thanks for
|
||
Takahiro Kawashima for the patch.
|
||
- Numerous updates and performance improvements to OpenSHMEM.
|
||
- Turned off message coalescing in the openib BTL until a proper fix
|
||
for that capability can be provided (tentatively expected for 1.8.5)
|
||
- Fix a bug in iof output that dates back to the dinosaurs which would
|
||
output extra bytes if the system was very heavily loaded
|
||
- Fix a bug where specifying mca_component_show_load_errors=0 could
|
||
cause ompi_info to segfault
|
||
- Updated valgrind suppression file
|
||
|
||
|
||
1.8.3: 26 Sep 2014
|
||
------------------
|
||
|
||
- Fixed application abort bug to ensure that MPI_Abort exits appropriately
|
||
and returns the provided exit status
|
||
- Fixed some alignment (not all) issues identified by Clang
|
||
- Allow CUDA-aware to work with nonblocking collectives. Forces packing to
|
||
happen when using GPU buffers.
|
||
- Fixed configure test issue with Intel 2015 Fortran compiler
|
||
- Fixed some PGI-related errors
|
||
- Provide better help message when encountering a firewall
|
||
- Fixed MCA parameter quoting to protect multi-word params and params
|
||
that contain special characters
|
||
- Improved the bind-to help message to clarify the defaults
|
||
- Add new MPI-3.1 tools interface
|
||
- Several performance optimizations and memory leak cleanups
|
||
- Turn off the coll/ml plugin unless specifically requested as it
|
||
remains in an experimental state
|
||
- Fix LSF support by adding required libraries for the latest LSF
|
||
releases. Thanks to Joshua Randal for supplying the initial
|
||
patches.
|
||
|
||
|
||
1.8.2: 25 Aug 2014
|
||
------------------
|
||
|
||
- Fix auto-wireup of OOB, allowing ORTE to automatically
|
||
test all available NICs
|
||
- "Un-deprecate" pernode, npernode, and npersocket options
|
||
by popular demand
|
||
- Add missing Fortran bindings for MPI_WIN_LOCK_ALL,
|
||
MPI_WIN_UNLOCK_ALL, and MPI_WIN_SYNC.
|
||
- Fix cascading/over-quoting in some cases with the rsh/ssh-based
|
||
launcher. Thanks to multiple users for raising the issue.
|
||
- Properly add support for gfortran 4.9 ignore TKR pragma (it was
|
||
erroneously only partially added in v1.7.5). Thanks to Marcus
|
||
Daniels for raising the issue.
|
||
- Update/improve help messages in the usnic BTL.
|
||
- Resolve a race condition in MPI_Abort.
|
||
- Fix obscure cases where static linking from wrapper compilers would
|
||
fail.
|
||
- Clarify the configure --help message about when OpenSHMEM is
|
||
enabled/disabled by default. Thanks to Paul Hargrove for the
|
||
suggestion.
|
||
- Align pages properly where relevant. Thanks to Paul Hargrove for
|
||
identifying the issue.
|
||
- Various compiler warning and minor fixes for OpenBSD, FreeBSD, and
|
||
Solaris/SPARC. Thanks to Paul Hargrove for the patches.
|
||
- Properly pass function pointers from Fortran to C in the mpi_f08
|
||
module, thereby now supporting gfortran 4.9. Thanks to Tobias
|
||
Burnus for assistance and testing with this issue.
|
||
- Improve support for Cray CLE 5.
|
||
- Fix mpirun regression: ensure exit status is non-zero if mpirun is
|
||
terminated due to signal.
|
||
- Improved CUDA efficiency of asynchronous copies.
|
||
- Fix to parameter type in MPI_Type_indexed.3. Thanks to Bastian
|
||
Beischer for reporting the mistake.
|
||
- Fix NUMA distance calculations in the openib BTL.
|
||
- Decrease time required to shut down mpirun at the end of a job.
|
||
- More RMA fixes.
|
||
- More hostfile fixes from Tetsuya Mishima.
|
||
- Fix darray issue where UB was not computed correctly.
|
||
- Fix mpi_f08 parameter name for MPI_GET_LIBRARY_VERSION. Thanks to
|
||
Junchao Zhang for pointing out the issue.
|
||
- Ensure mpirun aborts properly when unable to map processes in
|
||
scheduled environments.
|
||
- Ensure that MPI RMA error codes show up properly. Thanks to
|
||
Lisandro Dalcin for reporting the issue.
|
||
- Minor bug fixes and improvements to the bash and zsh mpirun
|
||
autocompletion scripts.
|
||
- Fix sequential mpirun process mapper. Thanks to Bill Chen for
|
||
reporting the issue.
|
||
- Correct SLURM stdout/stderr redirection.
|
||
- Added missing portals 4 files.
|
||
- Performance improvements for blocking sends and receives.
|
||
- Lots of cleanup to the ml collective component
|
||
- Added new Java methods to provide full MPI coverage
|
||
- Many OSHMEM cleanups
|
||
- Prevent comm_spawn from automatically launching a VM across
|
||
all available nodes
|
||
- Close many memory leaks to achieve valgrind-clean operation
|
||
- Better handling of TCP connection discovery for mismatched networks
|
||
where we don't have a direct 1:1 subnet match between nodes
|
||
- Prevent segfault when OMPI info tools are used in pipes and user
|
||
exits one step of that pipe before completing output
|
||
|
||
|
||
1.8.1: 23 Apr 2014
|
||
------------------
|
||
|
||
- Fix for critical bug: mpirun removed files (but not directories)
|
||
from / when run as root. Thanks to Jay Fenlason and Orion Poplawski
|
||
for bringing the issue to our attention and helping identify the
|
||
fix.
|
||
|
||
|
||
1.8: 31 Mar 2014
|
||
----------------
|
||
|
||
- Commit upstream ROMIO fix for mixed NFS+local filesystem environments.
|
||
- Several fixes for MPI-3 one-sided support. For example,
|
||
arbitrary-length datatypes are now supported.
|
||
- Add config support for the Mellanox ConnectX 4 card.
|
||
- Add missing MPI_COMM_GET|SET_INFO functions, and missing
|
||
MPI_WEIGHTS_EMPTY and MPI_ERR_RMA_SHARED constants. Thanks to
|
||
Lisandro Dalcin for pointing out the issue.
|
||
- Update some help messages in OSHMEM, the usnic BTL, the TCP BTL, and
|
||
ORTE, and update documentation about ompi_info's --level option.
|
||
- Fix some compiler warnings.
|
||
- Ensure that ORTE daemons are not bound to a single processor
|
||
if TaskAffinity is set on by default in Slurm. Thanks to Artem Polyakov
|
||
for identifying the problem and providing a patch
|
||
|
||
|
||
1.7.5 20 Mar 2014
|
||
-----------------
|
||
|
||
**********************************************************************
|
||
* Open MPI is now fully MPI-3.0 compliant
|
||
**********************************************************************
|
||
|
||
- Add Linux OpenSHMEM support built on top of Open MPI's MPI
|
||
layer. Thanks to Mellanox for contributing this new feature.
|
||
- Allow restricting ORTE daemons to specific cores using the
|
||
orte_daemon_cores MCA param.
|
||
- Ensure to properly set "locality" flags for processes launched via
|
||
MPI dynamic functions such as MPI_COMM_SPAWN.
|
||
- Fix MPI_GRAPH_CREATE when nnodes is smaller than the size of the old
|
||
communicator.
|
||
- usnic BTL now supports underlying UDP transport.
|
||
- usnic BTL now checks for common connectivty errors at first send to
|
||
a remote server.
|
||
- Minor scalability improvements in the usnic BTL.
|
||
- ompi_info now lists whether the Java MPI bindings are available or not.
|
||
- MPI-3: mpi.h and the Fortran interfaces now report MPI_VERSION==3
|
||
and MPI_SUBVERSION==0.
|
||
- MPI-3: Added support for new RMA functions and functionality.
|
||
- Fix MPI_Info "const buglet. Thanks to Orion Poplawski for
|
||
identifying the issue.
|
||
- Multiple fixes to mapping/binding options. Thanks to Tetsuya Mishima
|
||
for his assistance.
|
||
- Multiple fixes for normal and abnormal process termination,
|
||
including singleton MPI_Abort and ensuring to kill entire process
|
||
groups when abnormally terminating a job.
|
||
- Fix DESTDIR install for javadocs. Thanks to Orion Poplawski for
|
||
pointing out the issue.
|
||
- Various performance improvements for the MPI Java bindings.
|
||
- OMPI now uses its own internal random number generator and will not
|
||
perturb srand() and friends.
|
||
- Some cleanups for Cygwin builds. Thanks to Marco Atzeri for the
|
||
patches.
|
||
- Add a new collective component (coll/ml) that provides substantially
|
||
improved performance. It is still experimental, and requires
|
||
setting coll_ml_priority > 0 to become active.
|
||
- Add version check during startup to ensure you are using the same
|
||
version of Open MPI on all nodes in a job.
|
||
- Significantly improved the performance of MPI_DIMS_CREATE for large
|
||
values. Thanks to Andreas Sch<63>fer for the contribution.
|
||
- Removed ASYNCHRONOUS keyword from the "ignore TKR" mpi_f08 module.
|
||
- Deprecated the following mpirun options:
|
||
--bynode, --bycore, --byslot: replaced with --map-by node|core|slot.
|
||
--npernode, --npersocket: replaced with --map-by ppr:N:node and
|
||
--map-by ppr:N:socket, respectively
|
||
- Pick NFS "infinitely stale" fix from ROMIO upstream.
|
||
- Various PMI2 fixes and extension to support broader range of mappings.
|
||
- Improve launch performance at large scale.
|
||
- Add support for PBS/Torque environments that set environment
|
||
variables to indicate the number of slots available on each nodes.
|
||
Set the ras_tm_smp MCA parameter to "1" to enable this mode.
|
||
- Add new, more scalable endpoint exchange (commonly called "modex")
|
||
method that only exchanges endpoint data on a per-peer basis
|
||
on first message. Not all transports have been updated to use
|
||
this feature. Set the rte_orte_direct_modex parameter to "1"
|
||
to enable this mode.
|
||
|
||
|
||
1.7.4: 5 Feb 2014
|
||
-----------------
|
||
|
||
**********************************************************************
|
||
* CRITICAL CHANGE
|
||
*
|
||
* As of release 1.7.4, OpenMPI's default mapping, ranking, and binding
|
||
* settings have changed:
|
||
*
|
||
* Mapping:
|
||
* if #procs <= 2, default to map-by core
|
||
* if #procs > 2, default to map-by socket
|
||
* Ranking:
|
||
* if default mapping is used, then default to rank-by slot
|
||
* if map-by <obj> is given, then default to rank-by <obj>,
|
||
* where <obj> is whatever object we mapped against
|
||
* Binding:
|
||
* default to bind-to core
|
||
*
|
||
* Users can override any of these settings individually using the
|
||
* corresponding MCA parameter. Note that multi-threaded applications
|
||
* in particular may want to override at least the binding default
|
||
* to allow threads to use multiple cores.
|
||
**********************************************************************
|
||
|
||
- Restore version number output in "ompi_info --all".
|
||
- Various bug fixes for the mpi_f08 Fortran bindings.
|
||
- Fix ROMIO compile error with Lustre 2.4. Thanks to Adam Moody for
|
||
reporting the issue.
|
||
- Various fixes for 32 bit platforms.
|
||
- Add ability to selectively disable building the mpi or mpi_f08
|
||
module. See the README file for details.
|
||
- Fix MX MTL finalization issue.
|
||
- Fix ROMIO issue when opening a file with MPI_MODE_EXCL.
|
||
- Fix PowerPC and MIPS assembly issues.
|
||
- Various fixes to the hcoll and FCA collective offload modules.
|
||
- Prevent integer overflow when creating datatypes. Thanks to
|
||
original patch from Gilles Gouaillardet.
|
||
- Port some upstream hwloc fixes to Open MPI's embedded copy for
|
||
working around buggy NUMA node cpusets and including mising header
|
||
files. Thanks to Jeff Becker and Paul Hargrove for reporting the
|
||
issues.
|
||
- Fix recursive invocation issues in the MXM MTL.
|
||
- Various bug fixes to the new MCA parameter back-end system.
|
||
- Have the posix fbtl module link against -laio on NetBSD platforms.
|
||
Thanks to Paul Hargrove for noticing the issue.
|
||
- Various updates and fixes to network filesystem detection to support
|
||
more operating systems.
|
||
- Add gfortran v4.9 "ignore TKR" syntax to the mpi Fortran module.
|
||
- Various compiler fixes for several BSD-based platforms. Thanks to
|
||
Paul Hargrove for reporting the issues.
|
||
- Fix when MPI_COMM_SPAWN[_MULTIPLE] is used on oversubscribed
|
||
systems.
|
||
- Change the output from --report bindings to simply state that a
|
||
process is not bound, instead of reporting that it is bound to all
|
||
processors.
|
||
- Per MPI-3.0 guidance, remove support for all MPI subroutines with
|
||
choice buffers from the TKR-based mpi Fortran module. Thanks to Jed
|
||
Brown for raising the issue.
|
||
- Only allow the usnic BTL to build on 64 bit platforms.
|
||
- Various bug fixes to SLURM support, to include ensuring proper
|
||
exiting on abnormal termination.
|
||
- Ensure that MPI_COMM_SPAWN[_MULTIPLE] jobs get the same mapping
|
||
directives that were used with mpirun.
|
||
- Fixed the application of TCP_NODELAY.
|
||
- Change the TCP BTL to not warn if a non-existent interface is
|
||
ignored.
|
||
- Restored the "--bycore" mpirun option for backwards compatibility.
|
||
- Fixed debugger attach functionality. Thanks to Ashley Pittman for
|
||
reporting the issue and suggesting the fix.
|
||
- Fixed faulty MPI_IBCAST when invoked on a communicator with only
|
||
one process.
|
||
- Add new Mellanox device IDs to the openib BTL.
|
||
- Progress towards cleaning up various internal memory leaks as
|
||
reported by Valgrind.
|
||
- Fixed some annoying flex-generated warnings that have been there for
|
||
years. Thanks to Tom Fogal for the initial patch.
|
||
- Support user-provided environment variables via the "env" info key
|
||
to MPI_COMM_SPAWN[_MULTIPLE]. Thanks to Tom Fogal for the feature
|
||
request.
|
||
- Fix uninitialized variable in MPI_DIST_GRAPH_CREATE.
|
||
- Fix a variety of memory errors on SPARC platforms. Thanks to
|
||
Siegmar Gross for reporting and testing all the issues.
|
||
- Remove Solaris threads support. When building on Solaris, pthreads
|
||
will be used.
|
||
- Correctly handle the convertor internal stack for persistent
|
||
receives. Thanks to Guillaume Gouaillardet for identifying the
|
||
problem.
|
||
- Add support for using an external libevent via --with-libevent. See
|
||
the README for more details.
|
||
- Various OMPIO updates and fixes.
|
||
- Add support for the MPIEXEC_TIMEOUT environment variable. If set,
|
||
mpirun will terminate the job after this many seconds.
|
||
- Update the internal copy of ROMIO to that which shipped in MPICH
|
||
3.0.4.
|
||
- Various performance tweaks and improvements in the usnic BTL,
|
||
including now reporting MPI_T performance variables for each usnic
|
||
device.
|
||
- Fix to not access send datatypes for non-root processes with
|
||
MPI_ISCATTER[V] and MPI_IGATHER[V]. Thanks to Pierre Jolivet for
|
||
supplying the initial patch.
|
||
- Update VampirTrace to 5.14.4.9.
|
||
- Fix ptmalloc2 hook disable when used with ummunotify.
|
||
- Change the default connection manager for the openib BTL to be based
|
||
on UD verbs data exchanges instead of ORTE OOB data exchanges.
|
||
- Fix Fortran compile error when compiling with 8-byte INTEGERs and
|
||
4-byte ints.
|
||
- Fix C++11 issue identified by Jeremiah Willcock.
|
||
- Many changes, updates, and bug fixes to the ORTE run-time layer.
|
||
- Correctly handle MPI_REDUCE_SCATTER with recvcounts of 0.
|
||
- Update man pages for MPI-3, and add some missing man pages for
|
||
MPI-2.x functions.
|
||
- Updated mpi_f08 module in accordance with post-MPI-3.0 errata which
|
||
basically removed BIND(C) from all interfaces.
|
||
- Fixed MPI_IN_PLACE detection for MPI_SCATTER[V] in Fortran
|
||
routines. Thanks to Charles Gerlach for identifying the issue.
|
||
- Added support for routable RoCE to the openib BTL.
|
||
- Update embedded hwloc to v1.7.2.
|
||
- ErrMgr framework redesigned to better support fault tolerance development
|
||
activities. See the following RFC for details:
|
||
http://www.open-mpi.org/community/lists/devel/2010/03/7589.php
|
||
- Added database framework to OPAL and changed all modex operations
|
||
to flow thru it, also included additional system info in the
|
||
available data
|
||
- Added staged state machine to support sequential work flows
|
||
- Added distributed file system support for accessing files across
|
||
nodes that do not have networked file systems
|
||
- Extended filem framework to support scalable pre-positioning of
|
||
files for use by applications, adding new "raw" component that
|
||
transmits files across the daemon network
|
||
- Native Windows support has been removed. A cygwin package is
|
||
available from that group for Windows-based use.
|
||
- Added new MPI Java bindings. See the Javadocs for more details on
|
||
the API.
|
||
- Wrapper compilers now add rpath support by default to generated
|
||
executables on systems that support it. This behavior can be
|
||
disabled via --disable-wrapper-rpath. See note in README about ABI
|
||
issues when using rpath in MPI applications.
|
||
- Added a new parallel I/O component and multiple new frameworks to
|
||
support parallel I/O operations.
|
||
- Fixed MPI_STATUS_SIZE Fortran issue when used with 8-byte Fortran
|
||
INTEGERs and 4-byte C ints. Since this issue affects ABI, it is
|
||
only enabled if Open MPI is configured with
|
||
--enable-abi-breaking-fortran-status-i8-fix. Thanks to Jim Parker
|
||
for supplying the initial patch.
|
||
- Add support for Intel Phi SCIF transport.
|
||
- For CUDA-aware MPI configured with CUDA 6.0, use new pointer
|
||
attribute to avoid extra synchronization in stream 0 when using
|
||
CUDA IPC between GPUs on the same node.
|
||
- For CUDA-aware MPI configured with CUDA 6.0, compile in support
|
||
of GPU Direct RDMA in openib BTL to improve small message latency.
|
||
- Updated ROMIO from MPICH v3.0.4.
|
||
- MPI-3: Added support for remaining non-blocking collectives.
|
||
- MPI-3: Added support for neighborhood collectives.
|
||
- MPI-3: Updated C bindings with consistent use of [].
|
||
- MPI-3: Added the const keyword to read-only buffers.
|
||
- MPI-3: Added support for non-blocking communicator duplication.
|
||
- MPI-3: Added support for non-collective communicator creation.
|
||
|
||
|
||
1.7.3: 17 Oct 2013
|
||
------------------
|
||
|
||
- Make CUDA-aware support dynamically load libcuda.so so CUDA-aware
|
||
MPI library can run on systems without CUDA software.
|
||
- Fix various issues with dynamic processes and intercommunicator
|
||
operations under Torque. Thanks to Suraj Prabhakaran for reporting
|
||
the problem.
|
||
- Enable support for the Mellanox MXM2 library by default.
|
||
- Improve support for Portals 4.
|
||
- Various Solaris fixes. Many thanks to Siegmar Gross for his
|
||
incredible patience in reporting all the issues.
|
||
- MPI-2.2: Add reduction support for MPI_C_*COMPLEX and MPI::*COMPLEX.
|
||
- Fixed internal accounting when openpty() fails. Thanks to Michal
|
||
Peclo for reporting the issue and providing a patch.
|
||
- Fixed too-large memory consumption in XRC mode of the openib BTL.
|
||
Thanks to Alexey Ryzhikh for the patch.
|
||
- Add bozo check for negative np values to mpirun to prevent a
|
||
deadlock. Thanks to Upinder Malhi for identifying the issue.
|
||
- Fixed MPI_IS_THREAD_MAIN behavior. Thanks to Lisandro Dalcin for
|
||
pointing out the problem.
|
||
- Various rankfile fixes.
|
||
- Fix functionality over iWARP devices.
|
||
- Various memory and performance optimizations and tweaks.
|
||
- Fix MPI_Cancel issue identified by Fujitsu.
|
||
- Add missing support for MPI_Get_address in the "use mpi" TKR
|
||
implementation. Thanks to Hugo Gagnon for identifying the issue.
|
||
- MPI-3: Add support for MPI_Count.
|
||
- MPI-2.2: Add missing MPI_IN_PLACE support for MPI_ALLTOALL.
|
||
- Added new usnic BTL to support the Cisco usNIC device.
|
||
- Minor VampirTrace update to 5.14.4.4.
|
||
- Removed support for ancient OS X systems (i.e., prior to 10.5).
|
||
- Fixed obscure packing/unpacking datatype bug. Thanks to Takahiro
|
||
Kawashima for identifying the issue.
|
||
- Add run-time support for PMI2 environments.
|
||
- Update openib BTL default parameters to include support for Mellanox
|
||
ConnectX3-Pro devices.
|
||
- Update libevent to v2.0.21.
|
||
- "ompi_info --param TYPE PLUGIN" now only shows a small number of MCA
|
||
parameters by default. Add "--level 9" or "--all" to see *all* MCA
|
||
parameters. See README for more details.
|
||
- Add support for asynchronous CUDA-aware copies.
|
||
- Add support for Mellanox MPI collective operation offload via the
|
||
"hcoll" library.
|
||
- MPI-3: Add support for the MPI_T interface. Open MPI's MCA
|
||
parameters are now accessible via the MPI_T control variable
|
||
interface. Support has been added for a small number of MPI_T
|
||
performance variables.
|
||
- Add Gentoo memory hooks override. Thanks to Justin Bronder for the
|
||
patch.
|
||
- Added new "mindist" process mapper, allowing placement of processes
|
||
via PCI locality information reported by the BIOS.
|
||
- MPI-2.2: Add support for MPI_Dist_graph functionality.
|
||
- Enable generic, client-side support for PMI2 implementations. Can
|
||
be leveraged by any resource manager that implements PMI2; e.g. SLURM,
|
||
versions 2.6 and higher.
|
||
|
||
1.7.2: 26 Jun 2013
|
||
------------------
|
||
|
||
- Major VampirTrace update to 5.14.4.2.
|
||
(** also appeared: 1.6.5)
|
||
- Fix to set flag==1 when MPI_IPROBE is called with MPI_PROC_NULL.
|
||
(** also appeared: 1.6.5)
|
||
- Set the Intel Phi device to be ignored by default by the openib BTL.
|
||
(** also appeared: 1.6.5)
|
||
- Decrease the internal memory storage used by intrinsic MPI datatypes
|
||
for Fortran types. Thanks to Takahiro Kawashima for the initial
|
||
patch.
|
||
(** also appeared: 1.6.5)
|
||
- Fix total registered memory calculation for Mellanox ConnectIB and
|
||
OFED 2.0.
|
||
(** also appeared: 1.6.5)
|
||
- Fix possible data corruption in the MXM MTL component.
|
||
(** also appeared: 1.6.5)
|
||
- Remove extraneous -L from hwloc's embedding. Thanks to Stefan
|
||
Friedel for reporting the issue.
|
||
(** also appeared: 1.6.5)
|
||
- Fix contiguous datatype memory check. Thanks to Eric Chamberland
|
||
for reporting the issue.
|
||
(** also appeared: 1.6.5)
|
||
- Make the openib BTL more friendly to ignoring verbs devices that are
|
||
not RC-capable.
|
||
(** also appeared: 1.6.5)
|
||
- Fix some MPI datatype engine issues. Thanks to Thomas Jahns for
|
||
reporting the issue.
|
||
(** also appeared: 1.6.5)
|
||
- Add INI information for Chelsio T5 device.
|
||
(** also appeared: 1.6.5)
|
||
- Integrate MXM STREAM support for MPI_ISEND and MPI_IRECV, and other
|
||
minor MXM fixes.
|
||
(** also appeared: 1.6.5)
|
||
- Fix to not show amorphous "MPI was already finalized" error when
|
||
failing to MPI_File_close an open file. Thanks to Brian Smith for
|
||
reporting the issue.
|
||
(** also appeared: 1.6.5)
|
||
- Add a distance-based mapping component to find the socket "closest"
|
||
to the PCI bus.
|
||
- Fix an error that caused epoll to automatically be disabled
|
||
in libevent.
|
||
- Upgrade hwloc to 1.5.2.
|
||
- *Really* fixed XRC compile issue in Open Fabrics support.
|
||
- Fix MXM connection establishment flow.
|
||
- Fixed parallel debugger ability to attach to MPI jobs.
|
||
- Fixed some minor memory leaks.
|
||
- Fixed datatype corruption issue when combining datatypes of specific
|
||
formats.
|
||
- Added Location Aware Mapping Algorithm (LAMA) mapping component.
|
||
- Fixes for MPI_STATUS handling in corner cases.
|
||
- Add a distance-based mapping component to find the socket "closest"
|
||
to the PCI bus.
|
||
|
||
|
||
1.7.1: 16 Apr 2013
|
||
------------------
|
||
|
||
- Fixed compile error when --without-memory-manager was specified
|
||
on Linux
|
||
- Fixed XRC compile issue in Open Fabrics support.
|
||
|
||
|
||
1.7: 1 Apr 2013
|
||
---------------
|
||
|
||
- Added MPI-3 functionality:
|
||
- MPI_GET_LIBRARY_VERSION
|
||
- Matched probe
|
||
- MPI_TYPE_CREATE_HINDEXED_BLOCK
|
||
- Non-blocking collectives
|
||
- MPI_INFO_ENV support
|
||
- Fortran '08 bindings (see below)
|
||
- Dropped support for checkpoint/restart due to loss of maintainer :-(
|
||
- Enabled compile-time warning of deprecated MPI functions by default
|
||
(in supported compilers).
|
||
- Revamped Fortran MPI bindings (see the README for details):
|
||
- "mpifort" is now the preferred wrapper compiler for Fortran
|
||
- Added "use mpi_f08" bindings (for compilers that support it)
|
||
- Added better "use mpi" support (for compilers that support it)
|
||
- Removed incorrect MPI_SCATTERV interface from "mpi" module that
|
||
was added in the 1.5.x series for ABI reasons.
|
||
- Lots of VampirTrace upgrades and fixes; upgrade to v5.14.3.
|
||
- Modified process affinity system to provide warning when bindings
|
||
result in being "bound to all", which is equivalent to not being
|
||
bound.
|
||
- Removed maffinity, paffinity, and carto frameworks (and associated
|
||
MCA params).
|
||
- Upgraded to hwloc v1.5.1.
|
||
- Added performance improvements to the OpenIB (OpenFabrics) BTL.
|
||
- Made malloc hooks more friendly to IO interprosers. Thanks to the
|
||
bug report and suggested fix from Darshan maintainer Phil Carns.
|
||
- Added support for the DMTCP checkpoint/restart system.
|
||
- Added support for the Cray uGNI interconnect.
|
||
- Fixed header file problems on OpenBSD.
|
||
- Fixed issue with MPI_TYPE_CREATE_F90_REAL.
|
||
- Wrapper compilers now explicitly list/link all Open MPI libraries if
|
||
they detect static linking CLI arguments.
|
||
- Open MPI now requires a C99 compiler to build. Please upgrade your
|
||
C compiler if you do not have a C99-compliant compiler.
|
||
- Fix MPI_GET_PROCESSOR_NAME Fortran binding to set ierr properly.
|
||
Thanks to LANL for spotting the error.
|
||
- Many MXM and FCA updates.
|
||
- Fixed erroneous free of putenv'ed string that showed up in Valgrind
|
||
reports.
|
||
- Fixed MPI_IN_PLACE case for MPI_ALLGATHER.
|
||
- Fixed a bug that prevented MCA params from being forwarded to
|
||
daemons upon launch.
|
||
- Fixed issues with VT and CUDA --with-cuda[-libdir] configuration CLI
|
||
parameters.
|
||
- Entirely new implementation of many MPI collective routines focused
|
||
on better performance.
|
||
- Revamped autogen / build system.
|
||
- Add new sensor framework to ORTE that includes modules for detecting
|
||
stalled applications and processes that consume too much memory.
|
||
- Added new state machine framework to ORTE that converts ORTE into an
|
||
event-driven state machine using the event library.
|
||
- Added a new MCA parameter (ess_base_stream_buffering) that allows the user
|
||
to override the system default for buffering of stdout/stderr streams
|
||
(via setvbuf). Parameter is not visible via ompi_info.
|
||
- Revamped the launch system to allow consideration of node hardware
|
||
in assigning process locations and bindings.
|
||
- Added the -novm option to preserve the prior launch behavior.
|
||
- Revamped the process mapping system to utilize node hardware by adding
|
||
new map-by, rank-by, and bind-to cmd line options.
|
||
- Added new MCA parameter to provide protection against IO forwarding
|
||
backlog.
|
||
- Dropped support for native Windows due to loss of maintainers. :-(
|
||
- Added a new parallel I/O component and multiple new frameworks to
|
||
support parallel I/O operations.
|
||
- Fix typo in orte_setup_hadoop.m4. Thanks to Aleksej Saushev for
|
||
reporting it
|
||
- Fix a very old error in opal_path_access(). Thanks to Marco Atzeri
|
||
for chasing it down.
|
||
|
||
|
||
1.6.6: Not released
|
||
-------------------
|
||
|
||
- Prevent integer overflow in datatype creation. Thanks to Gilles
|
||
Gouaillardet for identifying the problem and providing a preliminary
|
||
version of the patch.
|
||
- Ensure help-opal-hwloc-base.txt is included in distribution
|
||
tarballs. Thanks to Gilles Gouaillardet for supplying the patch.
|
||
- Correctly handle the invalid status for NULL and inactive requests.
|
||
Thanks to KAWASHIMA Takahiro for submitting the initial patch.
|
||
- Fixed MPI_STATUS_SIZE Fortran issue when used with 8-byte Fortran
|
||
INTEGERs and 4-byte C ints. Since this issue affects ABI, it is
|
||
only enabled if Open MPI is configured with
|
||
--enable-abi-breaking-fortran-status-i8-fix. Thanks to Jim Parker
|
||
for supplying the initial patch.
|
||
- Fix datatype issue for sending from the middle of non-contiguous
|
||
data.
|
||
- Fixed failure error with pty support. Thanks to Michal Pecio for
|
||
the patch.
|
||
- Fixed debugger support for direct-launched jobs.
|
||
- Fix MPI_IS_THREAD_MAIN to return the correct value. Thanks to
|
||
Lisandro Dalcin for pointing out the issue.
|
||
- Update VT to 5.14.4.4:
|
||
- Fix C++-11 issue.
|
||
- Fix support for building RPMs on Fedora with CUDA libraries.
|
||
- Add openib part number for ConnectX3-Pro HCA.
|
||
- Ensure to check that all resolved IP addresses are local.
|
||
- Fix MPI_COMM_SPAWN via rsh when mpirun is on a different server.
|
||
- Add Gentoo "sandbox" memory hooks override.
|
||
|
||
|
||
1.6.5: 26 Jun 2013
|
||
------------------
|
||
|
||
- Updated default SRQ parameters for the openib BTL.
|
||
(** also to appear: 1.7.2)
|
||
- Major VampirTrace update to 5.14.4.2.
|
||
(** also to appear: 1.7.2)
|
||
- Fix to set flag==1 when MPI_IPROBE is called with MPI_PROC_NULL.
|
||
(** also to appear: 1.7.2)
|
||
- Set the Intel Phi device to be ignored by default by the openib BTL.
|
||
(** also to appear: 1.7.2)
|
||
- Decrease the internal memory storage used by intrinsic MPI datatypes
|
||
for Fortran types. Thanks to Takahiro Kawashima for the initial
|
||
patch.
|
||
(** also to appear: 1.7.2)
|
||
- Fix total registered memory calculation for Mellanox ConnectIB and
|
||
OFED 2.0.
|
||
(** also to appear: 1.7.2)
|
||
- Fix possible data corruption in the MXM MTL component.
|
||
(** also to appear: 1.7.2)
|
||
- Remove extraneous -L from hwloc's embedding. Thanks to Stefan
|
||
Friedel for reporting the issue.
|
||
(** also to appear: 1.7.2)
|
||
- Fix contiguous datatype memory check. Thanks to Eric Chamberland
|
||
for reporting the issue.
|
||
(** also to appear: 1.7.2)
|
||
- Make the openib BTL more friendly to ignoring verbs devices that are
|
||
not RC-capable.
|
||
(** also to appear: 1.7.2)
|
||
- Fix some MPI datatype engine issues. Thanks to Thomas Jahns for
|
||
reporting the issue.
|
||
(** also to appear: 1.7.2)
|
||
- Add INI information for Chelsio T5 device.
|
||
(** also to appear: 1.7.2)
|
||
- Integrate MXM STREAM support for MPI_ISEND and MPI_IRECV, and other
|
||
minor MXM fixes.
|
||
(** also to appear: 1.7.2)
|
||
- Improved alignment for OpenFabrics buffers.
|
||
- Fix to not show amorphous "MPI was already finalized" error when
|
||
failing to MPI_File_close an open file. Thanks to Brian Smith for
|
||
reporting the issue.
|
||
(** also to appear: 1.7.2)
|
||
|
||
|
||
1.6.4: 21 Feb 2013
|
||
------------------
|
||
|
||
- Fix Cygwin shared memory and debugger plugin support. Thanks to
|
||
Marco Atzeri for reporting the issue and providing initial patches.
|
||
- Fix to obtaining the correct available nodes when a rankfile is
|
||
providing the allocation. Thanks to Siegmar Gross for reporting the
|
||
problem.
|
||
- Fix process binding issue on Solaris. Thanks to Siegmar Gross for
|
||
reporting the problem.
|
||
- Updates for MXM 2.0.
|
||
- Major VT update to 5.14.2.3.
|
||
- Fixed F77 constants for Cygwin/Cmake build.
|
||
- Fix a linker error when configuring --without-hwloc.
|
||
- Automatically provide compiler flags that compile properly on some
|
||
types of ARM systems.
|
||
- Fix slot_list behavior when multiple sockets are specified. Thanks
|
||
to Siegmar Gross for reporting the problem.
|
||
- Fixed memory leak in one-sided operations. Thanks to Victor
|
||
Vysotskiy for letting us know about this one.
|
||
- Added performance improvements to the OpenIB (OpenFabrics) BTL.
|
||
- Improved error message when process affinity fails.
|
||
- Fixed MPI_MINLOC on man pages for MPI_REDUCE(_LOCAL). Thanks to Jed
|
||
Brown for noticing the problem and supplying a fix.
|
||
- Made malloc hooks more friendly to IO interprosers. Thanks to the
|
||
bug report and suggested fix from Darshan maintainer Phil Carns.
|
||
- Restored ability to direct launch under SLURM without PMI support.
|
||
- Fixed MPI datatype issues on OpenBSD.
|
||
- Major VT update to 5.14.2.3.
|
||
- Support FCA v3.0+.
|
||
- Fixed header file problems on OpenBSD.
|
||
- Fixed issue with MPI_TYPE_CREATE_F90_REAL.
|
||
- Fix an issue with using external libltdl installations. Thanks to
|
||
opolawski for identifying the problem.
|
||
- Fixed MPI_IN_PLACE case for MPI_ALLGATHER for FCA.
|
||
- Allow SLURM PMI support to look in lib64 directories. Thanks to
|
||
Guillaume Papaure for the patch.
|
||
- Restore "use mpi" ABI compatibility with the rest of the 1.5/1.6
|
||
series (except for v1.6.3, where it was accidentally broken).
|
||
- Fix a very old error in opal_path_access(). Thanks to Marco Atzeri
|
||
for chasing it down.
|
||
|
||
|
||
1.6.3: 30 Oct 2012
|
||
------------------
|
||
|
||
- Fix mpirun --launch-agent behavior when a prefix is specified.
|
||
Thanks to Reuti for identifying the issue.
|
||
- Fixed memchecker configury.
|
||
- Brought over some compiler warning squashes from the development trunk.
|
||
- Fix spawning from a singleton to multiple hosts when the "add-host"
|
||
MPI_Info key is used. Thanks to Brian Budge for pointing out the
|
||
problem.
|
||
- Add Mellanox ConnextIB IDs and max inline value.
|
||
- Fix rankfile when no -np is given.
|
||
- FreeBSD detection improvement. Thanks to Brooks Davis for the
|
||
patch.
|
||
- Removed TCP warnings on Windows.
|
||
- Improved collective algorithm selection for very large messages.
|
||
- Fix PSM MTL affinity settings.
|
||
- Fix issue with MPI_OP_COMMUTATIVE in the mpif.h bindings. Thanks to
|
||
Ake Sandgren for providing a patch to fix the issue.
|
||
- Fix issue with MPI_SIZEOF when using CHARACTER and LOGICAL types in
|
||
the mpi module. Thanks to Ake Sandgren for providing a patch to fix
|
||
the issue.
|
||
|
||
|
||
1.6.2: 25 Sep 2012
|
||
------------------
|
||
|
||
- Fix issue with MX MTL. Thanks to Doug Eadline for raising the issue.
|
||
- Fix singleton MPI_COMM_SPAWN when the result job spans multiple nodes.
|
||
- Fix MXM hang, and update for latest version of MXM.
|
||
- Update to support Mellanox FCA 2.5.
|
||
- Fix startup hang for large jobs.
|
||
- Ensure MPI_TESTANY / MPI_WAITANY properly set the empty status when
|
||
count==0.
|
||
- Fix MPI_CART_SUB behavior of not copying periods to the new
|
||
communicator properly. Thanks to John Craske for the bug report.
|
||
- Add btl_openib_abort_not_enough_reg_mem MCA parameter to cause Open
|
||
MPI to abort MPI jobs if there is not enough registered memory
|
||
available on the system (vs. just printing a warning). Thanks to
|
||
Brock Palen for raising the issue.
|
||
- Minor fix to Fortran MPI_INFO_GET: only copy a value back to the
|
||
user's buffer if the flag is .TRUE.
|
||
- Fix VampirTrace compilation issue with the PGI compiler suite.
|
||
|
||
|
||
1.6.1: 22 Aug 2012
|
||
------------------
|
||
|
||
- A bunch of changes to eliminate hangs on OpenFabrics-based networks.
|
||
Users with Mellanox hardware are ***STRONGLY ENCOURAGED*** to check
|
||
their registered memory kernel module settings to ensure that the OS
|
||
will allow registering more than 8GB of memory. See this FAQ item
|
||
for details:
|
||
|
||
http://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem
|
||
|
||
- Fall back to send/receive semantics if registered memory is
|
||
unavilable for RDMA.
|
||
- Fix two fragment leaks when registered memory is exhausted.
|
||
- Hueristically determine how much registered memory is available
|
||
and warn if it's significantly less than all of RAM.
|
||
- Artifically limit the amount of registered memory each MPI process
|
||
can use to about 1/Nth to total registered memory available.
|
||
- Improve error messages when events occur that are likely due to
|
||
unexpected registered memory exhaustion.
|
||
|
||
- Fix double semicolon error in the C++ in <mpi.h>. Thanks to John
|
||
Foster for pointing out the issue.
|
||
- Allow -Xclang to be specified multiple times in CFLAGS. Thanks to
|
||
P. Martin for raising the issue.
|
||
- Break up a giant "print *" statement in the ABI-preserving incorrect
|
||
MPI_SCATTER interface in the "large" Fortran "mpi" module. Thanks
|
||
to Juan Escobar for the initial patch.
|
||
- Switch the MPI_ALLTOALLV default algorithm to a pairwise exchange.
|
||
- Increase the openib BTL default CQ length to handle more types of
|
||
OpenFabrics devices.
|
||
- Lots of VampirTrace fixes; upgrade to v5.13.0.4.
|
||
- Map MPI_2INTEGER to underlying MPI_INTEGERs, not MPI_INTs.
|
||
- Ensure that the OMPI version number is toleant of handling spaces.
|
||
Thanks to dragonboy for identifying the issue.
|
||
- Fixed IN parameter marking on Fortran "mpi" module
|
||
MPI_COMM_TEST_INTER interface.
|
||
- Various MXM improvements.
|
||
- Make the output of "mpirun --report-bindings" much more friendly /
|
||
human-readable.
|
||
- Properly handle MPI_COMPLEX8|16|32.
|
||
- More fixes for mpirun's processor affinity options (--bind-to-core
|
||
and friends).
|
||
- Use aligned memory for OpenFabrics registered memory.
|
||
- Multiple fixes for parameter checking in MPI_ALLGATHERV,
|
||
MPI_REDUCE_SCATTER, MPI_SCATTERV, and MPI_GATHERV. Thanks to the
|
||
mpi4py community (Bennet Fauber, Lisandro Dalcin, Jonathan Dursi).
|
||
- Fixed file positioning overflows in MPI_FILE_GET_POSITION,
|
||
MPI_FILE_GET_POSITION_SHARED, FILE_GET_SIZE, FILE_GET_VIEW.
|
||
- Removed the broken --cpu-set mpirun option.
|
||
- Fix cleanup of MPI errorcodes. Thanks to Alexey Bayduraev for the
|
||
patch.
|
||
- Fix default hostfile location. Thanks to G<>tz Waschk for noticing
|
||
the issue.
|
||
- Improve several error messages.
|
||
|
||
|
||
1.6: 14 May 2012
|
||
----------------
|
||
|
||
- Fix some process affinity issues. When binding a process, Open MPI
|
||
will now bind to all available hyperthreads in a core (or socket,
|
||
depending on the binding options specified).
|
||
--> Note that "mpirun --bind-to-socket ..." does not work on POWER6-
|
||
and POWER7-based systems with some Linux kernel versions. See
|
||
the FAQ on the Open MPI web site for more information.
|
||
- Add support for ARM5 and ARM6 (in addition to the existing ARM7
|
||
support). Thanks to Evan Clinton for the patch.
|
||
- Minor Mellanox MXM fixes.
|
||
- Properly detect FDR10, FDR, and EDR OpenFabrics devices.
|
||
- Minor fixes to the mpirun(1) and MPI_Comm_create(3) man pages.
|
||
- Prevent segv if COMM_SPAWN_MULTIPLE fails. Thanks to Fujitsu for
|
||
the patch.
|
||
- Disable interposed memory management in fakeroot environments. This
|
||
fixes a problem in some build environments.
|
||
- Minor hwloc updates.
|
||
- Array versions of MPI_TEST and MPI_WAIT with a count==0 will now
|
||
return immediately with MPI_SUCCESS. Thanks to Jeremiah Willcock
|
||
for the suggestion.
|
||
- Update VampirTrace to v5.12.2.
|
||
- Properly handle forwarding stdin to all processes when "mpirun
|
||
--stdin all" is used.
|
||
- Workaround XLC assembly bug.
|
||
- OS X Tiger (10.4) has not been supported for a while, so forcibly
|
||
abort configure if we detect it.
|
||
- Fix segv in the openib BTL when running on SPARC 64 systems.
|
||
- Fix some include file ordering issues on some BSD-based platforms.
|
||
Thanks to Paul Hargove for this (and many, many other) fixes.
|
||
- Properly handle .FALSE. return parameter value to attribute copy
|
||
callback functions.
|
||
- Fix a bunch of minor C++ API issues; thanks to Fujitsu for the patch.
|
||
- Fixed the default hostfile MCA parameter behavior.
|
||
- Per the MPI spec, ensure not to touch the port_name parameter to
|
||
MPI_CLOSE_PORT (it's an IN parameter).
|
||
|
||
|
||
1.5.5: 27 Mar 2012
|
||
------------------
|
||
|
||
- Many, many portability configure/build fixes courtesy of Paul
|
||
Hargrove. Thanks, Paul!
|
||
- Fixed shared memory fault tolerance support compiler errors.
|
||
- Removed not-production-quality rshd and tmd PLM launchers.
|
||
- Minor updates to the Open MPI SRPM spec file.
|
||
- Fixed mpirun's --bind-to-socket option.
|
||
- A few MPI_THREAD_MULTIPLE fixes in the shared memory BTL.
|
||
- Upgrade the GNU Autotools used to bootstrap the 1.5/1.6 series to
|
||
all the latest versions at the time of this release.
|
||
- Categorically state in the README that if you're having a problem
|
||
with Open MPI with the Linux Intel 12.1 compilers, *upgrade your
|
||
Intel Compiler Suite to the latest patch version*, and the problems
|
||
will go away. :-)
|
||
- Fix the --without-memory-manager configure option.
|
||
- Fixes for Totalview/DDT MPI-capable debuggers.
|
||
- Update rsh/ssh support to properly handle the Mac OS X library path
|
||
(i.e., DYLD_LIBRARY_PATH).
|
||
- Make warning about shared memory backing files on a networked file
|
||
system be optional (i.e., can be disabled via MCA parameter).
|
||
- Several fixes to processor and memory affinity.
|
||
- Various shared memory infrastructure improvements.
|
||
- Various checkpoint/restart fixes.
|
||
- Fix MPI_IN_PLACE (and other MPI sentinel values) on OS X. Thanks to
|
||
Dave Goodell for providing the magic OS X gcc linker flags necessary.
|
||
- Various man page corrections and typo fixes. Thanks to Fujitsu for
|
||
the patch.
|
||
- Updated wrapper compiler man pages to list the various --showme
|
||
options that are available.
|
||
- Add PMI direct-launch support (e.g., "srun mpi_application" under
|
||
SLURM).
|
||
- Correctly compute the aligned address when packing the
|
||
datatype description. Thanks to Fujitsu for the patch.
|
||
- Fix MPI obscure corner case handling in packing MPI datatypes.
|
||
Thanks to Fujitsu for providing the patch.
|
||
- Workaround an Intel compiler v12.1.0 2011.6.233 vector optimization
|
||
bug.
|
||
- Output the MPI API in ompi_info output.
|
||
- Major VT update to 5.12.1.4.
|
||
- Upgrade embedded Hardware Locality (hwloc) v1.3.2, plus some
|
||
post-1.3.2-release bug fixes. All processor and memory binding is
|
||
now done through hwloc. Woo hoo! Note that this fixes core binding
|
||
on AMD Opteron 6200 and 4200 series-based systems (sometimes known
|
||
as Interlagos, Valencia, or other Bulldozer-based chips).
|
||
- New MCA parameters to control process-wide memory binding policy:
|
||
hwloc_base_mem_alloc_policy, hwloc_base_mem_bind_failure_action (see
|
||
ompi_info --param hwloc base).
|
||
- Removed direct support for libnuma. Libnuma support may now be
|
||
picked up through hwloc.
|
||
- Added MPI_IN_PLACE support to MPI_EXSCAN.
|
||
- Various fixes for building on Windows, including MinGW support.
|
||
- Removed support for the OpenFabrics IBCM connection manager.
|
||
- Updated Chelsio T4 and Intel NE OpenFabrics default buffer settings.
|
||
- Increased the default RDMA CM timeout to 30 seconds.
|
||
- Issue a warning if both btl_tcp_if_include and btl_tcp_if_exclude
|
||
are specified.
|
||
- Many fixes to the Mellanox MXM transport.
|
||
|
||
|
||
1.5.4: 18 Aug 2011
|
||
------------------
|
||
|
||
- Add support for the (as yet unreleased) Mellanox MXM transport.
|
||
- Add support for dynamic service levels (SLs) in the openib BTL.
|
||
- Fixed C++ bindings cosmetic/warnings issue with
|
||
MPI::Comm::NULL_COPY_FN and MPI::Comm::NULL_DELETE_FN. Thanks to
|
||
J<>lio Hoffimann for identifying the issues.
|
||
- Also allow the word "slots" in rankfiles (i.e., not just "slot").
|
||
(** also to appear in 1.4.4)
|
||
- Add Mellanox ConnectX 3 device IDs to the openib BTL defaults.
|
||
(** also to appear in 1.4.4)
|
||
- Various FCA updates.
|
||
- Fix 32 bit SIGBUS errors on Solaris SPARC platforms.
|
||
- Add missing ARM assembly code files.
|
||
- Update to allow more than 128 entries in an appfile.
|
||
(** also to appear in 1.4.4)
|
||
- Various VT updates and bug fixes.
|
||
- Update description of btl_openib_cq_size to be more accurate.
|
||
(** also to appear in 1.4.4)
|
||
- Various assembly "clobber" fixes.
|
||
- Fix a hang in carto selection in obscure situations.
|
||
- Guard the inclusion of execinfo.h since not all platforms have it. Thanks
|
||
to Aleksej Saushev for identifying this issue.
|
||
(** also to appear in 1.4.4)
|
||
- Support Solaris legacy munmap prototype changes.
|
||
(** also to appear in 1.4.4)
|
||
- Updated to Automake 1.11.1 per
|
||
http://www.open-mpi.org/community/lists/devel/2011/07/9492.php.
|
||
- Fix compilation of LSF support.
|
||
- Update MPI_Comm_spawn_multiple.3 man page to reflect what it
|
||
actually does.
|
||
- Fix for possible corruption of the environment. Thanks to Peter
|
||
Thompson for the suggestion. (** also to appear in 1.4.4)
|
||
- Enable use of PSM on direct-launch SLURM jobs.
|
||
- Update paffinity hwloc to v1.2, and to fix minor bugs affinity
|
||
assignment bugs on PPC64/Linux platforms.
|
||
- Let the openib BTL auto-detect its bandwidth.
|
||
- Support new MPI-2.2 datatypes.
|
||
- Updates to support more datatypes in MPI one-sided communication.
|
||
- Fix recursive locking bug when MPI-IO was used with
|
||
MPI_THREAD_MULTIPLE. (** also to appear in 1.4.4)
|
||
- Fix mpirun handling of prefix conflicts.
|
||
- Ensure mpirun's --xterm options leaves sessions attached.
|
||
(** also to appear in 1.4.4)
|
||
- Fixed type of sendcounts and displs in the "use mpi" F90 module.
|
||
ABI is preserved, but applications may well be broken. See the
|
||
README for more details. Thanks to Stanislav Sazykin for
|
||
identifying the issue. (** also to appear in 1.4.4)
|
||
- Fix indexed datatype leaks. Thanks to Pascal Deveze for supplying
|
||
the initial patch. (** also to appear in 1.4.4)
|
||
- Fix debugger mapping when mpirun's -npernode option is used.
|
||
- Fixed support for configure's --disable-dlopen option when used with
|
||
"make distclean".
|
||
- Fix segv associated with MPI_Comm_create with MPI_GROUP_EMPTY.
|
||
Thanks to Dominik Goeddeke for finding this.
|
||
(** also to appear in 1.4.4)
|
||
- Improved LoadLeveler ORTE support.
|
||
- Add new WinVerbs BTL plugin, supporting native OpenFabrics verbs on
|
||
Windows (the "wv" BTL).
|
||
- Add new btl_openib_gid_index MCA parameter to allow selecting which
|
||
GID to use on an OpenFabrics device's GID table.
|
||
- Add support for PCI relaxed ordering in the OpenFabrics BTL (when
|
||
available).
|
||
- Update rsh logic to allow correct SGE operation.
|
||
- Ensure that the mca_paffinity_alone MCA parameter only appears once
|
||
in the ompi_info output. Thanks to Gus Correa for identifying the
|
||
issue.
|
||
- Fixed return codes from MPI_PROBE and MPI_IPROBE.
|
||
(** also to appear in 1.4.4)
|
||
- Remove --enable-progress-thread configure option; it doesn't work on
|
||
the v1.5 branch. Rename --enable-mpi-threads to
|
||
--enable-mpi-thread-multiple. Add new --enable-opal-multi-threads
|
||
option.
|
||
- Updates for Intel Fortran compiler version 12.
|
||
- Remove bproc support. Farewell bproc!
|
||
- If something goes wrong during MPI_INIT, fix the error
|
||
message to say that it's illegal to invoke MPI_INIT before
|
||
MPI_INIT.
|
||
|
||
|
||
1.5.3: 16 Mar 2011
|
||
------------------
|
||
|
||
- Add missing "affinity" MPI extension (i.e., the OMPI_Affinity_str()
|
||
API) that was accidentally left out of the 1.5.2 release.
|
||
|
||
|
||
1.5.2: 9 Mar 2011
|
||
-----------------
|
||
|
||
- Replaced all custom topology / affinity code with initial support
|
||
for hwloc v1.1.1 (PLPA has been removed -- long live hwloc!). Note
|
||
that hwloc is bundled with Open MPI, but an external hwloc can be
|
||
used, if desired. See README for more details.
|
||
- Many CMake updates for Windows builds.
|
||
- Updated opal_cr_thread_sleep_wait MCA param default value to make it
|
||
less aggressive.
|
||
- Updated debugger support to allow Totalview attaching from jobs
|
||
launched directly via srun (not mpirun). Thanks to Nikolay Piskun
|
||
for the patch.
|
||
- Added more FTB/CIFTS support.
|
||
- Fixed compile error with the PGI compiler.
|
||
- Portability fixes to allow the openib BTL to run on the Solaris
|
||
verbs stack.
|
||
- Fixed multi-token command-line issues when using the mpirun
|
||
--debug switch. For example:
|
||
mpirun --debug -np 2 a.out "foo bar"
|
||
Thanks to Gabriele Fatigati for reporting the issue.
|
||
- Added ARM support.
|
||
- Added the MPI_ROOT environment variable in the Open MPI Linux SRPM
|
||
for customers who use the BPS and LSF batch managers.
|
||
- Updated ROMIO from MPICH v1.3.1 (plus one additional patch).
|
||
- Fixed some deprecated MPI API function notification messages.
|
||
- Added new "bfo" PML that provides failover on OpenFabrics networks.
|
||
- Fixed some buffer memcheck issues in MPI_*_init.
|
||
- Added Solaris-specific chip detection and performance improvements.
|
||
- Fix some compile errors on Solaris.
|
||
- Updated the "rmcast" framework with bug fixes, new functionality.
|
||
- Updated the Voltaire FCA component with bug fixes, new
|
||
functionality. Support for FCA version 2.1.
|
||
- Fix gcc 4.4.x and 4.5.x over-aggressive warning notifications on
|
||
possibly freeing stack variables. Thanks to the Gentoo packagers
|
||
for reporting the issue.
|
||
- Make the openib component be verbose when it disqualifies itself due
|
||
to MPI_THREAD_MULTIPLE.
|
||
- Minor man page fixes.
|
||
- Various checkpoint / restart fixes.
|
||
- Fix race condition in the one-sided unlock code. Thanks to
|
||
Guillaume Thouvenin for finding the issue.
|
||
- Improve help message aggregation.
|
||
- Add OMPI_Affinity_str() optional user-level API function (i.e., the
|
||
"affinity" MPI extension). See README for more details.
|
||
- Added btl_tcp_if_seq MCA parameter to select a different ethernet
|
||
interface for each MPI process on a node. This parameter is only
|
||
useful when used with virtual ethernet interfaces on a single
|
||
network card (e.g., when using virtual interfaces give dedicated
|
||
hardware resources on the NIC to each process).
|
||
- Changed behavior of mpirun to terminate if it receives 10 (or more)
|
||
SIGPIPEs.
|
||
- Fixed oversubscription detection.
|
||
- Added new mtl_mx_board and mtl_mx_endpoint MCA parameters.
|
||
- Added ummunotify support for OpenFabrics-based transports. See the
|
||
README for more details.
|
||
|
||
|
||
1.5.1: 15 Dec 2010
|
||
------------------
|
||
|
||
- Fixes for the Oracle Studio 12.2 Fortran compiler.
|
||
- Fix SPARC and SPARCv9 atomics. Thanks to Nicola Stange for the
|
||
initial patch.
|
||
- Fix Libtool issues with the IBM XL compiler in 64-bit mode.
|
||
- Restore the reset of the libevent progress counter to avoid
|
||
over-sampling the event library.
|
||
- Update memory barrier support.
|
||
- Use memmove (instead of memcpy) when necessary (e.g., source and
|
||
destination overlap).
|
||
- Fixed ompi-top crash.
|
||
- Fix to handle Autoconf --program-transforms properly and other
|
||
m4/configury updates. Thanks to the GASNet project for the
|
||
--program transforms fix.
|
||
- Allow hostfiles to specify usernames on a per-host basis.
|
||
- Update wrapper compiler scripts to search for perl during configure,
|
||
per request from the BSD maintainers.
|
||
- Minor man page fixes.
|
||
- Added --with-libltdl option to allow building Open MPI with an
|
||
external installation of libltdl.
|
||
- Fixed various issues with -D_FORTIFY_SOURCE=2.
|
||
- Various VT fixes and updates.
|
||
|
||
|
||
1.5: 10 Oct 2010
|
||
----------------
|
||
|
||
- Added "knem" support: direct process-to-process copying for shared
|
||
memory message passing. See http://runtime.bordeaux.inria.fr/knem/
|
||
and the README file for more details.
|
||
- Updated shared library versioning scheme and linking style of MPI
|
||
applications. The MPI application ABI has been broken from the
|
||
v1.3/v1.4 series. MPI applications compiled against any prior
|
||
version of Open MPI will need to, at a minimum, re-link. See the
|
||
README file for more details.
|
||
- Added "fca" collective component, enabling MPI collective offload
|
||
support for Voltaire switches.
|
||
- Fixed MPI one-sided operations with large target displacements.
|
||
Thanks to Brian Price and Jed Brown for reporting the issue.
|
||
- Fixed MPI_GET_COUNT when used with large counts. Thanks to Jed
|
||
Brown for reporting the issue.
|
||
- Made the openib BTL safer if extremely low SRQ settings are used.
|
||
- Fixed handling of the array_of_argv parameter in the Fortran
|
||
binding of MPI_COMM_SPAWN_MULTIPLE (** also to appear: 1.4.3).
|
||
- Fixed malloc(0) warnings in some collectives.
|
||
- Fixed a problem with the Fortran binding for
|
||
MPI_FILE_CREATE_ERRHANDLER. Thanks to Secretan Yves for identifying
|
||
the issue (** also to appear: 1.4.3).
|
||
- Updates to the LSF PLM to ensure that the path is correctly passed.
|
||
Thanks to Teng Lin for the patch (** also to appear: 1.4.3).
|
||
- Fixes for the F90 MPI_COMM_SET_ERRHANDLER and MPI_WIN_SET_ERRHANDLER
|
||
bindings. Thanks to Paul Kapinos for pointing out the issue
|
||
(** also to appear: 1.4.3).
|
||
- Fixed extra_state parameter types in F90 prototypes for
|
||
MPI_COMM_CREATE_KEYVAL, MPI_GREQUEST_START, MPI_REGISTER_DATAREP,
|
||
MPI_TYPE_CREATE_KEYVAL, and MPI_WIN_CREATE_KEYVAL.
|
||
- Fixes for Solaris oversubscription detection.
|
||
- If the PML determines it can't reach a peer process, print a
|
||
slightly more helpful message. Thanks to Nick Edmonds for the
|
||
suggestion.
|
||
- Make btl_openib_if_include/exclude function the same way
|
||
btl_tcp_if_include/exclude works (i.e., supplying an _include list
|
||
overrides supplying an _exclude list).
|
||
- Apply more scalable reachability algorithm on platforms with more
|
||
than 8 TCP interfaces.
|
||
- Various assembly code updates for more modern platforms / compilers.
|
||
- Relax restrictions on using certain kinds of MPI datatypes with
|
||
one-sided operations. Users beware; not all MPI datatypes are valid
|
||
for use with one-sided operations!
|
||
- Improve behavior of MPI_COMM_SPAWN with regards to --bynode.
|
||
- Various threading fixes in the openib BTL and other core pieces of
|
||
Open MPI.
|
||
- Various help file and man pages updates.
|
||
- Various FreeBSD and NetBSD updates and fixes. Thanks to Kevin
|
||
Buckley and Aleksej Saushev for their work.
|
||
- Fix case where freeing communicators in MPI_FINALIZE could cause
|
||
process failures.
|
||
- Print warnings if shared memory state files are opened on what look
|
||
like networked filesystems.
|
||
- Update libevent to v1.4.13.
|
||
- Allow propagating signals to processes that call fork().
|
||
- Fix bug where MPI_GATHER was sometimes incorrectly examining the
|
||
datatype on non-root processes. Thanks to Michael Hofmann for
|
||
investigating the issue.
|
||
- Various Microsoft Windows fixes.
|
||
- Various Catamount fixes.
|
||
- Various checkpoint / restart fixes.
|
||
- Xgrid support has been removed until it can be fixed (patches
|
||
would be welcome).
|
||
- Added simplistic "libompitrace" contrib package. Using the MPI
|
||
profiling interface, it essentially prints out to stderr when select
|
||
MPI functions are invoked.
|
||
- Update bundled VampirTrace to v5.8.2.
|
||
- Add pkg-config(1) configuration files for ompi, ompi-c, ompi-cxx,
|
||
ompi-f77, ompi-f90. See the README for more details.
|
||
- Removed the libopenmpi_malloc library (added in the v1.3 series)
|
||
since it is no longer necessary
|
||
- Add several notifier plugins (generally used when Open MPI detects
|
||
system/network administrator-worthy problems); each have their own
|
||
MCA parameters to govern their usage. See "ompi_info --param
|
||
notifier <name>" for more details.
|
||
- command to execute arbitrary commands (e.g., run a script).
|
||
- file to send output to a file.
|
||
- ftb to send output to the Fault Tolerant Backplane (see
|
||
http://wiki.mcs.anl.gov/cifts/index.php/CIFTS)
|
||
- hnp to send the output to mpirun.
|
||
- smtp (requires libesmtp) to send an email.
|
||
|
||
1.4.5: 12 Feb 2012
|
||
------------------
|
||
|
||
- Fixed the --disable-memory-manager configure switch.
|
||
(** also to appear in 1.5.5)
|
||
- Fix typos in code and man pages. Thanks to Fujitsu for these fixes.
|
||
(** also to appear in 1.5.5)
|
||
- Improve management of the registration cache; when full, try freeing
|
||
old entries and attempt to re-register.
|
||
- Fixed a data packing pointer alignment issue. Thanks to Fujitsu
|
||
for the patch.
|
||
(** also to appear in 1.5.5)
|
||
- Add ability to turn off warning about having the shared memory backing
|
||
store over a networked filesystem. Thanks to Chris Samuel for this
|
||
suggestion.
|
||
(** also to appear in 1.5.5)
|
||
- Removed an unnecessary memmove() and plugged a couple of small memory leaks
|
||
in the openib OOB connection setup code.
|
||
- Fixed some QLogic bugs. Thanks to Mark Debbage from QLogic for the patches.
|
||
- Fixed problem with MPI_IN_PLACE and other sentinel Fortran constants
|
||
on OS X.
|
||
(** also to appear in 1.5.5)
|
||
- Fix SLURM cpus-per-task allocation.
|
||
(** also to appear in 1.5.5)
|
||
- Fix the datatype engine for when data left over from the previous
|
||
pack was larger than the allowed space in the pack buffer. Thanks to
|
||
Yuki Matsumoto and Takahiro Kawashima for the bug report and the
|
||
patch.
|
||
- Fix Fortran value for MPI_MAX_PORT_NAME. Thanks to Enzo Dari for
|
||
raising the issue.
|
||
- Workaround an Intel compiler v12.1.0 2011.6.233 vector optimization
|
||
bug.
|
||
- Fix issues on Solaris with the openib BTL.
|
||
- Fixes for the Oracle Studio 12.2 Fortran compiler.
|
||
- Update iWARP parameters for the Intel NICs.
|
||
(** also to appear in 1.5.5)
|
||
- Fix obscure cases where MPI_ALLGATHER could crash. Thanks to Andrew
|
||
Senin for reporting the problem.
|
||
(** also to appear in 1.5.5)
|
||
|
||
|
||
1.4.4: 11 Oct 2011
|
||
------------------
|
||
|
||
- Modified a memcpy() call in the openib btl connection setup to use
|
||
memmove() instead because of the possibility of an overlapping
|
||
copy (as identified by valgrind).
|
||
- Changed use of sys_timer_get_cycles() to the more appropriate
|
||
wrapper: opal_timer_base_get_cycles(). Thanks to Jani Monoses
|
||
for this fix.
|
||
- Corrected the reported default value of btl_openib_ib_timeout
|
||
in the "IB retries exceeded" error message. Thanks to Kevin Buckley
|
||
for this correction.
|
||
- Increased rdmacm address resolution timeout from 1s to 30s &
|
||
updated Chelsio T4 openib BTL defaults. Thanks to Steve Wise
|
||
for these updates.
|
||
(** also to appear in 1.5.5)
|
||
- Ensure that MPI_Accumulate error return in 1.4 is consistent with
|
||
1.5.x and trunk.
|
||
- Allow the word "slots" in rankfiles (i.e., not just "slot").
|
||
(** also appeared in 1.5.4)
|
||
- Add Mellanox ConnectX 3 device IDs to the openib BTL defaults.
|
||
(** also appeared in 1.5.4)
|
||
- Update description of btl_openib_cq_size to be more accurate.
|
||
- Ensure mpirun's --xterm options leaves sessions attached.
|
||
(** also appeared in 1.5.4)
|
||
- Update to allow more than 128 entries in an appfile.
|
||
(** also appeared in 1.5.4)
|
||
- Update description of btl_openib_cq_size to be more accurate.
|
||
(** also appeared in 1.5.4)
|
||
- Fix for deadlock when handling recursive attribute keyval deletions
|
||
(e.g., when using ROMIO with MPI_THREAD_MULTIPLE).
|
||
- Fix indexed datatype leaks. Thanks to Pascal Deveze for supplying
|
||
the initial patch. (** also appeared in 1.5.4)
|
||
- Fixed the F90 types of the sendcounts and displs parameters to
|
||
MPI_SCATTERV. Thanks to Stanislav Sazykin for identifying the issue.
|
||
(** also appeared in 1.5.4)
|
||
- Exclude opal/libltdl from "make distclean" when --disable-dlopen is
|
||
used. Thanks to David Gunter for reporting the issue.
|
||
- Fixed a segv in MPI_Comm_create when called with GROUP_EMPTY.
|
||
Thanks to Dominik Goeddeke for finding this.
|
||
(** also appeared in 1.5.4)
|
||
- Fixed return codes from MPI_PROBE and MPI_IPROBE.
|
||
(** also appeared in 1.5.4)
|
||
- Fixed undefined symbol error when using the vtf90 profiling tool.
|
||
- Fix for referencing an uninitialized variable in DPM ORTE. Thanks
|
||
to Avinash Malik for reporting the issue.
|
||
- Fix for correctly handling multi-token args when using debuggers.
|
||
- Eliminated the unneeded u_int*_t datatype definitions.
|
||
- Change in ORTE DPM to get around gcc 4.[45].x compiler wanrings
|
||
about possibly calling free() on a non-heap variable, even though it
|
||
will never happen because the refcount will never go to zero.
|
||
- Fixed incorrect text in MPI_File_set_view man page.
|
||
- Fix in MPI_Init_thread for checkpoint/restart.
|
||
- Fix for libtool issue when using pgcc to compile ompi in conjunction
|
||
with the -tp option.
|
||
- Fixed a race condition in osc_rdma_sync. Thanks to Guillaume
|
||
Thouvenin for finding this issue.
|
||
- Clarification of MPI_Init_thread man page.
|
||
- Fixed an indexing problem in precondition_transports.
|
||
- Fixed a problem in which duplicated libs were being specified for
|
||
linking. Thanks to Hicham Mouline for noticing it.
|
||
- Various autogen.sh fixes.
|
||
- Fix for memchecking buffers during MPI_*INIT.
|
||
- Man page cleanups. Thanks to Jeremiah Willcock and Jed Brown.
|
||
- Fix for VT rpmbuild on RHEL5.
|
||
- Support Solaris legacy munmap prototype changes.
|
||
(** also appeared in 1.5.4)
|
||
- Expands app_idx to int32_t to allow more than 127 app_contexts.
|
||
- Guard the inclusion of execinfo.h since not all platforms have it. Thanks
|
||
to Aleksej Saushev for identifying this issue.
|
||
(** also appeared in 1.5.4)
|
||
- Fix to avoid possible environment corruption. Thanks to Peter Thompson
|
||
for identifying the issue and supplying a patch.
|
||
(** also appeared in 1.5.4)
|
||
- Fixed paffinity base MCA duplicate registrations. Thanks to Gus
|
||
Correa for bringing this to our attention.
|
||
- Fix recursive locking bug when MPI-IO was used with
|
||
MPI_THREAD_MULTIPLE. (** also appeared in 1.5.4)
|
||
- F90 MPI API fixes.
|
||
- Fixed a misleading MPI_Bcast error message. Thanks to Jeremiah
|
||
Willcock for reporting this.
|
||
- Added <sys/stat.h> to ptmalloc's hooks.c (it's not always included
|
||
by default on some systems).
|
||
- Libtool patch to get around a build problem when using the IBM XL
|
||
compilers.
|
||
- Fix to detect and avoid overlapping memcpy(). Thanks to
|
||
Francis Pellegrini for identifying the issue.
|
||
- Fix to allow ompi to work on top of RoCE vLANs.
|
||
- Restored a missing debugger flag to support TotalView. Thanks to
|
||
David Turner and the TV folks for supplying the fix.
|
||
- Updated SLURM support to 1.5.1.
|
||
- Removed an extraneous #include from the TCP BTL.
|
||
- When specifying OOB ports, fix to convert the ports into network
|
||
byte order before binding.
|
||
- Fixed use of memory barriers in the SM BTL. This fixed segv's when
|
||
compiling with Intel 10.0.025 or PGI 9.0-3.
|
||
- Fix to prevent the SM BTL from creating its mmap'd file in
|
||
directories that are remotely mounted.
|
||
|
||
|
||
1.4.3: 6 Sep 2010
|
||
-----------------
|
||
|
||
- Fixed handling of the array_of_argv parameter in the Fortran
|
||
binding of MPI_COMM_SPAWN_MULTIPLE (** also to appear: 1.5).
|
||
- Fixed a problem with the Fortran binding for
|
||
MPI_FILE_CREATE_ERRHANDLER. Thanks to Secretan Yves for identifying
|
||
the issue (** also to appear: 1.5).
|
||
- Updates to the LSF PLM to ensure that the path is correctly passed.
|
||
Thanks to Teng Lin for the patch (** also to appear: 1.5).
|
||
- Fixes for the F90 MPI_COMM_SET_ERRHANDLER and MPI_WIN_SET_ERRHANDLER
|
||
bindings. Thanks to Paul Kapinos for pointing out the issue.
|
||
(** also to appear: 1.5).
|
||
- Fixed various MPI_THREAD_MULTIPLE race conditions.
|
||
- Fixed an issue with an undeclared variable from ptmalloc2 munmap on
|
||
BSD systems.
|
||
- Fixes for BSD interface detection.
|
||
- Various other BSD fixes. Thanks to Kevin Buckley helping to track.
|
||
all of this down.
|
||
- Fixed issues with the use of the -nper* mpirun command line arguments.
|
||
- Fixed an issue with coll tuned dynamic rules.
|
||
- Fixed an issue with the use of OPAL_DESTDIR being applied too aggressively.
|
||
- Fixed an issue with one-sided xfers when the displacement exceeds 2GBytes.
|
||
- Change to ensure TotalView works properly on Darwin.
|
||
- Added support for Visual Studio 2010.
|
||
- Fix to ensure proper placement of VampirTrace header files.
|
||
- Needed to add volatile keyword to a varialbe used in debugging
|
||
(MPIR_being_debugged).
|
||
- Fixed a bug in inter-allgather.
|
||
- Fixed malloc(0) warnings.
|
||
- Corrected a typo the MPI_Comm_size man page (intra -> inter). Thanks
|
||
to Simon number.cruncher for pointing this out.
|
||
- Fixed a SegV in orted when given more than 127 app_contexts.
|
||
- Removed xgrid source code from the 1.4 branch since it is no longer
|
||
supported in the 1.4 series.
|
||
- Removed the --enable-opal-progress-threads config option since
|
||
opal progress thread support does not work in 1.4.x.
|
||
- Fixed a defect in VampirTrace's vtfilter.
|
||
- Fixed wrong Windows path in hnp_contact.
|
||
- Removed the requirement for a paffinity component.
|
||
- Removed a hardcoded limit of 64 interconnected jobs.
|
||
- Fix to allow singletons to use ompi-server for rendezvous.
|
||
- Fixed bug in output-filename option.
|
||
- Fix to correctly handle failures in mx_init().
|
||
- Fixed a potential Fortran memory leak.
|
||
- Fixed an incorrect branch in some ppc32 assembly code. Thanks
|
||
to Matthew Clark for this fix.
|
||
- Remove use of undocumented AS_VAR_GET macro during configuration.
|
||
- Fixed an issue with VampirTrace's wrapper for MPI_init_thread.
|
||
- Updated mca-btl-openib-device-params.ini file with various new vendor id's.
|
||
- Configuration fixes to ensure CPPFLAGS in handled properly if a non-standard
|
||
valgrind location was specified.
|
||
- Various man page updates
|
||
|
||
|
||
1.4.2: 4 May 2010
|
||
-----------------
|
||
|
||
- Fixed problem when running in heterogeneous environments. Thanks to
|
||
Timur Magomedov for helping to track down this issue.
|
||
- Update LSF support to ensure that the path is passed correctly.
|
||
Thanks to Teng Lin for submitting a patch.
|
||
- Fixed some miscellaneous oversubscription detection bugs.
|
||
- IBM re-licensed its LoadLeveler code to be BSD-compliant.
|
||
- Various OpenBSD and NetBSD build and run-time fixes. Many thanks to
|
||
the OpenBSD community for their time, expertise, and patience
|
||
getting these fixes incorporated into Open MPI's main line.
|
||
- Various fixes for multithreading deadlocks, race conditions, and
|
||
other nefarious things.
|
||
- Fixed ROMIO's handling of "nearly" contiguous issues (e.g., with
|
||
non-zero true_lb). Thanks for Pascal Deveze for the patch.
|
||
- Bunches of Windows build fixes. Many thanks to several Windows
|
||
users for their help in improving our support on Windows.
|
||
- Now allow the graceful failover from MTLs to BTLs if no MTLs can
|
||
initialize successfully.
|
||
- Added "clobber" information to various atomic operations, fixing
|
||
erroneous behavior in some newer versions of the GNU compiler suite.
|
||
- Update various iWARP and InfiniBand device specifications in the
|
||
OpenFabrics .ini support file.
|
||
- Fix the use of hostfiles when a username is supplied.
|
||
- Various fixes for rankfile support.
|
||
- Updated the internal version of VampirTrace to 5.4.12.
|
||
- Fixed OS X TCP wireup issues having to do with IPv4/IPv6 confusion
|
||
(see https://svn.open-mpi.org/trac/ompi/changeset/22788 for more
|
||
details).
|
||
- Fixed some problems in processor affinity support, including when
|
||
there are "holes" in the processor namespace (e.g., offline
|
||
processors).
|
||
- Ensure that Open MPI's "session directory" (usually located in /tmp)
|
||
is cleaned up after process termination.
|
||
- Fixed some problems with the collective "hierarch" implementation
|
||
that could occur in some obscure conditions.
|
||
- Various MPI_REQUEST_NULL, API parameter checking, and attribute
|
||
error handling fixes. Thanks to Lisandro Dalcín for reporting the
|
||
issues.
|
||
- Fix case where MPI_GATHER erroneously used datatypes on non-root
|
||
nodes. Thanks to Michael Hofmann for investigating the issue.
|
||
- Patched ROMIO support for PVFS2 > v2.7 (patch taken from MPICH2
|
||
version of ROMIO).
|
||
- Fixed "mpirun --report-bindings" behavior when used with
|
||
mpi_paffinity_alone=1. Also fixed mpi_paffinity_alone=1 behavior
|
||
with non-MPI applications. Thanks to Brice Goglin for noticing the
|
||
problem.
|
||
- Ensure that all OpenFabrics devices have compatible receive_queues
|
||
specifications before allowing them to communicate. See the lengthy
|
||
comment in https://svn.open-mpi.org/trac/ompi/changeset/22592 for
|
||
more details.
|
||
- Fix some issues with checkpoint/restart.
|
||
- Improve the pre-MPI_INIT/post-MPI_FINALIZE error messages.
|
||
- Ensure that loopback addresses are never advertised to peer
|
||
processes for RDMA/OpenFabrics support.
|
||
- Fixed a CSUM PML false positive.
|
||
- Various fixes for Catamount support.
|
||
- Minor update to wrapper compilers in how user-specific argv is
|
||
ordered on the final command line. Thanks to Jed Brown for the
|
||
suggestions.
|
||
- Removed flex.exe binary from Open MPI tarballs; now generate flex
|
||
code from a newer (Windows-friendly) flex when we make official
|
||
tarballs.
|
||
|
||
|
||
1.4.1: 15 Jan 2010
|
||
------------------
|
||
|
||
- Update to PLPA v1.3.2, addressing a licensing issue identified by
|
||
the Fedora project. See
|
||
https://svn.open-mpi.org/trac/plpa/changeset/262 for details.
|
||
- Add check for malformed checkpoint metadata files (Ticket #2141).
|
||
- Fix error path in ompi-checkpoint when not able to checkpoint
|
||
(Ticket #2138).
|
||
- Cleanup component release logic when selecting checkpoint/restart
|
||
enabled components (Ticket #2135).
|
||
- Fixed VT node name detection for Cray XT platforms, and fixed some
|
||
broken VT documentation files.
|
||
- Fix a possible race condition in tearing down RDMA CM-based
|
||
connections.
|
||
- Relax error checking on MPI_GRAPH_CREATE. Thanks to David Singleton
|
||
for pointing out the issue.
|
||
- Fix a shared memory "hang" problem that occurred on x86/x86_64
|
||
platforms when used with the GNU >=4.4.x compiler series.
|
||
- Add fix for Libtool 2.2.6b's problems with the PGI 10.x compiler
|
||
suite. Inspired directly from the upstream Libtool patches that fix
|
||
the issue (but we need something working before the next Libtool
|
||
release).
|
||
|
||
|
||
1.4: 8 Dec 2009
|
||
---------------
|
||
|
||
The *only* change in the Open MPI v1.4 release (as compared to v1.3.4)
|
||
was to update the embedded version of Libtool's libltdl to address a
|
||
potential security vulnerability. Specifically: Open MPI v1.3.4 was
|
||
created with GNU Libtool 2.2.6a; Open MPI v1.4 was created with GNU
|
||
Libtool 2.2.6b. There are no other changes between Open MPI v1.3.4
|
||
and v1.4.
|
||
|
||
|
||
1.3.4: 13 Feb 2010
|
||
------------------
|
||
|
||
- Fix some issues in OMPI's SRPM with regard to shell_scripts_basename
|
||
and its use with mpi-selector. Thanks to Bill Johnstone for
|
||
pointing out the problem.
|
||
- Added many new MPI job process affinity options to mpirun. See the
|
||
newly-updated mpirun(1) man page for details.
|
||
- Several updates to mpirun's XML output.
|
||
- Update to fix a few Valgrind warnings with regards to the ptmalloc2
|
||
allocator and Open MPI's use of PLPA.
|
||
- Many updates and fixes to the (non-default) "sm" collective
|
||
component (i.e., native shared memory MPI collective operations).
|
||
- Updates and fixes to some MPI_COMM_SPAWN_MULTIPLE corner cases.
|
||
- Fix some internal copying functions in Open MPI's use of PLPA.
|
||
- Correct some SLURM nodelist parsing logic that may have interfered
|
||
with large jobs. Additionally, per advice from the SLURM team,
|
||
change the environment variable that we use for obtaining the job's
|
||
allocation.
|
||
- Revert to an older, safer (but slower) communicator ID allocation
|
||
algorithm.
|
||
- Fixed minimum distance finding for OpenFabrics devices in the openib
|
||
BTL.
|
||
- Relax the parameter checking MPI_CART_CREATE a bit.
|
||
- Fix MPI_COMM_SPAWN[_MULTIPLE] to only error-check the info arguments
|
||
on the root process. Thanks to Federico Golfre Andreasi for
|
||
reporting the problem.
|
||
- Fixed some BLCR configure issues.
|
||
- Fixed a potential deadlock when the openib BTL was used with
|
||
MPI_THREAD_MULTIPLE.
|
||
- Fixed dynamic rules selection for the "tuned" coll component.
|
||
- Added a launch progress meter to mpirun (useful for large jobs; set
|
||
the orte_report_launch_progress MCA parameter to 1 to see it).
|
||
- Reduced the number of file descriptors consumed by each MPI process.
|
||
- Add new device IDs for Chelsio T3 RNICs to the openib BTL config file.
|
||
- Fix some CRS self component issues.
|
||
- Added some MCA parameters to the PSM MTL to tune its run-time
|
||
behavior.
|
||
- Fix some VT issues with MPI_BOTTOM/MPI_IN_PLACE.
|
||
- Man page updates from the Debain Open MPI package maintainers.
|
||
- Add cycle counter support for the Alpha and Sparc platforms.
|
||
- Pass visibility flags to libltdl's configure script, resulting in
|
||
those symbols being hidden. This appears to mainly solve the
|
||
problem of applications attempting to use different versions of
|
||
libltdl from that used to build Open MPI.
|
||
|
||
|
||
1.3.3: 14 Jul 2009
|
||
------------------
|
||
|
||
- Fix a number of issues with the openib BTL (OpenFabrics) RDMA CM,
|
||
including a memory corruption bug, a shutdown deadlock, and a route
|
||
timeout. Thanks to David McMillen and Hal Rosenstock for help in
|
||
tracking down the issues.
|
||
- Change the behavior of the EXTRA_STATE parameter that is passed to
|
||
Fortran attribute callback functions: this value is now stored
|
||
internally in MPI -- it no longer references the original value
|
||
passed by MPI_*_CREATE_KEYVAL.
|
||
- Allow the overriding RFC1918 and RFC3330 for the specification of
|
||
"private" networks, thereby influencing Open MPI's TCP
|
||
"reachability" computations.
|
||
- Improve flow control issues in the sm btl, by both tweaking the
|
||
shared memory progression rules and by enabling the "sync" collective
|
||
to barrier every 1,000th collective.
|
||
- Various fixes for the IBM XL C/C++ v10.1 compiler.
|
||
- Allow explicit disabling of ptmalloc2 hooks at runtime (e.g., enable
|
||
support for Debian's builtroot system). Thanks to Manuel Prinz and
|
||
the rest of the Debian crew for helping identify and fix this issue.
|
||
- Various minor fixes for the I/O forwarding subsystem.
|
||
- Big endian iWARP fixes in the Open Fabrics RDMA CM support.
|
||
- Update support for various OpenFabrics devices in the openib BTL's
|
||
.ini file.
|
||
- Fixed undefined symbol issue with Open MPI's parallel debugger
|
||
message queue support so it can be compiled by Sun Studio compilers.
|
||
- Update MPI_SUBVERSION to 1 in the Fortran bindings.
|
||
- Fix MPI_GRAPH_CREATE Fortran 90 binding.
|
||
- Fix MPI_GROUP_COMPARE behavior with regards to MPI_IDENT. Thanks to
|
||
Geoffrey Irving for identifying the problem and supplying the fix.
|
||
- Silence gcc 4.1 compiler warnings about type punning. Thanks to
|
||
Number Cruncher for the fix.
|
||
- Added more Valgrind and other memory-cleanup fixes. Thanks to
|
||
various Open MPI users for help with these issues.
|
||
- Miscellaneous VampirTrace fixes.
|
||
- More fixes for openib credits in heavy-congestion scenarios.
|
||
- Slightly decrease the latency in the openib BTL in some conditions
|
||
(add "send immediate" support to the openib BTL).
|
||
- Ensure to allow MPI_REQUEST_GET_STATUS to accept an
|
||
MPI_STATUS_IGNORE parameter. Thanks to Shaun Jackman for the bug
|
||
report.
|
||
- Added Microsoft Windows support. See README.WINDOWS file for
|
||
details.
|
||
|
||
|
||
1.3.2: 22 Apr 2009
|
||
------------------
|
||
|
||
- Fixed a potential infinite loop in the openib BTL that could occur
|
||
in senders in some frequent-communication scenarios. Thanks to Don
|
||
Wood for reporting the problem.
|
||
- Add a new checksum PML variation on ob1 (main MPI point-to-point
|
||
communication engine) to detect memory corruption in node-to-node
|
||
messages
|
||
- Add a new configuration option to add padding to the openib
|
||
header so the data is aligned
|
||
- Add a new configuration option to use an alternative checksum algo
|
||
when using the checksum PML
|
||
- Fixed a problem reported by multiple users on the mailing list that
|
||
the LSF support would fail to find the appropriate libraries at
|
||
run-time.
|
||
- Allow empty shell designations from getpwuid(). Thanks to Sergey
|
||
Koposov for the bug report.
|
||
- Ensure that mpirun exits with non-zero status when applications die
|
||
due to user signal. Thanks to Geoffroy Pignot for suggesting the
|
||
fix.
|
||
- Ensure that MPI_VERSION / MPI_SUBVERSION match what is returned by
|
||
MPI_GET_VERSION. Thanks to Rob Egan for reporting the error.
|
||
- Updated MPI_*KEYVAL_CREATE functions to properly handle Fortran
|
||
extra state.
|
||
- A variety of ob1 (main MPI point-to-point communication engine) bug
|
||
fixes that could have caused hangs or seg faults.
|
||
- Do not install Open MPI's signal handlers in MPI_INIT if there are
|
||
already signal handlers installed. Thanks to Kees Verstoep for
|
||
bringing the issue to our attention.
|
||
- Fix GM support to not seg fault in MPI_INIT.
|
||
- Various VampirTrace fixes.
|
||
- Various PLPA fixes.
|
||
- No longer create BTLs for invalid (TCP) devices.
|
||
- Various man page style and lint cleanups.
|
||
- Fix critical OpenFabrics-related bug noted here:
|
||
http://www.open-mpi.org/community/lists/announce/2009/03/0029.php.
|
||
Open MPI now uses a much more robust memory intercept scheme that is
|
||
quite similar to what is used by MX. The use of "-lopenmpi-malloc"
|
||
is no longer necessary, is deprecated, and is expected to disappear
|
||
in a future release. -lopenmpi-malloc will continue to work for the
|
||
duration of the Open MPI v1.3 and v1.4 series.
|
||
- Fix some OpenFabrics shutdown errors, both regarding iWARP and SRQ.
|
||
- Allow the udapl BTL to work on Solaris platforms that support
|
||
relaxed PCI ordering.
|
||
- Fix problem where the mpirun would sometimes use rsh/ssh to launch on
|
||
the localhost (instead of simply forking).
|
||
- Minor SLURM stdin fixes.
|
||
- Fix to run properly under SGE jobs.
|
||
- Scalability and latency improvements for shared memory jobs: convert
|
||
to using one message queue instead of N queues.
|
||
- Automatically size the shared-memory area (mmap file) to match
|
||
better what is needed; specifically, so that large-np jobs will start.
|
||
- Use fixed-length MPI predefined handles in order to provide ABI
|
||
compatibility between Open MPI releases.
|
||
- Fix building of the posix paffinity component to properly get the
|
||
number of processors in loosely tested environments (e.g.,
|
||
FreeBSD). Thanks to Steve Kargl for reporting the issue.
|
||
- Fix --with-libnuma handling in configure. Thanks to Gus Correa for
|
||
reporting the problem.
|
||
|
||
|
||
1.3.1: 19 Mar 2009
|
||
------------------
|
||
|
||
- Added "sync" coll component to allow users to synchronize every N
|
||
collective operations on a given communicator.
|
||
- Increased the default values of the IB and RNR timeout MCA parameters.
|
||
- Fix a compiler error noted by Mostyn Lewis with the PGI 8.0 compiler.
|
||
- Fix an error that prevented stdin from being forwarded if the
|
||
rsh launcher was in use. Thanks to Branden Moore for pointing out
|
||
the problem.
|
||
- Correct a case where the added datatype is considered as contiguous but
|
||
has gaps in the beginning.
|
||
- Fix an error that limited the number of comm_spawns that could
|
||
simultaneously be running in some environments
|
||
- Correct a corner case in OB1's GET protocol for long messages; the
|
||
error could sometimes cause MPI jobs using the openib BTL to hang.
|
||
- Fix a bunch of bugs in the IO forwarding (IOF) subsystem and add some
|
||
new options to output to files and redirect output to xterm. Thanks to
|
||
Jody Weissmann for helping test out many of the new fixes and
|
||
features.
|
||
- Fix SLURM race condition.
|
||
- Fix MPI_File_c2f(MPI_FILE_NULL) to return 0, not -1. Thanks to
|
||
Lisandro Dalcin for the bug report.
|
||
- Fix the DSO build of tm PLM.
|
||
- Various fixes for size disparity between C int's and Fortran
|
||
INTEGER's. Thanks to Christoph van Wullen for the bug report.
|
||
- Ensure that mpirun exits with a non-zero exit status when daemons or
|
||
processes abort or fail to launch.
|
||
- Various fixes to work around Intel (NetEffect) RNIC behavior.
|
||
- Various fixes for mpirun's --preload-files and --preload-binary
|
||
options.
|
||
- Fix the string name in MPI::ERRORS_THROW_EXCEPTIONS.
|
||
- Add ability to forward SIFTSTP and SIGCONT to MPI processes if you
|
||
set the MCA parameter orte_forward_job_control to 1.
|
||
- Allow the sm BTL to allocate larger amounts of shared memory if
|
||
desired (helpful for very large multi-core boxen).
|
||
- Fix a few places where we used PATH_MAX instead of OPAL_PATH_MAX,
|
||
leading to compile problems on some platforms. Thanks to Andrea Iob
|
||
for the bug report.
|
||
- Fix mca_btl_openib_warn_no_device_params_found MCA parameter; it
|
||
was accidentally being ignored.
|
||
- Fix some run-time issues with the sctp BTL.
|
||
- Ensure that RTLD_NEXT exists before trying to use it (e.g., it
|
||
doesn't exist on Cygwin). Thanks to Gustavo Seabra for reporting
|
||
the issue.
|
||
- Various fixes to VampirTrace, including fixing compile errors on
|
||
some platforms.
|
||
- Fixed missing MPI_Comm_accept.3 man page; fixed minor issue in
|
||
orterun.1 man page. Thanks to Dirk Eddelbuettel for identifying the
|
||
problem and submitting a patch.
|
||
- Implement the XML formatted output of stdout/stderr/stddiag.
|
||
- Fixed mpirun's -wdir switch to ensure that working directories for
|
||
multiple app contexts are properly handled. Thanks to Geoffroy
|
||
Pignot for reporting the problem.
|
||
- Improvements to the MPI C++ integer constants:
|
||
- Allow MPI::SEEK_* constants to be used as constants
|
||
- Allow other MPI C++ constants to be used as array sizes
|
||
- Fix minor problem with orte-restart's command line options. See
|
||
ticket #1761 for details. Thanks to Gregor Dschung for reporting
|
||
the problem.
|
||
|
||
|
||
1.3: 19 Jan 2009
|
||
----------------
|
||
|
||
- Extended the OS X 10.5.x (Leopard) workaround for a problem when
|
||
assembly code is compiled with -g[0-9]. Thanks to Barry Smith for
|
||
reporting the problem. See ticket #1701.
|
||
- Disabled MPI_REAL16 and MPI_COMPLEX32 support on platforms where the
|
||
bit representation of REAL*16 is different than that of the C type
|
||
of the same size (usually long double). Thanks to Julien Devriendt
|
||
for reporting the issue. See ticket #1603.
|
||
- Increased the size of MPI_MAX_PORT_NAME to 1024 from 36. See ticket #1533.
|
||
- Added "notify debugger on abort" feature. See tickets #1509 and #1510.
|
||
Thanks to Seppo Sahrakropi for the bug report.
|
||
- Upgraded Open MPI tarballs to use Autoconf 2.63, Automake 1.10.1,
|
||
Libtool 2.2.6a.
|
||
- Added missing MPI::Comm::Call_errhandler() function. Thanks to Dave
|
||
Goodell for bringing this to our attention.
|
||
- Increased MPI_SUBVERSION value in mpi.h to 1 (i.e., MPI 2.1).
|
||
- Changed behavior of MPI_GRAPH_CREATE, MPI_TOPO_CREATE, and several
|
||
other topology functions per MPI-2.1.
|
||
- Fix the type of the C++ constant MPI::IN_PLACE.
|
||
- Various enhancements to the openib BTL:
|
||
- Added btl_openib_if_[in|ex]clude MCA parameters for
|
||
including/excluding comma-delimited lists of HCAs and ports.
|
||
- Added RDMA CM support, includng btl_openib_cpc_[in|ex]clude MCA
|
||
parameters
|
||
- Added NUMA support to only use "near" network adapters
|
||
- Added "Bucket SRQ" (BSRQ) support to better utilize registered
|
||
memory, including btl_openib_receive_queues MCA parameter
|
||
- Added ConnectX XRC support (and integrated with BSRQ)
|
||
- Added btl_openib_ib_max_inline_data MCA parameter
|
||
- Added iWARP support
|
||
- Revamped flow control mechansisms to be more efficient
|
||
- "mpi_leave_pinned=1" is now the default when possible,
|
||
automatically improving performance for large messages when
|
||
application buffers are re-used
|
||
- Elimiated duplicated error messages when multiple MPI processes fail
|
||
with the same error.
|
||
- Added NUMA support to the shared memory BTL.
|
||
- Add Valgrind-based memory checking for MPI-semantic checks.
|
||
- Add support for some optional Fortran datatypes (MPI_LOGICAL1,
|
||
MPI_LOGICAL2, MPI_LOGICAL4 and MPI_LOGICAL8).
|
||
- Remove the use of the STL from the C++ bindings.
|
||
- Added support for Platform/LSF job launchers. Must be Platform LSF
|
||
v7.0.2 or later.
|
||
- Updated ROMIO with the version from MPICH2 1.0.7.
|
||
- Added RDMA capable one-sided component (called rdma), which
|
||
can be used with BTL components that expose a full one-sided
|
||
interface.
|
||
- Added the optional datatype MPI_REAL2. As this is added to the "end of"
|
||
predefined datatypes in the fortran header files, there will not be
|
||
any compatibility issues.
|
||
- Added Portable Linux Processor Affinity (PLPA) for Linux.
|
||
- Addition of a finer symbols export control via the visibiliy feature
|
||
offered by some compilers.
|
||
- Added checkpoint/restart process fault tolerance support. Initially
|
||
support a LAM/MPI-like protocol.
|
||
- Removed "mvapi" BTL; all InfiniBand support now uses the OpenFabrics
|
||
driver stacks ("openib" BTL).
|
||
- Added more stringent MPI API parameter checking to help user-level
|
||
debugging.
|
||
- The ptmalloc2 memory manager component is now by default built as
|
||
a standalone library named libopenmpi-malloc. Users wanting to
|
||
use leave_pinned with ptmalloc2 will now need to link the library
|
||
into their application explicitly. All other users will use the
|
||
libc-provided allocator instead of Open MPI's ptmalloc2. This change
|
||
may be overriden with the configure option enable-ptmalloc2-internal
|
||
- The leave_pinned options will now default to using mallopt on
|
||
Linux in the cases where ptmalloc2 was not linked in. mallopt
|
||
will also only be available if munmap can be intercepted (the
|
||
default whenever Open MPI is not compiled with --without-memory-
|
||
manager.
|
||
- Open MPI will now complain and refuse to use leave_pinned if
|
||
no memory intercept / mallopt option is available.
|
||
- Add option of using Perl-based wrapper compilers instead of the
|
||
C-based wrapper compilers. The Perl-based version does not
|
||
have the features of the C-based version, but does work better
|
||
in cross-compile environments.
|
||
|
||
|
||
1.2.9: 14 Feb 2009
|
||
------------------
|
||
|
||
- Fix a segfault when using one-sided communications on some forms of derived
|
||
datatypes. Thanks to Dorian Krause for reporting the bug. See #1715.
|
||
- Fix an alignment problem affecting one-sided communications on
|
||
some architectures (e.g., SPARC64). See #1738.
|
||
- Fix compilation on Solaris when thread support is enabled in Open MPI
|
||
(e.g., when using --with-threads). See #1736.
|
||
- Correctly take into account the MTU that an OpenFabrics device port
|
||
is using. See #1722 and
|
||
https://bugs.openfabrics.org/show_bug.cgi?id=1369.
|
||
- Fix two datatype engine bugs. See #1677.
|
||
Thanks to Peter Kjellstrom for the bugreport.
|
||
- Fix the bml r2 help filename so the help message can be found. See #1623.
|
||
- Fix a compilation problem on RHEL4U3 with the PGI 32 bit compiler
|
||
caused by <infiniband/driver.h>. See ticket #1613.
|
||
- Fix the --enable-cxx-exceptions configure option. See ticket #1607.
|
||
- Properly handle when the MX BTL cannot open an endpoint. See ticket #1621.
|
||
- Fix a double free of events on the tcp_events list. See ticket #1631.
|
||
- Fix a buffer overun in opal_free_list_grow (called by MPI_Init).
|
||
Thanks to Patrick Farrell for the bugreport and Stephan Kramer for
|
||
the bugfix. See ticket #1583.
|
||
- Fix a problem setting OPAL_PREFIX for remote sh-based shells.
|
||
See ticket #1580.
|
||
|
||
|
||
1.2.8: 14 Oct 2008
|
||
------------------
|
||
|
||
- Tweaked one memory barrier in the openib component to be more conservative.
|
||
May fix a problem observed on PPC machines. See ticket #1532.
|
||
- Fix OpenFabrics IB partition support. See ticket #1557.
|
||
- Restore v1.1 feature that sourced .profile on remote nodes if the default
|
||
shell will not do so (e.g. /bin/sh and /bin/ksh). See ticket #1560.
|
||
- Fix segfault in MPI_Init_thread() if ompi_mpi_init() fails. See ticket #1562.
|
||
- Adjust SLURM support to first look for $SLURM_JOB_CPUS_PER_NODE instead of
|
||
the deprecated $SLURM_TASKS_PER_NODE environment variable. This change
|
||
may be *required* when using SLURM v1.2 and above. See ticket #1536.
|
||
- Fix the MPIR_Proctable to be in process rank order. See ticket #1529.
|
||
- Fix a regression introduced in 1.2.6 for the IBM eHCA. See ticket #1526.
|
||
|
||
|
||
1.2.7: 28 Aug 2008
|
||
------------------
|
||
|
||
- Add some Sun HCA vendor IDs. See ticket #1461.
|
||
- Fixed a memory leak in MPI_Alltoallw when called from Fortran.
|
||
Thanks to Dave Grote for the bugreport. See ticket #1457.
|
||
- Only link in libutil when it is needed/desired. Thanks to
|
||
Brian Barret for diagnosing and fixing the problem. See ticket #1455.
|
||
- Update some QLogic HCA vendor IDs. See ticket #1453.
|
||
- Fix F90 binding for MPI_CART_GET. Thanks to Scott Beardsley for
|
||
bringing it to our attention. See ticket #1429.
|
||
- Remove a spurious warning message generated in/by ROMIO. See ticket #1421.
|
||
- Fix a bug where command-line MCA parameters were not overriding
|
||
MCA parameters set from environment variables. See ticket #1380.
|
||
- Fix a bug in the AMD64 atomics assembly. Thanks to Gabriele Fatigati
|
||
for the bug report and bugfix. See ticket #1351.
|
||
- Fix a gather and scatter bug on intercommunicators when the datatype
|
||
being moved is 0 bytes. See ticket #1331.
|
||
- Some more man page fixes from the Debian maintainers.
|
||
See tickets #1324 and #1329.
|
||
- Have openib BTL (OpenFabrics support) check for the presence of
|
||
/sys/class/infiniband before allowing itself to be used. This check
|
||
prevents spurious "OMPI did not find RDMA hardware!" notices on
|
||
systems that have the software drivers installed, but no
|
||
corresponding hardware. See tickets #1321 and #1305.
|
||
- Added vendor IDs for some ConnectX openib HCAs. See ticket #1311.
|
||
- Fix some RPM specfile inconsistencies. See ticket #1308.
|
||
Thanks to Jim Kusznir for noticing the problem.
|
||
- Removed an unused function prototype that caused warnings on
|
||
some systems (e.g., OS X). See ticket #1274.
|
||
- Fix a deadlock in inter-communicator scatter/gather operations.
|
||
Thanks to Martin Audet for the bug report. See ticket #1268.
|
||
|
||
|
||
1.2.6: 7 Apr 2008
|
||
-----------------
|
||
|
||
- Fix a bug in the inter-allgather for asymmetric inter-communicators.
|
||
Thanks to Martin Audet for the bug report. See ticket #1247.
|
||
- Fix a bug in the openib BTL when setting the CQ depth. Thanks
|
||
to Jon Mason for the bug report and fix. See ticket #1245.
|
||
- On Mac OS X Leopard, the execinfo component will be used for
|
||
backtraces, making for a more durable solution. See ticket #1246.
|
||
- Added vendor IDs for some QLogic DDR openib HCAs. See ticket #1227.
|
||
- Updated the URL to get the latest config.guess and config.sub files.
|
||
Thanks to Ralf Wildenhues for the bug report. See ticket #1226.
|
||
- Added shared contexts support to PSM MTL. See ticket #1225.
|
||
- Added pml_ob1_use_early_completion MCA parameter to allow users
|
||
to turn off the OB1 early completion semantic and avoid "stall"
|
||
problems seen on InfiniBand in some cases. See ticket #1224.
|
||
- Sanitized some #define macros used in mpi.h to avoid compiler warnings
|
||
caused by MPI programs built with different autoconf versions.
|
||
Thanks to Ben Allan for reporting the problem, and thanks to
|
||
Brian Barrett for the fix. See ticket #1220.
|
||
- Some man page fixes from the Debian maintainers. See ticket #1219.
|
||
- Made the openib BTL a bit more resilient in the face of driver
|
||
errors. See ticket #1217.
|
||
- Fixed F90 interface for MPI_CART_CREATE. See ticket #1208.
|
||
Thanks to Michal Charemza for reporting the problem.
|
||
- Fixed some C++ compiler warnings. See ticket #1203.
|
||
- Fixed formatting of the orterun man page. See ticket #1202.
|
||
Thanks to Peter Breitenlohner for the patch.
|
||
|
||
|
||
1.2.5: 8 Jan 2008
|
||
-----------------
|
||
|
||
- Fixed compile issue with open() on Fedora 8 (and newer) platforms.
|
||
Thanks to Sebastian Schmitzdorff for noticing the problem.
|
||
- Added run-time warnings during MPI_INIT when MPI_THREAD_MULTIPLE
|
||
and/or progression threads are used (the OMPI v1.2 series does not
|
||
support these well at all).
|
||
- Better handling of ECONNABORTED from connect on Linux. Thanks to
|
||
Bob Soliday for noticing the problem; thanks to Brian Barrett for
|
||
submitting a patch.
|
||
- Reduce extraneous output from OOB when TCP connections must
|
||
be retried. Thanks to Brian Barrett for submitting a patch.
|
||
- Fix for ConnectX devices and OFED 1.3. See ticket #1190.
|
||
- Fixed a configure problem for Fortran 90 on Cray systems. Ticket #1189.
|
||
- Fix an uninitialized variable in the error case in opal_init.c.
|
||
Thanks to Ake Sandgren for pointing out the mistake.
|
||
- Fixed a hang in configure if $USER was not defined. Thanks to
|
||
Darrell Kresge for noticing the problem. See ticket #900.
|
||
- Added support for parallel debuggers even when we have an optimized build.
|
||
See ticket #1178.
|
||
- Worked around a bus error in the Mac OS X 10.5.X (Leopard) linker when
|
||
compiling Open MPI with -g. See ticket #1179.
|
||
- Removed some warnings about 'rm' from Mac OS X 10.5 (Leopard) builds.
|
||
- Fix the handling of mx_finalize(). See ticket #1177.
|
||
Thanks to Ake Sandgren for bringing this issue to our attention.
|
||
- Fixed minor file descriptor leak in the Altix timer code. Thanks to
|
||
Paul Hargrove for noticing the problem and supplying the fix.
|
||
- Fix a problem when using a different compiler for C and Objective C.
|
||
See ticket #1153.
|
||
- Fix segfault in MPI_COMM_SPAWN when the user specified a working
|
||
directory. Thanks to Murat Knecht for reporting this and suggesting
|
||
a fix.
|
||
- A few manpage fixes from the Debian Open MPI maintainers. Thanks to
|
||
Tilman Koschnick, Sylvestre Ledru, and Dirk Eddelbuettel.
|
||
- Fixed issue with pthread detection when compilers are not all
|
||
from the same vendor. Thanks to Ake Sandgren for the bug
|
||
report. See ticket #1150.
|
||
- Fixed vector collectives in the self module. See ticket #1166.
|
||
- Fixed some data-type engine bugs: an indexing bug, and an alignment bug.
|
||
See ticket #1165.
|
||
- Only set the MPI_APPNUM attribute if it is defined. See ticket
|
||
#1164.
|
||
|
||
|
||
1.2.4: 26 Sep 2007
|
||
------------------
|
||
|
||
- Really added support for TotalView/DDT parallel debugger message queue
|
||
debugging (it was mistakenly listed as "added" in the 1.2 release).
|
||
- Fixed a build issue with GNU/kFreeBSD. Thanks to Petr Salinger for
|
||
the patch.
|
||
- Added missing MPI_FILE_NULL constant in Fortran. Thanks to
|
||
Bernd Schubert for bringing this to our attention.
|
||
- Change such that the UDAPL BTL is now only built in Linux when
|
||
explicitly specified via the --with-udapl configure command line
|
||
switch.
|
||
- Fixed an issue with umask not being propagated when using the TM
|
||
launcher.
|
||
- Fixed behavior if number of slots is not the same on all bproc nodes.
|
||
- Fixed a hang on systems without GPR support (ex. Cray XT3/4).
|
||
- Prevent users of 32-bit MPI apps from requesting >= 2GB of shared
|
||
memory.
|
||
- Added a Portals MTL.
|
||
- Fix 0 sized MPI_ALLOC_MEM requests. Thanks to Lisandro Dalcin for
|
||
pointing out the problem.
|
||
- Fixed a segfault crash on large SMPs when doing collectives.
|
||
- A variety of fixes for Cray XT3/4 class of machines.
|
||
- Fixed which error handler is used when MPI_COMM_SELF is passed
|
||
to MPI_COMM_FREE. Thanks to Lisandro Dalcini for the bug report.
|
||
- Fixed compilation on platforms that don't have hton/ntoh.
|
||
- Fixed a logic problem in the fortran binding for MPI_TYPE_MATCH_SIZE.
|
||
Thanks to Jeff Dusenberry for pointing out the problem and supplying
|
||
the fix.
|
||
- Fixed a problem with MPI_BOTTOM in various places of the f77-interface.
|
||
Thanks to Daniel Spangberg for bringing this up.
|
||
- Fixed problem where MPI-optional Fortran datatypes were not
|
||
correctly initialized.
|
||
- Fixed several problems with stdin/stdout forwarding.
|
||
- Fixed overflow problems with the sm mpool MCA parameters on large SMPs.
|
||
- Added support for the DDT parallel debugger via orterun's --debug
|
||
command line option.
|
||
- Added some sanity/error checks to the openib MCA parameter parsing
|
||
code.
|
||
- Updated the udapl BTL to use RDMA capabilities.
|
||
- Allow use of the BProc head node if it was allocated to the user.
|
||
Thanks to Sean Kelly for reporting the problem and helping debug it.
|
||
- Fixed a ROMIO problem where non-blocking I/O errors were not properly
|
||
reported to the user.
|
||
- Made remote process launch check the $SHELL environment variable if
|
||
a valid shell was not otherwise found for the user.
|
||
Thanks to Alf Wachsmann for the bugreport and suggested fix.
|
||
- Added/updated some vendor IDs for a few openib HCAs.
|
||
- Fixed a couple of failures that could occur when specifying devices
|
||
for use by the OOB.
|
||
- Removed dependency on sysfsutils from the openib BTL for
|
||
libibverbs >=v1.1 (i.e., OFED 1.2 and beyond).
|
||
|
||
|
||
1.2.3: 20 Jun 2007
|
||
------------------
|
||
|
||
- Fix a regression in comm_spawn functionality that inadvertently
|
||
caused the mapping of child processes to always start at the same
|
||
place. Thanks to Prakash Velayutham for helping discover the
|
||
problem.
|
||
- Fix segfault when a user's home directory is unavailable on a remote
|
||
node. Thanks to Guillaume Thomas-Collignon for bringing the issue
|
||
to our attention.
|
||
- Fix MPI_IPROBE to properly handle MPI_STATUS_IGNORE on mx and psm
|
||
MTLs. Thanks to Sophia Corwell for finding this and supplying a
|
||
reproducer.
|
||
- Fix some error messages in the tcp BTL.
|
||
- Use _NSGetEnviron instead of environ on Mac OS X so that there
|
||
are no undefined symbols in the shared libraries.
|
||
- On OS X, when MACOSX_DEPLOYMENT_TARGET is 10.3 or higher, support
|
||
building the Fortran 90 bindings as a shared library. Thanks to
|
||
Jack Howarth for his advice on making this work.
|
||
- No longer require extra include flag for the C++ bindings.
|
||
- Fix detection of weak symbols support with Intel compilers.
|
||
- Fix issue found by Josh England: ompi_info would not show framework
|
||
MCA parameters set in the environment properly.
|
||
- Rename the oob_tcp_include/exclude MCA params to oob_tcp_if_include/exclude
|
||
so that they match the naming convention of the btl_tcp_if_include/exclude
|
||
params. The old names are depreciated, but will still work.
|
||
- Add -wd as a synonym for the -wdir orterun/mpirun option.
|
||
- Fix the mvapi BTL to compile properly with compilers that do not support
|
||
anonymous unions. Thanks to Luis Kornblueh for reporting the bug.
|
||
|
||
|
||
1.2.2: 16 May 2007
|
||
------------------
|
||
|
||
- Fix regression in 1.2.1 regarding the handling of $CC with both
|
||
absolute and relative path names.
|
||
- Fix F90 array of status dimensions. Thanks to Randy Bramley for
|
||
noticing the problem.
|
||
- Add btl_openib_ib_pkey_value MCA parameter for controlling IB port selection.
|
||
- Fixed a variety of threading/locking bugs.
|
||
- Fixed some compiler warnings associated with ROMIO, OS X, and gridengine.
|
||
- If pbs-config can be found, use it to look for TM support. Thanks
|
||
to Bas van der Vlies for the inspiration and preliminary work.
|
||
- Fixed a deadlock in orterun when the rsh PLS encounters some errors.
|
||
|
||
|
||
1.2.1: 25 Apr 2007
|
||
------------------
|
||
|
||
- Fixed a number of connection establishment errors in the TCP out-
|
||
of-band messaging system.
|
||
- Fixed a memory leak when using mpi_comm calls.
|
||
Thanks to Bas van der Vlies for reporting the problem.
|
||
- Fixed various memory leaks in OPAL and ORTE.
|
||
- Improved launch times when using TM (PBS Pro, Torque, Open PBS).
|
||
- Fixed mpi_leave_pinned to work for all datatypes.
|
||
- Fix functionality allowing users to disable sbrk() (the
|
||
mpool_base_disable_sbrk MCA parameter) on platforms that support it.
|
||
- Fixed a pair of problems with the TCP "listen_thread" mode for the
|
||
oob_tcp_listen_mode MCA parameter that would cause failures when
|
||
attempting to launch applications.
|
||
- Fixed a segfault if there was a failure opening a BTL MX endpoint.
|
||
- Fixed a problem with mpirun's --nolocal option introduced in 1.2.
|
||
- Re-enabled MPI_COMM_SPAWN_MULTIPLE from singletons.
|
||
- LoadLeveler and TM configure fixes, Thanks to Martin Audet for the
|
||
bug report.
|
||
- Various C++ MPI attributes fixes.
|
||
- Fixed issues with backtrace code on 64 bit Intel & PPC OS X builds.
|
||
- Fixed issues with multi-word CC variables and libtool.
|
||
Thanks to Bert Wesarg for the bug reports.
|
||
- Fix issue with non-uniform node naming schemes in SLURM.
|
||
- Fix file descriptor leak in the Grid Engine/N1GE support.
|
||
- Fix compile error on OS X 10.3.x introduced with Open MPI 1.1.5.
|
||
- Implement MPI_TYPE_CREATE_DARRAY function (was in 1.1.5 but not 1.2).
|
||
- Recognize zsh shell when using rsh/ssh for launching MPI jobs.
|
||
- Ability to set the OPAL_DESTDIR or OPAL_PREFIX environment
|
||
variables to "re-root" an existing Open MPI installation.
|
||
- Always include -I for Fortran compiles, even if the prefix is
|
||
/usr/local.
|
||
- Support for "fork()" in MPI applications that use the
|
||
OpenFabrics stack (OFED v1.2 or later).
|
||
- Support for setting specific limits on registered memory.
|
||
|
||
|
||
1.2: 15 Mar 2007
|
||
----------------
|
||
|
||
- Fixed race condition in the shared memory fifo's, which led to
|
||
orphaned messages.
|
||
- Corrected the size of the shared memory file - subtracted out the
|
||
space the header was occupying.
|
||
- Add support for MPI_2COMPLEX and MPI_2DOUBLE_COMPLEX.
|
||
- Always ensure to create $(includedir)/openmpi, even if the C++
|
||
bindings are disabled so that the wrapper compilers don't point to
|
||
a directory that doesn't exist. Thanks to Martin Audet for
|
||
identifying the problem.
|
||
- Fixes for endian handling in MPI process startup.
|
||
- Openib BTL initialization fixes for cases where MPI processes in the
|
||
same job has different numbers of active ports on the same physical
|
||
fabric.
|
||
- Print more descriptive information when displaying backtraces on
|
||
OS's that support this functionality, such as the hostname and PID
|
||
of the process in question.
|
||
- Fixes to properly handle MPI exceptions in C++ on communicators,
|
||
windows, and files.
|
||
- Much more reliable runtime support, particularly with regards to MPI
|
||
job startup scalability, BProc support, and cleanup in failure
|
||
scenarios (e.g., MPI_ABORT, MPI processes abnormally terminating,
|
||
etc.).
|
||
- Significant performance improvements for MPI collectives,
|
||
particularly on high-speed networks.
|
||
- Various fixes in the MX BTL component.
|
||
- Fix C++ typecast problems with MPI_ERRCODES_IGNORE. Thanks to
|
||
Satish Balay for bringing this to our attention.
|
||
- Allow run-time specification of the maximum amount of registered
|
||
memory for OpenFabrics and GM.
|
||
- Users who utilize the wrapper compilers (e.g., mpicc and mpif77)
|
||
will not notice, but the underlying library names for ORTE and OPAL
|
||
have changed to libopen-rte and libopen-pal, respectively (listed
|
||
here because there are undoubtedly some users who are not using the
|
||
wrapper compilers).
|
||
- Many bug fixes to MPI-2 one-sided support.
|
||
- Added support for TotalView message queue debugging.
|
||
- Fixes for MPI_STATUS_SET_ELEMENTS.
|
||
- Print better error messages when mpirun's "-nolocal" is used when
|
||
there is only one node available.
|
||
- Added man pages for several Open MPI executables and the MPI API
|
||
functions.
|
||
- A number of fixes for Alpha platforms.
|
||
- A variety of Fortran API fixes.
|
||
- Build the Fortran MPI API as a separate library to allow these
|
||
functions to be profiled properly.
|
||
- Add new --enable-mpirun-prefix-by-default configure option to always
|
||
imply the --prefix option to mpirun, preventing many rsh/ssh-based
|
||
users from needing to modify their shell startup files.
|
||
- Add a number of missing constants in the C++ bindings.
|
||
- Added tight integration with Sun N1 Grid Engine (N1GE) 6 and the
|
||
open source Grid Engine.
|
||
- Allow building the F90 MPI bindings as shared libraries for most
|
||
compilers / platforms. Explicitly disallow building the F90
|
||
bindings as shared libraries on OS X because of complicated
|
||
situations with Fortran common blocks and lack of support for
|
||
unresolved common symbols in shared libraries.
|
||
- Added stacktrace support for Solaris and Mac OS X.
|
||
- Update event library to libevent-1.1b.
|
||
- Fixed standards conformance issues with MPI_ERR_TRUNCATED and
|
||
setting MPI_ERROR during MPI_TEST/MPI_WAIT.
|
||
- Addition of "cm" PML to better support library-level matching
|
||
interconnects, with support for Myrinet/MX, and QLogic PSM-based
|
||
networks.
|
||
- Addition of "udapl" BTL for transport across uDAPL interconnects.
|
||
- Really check that the $CXX given to configure is a C++ compiler
|
||
(not a C compiler that "sorta works" as a C++ compiler).
|
||
- Properly check for local host only addresses properly, looking
|
||
for 127.0.0.0/8, rather than just 127.0.0.1.
|
||
|
||
|
||
1.1.5: 19 Mar 2007
|
||
------------------
|
||
|
||
- Implement MPI_TYPE_CREATE_DARRAY function.
|
||
- Fix race condition in shared memory BTL startup that could cause MPI
|
||
applications to hang in MPI_INIT.
|
||
- Fix syntax error in a corner case of the event library. Thanks to
|
||
Bert Wesarg for pointing this out.
|
||
- Add new MCA parameter (mpi_preconnect_oob) for pre-connecting the
|
||
"out of band" channels between all MPI processes. Most helpful for
|
||
MPI applications over InfiniBand where process A sends an initial
|
||
message to process B, but process B does not enter the MPI library
|
||
for a long time.
|
||
- Fix for a race condition in shared memory locking semantics.
|
||
- Add major, minor, and release version number of Open MPI to mpi.h.
|
||
Thanks to Martin Audet for the suggestion.
|
||
- Fix the "restrict" compiler check in configure.
|
||
- Fix a problem with argument checking in MPI_TYPE_CREATE_SUBARRAY.
|
||
- Fix a problem with compiling the XGrid components with non-gcc
|
||
compilers.
|
||
|
||
|
||
1.1.4: 30 Jan 2007
|
||
------------------
|
||
|
||
- Fixed 64-bit alignment issues with TCP interface detection on
|
||
intel-based OS X machines.
|
||
- Adjusted TCP interface selection to automatically ignore Linux
|
||
channel-bonded slave interfaces.
|
||
- Fixed the type of the first parameter to the MPI F90 binding for
|
||
MPI_INITIALIZED. Thanks to Tim Campbell for pointing out the
|
||
problem.
|
||
- Fix a bunch of places in the Fortran MPI bindings where (MPI_Fint*)
|
||
was mistakenly being used instead of (MPI_Aint*).
|
||
- Fixes for fortran MPI_STARTALL, which could sometimes return
|
||
incorrect request values. Thanks to Tim Campbell for pointing out
|
||
the problem.
|
||
- Include both pre- and post-MPI-2 errata bindings for
|
||
MPI::Win::Get_attr.
|
||
- Fix math error on Intel OS X platforms that would greatly increase
|
||
shared memory latency.
|
||
- Fix type casting issue with MPI_ERRCODES_IGNORE that would cause
|
||
errors when using a C++ compiler. Thanks to Barry Smith for
|
||
bringing this to our attention.
|
||
- Fix possible segmentation fault during shutdown when using the
|
||
MX BTL.
|
||
|
||
|
||
1.1.3: 26 Jan 2007
|
||
------------------
|
||
|
||
- Remove the "hierarch" coll component; it was not intended to be
|
||
included in stable releases yet.
|
||
- Fix a race condition with stdout/stderr not appearing properly from
|
||
all processes upon termination of an MPI job.
|
||
- Fix internal accounting errors with the self BTL.
|
||
- Fix typos in the code path for when sizeof(int) != sizeof(INTEGER)
|
||
in the MPI F77 bindings functions. Thanks to Pierre-Matthieu
|
||
Anglade for bringing this problem to our attention.
|
||
- Fix for a memory leak in the derived datatype function
|
||
ompi_ddt_duplicate(). Thanks to Andreas Schafer for reporting,
|
||
diagnosing, and patching the leak.
|
||
- Used better performing basic algorithm for MPI_ALLGATHERV.
|
||
- Added a workaround for a bug in the Intel 9.1 C++ compiler (all
|
||
versions up to and including 20060925) in the MPI C++ bindings that
|
||
caused run-time failures. Thanks to Scott Weitzenkamp for reporting
|
||
this problem.
|
||
- Fix MPI_SIZEOF implementation in the F90 bindings for COMPLEX
|
||
variable types.
|
||
- Fixes for persistent requests involving MPI_PROC_NULL. Thanks to
|
||
Lisandro Dalcin for reporting the problem.
|
||
- Fixes to MPI_TEST* and MPI_WAIT* for proper MPI exception reporting.
|
||
Thanks to Lisandro Dalcin for finding the issue.
|
||
- Various fixes for MPI generalized request handling; addition of
|
||
missing MPI::Grequest functionality to the C++ bindings.
|
||
- Add "mpi_preconnect_all" MCA parameter to force wireup of all MPI
|
||
connections during MPI_INIT (vs. making connections lazily whenever
|
||
the first MPI communication occurs between a pair of peers).
|
||
- Fix a problem for when $FC and/or $F77 were specified as multiple
|
||
tokens. Thanks to Orion Poplawski for identifying the problem and
|
||
to Ralf Wildenhues for suggesting the fix.
|
||
- Fix several MPI_*ERRHANDLER* functions and MPI_GROUP_TRANSLATE_RANKS
|
||
with respect to what arguments they allowed and the behavior that
|
||
they effected. Thanks to Lisandro Dalcin for reporting the
|
||
problems.
|
||
|
||
|
||
1.1.2: 18 Oct 2006
|
||
------------------
|
||
|
||
- Really fix Fortran status handling in MPI_WAITSOME and MPI_TESTSOME.
|
||
- Various datatype fixes, reported by several users as causing
|
||
failures in the BLACS testing suite. Thanks to Harald Forbert, Ake
|
||
Sandgren and, Michael Kluskens for reporting the problem.
|
||
- Correctness and performance fixes for heterogeneous environments.
|
||
- Fixed a error in command line parsing on some platforms (causing
|
||
mpirun to crash without doing anything).
|
||
- Fix for initialization hangs on 64 bit Mac OS X PowerPC systems.
|
||
- Fixed some memory allocation problems in mpirun that could cause
|
||
random problems if "-np" was not specified on the command line.
|
||
- Add Kerberos authentication support for XGrid.
|
||
- Added LoadLeveler support for jobs larger than 128 tasks.
|
||
- Fix for large-sized Fortran LOGICAL datatypes.
|
||
- Fix various error checking in MPI_INFO_GET_NTHKEY and
|
||
MPI_GROUP_TRANSLATE_RANKS, and some collective operations
|
||
(particularly with regards to MPI_IN_PLACE). Thanks to Lisandro
|
||
Dalcin for reporting the problems.
|
||
- Fix receiving messages to buffers allocated by MPI_ALLOC_MEM.
|
||
- Fix a number of race conditions with the MPI-2 Onesided
|
||
interface.
|
||
- Fix the "tuned" collective componenete where some cases where
|
||
MPI_BCAST could hang.
|
||
- Update TCP support to support non-uniform TCP environments.
|
||
- Allow the "poe" RAS component to be built on AIX or Linux.
|
||
- Only install mpif.h if the rest of the Fortran bindings are
|
||
installed.
|
||
- Fixes for BProc node selection.
|
||
- Add some missing Fortran MPI-2 IO constants.
|
||
|
||
|
||
1.1.1: 28 Aug 2006
|
||
------------------
|
||
|
||
- Fix for Fortran string handling in various MPI API functions.
|
||
- Fix for Fortran status handling in MPI_WAITSOME and MPI_TESTSOME.
|
||
- Various fixes for the XL compilers.
|
||
- Automatically disable using mallot() on AIX.
|
||
- Memory fixes for 64 bit platforms with registering MCA parameters in
|
||
the self and MX BTL components.
|
||
- Fixes for BProc to support oversubscription and changes to the
|
||
mapping algorithm so that mapping processes "by slot" works as
|
||
expected.
|
||
- Fixes for various abort cases to not hang and clean up nicely.
|
||
- If using the Intel 9.0 v20051201 compiler on an IA64 platform, the
|
||
ptmalloc2 memory manager component will automatically disable
|
||
itself. Other versions of the Intel compiler on this platform seem
|
||
to work fine (e.g., 9.1).
|
||
- Added "host" MPI_Info key to MPI_COMM_SPAWN and
|
||
MPI_COMM_SPAWN_MULTIPLE.
|
||
- Add missing C++ methods: MPI::Datatype::Create_indexed_block,
|
||
MPI::Datatype::Create_resized, MPI::Datatype::Get_true_extent.
|
||
- Fix OSX linker issue with Fortran bindings.
|
||
- Fixed MPI_COMM_SPAWN to start spawning new processes in slots that
|
||
(according to Open MPI) are not already in use.
|
||
- Added capability to "mpirun a.out" (without specifying -np) that
|
||
will run on all currently-allocated resources (e.g., within a batch
|
||
job such as SLURM, Torque, etc.).
|
||
- Fix a bug with one particular case of MPI_BCAST. Thanks to Doug
|
||
Gregor for identifying the problem.
|
||
- Ensure that the shared memory mapped file is only created when there
|
||
is more than one process on a node.
|
||
- Fixed problems with BProc stdin forwarding.
|
||
- Fixed problem with MPI_TYPE_INDEXED datatypes. Thanks to Yven
|
||
Fournier for identifying this problem.
|
||
- Fix some thread safety issues in MPI attributes and the openib BTL.
|
||
- Fix the BProc allocator to not potentially use the same resources
|
||
across multiple ORTE universes.
|
||
- Fix gm resource leak.
|
||
- More latency reduction throughout the code base.
|
||
- Make the TM PLS (PBS Pro, Torque, Open PBS) more scalable, and fix
|
||
some latent bugs that crept in v1.1. Thanks to the Thunderbird crew
|
||
at Sandia National Laboratories and Martin Schaffoner for access to
|
||
testing facilities to make this happen.
|
||
- Added new command line options to mpirun:
|
||
--nolocal: Do not run any MPI processes on the same node as mpirun
|
||
(compatibility with the OSC mpiexec launcher)
|
||
--nooversubscribe: Abort if the number of processes requested would
|
||
cause oversubscription
|
||
--quiet / -q: do not show spurious status messages
|
||
--version / -V: show the version of Open MPI
|
||
- Fix bus error in XGrid process starter. Thanks to Frank from the
|
||
Open MPI user's list for identifying the problem.
|
||
- Fix data size mismatches that caused memory errors on PPC64
|
||
platforms during the startup of the openib BTL.
|
||
- Allow propagation of SIGUSR1 and SIGUSR2 signals from mpirun to
|
||
back-end MPI processes.
|
||
- Add missing MPI::Is_finalized() function.
|
||
|
||
|
||
1.1: 23 Jun 2006
|
||
----------------
|
||
|
||
- Various MPI datatype fixes, optimizations.
|
||
- Fixed various problems on the SPARC architecture (e.g., not
|
||
correctly aligning addresses within structs).
|
||
- Improvements in various run-time error messages to be more clear
|
||
about what they mean and where the errors are occurring.
|
||
- Various fixes to mpirun's handling of --prefix.
|
||
- Updates and fixes for Cray/Red Storm support.
|
||
- Major improvements to the Fortran 90 MPI bindings:
|
||
- General improvements in compile/linking time and portability
|
||
between different F90 compilers.
|
||
- Addition of "trivial", "small" (the default), and "medium"
|
||
Fortran 90 MPI module sizes (v1.0.x's F90 module was
|
||
equivalent to "medium"). See the README file for more
|
||
explanation.
|
||
- Fix various MPI F90 interface functions and constant types to
|
||
match. Thanks to Michael Kluskens for pointing out the problems
|
||
to us.
|
||
- Allow short messagees to use RDMA (vs. send/receive semantics) to a
|
||
limited number peers in both the mvapi and openib BTL components.
|
||
This reduces communication latency over IB channels.
|
||
- Numerous performance improvements throughout the entire code base.
|
||
- Many minor threading fixes.
|
||
- Add a define OMPI_SKIP_CXX to allow the user to skip the mpicxx.h from
|
||
being included in mpi.h. It allows the user to compile C code with a CXX
|
||
compiler without including the CXX bindings.
|
||
- PERUSE support has been added. In order to activate it add
|
||
--enable-peruse to the configure options. All events described in
|
||
the PERUSE 2.0 draft are supported, plus one Open MPI
|
||
extension. PERUSE_COMM_REQ_XFER_CONTINUE allow to see how the data
|
||
is segmented internally, using multiple interfaces or the pipeline
|
||
engine. However, this version only support one event of each type
|
||
simultaneously attached to a communicator.
|
||
- Add support for running jobs in heterogeneous environments.
|
||
Currently supports environments with different endianness and
|
||
different representations of C++ bool and Fortran LOGICAL.
|
||
Mismatched sizes for other datatypes is not supported.
|
||
- Open MPI now includes an implementation of the MPI-2 One-Sided
|
||
Communications specification.
|
||
- Open MPI is now configurable in cross-compilation environments.
|
||
Several Fortran 77 and Fortran 90 tests need to be pre-seeded with
|
||
results from a config.cache-like file.
|
||
- Add --debug option to mpirun to generically invoke a parallel debugger.
|
||
|
||
|
||
1.0.3: Not released (all fixes included in 1.1)
|
||
-----------------------------------------------
|
||
|
||
- Fix a problem noted by Chris Hennes where MPI_INFO_SET incorrectly
|
||
disallowed long values.
|
||
- Fix a problem in the launch system that could cause inconsistent
|
||
launch behavior, particularly when launching large jobs.
|
||
- Require that the openib BTL find <sysfs/libsysfs.h>. Thanks to Josh
|
||
Aune for the suggestion.
|
||
- Include updates to support the upcoming Autoconf 2.60 and Libtool
|
||
2.0. Thanks to Ralf Wildenhues for all the work!
|
||
- Fix bug with infinite loop in the "round robin" process mapper.
|
||
Thanks to Paul Donohue for reporting the problem.
|
||
- Enusre that memory hooks are removed properly during MPI_FINALIZE.
|
||
Thanks to Neil Ludban for reporting the problem.
|
||
- Various fixes to the included support for ROMIO.
|
||
- Fix to ensure that MPI_LONG_LONG and MPI_LONG_LONG_INT are actually
|
||
synonyms, as defined by the MPI standard. Thanks to Martin Audet
|
||
for reporting this.
|
||
- Fix Fortran 90 configure tests to properly utilize LDFLAGS and LIBS.
|
||
Thanks to Terry Reeves for reporting the problem.
|
||
- Fix shared memory progression in asynchronous progress scenarios.
|
||
Thanks to Mykael Bouquey for reporting the problem.
|
||
- Fixed back-end operations for predefined MPI_PROD for some
|
||
datatypes. Thanks to Bert Wesarg for reporting this.
|
||
- Adapted configure to be able to handle Torque 2.1.0p0's (and above)
|
||
new library name. Thanks to Brock Palen for pointing this out and
|
||
providing access to a Torque 2.1.0p0 cluster to test with.
|
||
- Fixed situation where mpirun could set a shell pipeline's stdout
|
||
to non-blocking, causing the shell pipeline to prematurely fail.
|
||
Thanks to Darrell Kresge for figuring out what was happening.
|
||
- Fixed problems with leave_pinned that could cause Badness with the
|
||
mvapi BTL.
|
||
- Fixed problems with MPI_FILE_OPEN and non-blocking MPI-2 IO access.
|
||
- Fixed various InfiniBand port matching issues during startup.
|
||
Thanks to Scott Weitzenkamp for identifying these problems.
|
||
- Fixed various configure, build and run-time issues with ROMIO.
|
||
Thanks to Dries Kimpe for bringing them to our attention.
|
||
- Fixed error in MPI_COMM_SPLIT when dealing with intercommunicators.
|
||
Thanks to Bert Wesarg for identifying the problem.
|
||
- Fixed backwards handling of "high" parameter in MPI_INTERCOMM_MERGE.
|
||
Thanks to Michael Kluskens for pointing this out to us.
|
||
- Fixed improper handling of string arguments in Fortran bindings
|
||
for MPI-IO functionality
|
||
- Fixed segmentation fault with 64 bit applications on Solaris when
|
||
using the shared memory transports.
|
||
- Fixed MPI_COMM_SELF attributes to free properly at the beginning of
|
||
MPI_FINALIZE. Thanks to Martin Audet for bringing this to our
|
||
attention.
|
||
- Fixed alignment tests for cross-compiling to not cause errors with
|
||
recent versions of GCC.
|
||
|
||
|
||
1.0.2: 7 Apr 2006
|
||
-----------------
|
||
|
||
- Fixed assembly race condition on AMD64 platforms.
|
||
- Fixed residual .TRUE. issue with copying MPI attributes set from
|
||
Fortran.
|
||
- Remove unnecessary logic from Solaris pty I/O forwarding. Thanks to
|
||
Francoise Roch for bringing this to our attention.
|
||
- Fixed error when count = 0 was given for multiple completion MPI
|
||
functions (MPI_TESTSOME, MPI_TESTANY, MPI_TESTALL, MPI_WAITSOME,
|
||
MPI_WAITANY, MPI_WAITALL).
|
||
- Better handling in MPI_ABORT for when peer processes have already
|
||
died, especially under some resource managers.
|
||
- Random updates to README file, to include notes about the Portland
|
||
compilers.
|
||
- Random, small threading fixes to prevent deadlock.
|
||
- Fixed a problem with handling long mpirun app files. Thanks to Ravi
|
||
Manumachu for identifying the problem.
|
||
- Fix handling of strings in several of the Fortran 77 bindings.
|
||
- Fix LinuxPPC assembly issues. Thanks to Julian Seward for reporting
|
||
the problem.
|
||
- Enable pty support for standard I/O forwarding on platforms that
|
||
have ptys but do not have openpty(). Thanks to Pierre Valiron for
|
||
bringing this to our attention.
|
||
- Disable inline assembly for PGI compilers to avoid compiler errors.
|
||
Thanks to Troy Telford for bringing this to our attention.
|
||
- Added MPI_UNSIGNED_CHAR and MPI_SIGNED_CHAR to the allowed reduction
|
||
types.
|
||
- Fix a segv in variable-length message displays on Opterons running
|
||
Solaris. Thanks to Pierre Valiron for reporting the issue.
|
||
- Added MPI_BOOL to the intrinsic reduction operations MPI_LAND,
|
||
MPI_LOR, MPI_LXOR. Thanks to Andy Selle for pointing this out to us.
|
||
- Fixed TCP BTL network matching logic during MPI_INIT; in some cases
|
||
on multi-NIC nodes, a NIC could get paired with a NIC on another
|
||
network (typically resulting in deadlock). Thanks to Ken Mighell
|
||
for pointing this out to us.
|
||
- Change the behavior of orterun (mpirun, mpirexec) to search for
|
||
argv[0] and the cwd on the target node (i.e., the node where the
|
||
executable will be running in all systems except BProc, where the
|
||
searches are run on the node where orterun is invoked).
|
||
- Fix race condition in shared memory transport that could cause
|
||
crashes on machines with weak memory consistency models (including
|
||
POWER/PowerPC machines).
|
||
- Fix warnings about setting read-only MCA parameters on bproc systems.
|
||
- Change the exit status set by mpirun when an application process is
|
||
killed by a signal. The exit status is now set to signo + 128, which
|
||
conforms with the behavior of (almost) all shells.
|
||
- Correct a datatype problem with the convertor when partially
|
||
unpacking data. Now we can position the convertor to any position
|
||
not only on the predefined types boundaries. Thanks to Yvan Fournier
|
||
for reporting this to us.
|
||
- Fix a number of standard I/O forwarding issues, including the
|
||
ability to background mpirun and a loss of data issue when
|
||
redirecting mpirun's standard input from a file.
|
||
- Fixed bug in ompi_info where rcache and bml MCA parameters would not
|
||
be displayed.
|
||
- Fixed umask issues in the session directory. Thanks to Glenn Morris
|
||
for reporting this to us.
|
||
- Fixed tcsh-based LD_LIBRARY_PATH issues with --prefix. Thanks to
|
||
Glen Morris for identifying the problem and suggesting the fix.
|
||
- Removed extraneous \n's when setting PATH and LD_LIBRARY_PATH in the
|
||
rsh startup. Thanks to Glen Morris for finding these typos.
|
||
- Fixed missing constants in MPI C++ bindings.
|
||
- Fixed some errors caused by threading issues.
|
||
- Fixed openib BTL flow control logic to not overrun the number of
|
||
send wqes available.
|
||
- Update to match newest OpenIB user-level library API. Thanks to
|
||
Roland Dreier for submitting this patch.
|
||
- Report errors properly when failing to register memory in the openib
|
||
BTL.
|
||
- Reduce memory footprint of openib BTL.
|
||
- Fix parsing problem with mpirun's "-tv" switch. Thanks to Chris
|
||
Gottbrath for supplying the fix.
|
||
- Fix Darwin net/if.h configure warning.
|
||
- The GNU assembler unbelievably defaults to making stacks executable.
|
||
So when using gas, add flags to explicitly tell it to not make
|
||
stacks executable (lame but necessary).
|
||
- Add missing MPI::Request::Get_status() methods. Thanks to Bill
|
||
Saphir for pointing this out to us.
|
||
- Improved error messages on memory registration errors (e.g., when
|
||
using high-speed networks).
|
||
- Open IB support now checks firmware for how many outstanding RDMA
|
||
requests are supported. Thanks to Mellanox for pointing this out to
|
||
us.
|
||
- Enable printing of stack traces in MPI processes upon SIGBUS,
|
||
SIGSEGV, and SIGFPE if the platform supports it.
|
||
- Fixed F90 compilation support for the Lahey compiler.
|
||
- Fixed issues with ROMIO shared library support.
|
||
- Fixed internal accounting problems with rsh support.
|
||
- Update to GNU Libtool 1.5.22.
|
||
- Fix error in configure script when setting CCAS to ias (the Intel
|
||
assembler).
|
||
- Added missing MPI::Intercomm collectives.
|
||
- Fixed MPI_IN_PLACE handling for Fortran collectives.
|
||
- Fixed some more C++ const_cast<> issues. Thanks for Martin Audet
|
||
(again) for bringing this to our attention.
|
||
- Updated ROMIO with the version from MPICH 1.2.7p1, marked as version
|
||
2005-06-09.
|
||
- Fixes for some cases where the use of MPI_BOTTOM could cause
|
||
problems.
|
||
- Properly handle the case where an mVAPI does not have shared receive
|
||
queue support (such as the one shipped by SilverStorm / Infinicon
|
||
for OS X).
|
||
|
||
|
||
1.0.1: 12 Dec 2005
|
||
------------------
|
||
|
||
- Fixed assembly on Solaris AMD platforms. Thanks to Pierre Valiron
|
||
for bringing this to our attention.
|
||
- Fixed long messages in the send-to-self case.
|
||
- Ensure that when the "leave_pinned" option is used, the memory hooks
|
||
are also enabled. Thanks to Gleb Natapov for pointing this out.
|
||
- Fixed compile errors for IRIX.
|
||
- Allow hostfiles to have integer host names (for BProc clusters).
|
||
- Fixed a problem with message matching of out-of-order fragments in
|
||
multiple network device scenarios.
|
||
- Converted all the C++ MPI bindings to use proper const_cast<>'s
|
||
instead of old C-style casts to get rid of const-ness. Thanks to
|
||
Martin Audet for raising the issue with us.
|
||
- Converted MPI_Offset to be a typedef instead of a #define because it
|
||
causes problems for some C++ parsers. Thanks to Martin Audet for
|
||
bringing this to our attention.
|
||
- Improved latency of TCP BTL.
|
||
- Fixed index value in MPI_TESTANY to be MPI_UNDEFINED if some
|
||
requests were not MPI_REQUEST_NULL, but no requests finished.
|
||
- Fixed several Fortran MPI API implementations that incorrectly used
|
||
integers instead of logicals or address-sized integers.
|
||
- Fix so that Open MPI correctly handles the Fortran value for .TRUE.,
|
||
regardless of what the Fortran compiler's value for .TRUE. is.
|
||
- Improved scalability of MX startup.
|
||
- Fix datatype offset handling in the coll basic component's
|
||
MPI_SCATTERV implementation.
|
||
- Fix EOF handling on stdin.
|
||
- Fix missing MPI_F_STATUS_IGNORE and MPI_F_STATUSES_IGNORE
|
||
instanatiations. Thanks to Anthony Chan for pointing this out.
|
||
- Add a missing value for MPI_WIN_NULL in mpif.h.
|
||
- Bring over some fixes for the sm btl that somehow didn't make it
|
||
over from the trunk before v1.0. Thanks to Beth Tibbitts and Bill
|
||
Chung for helping identify this issue.
|
||
- Bring over some fixes for the iof that somehow didn't make it over
|
||
from the trunk before v1.0.
|
||
- Fix for --with-wrapper-ldflags handling. Thanks to Dries Kimpe for
|
||
pointing this out to us.
|
||
|
||
|
||
1.0: 17 Nov 2005
|
||
----------------
|
||
|
||
Initial public release.
|