Copyright (c) 2004-2006 The Trustees of Indiana University and Indiana University Research and Technology Corporation. All rights reserved. Copyright (c) 2004-2006 The University of Tennessee and The University of Tennessee Research Foundation. All rights reserved. Copyright (c) 2004-2006 High Performance Computing Center Stuttgart, University of Stuttgart. All rights reserved. Copyright (c) 2004-2006 The Regents of the University of California. All rights reserved. Copyright (c) 2006 Cisco Systems, Inc. All rights reserved. Copyright (c) 2006 Voltaire, Inc. All rights reserved. $COPYRIGHT$ Additional copyrights may follow $HEADER$ =========================================================================== This file contains the main features as well as overviews of specific bug fixes (and other actions) for each version of Open MPI since version 1.0. 1.2 --- - Really check that the $CXX given to configure is a C++ compiler (not a C compiler that "sorta works" as a C++ compiler). - Properly check for local host only addresses properly, looking for 127.0.0.0/8, rather than just 127.0.0.1. - Addition of OpenIB-based unreliable datagram BTL ("ud"). - Addition of UDAPL BTL ("udapl"). 1.1.1 ----- - Addition of "dr" (data reliability) PML. - Added capability to "mpirun a.out" (without specifying -np) that will run on all currently-allocated resources (e.g., within a batch job such as SLURM, Torque, etc.). - Fix a bug with one particular case of MPI_BCAST. Thanks to Doug Gregor for identifying the problem. - Ensure that the shared memory mapped file is only created when there is more than one process on a node. - Fixed problems with BProc stdin forwarding. - Fixed problem with MPI_TYPE_INDEXED datatypes. Thanks to Yven Fournier for identifying this problem. - Fix some thread safety issues in MPI attributes and the openib BTL. - Fix the BProc allocator to not potentially use the same resources across multiple ORTE universes. - Fix gm resource leak. - More latency reduction throughout the code base. - Make the TM PLS (PBS Pro, Torque, Open PBS) more scalable, and fix some latent bugs that crept in v1.1. Thanks to the Thunderbird crew at Sandia National Laboratories and Martin Schaffoner for access to testing facilities to make this happen. - Added new command line options to mpirun: --nolocal: Do not run any MPI processes on the same node as mpirun (compatibility with the OSC mpiexec launcher) --quiet / -q: do not show spurious status messages --version / -V: show the version of Open MPI - Fix bus error in XGrid process starter. Thanks to Eric Thibodeau for the fix. - Fix data size mismatches that caused memory errors on PPC64 platforms during the startup of the openib BTL. - Allow propagation of SIGUSR1 and SIGUSR2 signals from mpirun to back-end MPI processes. - Add missing MPI::Is_finalized() function. 1.1 --- - Various MPI datatype fixes, optimizations. - Fixed various problems on the SPARC architecture (e.g., not correctly aligning addresses within structs). - Addition of "dr" pml component (data reliability) for running in unreliable networks (such as those at high altitudes). Includes functionality for retransmission due to problems with end-to-end data content checking or loss of data in the network. - Improvements in various run-time error messages to be more clear about what they mean and where the errors are occurring. - Various fixes to mpirun's handling of --prefix. - Updates and fixes for Cray/Red Storm support. - Major improvements to the Fortran 90 MPI bindings: - General improvements in compile/linking time and portability between different F90 compilers. - Addition of "trivial", "small" (the default), and "medium" Fortran 90 MPI module sizes (v1.0.x's F90 module was equivalent to "medium"). See the README file for more explanation. - Fix various MPI F90 interface functions and constant types to match. Thanks to Michael Kluskens for pointing out the problems to us. - Allow short messagees to use RDMA (vs. send/receive semantics) to a limited number peers in both the mvapi and openib BTL components. This reduces communication latency over IB channels. - Numerous performance improvements throughout the entire code base. - Many minor threading fixes. - Add a define OMPI_SKIP_CXX to allow the user to skip the mpicxx.h from being included in mpi.h. It allows the user to compile C code with a CXX compiler without including the CXX bindings. - PERUSE support has been added. In order to activate it add --enable-peruse to the configure options. All events described in the PERUSE 2.0 draft are supported, plus one Open MPI extension. PERUSE_COMM_REQ_XFER_CONTINUE allow to see how the data is segmented internally, using multiple interfaces or the pipeline engine. However, this version only support one event of each type simultaneously attached to a communicator. - Add support for running jobs in heterogeneous environments. Currently supports environments with different endianness and different representations of C++ bool and Fortran LOGICAL. Mismatched sizes for other datatypes is not supported. - Open MPI now includes an implementation of the MPI-2 One-Sided Communications specification. - Open MPI is now configurable in cross-compilation environments. Several Fortran 77 and Fortran 90 tests need to be pre-seeded with results from a config.cache-like file. - Add --debug option to mpirun to generically invoke a parallel debugger. 1.0.3 ----- - Fix a problem in the launch system that could cause inconsistent launch behavior, particularly when launching large jobs. - Require that the openib BTL find . Thanks to Josh Aune for the suggestion. - Include updates to support the upcoming Autoconf 2.60 and Libtool 2.0. Thanks to Ralf Wildenhues for all the work! - Fix bug with infinite loop in the "round robin" process mapper. Thanks to Paul Donohue for reporting the problem. - Enusre that memory hooks are removed properly during MPI_FINALIZE. Thanks to Neil Ludban for reporting the problem. - Various fixes to the included support for ROMIO. - Fix to ensure that MPI_LONG_LONG and MPI_LONG_LONG_INT are actually synonyms, as defined by the MPI standard. Thanks to Martin Audet for reporting this. - Fix Fortran 90 configure tests to properly utilize LDFLAGS and LIBS. Thanks to Terry Reeves for reporting the problem. - Fix shared memory progression in asynchronous progress scenarios. Thanks to Mykael Bouquey for reporting the problem. - Fixed back-end operations for predefined MPI_PROD for some datatypes. Thanks to Bert Wesarg for reporting this. - Adapted configure to be able to handle Torque 2.1.0p0's (and above) new library name. Thanks to Brock Palen for pointing this out and providing access to a Torque 2.1.0p0 cluster to test with. - Fixed situation where mpirun could set a shell pipeline's stdout to non-blocking, causing the shell pipeline to prematurely fail. Thanks to Darrell Kresge for figuring out what was happening. - Fixed problems with leave_pinned that could cause Badness with the mvapi BTL. - Fixed problems with MPI_FILE_OPEN and non-blocking MPI-2 IO access. - Fixed various InfiniBand port matching issues during startup. Thanks to Scott Weitzenkamp for identifying these problems. - Fixed various configure, build and run-time issues with ROMIO. Thanks to Dries Kimpe for bringing them to our attention. - Fixed error in MPI_COMM_SPLIT when dealing with intercommunicators. Thanks to Bert Wesarg for identifying the problem. - Fixed backwards handling of "high" parameter in MPI_INTERCOMM_MERGE. Thanks to Michael Kluskens for pointing this out to us. - Fixed improper handling of string arguments in Fortran bindings for MPI-IO functionality - Fixed segmentation fault with 64 bit applications on Solaris when using the shared memory transports. - Fixed MPI_COMM_SELF attributes to free properly at the beginning of MPI_FINALIZE. Thanks to Martin Audet for bringing this to our attention. - Fixed alignment tests for cross-compiling to not cause errors with recent versions of GCC. 1.0.2 ----- - Fixed assembly race condition on AMD64 platforms. - Fixed residual .TRUE. issue with copying MPI attributes set from Fortran. - Remove unnecessary logic from Solaris pty I/O forwarding. Thanks to Francoise Roch for bringing this to our attention. - Fixed error when count = 0 was given for multiple completion MPI functions (MPI_TESTSOME, MPI_TESTANY, MPI_TESTALL, MPI_WAITSOME, MPI_WAITANY, MPI_WAITALL). - Better handling in MPI_ABORT for when peer processes have already died, especially under some resource managers. - Random updates to README file, to include notes about the Portland compilers. - Random, small threading fixes to prevent deadlock. - Fixed a problem with handling long mpirun app files. Thanks to Ravi Manumachu for identifying the problem. - Fix handling of strings in several of the Fortran 77 bindings. - Fix LinuxPPC assembly issues. Thanks to Julian Seward for reporting the problem. - Enable pty support for standard I/O forwarding on platforms that have ptys but do not have openpty(). Thanks to Pierre Valiron for bringing this to our attention. - Disable inline assembly for PGI compilers to avoid compiler errors. Thanks to Troy Telford for bringing this to our attention. - Added MPI_UNSIGNED_CHAR and MPI_SIGNED_CHAR to the allowed reduction types. - Fix a segv in variable-length message displays on Opterons running Solaris. Thanks to Pierre Valiron for reporting the issue. - Added MPI_BOOL to the intrinsic reduction operations MPI_LAND, MPI_LOR, MPI_LXOR. Thanks to Andy Selle for pointing this out to us. - Fixed TCP BTL network matching logic during MPI_INIT; in some cases on multi-NIC nodes, a NIC could get paired with a NIC on another network (typically resulting in deadlock). Thanks to Ken Mighell for pointing this out to us. - Change the behavior of orterun (mpirun, mpirexec) to search for argv[0] and the cwd on the target node (i.e., the node where the executable will be running in all systems except BProc, where the searches are run on the node where orterun is invoked). - Fix race condition in shared memory transport that could cause crashes on machines with weak memory consistency models (including POWER/PowerPC machines). - Fix warnings about setting read-only MCA parameters on bproc systems. - Change the exit status set by mpirun when an application process is killed by a signal. The exit status is now set to signo + 128, which conforms with the behavior of (almost) all shells. - Correct a datatype problem with the convertor when partially unpacking data. Now we can position the convertor to any position not only on the predefined types boundaries. Thanks to Yvan Fournier for reporting this to us. - Fix a number of standard I/O forwarding issues, including the ability to background mpirun and a loss of data issue when redirecting mpirun's standard input from a file. - Fixed bug in ompi_info where rcache and bml MCA parameters would not be displayed. - Fixed umask issues in the session directory. Thanks to Glenn Morris for reporting this to us. - Fixed tcsh-based LD_LIBRARY_PATH issues with --prefix. Thanks to Glen Morris for identifying the problem and suggesting the fix. - Removed extraneous \n's when setting PATH and LD_LIBRARY_PATH in the rsh startup. Thanks to Glen Morris for finding these typos. - Fixed missing constants in MPI C++ bindings. - Fixed some errors caused by threading issues. - Fixed openib BTL flow control logic to not overrun the number of send wqes available. - Update to match newest OpenIB user-level library API. Thanks to Roland Dreier for submitting this patch. - Report errors properly when failing to register memory in the openib BTL. - Reduce memory footprint of openib BTL. - Fix parsing problem with mpirun's "-tv" switch. Thanks to Chris Gottbrath for supplying the fix. - Fix Darwin net/if.h configure warning. - The GNU assembler unbelievably defaults to making stacks executable. So when using gas, add flags to explicitly tell it to not make stacks executable (lame but necessary). - Add missing MPI::Request::Get_status() methods. Thanks to Bill Saphir for pointing this out to us. - Improved error messages on memory registration errors (e.g., when using high-speed networks). - Open IB support now checks firmware for how many outstanding RDMA requests are supported. Thanks to Mellanox for pointing this out to us. - Enable printing of stack traces in MPI processes upon SIGBUS, SIGSEGV, and SIGFPE if the platform supports it. - Fixed F90 compilation support for the Lahey compiler. - Fixed issues with ROMIO shared library support. - Fixed internal accounting problems with rsh support. - Update to GNU Libtool 1.5.22. - Fix error in configure script when setting CCAS to ias (the Intel assembler). - Added missing MPI::Intercomm collectives. - Fixed MPI_IN_PLACE handling for Fortran collectives. - Fixed some more C++ const_cast<> issues. Thanks for Martin Audet (again) for bringing this to our attention. - Updated ROMIO with the version from MPICH 1.2.7p1, marked as version 2005-06-09. - Fixes for some cases where the use of MPI_BOTTOM could cause problems. - Properly handle the case where an mVAPI does not have shared receive queue support (such as the one shipped by SilverStorm / Infinicon for OS X). 1.0.1 ----- - Fixed assembly on Solaris AMD platforms. Thanks to Pierre Valiron for bringing this to our attention. - Fixed long messages in the send-to-self case. - Ensure that when the "leave_pinned" option is used, the memory hooks are also enabled. Thanks to Gleb Natapov for pointing this out. - Fixed compile errors for IRIX. - Allow hostfiles to have integer host names (for BProc clusters). - Fixed a problem with message matching of out-of-order fragments in multiple network device scenarios. - Converted all the C++ MPI bindings to use proper const_cast<>'s instead of old C-style casts to get rid of const-ness. Thanks to Martin Audet for raising the issue with us. - Converted MPI_Offset to be a typedef instead of a #define because it causes problems for some C++ parsers. Thanks to Martin Audet for bringing this to our attention. - Improved latency of TCP BTL. - Fixed index value in MPI_TESTANY to be MPI_UNDEFINED if some requests were not MPI_REQUEST_NULL, but no requests finished. - Fixed several Fortran MPI API implementations that incorrectly used integers instead of logicals or address-sized integers. - Fix so that Open MPI correctly handles the Fortran value for .TRUE., regardless of what the Fortran compiler's value for .TRUE. is. - Improved scalability of MX startup. - Fix datatype offset handling in the coll basic component's MPI_SCATTERV implementation. - Fix EOF handling on stdin. - Fix missing MPI_F_STATUS_IGNORE and MPI_F_STATUSES_IGNORE instanatiations. Thanks to Anthony Chan for pointing this out. - Add a missing value for MPI_WIN_NULL in mpif.h. - Bring over some fixes for the sm btl that somehow didn't make it over from the trunk before v1.0. Thanks to Beth Tibbitts and Bill Chung for helping identify this issue. - Bring over some fixes for the iof that somehow didn't make it over from the trunk before v1.0. - Fix for --with-wrapper-ldflags handling. Thanks to Dries Kimpe for pointing this out to us. 1.0 --- Initial public release.