openmpi/ompi/runtime/help-mpi-runtime.txt

# -*- text -*-
#
# Copyright (c) 2004-2005 The Trustees of Indiana University and Indiana
#                         University Research and Technology
#                         Corporation.  All rights reserved.
# Copyright (c) 2004-2005 The University of Tennessee and The University
#                         of Tennessee Research Foundation.  All rights
#                         reserved.
# Copyright (c) 2004-2005 High Performance Computing Center Stuttgart, 
#                         University of Stuttgart.  All rights reserved.
# Copyright (c) 2004-2005 The Regents of the University of California.
#                         All rights reserved.
# Copyright (c) 2007-2012 Cisco Systems, Inc.  All rights reserved.
# Copyright (c) 2013      NVIDIA Corporation.  All rights reserved.
# $COPYRIGHT$
# 
# Additional copyrights may follow
# 
# $HEADER$
#
# This is the US/English general help file for Open MPI.
#
[mpi_init:startup:internal-failure]
It looks like %s failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during %s; some of which are due to configuration or environment
problems.  This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):

  %s
  --> Returned "%s" (%d) instead of "Success" (0)
#
[mpi_init:startup:pml-add-procs-fail]
MPI_INIT has failed because at least one MPI process is unreachable
from another.  This *usually* means that an underlying communication
plugin -- such as a BTL or an MTL -- has either not loaded or not
allowed itself to be used.  Your MPI job will now abort.

You may wish to try to narrow down the problem;

 * Check the output of ompi_info to see which BTL/MTL plugins are
   available.
 * Run your application with MPI_THREAD_SINGLE.
 * Set the MCA parameter btl_base_verbose to 100 (or mtl_base_verbose,
   if using MTL-based communications) to see exactly which
   communication plugins were considered and/or discarded.
#
[mpi-param-check-enabled-but-compiled-out]
WARNING: The MCA parameter mpi_param_check has been set to true, but
parameter checking has been compiled out of Open MPI.  The
mpi_param_check value has therefore been ignored.
[mpi_finalize:invoked_multiple_times]
The function MPI_FINALIZE was invoked multiple times in a single
process on host %s, PID %d.  

This indicates an erroneous MPI program; MPI_FINALIZE is only allowed
to be invoked exactly once in a process.
#
[proc:heterogeneous-support-unavailable]
The build of Open MPI running on host %s was not 
compiled with heterogeneous support.  A process running on host 
%s appears to have a different architecture,
which will not work.  Please recompile Open MPI with the
configure option --enable-heterogeneous or use a homogeneous
environment.
#
[sparse groups enabled but compiled out]
WARNING: The MCA parameter mpi_use_sparse_group_storage has been set
to true, but sparse group support was not compiled into Open MPI.  The
mpi_use_sparse_group_storage value has therefore been ignored.
#
[heterogeneous-support-unavailable]
This installation of Open MPI was configured without support for
heterogeneous architectures, but at least one node in the allocation
was detected to have a different architecture. The detected node was:

Node: %s

In order to operate in a heterogeneous environment, please reconfigure
Open MPI with --enable-heterogeneous.
#
[ompi mpi abort:cannot guarantee all killed]
An MPI process is aborting at a time when it cannot guarantee that all
of its peer processes in the job will be killed properly.  You should
double check that everything has shut down cleanly.

  Reason:     %s
  Local host: %s
  PID:        %d
#
[no cuda support]
The user requested CUDA support with the --mca mpi_cuda_support 1 flag
but the library was not compiled with any support.
First cut of the Show Help Subsystem (SHS) - see src/util/show_help.h for details (doxygen); main function call is ompi_show_help() - text message files are expected to be located in $pkgdatadir (usually $prefix/share/openmpi). Anyone can install a text file in $pkgdatadir with their message(s) in it and then have them displayed via ompi_show_help(). "pkgdata_DATA" is the keyword to use in Makefile.am's, for example (from src/mca/base/Makefile.am): pkgdata_DATA = help-mca-base.txt - added a few examples in the code base using ompi_show_help(), but not too many -- can convert more "show_help" comments in the code over time; no huge rush. :-) - no i18n-like support yet; waiting for advice and consensus from other developers This commit was SVN r2519. 2004-09-05 20:05:37 +04:00			`# -- text --`
			`#`
Update the copyright notices for IU and UTK. This commit was SVN r7999. 2005-11-05 22:57:48 +03:00			`# Copyright (c) 2004-2005 The Trustees of Indiana University and Indiana`
			`# University Research and Technology`
			`# Corporation. All rights reserved.`
			`# Copyright (c) 2004-2005 The University of Tennessee and The University`
			`# of Tennessee Research Foundation. All rights`
			`# reserved.`
Add HLRS copyright This commit was SVN r3665. 2004-11-28 23:09:25 +03:00			`# Copyright (c) 2004-2005 High Performance Computing Center Stuttgart,`
			`# University of Stuttgart. All rights reserved.`
Add UC copyright This commit was SVN r5009. 2005-03-24 15:43:37 +03:00			`# Copyright (c) 2004-2005 The Regents of the University of California.`
			`# All rights reserved.`
Per RFC, bring in the following changes: * Remove paffinity, maffinity, and carto frameworks -- they've been wholly replaced by hwloc. * Move ompi_mpi_init() affinity-setting/checking code down to ORTE. * Update sm, smcuda, wv, and openib components to no longer use carto. Instead, use hwloc data. There are still optimizations possible in the sm/smcuda BTLs (i.e., making multiple mpools). Also, the old carto-based code found out how many NUMA nodes were ''available'' -- not how many were used ''in this job''. The new hwloc-using code computes the same value -- it was not updated to calculate how many NUMA nodes are used ''by this job.'' * Note that I cannot compile the smcuda and wv BTLs -- I ''think'' they're right, but they need to be verified by their owners. * The openib component now does a bunch of stuff to figure out where "near" OpenFabrics devices are. '''THIS IS A CHANGE IN DEFAULT BEHAVIOR!!''' and still needs to be verified by OpenFabrics vendors (I do not have a NUMA machine with an OpenFabrics device that is a non-uniform distance from multiple different NUMA nodes). * Completely rewrite the OMPI_Affinity_str() routine from the "affinity" mpiext extension. This extension now understands hyperthreads; the output format of it has changed a bit to reflect this new information. * Bunches of minor changes around the code base to update names/types from maffinity/paffinity-based names to hwloc-based names. * Add some helper functions into the hwloc base, mainly having to do with the fact that we have the hwloc data reporting ''all'' topology information, but sometimes you really only want the (online \| available) data. This commit was SVN r26391. 2012-05-07 18:52:54 +04:00			`# Copyright (c) 2007-2012 Cisco Systems, Inc. All rights reserved.`
Remove tabs for spaces, fix some error messages. This commit was SVN r28141. 2013-03-01 23:13:06 +04:00			`# Copyright (c) 2013 NVIDIA Corporation. All rights reserved.`
First cut at copyrights: IU, UTK, and some OSU. LANL and HLRS still pending. This commit was SVN r3655. 2004-11-22 04:38:40 +03:00			`# $COPYRIGHT$`
			`#`
			`# Additional copyrights may follow`
			`#`
First cut of the Show Help Subsystem (SHS) - see src/util/show_help.h for details (doxygen); main function call is ompi_show_help() - text message files are expected to be located in $pkgdatadir (usually $prefix/share/openmpi). Anyone can install a text file in $pkgdatadir with their message(s) in it and then have them displayed via ompi_show_help(). "pkgdata_DATA" is the keyword to use in Makefile.am's, for example (from src/mca/base/Makefile.am): pkgdata_DATA = help-mca-base.txt - added a few examples in the code base using ompi_show_help(), but not too many -- can convert more "show_help" comments in the code over time; no huge rush. :-) - no i18n-like support yet; waiting for advice and consensus from other developers This commit was SVN r2519. 2004-09-05 20:05:37 +04:00			`# $HEADER$`
			`#`
			`# This is the US/English general help file for Open MPI.`
			`#`
			`[mpi_init:startup:internal-failure]`
			`It looks like %s failed for some reason; your parallel process is`
			`likely to abort. There are many reasons that a parallel process can`
			`fail during %s; some of which are due to configuration or environment`
			`problems. This failure appears to be an internal failure; here's some`
			`additional information (which may only be relevant to an Open MPI`
			`developer):`

			`%s`
Print the string name of the return code This commit was SVN r7789. 2005-10-18 00:47:44 +04:00			`--> Returned "%s" (%d) instead of "Success" (0)`
If we get OMPI_ERR_UNREACH from the PML, print a slightly more specific error. Suggested by Nick Edmonds: http://www.open-mpi.org/community/lists/users/2010/03/12339.php This commit was SVN r22828. 2010-03-14 03:09:55 +03:00			`#`
			`[mpi_init:startup:pml-add-procs-fail]`
			`MPI_INIT has failed because at least one MPI process is unreachable`
			`from another. This usually means that an underlying communication`
			`plugin -- such as a BTL or an MTL -- has either not loaded or not`
			`allowed itself to be used. Your MPI job will now abort.`

			`You may wish to try to narrow down the problem;`

			`* Check the output of ompi_info to see which BTL/MTL plugins are`
			`available.`
			`* Run your application with MPI_THREAD_SINGLE.`
			`* Set the MCA parameter btl_base_verbose to 100 (or mtl_base_verbose,`
			`if using MTL-based communications) to see exactly which`
			`communication plugins were considered and/or discarded.`
			`#`
Change and add new features to the MCA parameter system: - new preferred API calls for registering MCA parameters are mca_base_param_reg_{int\|string} and mca_base_param_reg_{int\|string}_name. - See opal/mca/base/mca_base_param.h for docs on new calls. - Can now register and lookup a value at the same time. - Can now mark a parameter "read only" at registration time - Can now mark a parameter "internal" at registration time - Can now associate a help message with the parameter at registration time; displayed in the ompi_info output. The old API calls are still available for backwards compatibility (mca_base_param_register_{int\|string}. They will eventually be removed -- all developers are encouraged to use the new APIs from here on out and replace any old calls with the new API. Some params were also renamed -- the previous convention of using "base_" as a prefix for any param that was not associated with a component is henceforth deprecated. Instead, use one of the following prefixes: mca: for anything in the MCA base itself opal: for anything in OPAL orte: for anything in ORTE mpi: for anything in OMPI This commit was SVN r6698. 2005-08-02 02:38:17 +04:00			`[mpi-param-check-enabled-but-compiled-out]`
			`WARNING: The MCA parameter mpi_param_check has been set to true, but`
			`parameter checking has been compiled out of Open MPI. The`
			`mpi_param_check value has therefore been ignored.`
After extensive conversations about this... - My original patch stands: MPI_FINALIZE directly invokes the attribute callbacks on MPI_COMM_SELF - We added some user-level checks to ensure that they don't call MPI_FINALIZE twice (this isn't really required, but it will prevent whacky segv's -- they'll at least get a nice error message) - Removed the attribute callbacks on MPI_COMM_SELF from ompi_mpi_comm_finalize (i.e., we just moved them from ompi_mpi_comm_finalize to ompi_mpi_finalize -- we just moved this process up earlier in the MPI_FINALIZE sequence of events) - Because there were so many conversations about this, here's the rationale: - MPI-2:4.8 says that we have to MPI_COMM_FREE MPI_COMM_SELF so that the attribute callbacks are invoked. - After considerable discussion, we came to the conclusion that FREE'ing COMM_SELF is not the issue -- calling the callbacks is the issue. - So it is sufficent for MPI_FINALIZE to directly invoke these attribute callbacks - The attribute callbacks are not invoked on other communicators because said communicators are not MPI_COMM_FREE'ed This commit was SVN r9628. 2006-04-13 21:00:36 +04:00			`[mpi_finalize:invoked_multiple_times]`
			`The function MPI_FINALIZE was invoked multiple times in a single`
			`process on host %s, PID %d.`

			`This indicates an erroneous MPI program; MPI_FINALIZE is only allowed`
			`to be invoked exactly once in a process.`
Fix a few issues with error messages: * If something goes wrong during ompi_mpi_init, don't erroneously report that it is illegal to invoke MPI_INIT* before MPI_INIT * Aggregate help messages when possible when something goes wring during ompi_mpi_init This commit was SVN r24492. 2011-03-07 19:45:45 +03:00			`#`
Heterogeneous support changes: * Add line about heterogeneous support to ompi_info output * Print warning and abort if heterogeneous detected and no heterogeneous support available. Refs trac:587 This commit was SVN r12943. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587 2006-12-30 20:13:18 +03:00			`[proc:heterogeneous-support-unavailable]`
			`The build of Open MPI running on host %s was not`
			`compiled with heterogeneous support. A process running on host`
			`%s appears to have a different architecture,`
			`which will not work. Please recompile Open MPI with the`
			`configure option --enable-heterogeneous or use a homogeneous`
			`environment.`
Merging in the Sparse Groups.. This commit includes config changes.. This commit was SVN r15764. 2007-08-04 04:41:26 +04:00			`#`
			`[sparse groups enabled but compiled out]`
			`WARNING: The MCA parameter mpi_use_sparse_group_storage has been set`
			`to true, but sparse group support was not compiled into Open MPI. The`
			`mpi_use_sparse_group_storage value has therefore been ignored.`
Fold in the revised modex scheme. Move the ompi_proc_t modex portions to the RTE level since the daemons already have that info. Provide each process with the equivalent of a "nidmap" - both a map of what nodes are in the job, and a map of which node each process is on. This enables the use of static ports, though that hasn't been turned "on" in this commit. Update the rsh tree spawn capability so we spawn the next wave of daemons before launching our own local procs. Add an ability to encode nodenames for large clusters with contiguous node name numbering schemes - this allows communication of all node names in a few bytes instead of tens-of-bytes/node. This commit was SVN r18338. 2008-04-30 23:49:53 +04:00			`#`
			`[heterogeneous-support-unavailable]`
			`This installation of Open MPI was configured without support for`
			`heterogeneous architectures, but at least one node in the allocation`
			`was detected to have a different architecture. The detected node was:`

			`Node: %s`

			`In order to operate in a heterogeneous environment, please reconfigure`
			`Open MPI with --enable-heterogeneous.`
Provide a warning message if a user's app executes a "fork" operation while using subsystems that may not cleanly support it - e.g., the openib btl. The provided warning is a generic one indicating that use of fork in current conditions is not recommended. This is setup so that it only is issued once (as opposed to every time they do it), and goes through orte_show_help so the user doesn't get hammered by #procs copies of the warning. In addition, there is a new MCA param (can't have too many!) to shut the warning off altogether. This closes ticket #1244 This commit was SVN r19196. 2008-08-06 18:22:03 +04:00			`#`
Fix a few issues with error messages: * If something goes wrong during ompi_mpi_init, don't erroneously report that it is illegal to invoke MPI_INIT* before MPI_INIT * Aggregate help messages when possible when something goes wring during ompi_mpi_init This commit was SVN r24492. 2011-03-07 19:45:45 +03:00			`[ompi mpi abort:cannot guarantee all killed]`
			`An MPI process is aborting at a time when it cannot guarantee that all`
			`of its peer processes in the job will be killed properly. You should`
			`double check that everything has shut down cleanly.`

			`Reason: %s`
			`Local host: %s`
			`PID: %d`
Remove tabs for spaces, fix some error messages. This commit was SVN r28141. 2013-03-01 23:13:06 +04:00			`#`
			`[no cuda support]`
			`The user requested CUDA support with the --mca mpi_cuda_support 1 flag`
			`but the library was not compiled with any support.`