1
1
openmpi/opal/tools/opal-restart/help-opal-restart.txt
Josh Hursey b749ecbab8 This commit fixes trac:2190.
Originally the patch was to improve the error message, but when digging into the code I found a subtle bug. If the daemon does not tell the HNP what CRS component it used, then the HNP tries to figure it out from the metadata (this is an uncommon case). The path the HNP used was not complete, so it was unable to find the metadata information. This patch fixes this by adding the 'snapshot_reference' to the 'snapshot_location' which completes the path for this search.

cmr:v1.4 (needs a custom patch)

cmr:v1.5

This commit was SVN r22479.

The following Trac tickets were found above:
  Ticket 2190 --> https://svn.open-mpi.org/trac/ompi/ticket/2190
2010-01-25 20:28:38 +00:00

73 строки
2.3 KiB
Plaintext

# -*- text -*-
#
# Copyright (c) 2004-2010 The Trustees of Indiana University and Indiana
# University Research and Technology
# Corporation. All rights reserved.
# Copyright (c) 2004-2005 The University of Tennessee and The University
# of Tennessee Research Foundation. All rights
# reserved.
# Copyright (c) 2004-2005 High Performance Computing Center Stuttgart,
# University of Stuttgart. All rights reserved.
# Copyright (c) 2004-2005 The Regents of the University of California.
# All rights reserved.
# Copyright (c) 2007 Evergrid, Inc. All rights reserved.
#
# $COPYRIGHT$
#
# Additional copyrights may follow
#
# $HEADER$
#
# This is the US/English help file for Open MPI checkpoint tool
#
[usage]
opal-restart FILENAME
Open PAL Single Process Restart Tool
%s
[invalid_filename]
Error: The filename (%s) is invalid because either you have not provided a filename
or provided an invalid filename.
Please see --help for usage.
[invalid_metadata]
Error: The local checkpoint contains invalid or incomplete metadata.
This usually indicates that the original checkpoint was invalid.
Check the metadata file (%s) in the following directory:
%s
[restart_cmd_failure]
Error: Unable to obtain the proper restart command to restart from the
checkpoint file (%s). Returned %d.
[comp_open_failure]
Error: Unable to open the %s framework.
[comp_select_failure]
Error: Unable to select the %s component needed to restart this
application. (Returned %d)
This likely indicates that the checkpointer needed is not
available on this machine. You should move to a machine that
has this checkpointer enabled.
[comp_select_mismatch]
Error: For an unknown reason the selected and requested components do
not match.
Expected Component: %s
Selected Component: %s
[restart_failure]
Error: The restart command:
shell$ %s
returned an error code %d, and was unable to restart properly.
[failed-to-exec]
Error: The restart command failed to properly exec the process per
the user's request. It is possible that the incorrect OPAL CRS
component was selected. Please confirm the following:
Expected Component: %s
Selected Component: %s