e4c4e7df5e
In order to have an effect, ibv_fork_init should be called in the beginning of the verbs initialization flow - before the calls to the ibv_create_qp and ibv_create_cq verbs. These functions are called from the oob/ud code and by the time the other verbs components (btl openib, pml yalla, ...) call ibv_fork_init, it's too late. This commit forces the call to ibv_fork_init (if it's requested) right at the beginning of all the components that are using verbs. (ibv_fork_init() can be safely called multiple times) This commit also removes the btl_openib_want_fork_support mca parameter and adds a new mca parameter instead - opal_verbs_want_fork_support. Through this new parameter, fork support may be requested for ALL components. The default value for this parameter is set to 1. Before this commit the btl_openib_want_fork_support parameter didn't provide fork support for the openib btl if its value was set to 1. (because when openib called ibv_fork_init, it was already after the calls to ibv_create_* in oob/ud and thereofre it failed).
55 строки
1.5 KiB
Plaintext
55 строки
1.5 KiB
Plaintext
#
|
|
# Copyright (c) 2012-2014 Cisco Systems, Inc. All rights reserved.
|
|
#
|
|
# $COPYRIGHT$
|
|
#
|
|
# Additional copyrights may follow
|
|
#
|
|
# $HEADER$
|
|
#
|
|
[ibv_open_device fail]
|
|
Open MPI failed to open an OpenFabrics device. This is an unusual
|
|
error; the system reported the OpenFabrics device as being present,
|
|
but then later failed to access it successfully. This usually
|
|
indicates either a misconfiguration or a failed OpenFabrics hardware
|
|
device.
|
|
|
|
All OpenFabrics support has been disabled in this MPI process; your
|
|
job may or may not continue.
|
|
|
|
Hostname: %s
|
|
Device name: %s
|
|
Error (%d): %s
|
|
#
|
|
[ibv_query_device fail]
|
|
Open MPI failed to query an OpenFabrics device. This is an unusual
|
|
error; the system reported the OpenFabrics device as being present,
|
|
Open MPI was able to open it succesfully, but then later failed to
|
|
query it successfully. This usually indicates either a
|
|
misconfiguration or a failed OpenFabrics hardware device.
|
|
|
|
All OpenFabrics support has been disabled in this MPI process; your
|
|
job may or may not continue.
|
|
|
|
Hostname: %s
|
|
Device name: %s
|
|
Error (%d): %s
|
|
#
|
|
[nonexistent port]
|
|
WARNING: One or more nonexistent OpenFabrics devices/ports were
|
|
specified:
|
|
|
|
Host: %s
|
|
MCA parameter: ompi_common_verbs_%sclude
|
|
Nonexistent entities: %s
|
|
|
|
These entities will be ignored. You can disable this warning by
|
|
setting the ompi_common_verbs_warn_nonexistent_if MCA parameter to 0.
|
|
#
|
|
[ibv_fork_init fail]
|
|
Fork support was requested but the library call ibv_fork_init() failed.
|
|
|
|
Hostname: %s
|
|
Error (%d): %s
|
|
#
|