1
1
openmpi/ompi/runtime
Ralph Castain 45e695928f As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time:
* add a new MCA param orte_hostname_cutoff to specify the number of nodes at which we stop including hostnames. This defaults to INT_MAX => always include hostnames. If a value is given, then we will include hostnames for any allocation smaller than the given limit.

* remove ompi_proc_get_hostname. Replace all occurrences with a direct link to ompi_proc_t's proc_hostname, protected by appropriate "if NULL"

* modify the OMPI-ORTE integration component so that any call to modex_recv automatically loads the ompi_proc_t->proc_hostname field as well as returning the requested info. Thus, any process whose modex info you retrieve will automatically receive the hostname. Note that on-demand retrieval is still enabled - i.e., if we are running under direct launch with PMI, the hostname will be fetched upon first call to modex_recv, and then the ompi_proc_t->proc_hostname field will be loaded

* removed a stale MCA param "mpi_keep_peer_hostnames" that was no longer used anywhere in the code base

* added an envar lookup in ess/pmi for the number of nodes in the allocation. Sadly, PMI itself doesn't provide that info, so we have to get it a different way. Currently, we support PBS-based systems and SLURM - for any other, rank0 will emit a warning and we assume max number of daemons so we will always retain hostnames

This commit was SVN r29052.
2013-08-20 18:59:36 +00:00
..
help-mpi-runtime.txt Remove tabs for spaces, fix some error messages. 2013-03-01 19:13:06 +00:00
Makefile.am Per RFC add initial support for the MPI 3.0 tools interface. 2013-04-24 15:59:23 +00:00
mpiruntime.h Per RFC add initial support for the MPI 3.0 tools interface. 2013-04-24 15:59:23 +00:00
ompi_cr.c MCA/base: Add new MCA variable system 2013-03-27 21:09:41 +00:00
ompi_cr.h Move the RTE framework change into the trunk. With this change, all non-CR 2013-01-27 23:25:10 +00:00
ompi_info_support.c Per Geoffroy Vallee, use the OPAL constant 2013-04-25 14:18:18 +00:00
ompi_info_support.h Add file missing from last commit 2013-04-24 16:01:23 +00:00
ompi_module_exchange.c Only free the opal_byte_object_t if the fetch operation was succesful. 2013-03-25 00:36:57 +00:00
ompi_module_exchange.h Move the RTE framework change into the trunk. With this change, all non-CR 2013-01-27 23:25:10 +00:00
ompi_mpi_abort.c Move the RTE framework change into the trunk. With this change, all non-CR 2013-01-27 23:25:10 +00:00
ompi_mpi_finalize.c Update OMPI frameworks to use the MCA framework system. 2013-03-27 21:17:31 +00:00
ompi_mpi_init.c Per the RFC and discussion on the devel list, update the RTE-MPI error handling interface. There are a few differences in the code from the original RFC that came out of the discussion - I've captured those in the following writeup 2013-07-19 01:08:53 +00:00
ompi_mpi_params.c As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: 2013-08-20 18:59:36 +00:00
ompi_mpi_preconnect.c MCA/base: Add new MCA variable system 2013-03-27 21:09:41 +00:00
params.h As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: 2013-08-20 18:59:36 +00:00