1
1
openmpi/ompi/mca/mtl/psm
Ralph Castain 45e695928f As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time:
* add a new MCA param orte_hostname_cutoff to specify the number of nodes at which we stop including hostnames. This defaults to INT_MAX => always include hostnames. If a value is given, then we will include hostnames for any allocation smaller than the given limit.

* remove ompi_proc_get_hostname. Replace all occurrences with a direct link to ompi_proc_t's proc_hostname, protected by appropriate "if NULL"

* modify the OMPI-ORTE integration component so that any call to modex_recv automatically loads the ompi_proc_t->proc_hostname field as well as returning the requested info. Thus, any process whose modex info you retrieve will automatically receive the hostname. Note that on-demand retrieval is still enabled - i.e., if we are running under direct launch with PMI, the hostname will be fetched upon first call to modex_recv, and then the ompi_proc_t->proc_hostname field will be loaded

* removed a stale MCA param "mpi_keep_peer_hostnames" that was no longer used anywhere in the code base

* added an envar lookup in ess/pmi for the number of nodes in the allocation. Sadly, PMI itself doesn't provide that info, so we have to get it a different way. Currently, we support PBS-based systems and SLURM - for any other, rank0 will emit a warning and we assume max number of daemons so we will always retain hostnames

This commit was SVN r29052.
2013-08-20 18:59:36 +00:00
..
configure.m4 Revamp the handling of wrapper compiler flags. The user flags, main configure 2013-01-29 00:00:43 +00:00
help-mtl-psm.txt Add in support to specify IB path record query mechanism and IB Application/Service ID for PSM MTL. Also fix a minor bug in calculating the minimum connection timeout. 2010-01-13 18:58:00 +00:00
Makefile.am WARNING: Work on the temp branch being merged here encountered problems with bugs in subversion. Considerable effort has gone into validating the branch. However, not all conditions can be checked, so users are cautioned that it may be advisable to not update from the trunk for a few days to allow MTT to identify platform-specific issues. 2010-09-17 23:04:06 +00:00
mtl_psm_cancel.c ... Delayed due to notifier commits earlier this day ... 2009-04-29 01:32:14 +00:00
mtl_psm_component.c fix non-orte builds of PSM 2013-07-21 19:12:32 +00:00
mtl_psm_endpoint.c ... Delayed due to notifier commits earlier this day ... 2009-04-29 01:32:14 +00:00
mtl_psm_endpoint.h Update libevent to the 2.0 series, currently at 2.0.7rc. We will update to their final release when it becomes available. Currently known errors exist in unused portions of the libevent code. This revision passes the IBM test suite on a Linux machine and on a standalone Mac. 2010-10-24 18:35:54 +00:00
mtl_psm_probe.c fix copy/paste typo 2012-05-24 16:06:20 +00:00
mtl_psm_recv.c mtl/psm: add missing header for opal_show_help (one more) 2013-03-05 00:18:51 +00:00
mtl_psm_request.h - Split the datatype engine into two parts: an MPI specific part in 2009-07-13 04:56:31 +00:00
mtl_psm_send.c Convert a few opal_output() calls to instead use orte_show_help() as well as do some minor cosmetic changes dealing with tab spacing and c-blocks being enclosed with \{\}. There was also a long standing bug with the PSM mtl if the number of hardware contexts on adapter were less than the number of cores on a node (The default case is they are the same hence no issues were reported). For completeness we take care of this case as well but it requires us to tell PSM how many local processes are running on a node and the local rank of the process on a node so it can allocate the available hardware contexts appropriately. 2009-07-30 02:55:20 +00:00
mtl_psm_types.h MCA/base: Add new MCA variable system 2013-03-27 21:09:41 +00:00
mtl_psm.c As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: 2013-08-20 18:59:36 +00:00
mtl_psm.h Checking for NULL function pointers and direct-call semantics can't work 2012-03-26 19:27:03 +00:00
post_configure.sh Initial commit of QLogic PSM MTL. 2006-09-14 16:44:02 +00:00