1
1
openmpi/orte/mca/plm/base
Ralph Castain b59ae14a2a Fix static port and partial allocation operations
Fix static port wireup by recording the TCP port mpirun is using and correctly passing the regex of hosts to the daemons. Do a better job of closing sockets on failed connection attempts. Correctly identify the remote host in the associated error message.

Fix partial allocation operations by not attempting to set #slots on nodes that were not used, and thus don't have a daemon or topology assigned to them

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-28 10:09:44 -08:00
..
base.h Per the discussion on the telecon, change the -host behavior so we only run one instance if no slots were provided and the user didn't specify #procs to run. However, if no slots are given and the user does specify #procs, then let the number of slots default to the #found processing elements 2016-03-29 11:21:57 -07:00
help-plm-base.txt Purge whitespace from the repo 2015-06-23 20:59:57 -07:00
Makefile.am Remove the last involvement of the OOB system from the MPI layer, remove the no-longer-needed usock/oob component, and have procs no longer open the RML, OOB, ROUTED, and GRPCOMM frameworks as PMIx now provides all required app-mpirun cmds 2015-09-15 13:08:35 -07:00
owner.txt add owner files to opa/ompi/orte mca directories 2015-02-22 15:10:23 -07:00
plm_base_frame.c Integrate PMIx 1.0 with OMPI. 2015-08-29 16:04:10 -07:00
plm_base_jobid.c Convert the orte_job_data pointer array to a hash table so it doesn't grow forever as we run lots and lots of jobs in the persistent DVM. 2016-02-21 11:55:49 -08:00
plm_base_launch_support.c Fix static port and partial allocation operations 2017-01-28 10:09:44 -08:00
plm_base_orted_cmds.c more c99 updates 2015-06-25 10:14:13 -06:00
plm_base_receive.c Revert e9bc293 and try a different approach for scalably dealing with hetero clusters. Have each orted send back its topo "signature". If mpirun detects that this signature has not been seen before, then ask for that daemon to send back its full topology description. This allows the system to only get the topology once for each unique topo in the cluster. 2017-01-18 10:22:15 -08:00
plm_base_select.c mca/base: add priority output to mca_base_select 2015-10-19 12:32:41 -06:00
plm_private.h Revert e9bc293 and try a different approach for scalably dealing with hetero clusters. Have each orted send back its topo "signature". If mpirun detects that this signature has not been seen before, then ask for that daemon to send back its full topology description. This allows the system to only get the topology once for each unique topo in the cluster. 2017-01-18 10:22:15 -08:00