1
1

19 Коммитов

Автор SHA1 Сообщение Дата
Ralph Castain
6d6bc9bd07 Update alps module to new APIs
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-12 09:43:07 -07:00
Ralph Castain
48fc339718 Create an alternative mapping method that pushes responsibility
onto the backend daemons. By default, let mpirun only pack the app_context
info and send that to the backend daemons where the mapping will
be done. This significantly reduces the computational time on mpirun as it isn't
running up/down the topology tree computing thousands of binding
locations, and it reduces the launch message to a very small number of
bytes.

When running -novm, fall back to the old way of doing things
where mpirun computes the entire map and binding, and then sends
the full info to the backend daemon.

Add a new cmd line option/mca param --fwd-mpirun-port that allows
mpirun to dynamically select a port, but then passes that back to
all the other daemons so they will use that port as a static port
for their own wireup. In this mode, we no longer "phone home" directly
to mpirun, but instead use the static port to wireup at daemon
start. We then use the routing tree to rollup the initial
launch report, and limit the number of open sockets on mpirun's node.

Update ras simulator to track the new nidmap code

Cleanup some bugs in the nidmap regex code, and enhance the error message for not enough slots to include the host on which the problem is found.

Update gadget platform file

Initialize the range count when starting a new range

Fix the no-np case in managed allocation

Ensure DVM node usage gets cleaned up after each job

Update scaling.pl script to use --fwd-mpirun-port. Pre-connect the daemon to its parent during launch while we are otherwise waiting for the daemon's children to send their "phone home" rollup messages

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-07 20:43:12 -08:00
Ralph Castain
28abe78f8c Add new platform files. Modify scaling.pl to support ppn option
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-29 15:55:49 -08:00
Jeff Squyres
1187212f5d scaling.pl: minor change to perl quoting
Makes emacs syntax hilighting work better.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-12-08 09:25:08 -08:00
Ralph Castain
d5a428b646 Scaling test should only launch one proc/node
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2016-12-08 09:24:22 -08:00
Ralph Castain
af9a55ccf1 Fix the session directory cleanup - only remove the jobfam session dir level if we are the local daemon and are cleaning up our own session directory.
Update the scaling test to run more trials and report the options being tested each time

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2016-12-03 09:59:18 -08:00
Ralph Castain
b11c9574d4 Remove debug and update copyright 2016-10-11 23:28:16 -07:00
Ralph Castain
a2326e3ba0 Update the scaling test to properly use orterun for orte-dvm tests, and extend by adding params for async mpi init/finalize 2016-10-11 23:24:52 -07:00
Ralph Castain
84eb21d6bf Update the script to properly run on the Cray. Add rawout option to retain the raw timing output in case the formats don't match 2015-11-12 12:11:17 -08:00
Ralph Castain
1607daeb10 Update the scaling script to output data into a CSV file for easy import into Excel 2015-11-11 13:29:37 -08:00
Ralph Castain
efbea40a8b Minor typo for slurm scaling test support, add aprun for use on Cray 2015-11-11 13:29:37 -08:00
Ralph Castain
187fa9b131 Extend the scaling test script to support multiple starters, including mpirun, orterun (if mpirun not present), orte-dvm, and srun. Auto-detect which are p
resent and allow the user to run all of them. Auto-detect the number of nodes in the allocation.
2015-11-08 11:34:06 -08:00
Ralph Castain
73c8c30c5d Update the scaling.pl test script to support orte-dvm and srun 2015-11-07 13:13:36 -08:00
Ralph Castain
18c5cb48ff Update the scaling test script 2015-11-06 21:51:40 -08:00
Ralph Castain
8ebf235a56 Use preconnect as a better test of startup scaling than barrier
This commit was SVN r26530.
2012-06-01 02:35:15 +00:00
Ralph Castain
978897ade2 Little more cleanup, working now
This commit was SVN r26522.
2012-05-29 21:33:06 +00:00
Ralph Castain
e4d80001dc Little cleanup to handle the Mac
This commit was SVN r26513.
2012-05-29 18:21:47 +00:00
Ralph Castain
197f923ce3 Update scaling script
This commit was SVN r26510.
2012-05-29 17:41:38 +00:00
Ralph Castain
3068438022 Add scaling tests and script
This commit was SVN r26509.
2012-05-29 15:21:44 +00:00