openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	9898332ae0	Allow individual jobs to set their map/rank/bind policies Override the defaults when provided. Ignore LSF binding file if user overrides by specifying a policy. Fixes #6631 Signed-off-by: Ralph Castain <rhc@pmix.org> (cherry picked from commit `ea0dfc3218`)	2019-08-07 05:51:06 -07:00
Ralph Castain	dae71d3a75	Correct parsing of ppr directives Needed to apply commit from PR #5778 to get this commit from PR #6238 to apply cleanly. Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit `b19e5edf76`)	2019-01-29 11:34:44 -07:00
Ralph Castain	18afb8e8a6	Update mapping system Correctly transfer job-level mapping directives for dynamically spawned jobs to the mapping system. Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit `45f23ca5c9`)	2019-01-29 10:04:30 -07:00
Aurélien Bouteiller	d9b0dad828	Correctly propagate the oversubscribe flag to the spawnees This is a cherry-pick of master (`2820aef`). The propagation is intended to resolve issue #6130 Signed-off-by: Aurélien Bouteiller <bouteill@icl.utk.edu>	2018-12-21 14:53:25 -05:00
Ralph Castain	98c8492057	Fix typo for rmaps_base_oversubscribe Causes the MCA param to be ignored, while the cmd line option still works. Thanks to @iassiour for the report! Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-11-29 07:40:51 -08:00
Jeff Squyres	8be14b9b07	orte-rmaps-base: slightly amend help message Follow on to `430c659908`: clarify the help message and fix one typo. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit `e9bf318dcb`)	2018-11-08 18:20:28 -05:00
Jeff Squyres	76d4c1843e	orte-rmaps-base: update out-of-slots show_help message Update the show_help message for when there are not enough slots to run an application. Also, remove a bunch of copies of this message in various show_help text files that aren't used/referred to anywhere in the code. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit `430c659908`)	2018-11-08 16:03:28 -05:00
Jeff Squyres	37a9cf5c82	Squash a bunch of harmless compiler warnings. Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit `6bb356ab87`)	2018-09-26 14:42:13 -07:00
Boris Karasev	31ca3842da	Fixed copyrights of prev commit. Signed-off-by: Boris Karasev <karasev.b@gmail.com> (cherry picked from commit `beb0697f24`)	2018-08-28 12:29:16 +03:00
Boris Karasev	d995fb1b3f	Fixed the NUMA obj detection for hwloc ver >= 2.0.0 Since version hwloc 2.0.0 has a new organization of NUMA nodes on the topology tree. This commit adds the detection of local NUMA object for hwloc => 2.0.0, which fixes the procs bindings policy for rmaps mindist component. Signed-off-by: Boris Karasev <karasev.b@gmail.com> (cherry picked from commit `e5291ccc34`)	2018-08-28 12:29:08 +03:00
Ralph Castain	511319c316	Fix the multiple pe/proc option Things got a little out of whack and we weren't actually processing the map-by modifiers, plus an error crept into the display of the binding report. So clean those up. Thanks to @tonyreina for the error report Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit `bcdb1f45ac`)	2018-07-25 19:55:28 -07:00
Ralph Castain	6b6e63a346	Control inheritance of launch directives by child jobs Do not have child jobs inherit launch directives unless requested to do so. This affects the map-by, rank-by, bind-to, npernode, pernode, npersocket, persocket, and cpus-per-rank directives. Values provided in the spawn call always take precedence - if a particular value isn't specified, then the ORTE defaults will be used if inheritance is not requested, and the values specified by MCA param will be used if inheritance is set. Always inherit oversubscribe for now as otherwise MTT will break Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-07-10 15:12:05 -07:00
Ralph Castain	3b2390e5d5	Silence coverity warnings, remove/ignore build product Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-25 08:01:28 -07:00
Ralph Castain	f17d47087a	Define a new binding method and qualifier Allow users to request that procs be bound to a cpu in a given cpu-list based on their corresponding local rank Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-20 21:26:09 -07:00
Ralph Castain	98b4ed9a3a	Fix the no-disconnect test A race condition exists based on whether or not the userdata object attached to a hwloc_obj_t has been initialized. These objects are setup whenever we scan for resources under that location. You therefore must not set a variable to the pointer to the userdata object and then call a function that will initialize the data in it - you need to set the variable after the function call, and protect against a NULL pointer Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-19 13:52:34 -07:00
Ralph Castain	f0a0d606a0	Correct accounting for tools Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 1be080f7b92bad39745f42628a8cb6afefad2d2a)	2018-06-18 13:24:25 -07:00
Ralph Castain	ea21f7175a	Silence warnings and remove unused code Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-16 17:42:48 -07:00
Ralph Castain	7c0ec7e851	Cleanup warnings in binding code This still leaves two unresolved warnings: base/rmaps_base_binding.c:577:22: warning: variable ‘clvm’ set but not used [-Wunused-but-set-variable] unsigned clvl=0, clvm=0; ^~~~ base/rmaps_base_binding.c:576:27: warning: variable ‘hwm’ set but not used [-Wunused-but-set-variable] hwloc_obj_type_t hwb, hwm; ^~~ The problem is that these values are used in the OPAL_HWLOC_MAKE_OBJ_CACHE macro to form a variable name. Thus, the compiler doesn't recognize the values as being "used". I'm not entirely sure how to resolve it cleanly. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-06-03 11:47:14 -07:00
Brice Goglin	c4dffa1d0f	rmaps: simplify the lookup for the binding object and fix for hwloc 2.0 Don't bother doing a lookup upwards or downwards for the target object type. Just use the target depth, iterate over the level until we find the min_bound object that intersects the locale cpuset. Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>	2018-05-24 11:53:07 +02:00
Ralph Castain	d2040497b8	Silence Coverity warning Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-04-27 07:30:00 -07:00
Ralph Castain	1e8add52d7	Silence warning Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-04-26 11:45:25 -07:00
Ralph Castain	9ae80596f6	Fix rank-by option and improve npernode/skt This fixes a problem reported by @bgoglin where rank-by was incorrectly generating values when ranking by a type of object (e.g., socket). It also corrects the handling of the pernode, npernode, and npersocket options - these should only set the #procs and the default mapping pattern. They specifically should not prohibit the user from requesting a different mapping. Thus, the following should be valid: mpirun -npernode 2 --map-by socket ... should put 2 procs on each node, mapping them by-socket on each node. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-04-25 20:35:43 -07:00
Ralph Castain	d644f7ee26	Correctly fix the ranking policy Shorten the loops as much as possible - if someone wants to further optimize, they are welcome to do so. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-03-26 16:06:46 -07:00
Ralph Castain	322f6c5056	Fix a breakage in the ranking system While it may be faster to reverse the order of the assignment loops, it also results in the wrong answer Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-03-25 15:55:56 -07:00
Ralph Castain	cb221b6f6f	Correct mapping errors Since we now support the dynamic addition of hosts to the orte_node_pool, there is no longer any reason to require advanced specification of all possible nodes. Instead, use a precedence method to initially allocate only those hosts that were specified in the cmd line: * rankfile, if given, as that will specify the nodes * -host, aggregated across all app_contexts * -hostfile, aggregated across all app_contexts * default hostfile * assign local node Fix slots_inuse accounting so that the nodes are correctly reset upon error termination - e.g., when oversubscribed without permission. Ensure we accurately track the user's specified desires for oversubscribe and no-use-local when dynamically spawning jobs. Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit c9b3e68ce596a68a2ed2fbf73f211b3334b0a6a8)	2018-02-07 11:29:21 -08:00
Ralph Castain	73ef976ead	Silence warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-02-03 00:29:06 -08:00
Boris Karasev	52e81ee4b1	rmaps: fixed the ordering of `mpirun` target nodes Fixed the desync of job-nodelists between mpirun and orted daemons. The issue was observed when using RSH launching because user can provide arbitrary order of nodes regarding HNP placement. The mpirun process propagate the daemon's nodelist order to nodes. The problem was that HNP itself is assembling the nodelist based on user provided order. As the result ranks assignment was calculated differently on orted and mpirun. Consider following example: * User launches mpirun on node cn2. * Hostlist is cn1,cn2,cn3,cn4; ppn=1 * mpirun is passing hostlist cn[2:2,1,3-4]@0(4) to orteds So as result mpirun will assing rank 0 on cn1 while orted will assign rank 0 on cn2 (because orted sees cn2 as the first element in the node list) Signed-off-by: Boris Karasev <karasev.b@gmail.com>	2018-02-01 17:16:05 +02:00
Ralph Castain	8a7a57d4e2	Remove debug from rmaps base Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-20 00:22:51 -08:00
Yu Feng	6aaf62584b	Mention --oversubscribe The current error message when the number of slots is insufficient (e.g. running mpirun -n 4 on a dual core machine) does not mention the use of `--oversubscribe`. In earlier version of Open MPI, the over-subscription was automatic (albeit buggy?); but the important point was no error message was printed and the application runs. Mentioning the oversubscibe flag in the message will ease up the transition to the current behaviour where explicit request is required. Also make a few other minor tweaks / cleanups to the orte-rmaps-seq:alloc-error help message. Signed-off-by: Yu Feng <rainwoodman@gmail.com> Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-27 10:14:13 -05:00
Boris Karasev	d2a568afa5	rmaps/mindist: reworked the job map binding The following issues have been fixed for `mindist`: - computing the job map on the backend nodes - using slots count (`-host node1:<s1>,nodeN:<sN>`) - fixed `dist:span` job mapping method - fixed `oversubcribe` option with `-host` Signed-off-by: Boris Karasev <karasev.b@gmail.com>	2017-11-09 08:56:44 +02:00
Ralph Castain	dcf389b6fa	We now add all nodes to the job data object when we map, so don't do it twice Fixes #4449 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-05 17:17:50 -08:00
Ralph Castain	d7d127b9b5	Correctly assign locales when mapping ppr Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-03 03:03:56 -07:00
Ralph Castain	d5ce3c38e1	Begin cleaning up debugger support Debugger daemons do not count against available slots. Clean up some leftover errors from the upgrade to HWLOC 2 in the mappers. Properly flag debugger jobs that come in via PMIx. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-27 16:18:43 -05:00
Ralph Castain	fe9b584c05	Fully support OMPI spawn options. Fix a bug in the round-robin mappers where we weren't adding nodes to the job map node array, and so resources were not released Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 285d8cfef74ffc899e9c51e1d9c597b7fb2ceb89)	2017-09-21 10:29:27 -07:00
Jeff Squyres	7cccee9d92	rmaps/base: remove debugging "DONE" message Thanks for Ben Menadue for reporting and supplying the patch. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-09-19 07:10:00 -07:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
Ralph Castain	7a83fdb9bb	Update to hwloc 2.0.0a with shmem support. Update to support passing of HWLOC shmem topology to client procs Update use of distance API per @bgoglin Have the openib component lookup its object in the distance matrix Bring usnic up-to-date Restore binding for hwloc2 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-25 20:26:22 -07:00
Gilles Gouaillardet	60aa9cfcb6	hwloc: add support for hwloc v2 API Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-20 17:39:44 +09:00
Gilles Gouaillardet	9f29f3bff4	hwloc: since WHOLE_SYSTEM is no more used, remove useless checks related to offline and disallowed elements Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-20 17:39:21 +09:00
Geoffrey Paulsen	71333a4b14	Transitioning ownership of rmaps/seq and rmaps/rank_file from Intel to IBM.	2017-07-18 21:31:01 -04:00
Ralph Castain	7b39f19f60	Fix the backend mapper algorithm for comm_spawn. The front and back ends need to get the nodes into the job map in the same order so that the ranking algorithms will reach the same results Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-08 08:00:52 -07:00
Ralph Castain	93cf3c7203	Update OPAL and ORTE for thread safety (I swear, if I look this over one more time, I'll puke) Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-06 12:30:57 -07:00
Ralph Castain	ad108ba44d	Fix the DVM Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-30 11:42:42 -07:00
Ralph Castain	87201a80ff	Silence coverity warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-27 11:45:53 -07:00
Ralph Castain	657e701c65	Add debug verbosity to the orte data server and pmix pub/lookup functions Start updating the various mappers to the new procedure. Remove the stale lama component as it is now very out-of-date. Bring round_robin and PPR online, and modify the mindist component (but cannot test/debug it). Remove unneeded test Fix memory corruption by re-initializing variable to NULL in loop Resolve the race condition identified by @ggouaillardet by resetting the mapped flag within the same event where it was set. There is no need to retain the flag beyond that point as it isn't used again. Add a new job attribute ORTE_JOB_FULLY_DESCRIBED to indicate that all the job information (including locations and binding) is included in the launch message. Thus, the backend daemons do not need to do any map computation for the job. Use this for the seq, rankfile, and mindist mappers until someone decides to update them. Note that this will maintain functionality, but means that users of those three mappers will see large launch messages and less performant scaling than those using the other mappers. Have the mindist module add procs to the job's proc array as it is a fully described module Protect the hnp-not-in-allocation case Per path suggested by Gilles - protect the HNP node when it gets added in the absence of any other allocation or hostfile Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-25 18:41:27 -07:00
Ralph Castain	b527c40dae	Remove debug Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-12 12:41:36 -07:00
Ralph Castain	23af6c9d02	Merge pull request #3519 from rhc54/topic/nolocal Fix --nolocal	2017-05-12 09:57:52 -07:00
Ralph Castain	45bbd598c1	Fix --nolocal Fix the --nolocal option by ensuring we always check/remove the HNP from the list of available nodes if the flag is set Ensure that the HNP node is included as available when nothing else is given Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-12 09:03:26 -07:00
Ralph Castain	29e083bffd	Fix total_slots_allocated computation On unmanaged allocations, we need to update the total_slots_allocated once the daemons have been launched and "discovered" their topology Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-12 08:21:52 -07:00
Ralph Castain	911961ee21	Sigh - remove debug Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-10 11:26:42 -07:00

1 2 3 4 5 ...

658 Коммитов