openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	7a818a26a9	Whitespace cleanup Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-01-12 10:32:49 -08:00
Ralph Castain	e35347f9e3	Merge pull request #4704 from ggouaillardet/topic/regx_misc orte/regx: fix, revamp and enhancement	2018-01-12 06:50:58 -08:00
Ralph Castain	ac522a521f	Ensure that prun doesn't prematurely exit Ensure that prun doesn't exit until notified that its own child job terminated. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-01-11 19:03:32 -08:00
Gilles Gouaillardet	4130c93976	regx/reverse: add the reverse component Search for the digits to be compressed from the end of the node names. For example, if the nodelist is c712f6n01,c712f6n02,c712f6n03 the regx/fwd component generates c[3:712]f6n01,c[3:712]f6n02,c[3:712]f6n03@(3) when the regx/reverse component generates c712f6n[2:1-3]@0(3) which is a better fit here. Josh Hursey authored the changes and must be credited. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-12 11:45:49 +09:00
Gilles Gouaillardet	c2a358ff45	regx: move most functions from the fwd component to base Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-12 11:45:48 +09:00
Gilles Gouaillardet	0c686f01e5	regx: add the extract_node_names callback typedef int (orte_regx_base_module_extract_node_names_fn_t)(char regexp, char ***names); among other things, that will make testing way easier. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-12 10:58:41 +09:00
Gilles Gouaillardet	a056fdea2d	regx/fwd: correctly handle node names with multiple set of digits Refs. open-mpi/ompi#4689 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-12 10:58:36 +09:00
Ralph Castain	6216225bda	Ensure cleanup of registered files/dirs Resolve a race condition between registering for a file to be removed upon termination and actual creation of that file by providing attributes that identify whether the path is a file or directory. This removes the need for PMIx to detect the difference. Refs #4686 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-01-11 11:05:30 -08:00
Ralph Castain	4cd7f3b202	Convert nidmap to regx framework Handle the need for different regex generator/parsers by moving the orte/util/nidmap and orte/util/regex code into a new "regx" framework. Use the original code to complete a "fwd" component, and create a scaffold for IBM's "reverse" component. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-01-10 20:28:21 -08:00
Ralph Castain	e2bc941f1e	Silence some warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-01-05 11:28:20 -08:00
Gilles Gouaillardet	03da5218ea	orte: remove some dead code related to the new tree_spawn method Now that the daemon calls remote_spawn itself, there is no longer a need for the "tree_spawn" command nor the associated command processing code since the HNP is no longer sending a tree-spawn message to the orted. Thanks Ralph for the guidance ! Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-04 09:35:17 +09:00
Gilles Gouaillardet	4527584840	orted: fix tree-spawn when the node regex is too long When the node regex is too long to be sent on the command line, retrieve it first from the parent, and then spawn the remote orted Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-04 09:33:46 +09:00
Gilles Gouaillardet	799152e7fb	plm/base: add the orte_plm_base_node_regex_threshold MCA parameter This parameter can be used to set the node regex max length that can be passed to the orted command line. For testing purpose, it can be set to zero in order to force the node regex being retrieved by orted from its parent. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-04 09:33:46 +09:00
Gilles Gouaillardet	f7e29127bc	sstore/stage: fix parameter handling in sstore_stage_local_compress_waitpid_cb() since open-mpi/ompi@8f496b01b7 sstore_stage_local_compress_waitpid_cb is invoked with an orte_wait_tracker_t , that must be used to reach the orte_sstore_stage_local_app_snapshot_info_t . Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-04 09:33:46 +09:00
Gilles Gouaillardet	c4cd12bc43	plm/rsh: fix parameter handling in rsh_wait_daemon() since open-mpi/ompi@8f496b01b7 rsh_wait_daemon is invoked with an orte_wait_tracker_t , that must be used to reach the orte_plm_rsh_caddy_t . Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-04 09:33:46 +09:00
Ralph Castain	ad96fa19d4	Merge pull request #4642 from rhc54/topic/validate Detect/warn of illegal node names	2017-12-20 10:18:43 -08:00
Ralph Castain	8a7a57d4e2	Remove debug from rmaps base Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-20 00:22:51 -08:00
Ralph Castain	3269f2de66	Detect/warn of illegal node names If we detect that someone has given us an incorrect node name, provide a helpful message telling them as it is almost certainly a typo. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-19 12:55:04 -08:00
Ralph Castain	b37315658b	Merge pull request #4636 from rhc54/topic/attrs Fix the optnone attribute, add extension attribute	2017-12-19 10:18:59 -08:00
Ralph Castain	ccc2fcdfdf	Merge pull request #4627 from ggouaillardet/topic/nidmap orte/nidmap: correctly handle '-' as a valid hostname character	2017-12-19 09:09:58 -08:00
Ralph Castain	db8ebd33ad	Fix the optnone attribute, add extension attribute See how the various compilers handle these Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-18 19:18:53 -08:00
Ralph Castain	07427c6d89	Update to PMIx v3.0 PR for cleanup registration If available, have apps use registration capability to cleanup their session directories. Setup capability for vader to register its shared memory file location - let someone familiar with that code do so. Final cleanup to track uid/gid, update the opal/pmix API to pass flags for ignore and leave top directory alone Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-18 06:53:11 -08:00
Ralph Castain	7a58f91ab9	Fix the tree-spawn-with-rollup Somehow, the code for passing a daemon's parent was accidentally removed, thus breaking the tree-spawn callback sequence and causing all daemons to phone directly home. Note that this is noticeably slower than no-tree-spawn for small clusters where directly ssh launch of the child daemons from the HNP doesn't overload the available file descriptors. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-15 16:03:43 -08:00
Gilles Gouaillardet	f3e2a313af	orte/nidmap: correctly handle '-' as a valid hostname character '-' is not an alpha character nor a digit, but it is a valid hostname character and should be handled as an alpha character, otherwise, nodes such as node-001 do not get "compressed" in the regex. Refs open-mpi/ompi#4621 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-15 15:28:50 +09:00
Ralph Castain	5c4185abd8	Add the __optnone__ attribute to help avoid optimizing out MPIR_Breakpoint Thanks to @kiranchandramohan for the suggestion Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-14 13:14:21 -08:00
Ralph Castain	cfa810f125	Close the shmemfd to avoid leaking it Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-13 10:20:05 -08:00
Ralph Castain	c4501185b7	Merge pull request #4614 from rhc54/topic/hwloc Silence error messages and ensure we still support binding	2017-12-12 12:57:07 -08:00
Ralph Castain	9a7b0d8d9c	Merge pull request #4586 from rhc54/topic/addhosts Fix add-host support by including the location for procs of prior jobs when spawning new daemons.	2017-12-12 12:45:57 -08:00
Ralph Castain	84c51847b1	Silence error messages and ensure we still support binding, even if shmem support for hwloc isn't available Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-12 12:11:26 -08:00
Ralph Castain	4316213805	Fix add-host support by including the location for procs of prior jobs when spawning new daemons. Thanks to CalugaruVaxile for the report Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-07 14:48:58 -08:00
Ralph Castain	ee2a93cb2e	Ensure we don't send a kill signal to pid=0 as that hits ourselves and initiates an infinite loop. Thanks to Michael Fenn for the report. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-12-07 10:38:11 -08:00
Gilles Gouaillardet	4a481f66e6	odls/base: fix orte_odls_base_harvest_threads() Do not try to finalize odls progress threads if they have not been started yet Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-04 15:18:04 +09:00
Gilles Gouaillardet	3496897961	odls/base: fix handling of the odls_base_num_threads MCA param If a number of odls threads is explicitly required, then use that number no matter what. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-04 11:19:25 +09:00
Gilles Gouaillardet	a4755b694b	odls/pspawn: record the pid of the spawn'ed process Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-01 15:00:50 +09:00
Ralph Castain	b5bf0a7f1d	Add a new posix_spawn component to the ODLS framework. Only selectable when specifically requested via "-mca odls pspawn" Note that there are several concerns: * we aren't getting SIGCHLD calls when the procs terminate * we aren't seeing the IO pipes close on termination, though we are getting output forwarded to mpirun * I haven't found a way to bind the child process prior to exec. If we want to use this method, we probably need someone to implement a cgroup component for the orte/rtc framework Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-30 18:01:31 -08:00
Ralph Castain	335fc96f42	Remove debug Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-29 21:21:35 -08:00
Gilles Gouaillardet	8e17127258	plm/alps: fix orte_wait_cb() usage Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-30 13:49:22 +09:00
Ralph Castain	8f496b01b7	Try automatically adding local spawn threads to parallelize the fork/exec process to speed up the launch on large SMPs. Harvest the threads after initial spawn to minimize any impact on running jobs. Change the determination of #spawn threads to be done on basis of #local procs in first job being spawned. Someone can look at an optimization that handles subsequent dynamic spawns that might be larger in size. Leave the threads running, but blocked, for the life of the daemon, and use them to harvest the local procs as they terminate. This helps short-lived jobs in particular. Add MCA params to set: * max number of spawn threads (default: 4) * set a specific number of spawn threads (default: -1, indicating no set number) * cutoff - minimum number of local procs before using spawn threads (default: 32) Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-29 19:54:00 -08:00
Josh Hursey	38aefd2ec4	Merge pull request #4439 from mattaezell/lsf_csm Disable the LSF plm if CSM is detected	2017-11-29 07:43:16 -06:00
Yu Feng	6aaf62584b	Mention --oversubscribe The current error message when the number of slots is insufficient (e.g. running mpirun -n 4 on a dual core machine) does not mention the use of `--oversubscribe`. In earlier version of Open MPI, the over-subscription was automatic (albeit buggy?); but the important point was no error message was printed and the application runs. Mentioning the oversubscibe flag in the message will ease up the transition to the current behaviour where explicit request is required. Also make a few other minor tweaks / cleanups to the orte-rmaps-seq:alloc-error help message. Signed-off-by: Yu Feng <rainwoodman@gmail.com> Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-27 10:14:13 -05:00
Ralph Castain	1de0421e48	Provide a more robust way of checking for proct completion Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-26 10:39:53 -08:00
Ralph Castain	a25a7bcba7	Handle the case where stdout and stderr get merged into a file Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-26 08:18:34 -08:00
Ralph Castain	e3c308dfc8	Update the odls/alps component Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-25 19:51:07 -08:00
Ralph Castain	3906aaf41a	Silence warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-25 11:50:18 -08:00
Ralph Castain	30f23ac67a	Save one more file descriptor per process by not opening one for stddiag if PMIx (version > 1.x) is active since all diagnostic messages will instead flow thru the PMIx connection. Unfortunately, PMIx v1 does not support this feature, but we can remove the stddiag support once PMIx v1 slides out of the support window Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-25 11:48:53 -08:00
Gilles Gouaillardet	e88767866e	iof: optimize handling of stderr when iof_base_redirect_app_stderr_to_stdout is set avoid creating a pipe for a task stderr when we know it will be redirected to stdout Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-24 13:20:03 +09:00
Gilles Gouaillardet	84e96522f2	iof: optimize handling of stdin since some tasks migth end up having /dev/null as their stdin, simply avoid pipe creation and destruction for these tasks. From a pragmatic and MPI point of view, and unless explicitly required otherwise, all MPI tasks but (the first) one end up with /dev/null as their stdin. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-22 13:18:32 +09:00
Gilles Gouaillardet	47bf0d6f9d	iof/base: do not assume fileno(stdin) is zero Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-22 11:45:13 +09:00
Ralph Castain	6eb3c124e1	Merge pull request #4498 from rhc54/topic/pmixup Some minor cleanups of the DVM	2017-11-12 19:01:15 -08:00
Ralph Castain	9c84e1485b	Some minor cleanups of the DVM Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-12 16:27:37 -08:00
Boris Karasev	d2a568afa5	rmaps/mindist: reworked the job map binding The following issues have been fixed for `mindist`: - computing the job map on the backend nodes - using slots count (`-host node1:<s1>,nodeN:<sN>`) - fixed `dist:span` job mapping method - fixed `oversubcribe` option with `-host` Signed-off-by: Boris Karasev <karasev.b@gmail.com>	2017-11-09 08:56:44 +02:00
Ralph Castain	dcf389b6fa	We now add all nodes to the job data object when we map, so don't do it twice Fixes #4449 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-05 17:17:50 -08:00
Matt Ezell	e45761d498	Disable the LSF plm if CSM is detected LSF running on top of CSM does not provide LSF daemons on the compute nodes. Signed-off-by: Matt Ezell <ezellma@ornl.gov>	2017-11-02 13:48:46 -04:00
Ralph Castain	b97caf8f05	Correct copy/paste error in example Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-02 10:33:28 -07:00
Ralph Castain	6be74bfa7e	Add another test program for cross-lib coordination, this one based on native PMIx commands Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-29 11:33:25 -07:00
Ralph Castain	3b71be4db4	Update the scaling script to avoid use of "system" command, thus ensuring that each command sees the same environment. Fix prun to pickup and propagate OMPI MCA params Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-23 16:27:41 -07:00
bosilca	ac348da13a	Merge pull request #4374 from bosilca/topic/osx_syslog Topic/osx syslog	2017-10-23 18:06:36 -04:00
Ralph Castain	e33f319380	Update example to show tests of various APIs Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-23 12:02:54 -07:00
Ralph Castain	6ea3c8a0bd	Update the interlib example to show an alternative method for model declaration. Add a missing range value to the OPAL layer. Make it easier to see OMPI model callbacks Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-23 11:27:42 -07:00
George Bosilca	8f32b345de	Address syslog issues on OSX 10.13 with gcc 7.x gcc 7.[1,2] (at least) fails to correctly parse the OSX 10.13 sys/syslog.h header. As a results we need to potect syslog support in OPAL, PMIX and ORTE. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-10-23 14:02:10 -04:00
Ralph Castain	a63904d47f	Updates to support cross-version operations with OMPI v2.x Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-22 08:38:33 -07:00
Ralph Castain	f8ce31f13c	Fix event registration so OpenMP/MPI coordination sides can both get notification of model declarations Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-19 18:06:38 -07:00
Ralph Castain	75d411f3ea	Ensure we update the routing plan so that tree spawn works! Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-19 14:02:06 -07:00
Ralph Castain	6ffb0d0507	Ensure that the pmix server system-level rendezvous file is only output by the HNP as (at least for slurm on cray) a daemon could be colocated with the HNP and overwrite the file. Update the scaling.pl script to only use the system-level rendezvous so it doesn't get rejected by a colocated daemon Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-14 10:16:49 -07:00
Ralph Castain	60b338e857	Sync to PMIx v3. Ensure prun uses the ess/tool component. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-14 08:24:57 -07:00
Ralph Castain	388034c814	Add support for the -v (verbose) option to prun and silence the "executing" and "completed" output otherwise. Debounce "unreachable" notifications for tools when they disconnect Enable the -x cmd line option for prun Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 0a5b36180a22959654461ac1303cec35313f8b4a)	2017-10-10 12:54:49 -07:00
Ralph Castain	c696e04c5e	Since PMIx is moving to release v3.0, embed the new release candidate in opal/pmix framework. Move the pmix2x code over to the ext2x component. Create a new ext3x component Remove some build product. Tell PMIx that we don't need a new nspace generated when OMPI calls connect Add missing Makefile Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-09 13:51:08 -07:00
Ralph Castain	5352c31914	Enable remote tool connections for the DVM. Fix notifications so we "de-bounce" termination calls Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-06 10:47:05 -07:00
Ralph Castain	d7d127b9b5	Correctly assign locales when mapping ppr Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-03 03:03:56 -07:00
Ralph Castain	7d538c661c	Initialize variable and correctly compare against success Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-29 16:07:42 -07:00
Ralph Castain	d5ce3c38e1	Begin cleaning up debugger support Debugger daemons do not count against available slots. Clean up some leftover errors from the upgrade to HWLOC 2 in the mappers. Properly flag debugger jobs that come in via PMIx. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-27 16:18:43 -05:00
Ralph Castain	fcb7a2f29b	Minor cleanups for when using external pmix Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-24 09:53:04 -07:00
Ralph Castain	fe9b584c05	Fully support OMPI spawn options. Fix a bug in the round-robin mappers where we weren't adding nodes to the job map node array, and so resources were not released Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 285d8cfef74ffc899e9c51e1d9c597b7fb2ceb89)	2017-09-21 10:29:27 -07:00
Ralph Castain	e575c4d6f9	Fix tool connection logic so we properly search for default session server, perform specified number of retries, etc. Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 7c755e01004f8b86c71f1729662979ea45ab1adb)	2017-09-19 13:35:46 -07:00
Ralph Castain	16de607607	Merge pull request #4234 from rhc54/topic/upstream Ensure we update the total_slots_alloc field on each job. Correct the client example	2017-09-19 09:03:04 -07:00
Ralph Castain	658c3d1d51	Ensure we update the total_slots_alloc field on each job. Correct the client example Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit bcedd12a8a24dd246f04ff13b4fd2f1bbac6ce5a)	2017-09-19 08:14:14 -07:00
Jeff Squyres	7cccee9d92	rmaps/base: remove debugging "DONE" message Thanks for Ben Menadue for reporting and supplying the patch. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-09-19 07:10:00 -07:00
Ralph Castain	5708872112	Implement support for "local" range when publishing data Signed-off-by: Ralph Castain <rhc@open-mpi.org> (cherry picked from commit 2d54f7e0dd3a47260b0b2634aae3361316005933)	2017-09-18 19:34:08 -07:00
Ralph Castain	08c93091f7	Merge pull request #4223 from rhc54/topic/stale Remove stale tools	2017-09-18 09:43:06 -07:00
Josh Hursey	252be7ffb0	Merge pull request #4215 from jjhursey/fix/plm-lsf-rc plm/lsf: Improve error message if lsb_launch fails	2017-09-18 11:14:25 -05:00
Ralph Castain	ed508010b4	Remove stale tools Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-18 07:30:47 -07:00
Ralph Castain	3c914a7a97	Complete the fix of the ORTE DVM. We will now use "prun" instead of "orterun -hnp foo" to execute jobs. This provides the feature of automatic discovery of the orte-dvm so you don't need to manually enter URI's or contact file locations. All IO is forwarded to prun. Still in the "needs to be done" category: * mapping/ranking/binding options aren't correctly supported * if the DVM encounters some errors (e.g., not enough resources for the job), the resulting error is globally set and impacts any subsequent job submission Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-16 13:13:07 -07:00
Joshua Hursey	89c1aaf646	plm/lsf: Improve error message if lsb_launch fails Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-09-15 09:45:58 -05:00
Ralph Castain	7c7d8a69a0	Backport changes from PMIx reference server Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-14 11:48:56 -07:00
Ralph Castain	8d336ddcc0	Merge pull request #4209 from rhc54/topic/foobar Only build prun if building --with-devel-headers	2017-09-13 13:07:29 -07:00
Ralph Castain	3f8908871b	Since the DVM is now tied to prun, don't build the DVM either unless prun can be built Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-13 11:55:10 -07:00
Ralph Castain	589cc03d8e	Only build prun if building --with-devel-headers Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-13 11:38:11 -07:00
Ralph Castain	0a3d8af4c2	Merge pull request #4202 from anandhis/master Choosing provider when user requests generic transport "fabric"	2017-09-13 11:21:24 -07:00
Ralph Castain	bbd83fd4c0	Add a new launcher "prun" for starting applications against the ORTE DVM. Unlike "orterun", "prun" is a PMIx-only program that discovers the DVM connection instead of requiring that we explicitly provide it. Only build "prun" if PMIx v2.x is available. This gets the DVM working again, but still is showing problems for multiple executions. I'll detail those in a separate issue. Thus, the DVM should still be considered "broken". Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-12 21:40:41 -07:00
anandhi	4d7de8882f	Checking for generic transport "fabric" in mca parameter rml_ofi_transports to choose the first available non-socket provider. modified: orte/mca/rml/ofi/rml_ofi_component.c modified: orte/mca/rml/ofi/rml_ofi_send.c Signed-off-by: Anandhi Jayakumar <anandhi.s.jayakumar@intel.com>	2017-09-12 15:39:55 -07:00
Ralph Castain	3477079804	Repair the ORTE DVM Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-11 17:38:21 -07:00
Joshua Hursey	420ca65f4f	orte/pmix: Always seed environment with global rank * Even if we are only launching one app context, we might call spawn later and the remote groups might want their global rank information. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-09-08 08:53:49 -05:00
Howard Pritchard	5db9416724	rml/ofi: swat a compiler warning On the path to -Werror passing builds! Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-08-30 09:16:49 -06:00
Josh Hursey	ad87aa2674	Merge pull request #4121 from jjhursey/explore/dlopen-local mca: Dynamic components link against project lib	2017-08-25 13:15:51 -05:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
Ralph Castain	68029b27e4	Fix the orte-dvm operations so that orterun can connect and execute an application. There is a lingering problem, though. The first invocation of orterun succeeds every time. However, subsequent invocations have a high probability of hanging in the OOB connection handshake. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-23 17:31:08 -07:00
Ralph Castain	d80b0c7990	If the HWLOC shared memory system is unable to connect, then fallback to providing the topology via XML. Do not automatically provide the XML to every process as that defeats the purpose of the shared memory system. Instead, use PMIx_Query_info_nb to get the info from the server when required. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-22 18:12:26 -07:00
Brice Goglin	046d870124	rtc/hwloc/shmem: add Inria copyrights The code for finding the hole for the shmem region actually came from me. Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>	2017-08-21 23:09:57 +02:00
Brice Goglin	baf762d99d	rtc/hwloc/shmem: dump /proc/self/maps if failed to find a hole and verbosity > 4 Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>	2017-08-21 19:57:38 +02:00
Brice Goglin	8f6afbb641	rtc/hwloc/shmem: fix "heap" hole search kind There can be multiple [heap] consecutively in proc/<pid>/maps, and there's no room between them. Don't use a hole after the first [heap] is there's another [heap] immediately after it. This code would fail to find the last [heap] if there were multiple [heap] interleaved with non-heap VMA, but our kind "after heap" wouldn't be meaningful anymore anyway. Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>	2017-08-21 15:42:38 +02:00
Brice Goglin	b8b46b253b	rtc/hwloc/shmem: fix "libs" hole search kind We want the biggest hole between heap and stack, not outside. Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>	2017-08-21 15:40:36 +02:00
Ralph Castain	d515f48885	The local PMIx server is notifying its clients of all events, but for some reason I don't recall, the broadcast notification was marked for delivery only to non-default event handlers. This creates a discrepancy between the two behaviors, so don't restrict the broadcast notifications. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-18 17:26:11 -07:00
Ralph Castain	b67b1e88a5	Merge pull request #4111 from rhc54/topic/multiconnect Cleanup some issues in connect/accept support across jobs started by …	2017-08-17 12:49:01 -07:00
Ralph Castain	d85239e052	Cleanup some issues in connect/accept support across jobs started by different mpirun commands. Still not fully operational, but someone else will have to finish debugging it Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-17 11:58:48 -07:00
Ralph Castain	088b6cdeee	Silence coverity warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-17 09:49:35 -07:00
Ralph Castain	41df973359	Add diagnostics for hwloc get_topology Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-16 14:21:27 -07:00
Ralph Castain	65fb6070d9	Update tool support by adding MCA params to direct orted's to drop session and/or system-level tool rendezous files. Ensure PMIx is enabled for tools Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-15 17:49:47 -07:00
Ralph Castain	2fbce9d93c	Fix hostfile filtering in allocated environments to preserve slot assignments Refs #3984 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-15 14:41:12 -07:00
Artem Polyakov	10d6e90bf5	Revert "plm/rsh: Propagate PMIx prefix to orted's" This reverts commit `71da0fcbef`. (per https://github.com/open-mpi/ompi/pull/4052). Refs: https://github.com/open-mpi/ompi/issues/3980 Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2017-08-14 21:37:57 +07:00
Ralph Castain	edccb258cb	Provide the mapping, ranking, binding patterns Apps might want to make use of the relative patterns used to place/assign their procs Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-09 11:34:43 -07:00
Nathan Hjelm	76320a8ba5	opal: rename opal_atomic_init to opal_atomic_lock_init This function is used to initalize and opal atomic lock. The old name was confusing. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-08-07 14:15:11 -06:00
Ralph Castain	9921237f99	Merge pull request #4012 from rhc54/topic/p3 Cover the use-cases for OPAL_PREFIX and PMIX_INSTALL_PREFIX options	2017-08-07 11:42:53 -07:00
Ralph Castain	d1b7c3d8d5	Silence some compile-time warnings. Update scripts now that AUTHORS is gone Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-04 20:08:31 -07:00
Ralph Castain	a239b4c3c3	Per discussion on the PMIx side, do a better job of detecting mismatches between location directives for OPAL and PMIx. Provide a more helpful error message and error out if we find a mismatch. If any OPAL values are set and the PMIx equivalent is not, then transfer it. Do not clear PMIX_INSTALL_PREFIX from the daemon's launch environment Fixes #3980 Closes #4007 Refs #3985 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-04 19:36:00 -07:00
Ralph Castain	88a7c9dca3	Merge pull request #4013 from rhc54/topic/hwloc Silence warning on Mac - we know Mac doesn't support hwloc, and so it…	2017-08-03 15:52:44 -06:00
Howard Pritchard	897c62756b	Merge pull request #3999 from hppritcha/topic/slurmd_controls_them_all SLURM: launch all processes via slurmd	2017-08-03 15:33:44 -06:00
Gilles Gouaillardet	6b6e65a5bc	rtc/hwloc: fix MCA parameter handling always re-initialize vmhole before mca_base_component_var_register() otherwise the vmhole gets NULL'ified if orte is initialized a second time. that typically occurs when Open MPI is configure'd with --disable-dlopen and the app does MPI_T_init_thread(); MPI_T_finalize(); MPI_T_init_thread(); Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-03 14:45:43 +09:00
Ralph Castain	9f926b8083	Silence warning on Mac - we know Mac doesn't support hwloc, and so it doesn't matter if a VM hole isn't found. It also doesn't matter in general as all it really means is that we have to turn the hwloc shmem support "off". Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-02 20:20:45 -06:00
Howard Pritchard	d08be74573	SLURM: launch all processes via slurmd It turns out that the approach of having the HNP do the fork/exec of MPI ranks on the head node in a SLURM environment introduces problems when users/sysadmins want to use the SLURM scancl tool or sbatch --signal option to signal a job. This commit disables use of the HNP fork/exec procedure when a job is launched into a SLURM controlled allocation. update NEWS with a blurb about new ras framework mca parameter. related to #3998 Signed-off-by: Howard Pritchard <hppritcha@gmail.com>	2017-08-02 14:56:55 -06:00
Artem Polyakov	71da0fcbef	plm/rsh: Propagate PMIx prefix to orted's Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2017-08-02 08:06:13 +03:00
Ralph Castain	f39ce67982	Merge pull request #3951 from rhc54/topic/hwloc2 Update to hwloc 2.0.0a	2017-08-01 15:18:31 -06:00
Ralph Castain	e94786f4b7	Revert "Check for OPAL_PREFIX and set corresponding PMIX_PREFIX if found" This reverts commit `3744967adb`. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-01 08:14:12 -06:00
Ralph Castain	3744967adb	Check for OPAL_PREFIX and set corresponding PMIX_PREFIX if found Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-31 09:14:01 -06:00
Boris Karasev	e20b581529	pmix: fixed immediate request This commit fixes a hang when using external PMIx v1 module Signed-off-by: Boris Karasev <karasev.b@gmail.com>	2017-07-28 15:53:48 +06:00
Ralph Castain	7a83fdb9bb	Update to hwloc 2.0.0a with shmem support. Update to support passing of HWLOC shmem topology to client procs Update use of distance API per @bgoglin Have the openib component lookup its object in the distance matrix Bring usnic up-to-date Restore binding for hwloc2 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-25 20:26:22 -07:00
Ralph Castain	0042c758f1	Update the tools support so it allows tools to access PMIx Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-25 17:10:08 -07:00
Ralph Castain	af85e48dd7	Silence Coverity warning, silence pmix_error_log of success Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-21 15:33:16 -07:00
Ralph Castain	f7e8780a42	Remove fortran support from platform file Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-20 21:02:30 -07:00
Ralph Castain	b225366012	Bring the ofi/rml component online by completing the wireup protocol for the daemons. Cleanup the current confusion over how connection info gets created and passed to make it all flow thru the opal/pmix "put/get" operations. Update the PMIx code to latest master to pickup some required behaviors. Remove the no-longer-required get_contact_info and set_contact_info from the RML layer. Add an MCA param to allow the ofi/rml component to route messages if desired. This is mainly for experimentation at this point as we aren't sure if routing wi ll be beneficial at large scales. Leave it "off" by default. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-20 21:01:57 -07:00
Artem Polyakov	79c10c884d	orte/pmix/server: Fix direct modex response with error status `send_error()` is only packing status and peer info in the reply. While remote counterpart in `pmix_server_dmdx_resp()` expects the "hotel room number" to proceed correctly. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2017-07-20 23:50:57 +07:00
Gilles Gouaillardet	60aa9cfcb6	hwloc: add support for hwloc v2 API Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-20 17:39:44 +09:00
Gilles Gouaillardet	9f29f3bff4	hwloc: since WHOLE_SYSTEM is no more used, remove useless checks related to offline and disallowed elements Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-20 17:39:21 +09:00
Gilles Gouaillardet	1a34224948	hwloc: do not set the HWLOC_TOPOLOGY_FLAG_WHOLE_SYSTEM flag Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-20 17:39:16 +09:00
Ralph Castain	fca68b070b	Merge pull request #3934 from rhc54/topic/singleton Fix the isolated pmix component. Cleanup the ess/singleton component …	2017-07-19 16:02:37 -05:00
Ralph Castain	543c16b28d	Fix the isolated pmix component. Cleanup the ess/singleton component - we shouldn't be automatically discovering the local topology as that is now done on-demand. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-19 12:14:29 -07:00
Geoffrey Paulsen	71333a4b14	Transitioning ownership of rmaps/seq and rmaps/rank_file from Intel to IBM.	2017-07-18 21:31:01 -04:00
Gilles Gouaillardet	da34e2f109	ess/base: silence a warning by fixing a static initializer Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-19 09:30:53 +09:00
Ralph Castain	8a98aab6cc	Fix signal forwarding on ORTE daemons so that _all_ daemons do it, regardless of environment. Add missing support for SIGTSTP and a few others. Thanks to Eugene Dedits for reporting the problem. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-18 09:58:55 -07:00
Jeff Squyres	ccf17808b6	Merge pull request #3258 from markalle/pr/symbol_name_pollution symbol name pollution	2017-07-12 16:19:25 -05:00
Artem Polyakov	832f1b03a4	Merge pull request #3790 from artpol84/orte/iof_sbatch orte/iof: Address the case when output is a regular file	2017-07-12 09:38:01 -05:00
Gilles Gouaillardet	626e94b689	oob/tcp: make mca_oob_tcp_msg_type_t an uint8_t so no conversion is required when heterogeneous mode is enabled Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-12 10:27:45 +09:00
Mark Allen	552216f9ba	scripted symbol name change (ompi_ prefix) Passed the below set of symbols into a script that added ompi_ to them all. Note that if processing a symbol named "foo" the script turns foo into ompi_foo but doesn't turn foobar into ompi_foobar But beyond that the script is blind to C syntax, so it hits strings and comments etc as well as vars/functions. coll_base_comm_get_reqs comm_allgather_pml comm_allreduce_pml comm_bcast_pml fcoll_base_coll_allgather_array fcoll_base_coll_allgatherv_array fcoll_base_coll_bcast_array fcoll_base_coll_gather_array fcoll_base_coll_gatherv_array fcoll_base_coll_scatterv_array fcoll_base_sort_iovec mpit_big_lock mpit_init_count mpit_lock mpit_unlock netpatterns_base_err netpatterns_base_verbose netpatterns_cleanup_narray_knomial_tree netpatterns_cleanup_recursive_doubling_tree_node netpatterns_cleanup_recursive_knomial_allgather_tree_node netpatterns_cleanup_recursive_knomial_tree_node netpatterns_init netpatterns_register_mca_params netpatterns_setup_multinomial_tree netpatterns_setup_narray_knomial_tree netpatterns_setup_narray_tree netpatterns_setup_narray_tree_contigous_ranks netpatterns_setup_recursive_doubling_n_tree_node netpatterns_setup_recursive_doubling_tree_node netpatterns_setup_recursive_knomial_allgather_tree_node netpatterns_setup_recursive_knomial_tree_node pml_v_output_close pml_v_output_open intercept_extra_state_t odls_base_default_wait_local_proc _event_debug_mode_on _evthread_cond_fns _evthread_id_fn _evthread_lock_debugging_enabled _evthread_lock_fns cmd_line_option_t cmd_line_param_t crs_base_self_checkpoint_fn crs_base_self_continue_fn crs_base_self_restart_fn event_enable_debug_output event_global_current_base_ event_module_include eventops sync_wait_mt trigger_user_inc_callback var_type_names var_type_sizes Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-07-11 02:13:23 -04:00
Mark Allen	efc25168cd	symbol name pollution: making some vars static As part of addressing symbol name pollution, I'm switching a few vars/functions to static. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-07-11 02:13:22 -04:00
Gilles Gouaillardet	823382f5d7	plm/base: do not abort when configure'd with --enable-heterogeneous and a mix of BE/LE is detected Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-07-07 10:43:54 +09:00
Ralph Castain	2a580fa71e	Merge pull request #3801 from rhc54/topic/hetero Detect that we have a mix of BE/LE in the system	2017-07-06 15:29:06 -07:00
Ralph Castain	8979bfe71e	Silence Coverity warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-06 06:07:28 -07:00
anandhi	793ebc272e	When opening conduit, checking for the transport preference in below order - (1) rml_ofi_transports mca parameter. This parameter should have the list of transports (currently ethernet,fabric are valid) fabric is higher priority if provided. (2) ORTE_RML_TRANSPORT_TYPE key with values "ethernet" or "fabric". "fabric" is higher priority. If specific provider is required use ORTE_RML_OFI_PROV_NAME key with values "socket" or "OPA" or any other supported in system. modified: ../orte/mca/rml/ofi/rml_ofi.h modified: ../orte/mca/rml/ofi/rml_ofi_component.c modified: ../orte/mca/rml/ofi/rml_ofi_send.c On send_msg choose the provider on local and peer to follow below rules - 1. if the user specified the transport for this conduit (even giving us a prioritized list of candidates), then the one we selected is the _only_ one we will use. If the remote peer has a matching endpoint, then we use it - otherwise, we error out 2. if the user didn't specify a transport, then we look for matches against _all_ of our available transports, starting with fabric and then going to Ethernet, taking the first one that matches. 3. if we can't find any match, then we error out modified: ../orte/mca/rml/ofi/rml_ofi.h modified: ../orte/mca/rml/ofi/rml_ofi_component.c modified: ../orte/mca/rml/ofi/rml_ofi_send.c send_msg() -> Fixed case when the local provider chosen at time of opening conduit is not present in peer (destination) node modified: ../orte/mca/rml/ofi/rml_ofi.h modified: ../orte/mca/rml/ofi/rml_ofi_send.c When opening conduit, checking for the transport preference in below order - (1) rml_ofi_transports mca parameter. This parameter should have the list of transports (currently ethernet,fabric are valid) fabric is higher priority if provided. (2) ORTE_RML_TRANSPORT_TYPE key with values "ethernet" or "fabric". "fabric" is higher priority. If specific provider is required use ORTE_RML_OFI_PROV_NAME key with values "socket" or "OPA" or any other supported in system. modified: ../orte/mca/rml/ofi/rml_ofi.h modified: ../orte/mca/rml/ofi/rml_ofi_component.c modified: ../orte/mca/rml/ofi/rml_ofi_send.c On send_msg choose the provider on local and peer to follow below rules - 1. if the user specified the transport for this conduit (even giving us a prioritized list of candidates), then the one we selected is the _only_ one we will use. If the remote peer has a matching endpoint, then we use it - otherwise, we error out 2. if the user didn't specify a transport, then we look for matches against _all_ of our available transports, starting with fabric and then going to Ethernet, taking the first one that matches. 3. if we can't find any match, then we error out modified: ../orte/mca/rml/ofi/rml_ofi.h modified: ../orte/mca/rml/ofi/rml_ofi_component.c modified: ../orte/mca/rml/ofi/rml_ofi_send.c send_msg() -> Fixed case when the local provider chosen at time of opening conduit is not present in peer (destination) node modified: ../orte/mca/rml/ofi/rml_ofi.h modified: ../orte/mca/rml/ofi/rml_ofi_send.c Signed-off-by: Anandhi Jayakumar <anandhi.s.jayakumar@intel.com>	2017-07-05 15:40:14 -07:00
Ralph Castain	2753f53e6d	Detect that we have a mix of BE/LE in the system, provide a warning that OMPI doesn't currently support this environment, and error out Fixes #2817 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-07-03 15:47:05 -07:00
Artem Polyakov	374c824a5c	orte/iof: Generalize the fix related to always-ready fds Reference: https://bugzilla.kernel.org/show_bug.cgi?id=15272. Work with both stdin/stdout fds that are known to be always ready using libevent timers. Such fds can not be effectively used with non-blocking I/O functions like epoll, poll, select: - for poll/select the event will be triggered immediately; - for epoll `epoll_ctl` will reject an attempt to add this fd to the working set. Reference: http://www.wangafu.net/~nickm/libevent-book/Ref4_event.html Libevent suggests to use timers over event_active for the reasons provided by the link above. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2017-07-01 02:24:14 +07:00
Artem Polyakov	d9ad918a14	orte/iof: Address the case when output is a regular file Regular files are always write-ready, so non-blocking I/O does not give any benefits for them. More than that - if libevent is using "epoll" to track fd events, epoll_ctl will refuse attempt to add an fd pointing to a regular file descriptor with EPERM. This fix checks the object referenced by fd and avoids event_add using event_active instead. In the original configuration that uncovered this issue "epoll" was used in libevent, it was triggering the following warning message: "[warn] Epoll ADD(1) on fd 0 failed. Old events were 0; read change was 1 (add); write change was 0 (none): Operation not permitted" And the side effect was accumulation of all output in mpirun memory and actually writing it only at mpirun exit. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2017-07-01 02:24:14 +07:00
Ralph Castain	7cbea77238	Merge pull request #3778 from rhc54/topic/warn Attempt to detect when we are direct-launched without the necessary P…	2017-06-29 16:53:12 -07:00
Ralph Castain	85f8eb4c6b	Stop all progress threads prior to releasing the peer objects to avoid a race condition whereby a lost connection could be reported after a peer object was freed and before the threads were stopped. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-29 15:48:18 -07:00
Ralph Castain	bd4a6fee22	Attempt to detect when we are direct-launched without the necessary PMI support, and thus are incorrectly identified as being "singleton". Advise the user on the required PMI(x) support and error out. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-29 15:26:53 -07:00
Ralph Castain	9178219e6b	Deregister event handlers only on final call to finalize. Ensure we pass PMIx mca params Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-28 15:00:43 -07:00
Ralph Castain	c6c0258cd8	Need to signal -pgrp to get to all members of a process group. Thanks to Ted Sussman for the report and patience in tracking it down Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-27 12:10:34 -07:00
Ralph Castain	8a4565874e	Enable ORTE to continue running when a node fails - user takes responsibility for zombies. Minor cleanup to orte-clean Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-27 09:05:26 -07:00
Ralph Castain	6e2778ad3b	Silence coverity warnings, correctly transfer the endpoint blob bytes Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-26 08:32:06 -07:00
Ralph Castain	9dad3f7cbf	Add the modex code to combine all info from local providers into a single modex send, and then retrieve them on recv Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-25 07:24:29 -07:00
Ralph Castain	f4411c4393	Enable use of OFI fabrics for launch and other collective operations. Update the PMIx repo to the latest master to get the required support for the server to "push" modex info, and to retrieve all its own "modex" values for sending back to mpirun. Have mpirun cache them in its local modex hash as OFI goes point-to-point direct and doesn't route - so the remote daemons don't need a copy of this connection info. Remove the opal_ignore from the RML/OFI component, but disable that component unless the user specifically requests it via the "rml_ofi_desired=1" MCA param. This will let us test compile in various environments without interfering with operations while we continue to debug Fix an error when computing the number of infos during server init Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-23 19:57:21 -07:00
Ralph Castain	168e50bc13	Also need to avoid calling destruct on the opal_process_info struct after finalize Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-23 07:49:14 -07:00
Ralph Castain	3af9344764	Remove stale field Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-23 06:22:31 -07:00
Ralph Castain	38636f4f0a	Ensure we properly cleanup on termination, including when terminating due to ctrl-c Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-21 06:33:37 -07:00
Ralph Castain	2aa286c9d0	Update orte-clean so it cleans legacy session directories as well as pmix artifacts Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-20 17:46:39 -07:00
Ralph Castain	501ba8faad	Merge pull request #3704 from rhc54/topic/signal Control distribution of signals to children vs grandchildren	2017-06-20 11:11:43 -07:00
Ralph Castain	952726c121	Update to latest PMIx master - equivalent to 2.0rc2. Update the thread support in the opal/pmix framework to protect the framework-level structures. This now passes the loop test, and so we believe it resolves the random hangs in finalize. Changes in PMIx master that are included here: * Fixed a bug in the PMIx_Get logic * Fixed self-notification procedure * Made pmix_output functions thread safe * Fixed a number of thread safety issues * Updated configury to use 'uname -n' when hostname is unavailable Work on cleaning up the event handler thread safety problem Rarely used functions, but protect them anyway Fix the last part of the intercomm problem Ensure we don't cover any PMIx calls with the framework-level lock. Protect against NULL argv comm_spawn Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-20 09:02:15 -07:00
Ralph Castain	206aec6083	By default, apply signals to all direct children _and_ any children they might have spawned (so long as they remain in the same process group). Provide an MCA param (odls_base_signal_direct_children_only) to indicate that the signal is to go _only_ to our direct children, and not be delivered to any children spawned by those procs. Refs https://www.mail-archive.com/users@lists.open-mpi.org/msg31221.html Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-15 12:26:11 -07:00
Ralph Castain	8f09929469	Fix rank-file mapper launch by correctly setting up the remote map from the provided data Put a simple protection for the case where procs fail while we are trying to deregister handlers Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-15 08:33:29 -07:00
Ralph Castain	8afa1433b8	Only set the "bound" flag if we wre actually bound Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-14 13:22:01 -07:00
Gilles Gouaillardet	72c7329462	configury: use 'uname -n' when 'hostname' is not available the 'hostname' command might not be available on some platforms such as Fedora Core 26, so mimick config/libtool.m4 and fallback to 'uname -n' if needed Refs. #3680 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-06-12 15:04:32 +09:00
Ralph Castain	1f0f03b45b	Print a better error message when srun isn't found in the path. Ensure we don't segfault if -host specifies a node not included in the allocation Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-09 07:46:47 -07:00
Ralph Castain	00ba6a1be6	Protect against NULL topology Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-08 20:56:44 -07:00
Ralph Castain	7b39f19f60	Fix the backend mapper algorithm for comm_spawn. The front and back ends need to get the nodes into the job map in the same order so that the ranking algorithms will reach the same results Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-08 08:00:52 -07:00
Ralph Castain	81ab79f311	Ensure the orted doesn't go into an infinite loop during force-terminate Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-07 21:44:49 -07:00
Ralph Castain	7002535059	Merge pull request #3671 from rhc54/topic/ofi We cannot use OFI to determine when daemons can finalize as we don't …	2017-06-07 15:08:56 -07:00
George Bosilca	484004b03d	simple_spawn should be independent of ORTE.	2017-06-07 17:51:46 -04:00
Ralph Castain	919d7fcf49	We cannot use OFI to determine when daemons can finalize as we don't see the "sockets" go away. So always use the OOB for the mgmt conduit - this provides the necessary termination signal AND ensures that IOF and other mgmt messages go solely across TCP. Cleanup the way we look for matching OFI addresses by using the opal_net_samenetwork helper function. This now works for multi-network environments, but only using the socket provider Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-07 13:51:30 -07:00
Ralph Castain	bd1793ad17	Get the pmix/ext2x component to work. Fix a minor problem in the libevent external component. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-06 20:06:28 -07:00
Ralph Castain	93cf3c7203	Update OPAL and ORTE for thread safety (I swear, if I look this over one more time, I'll puke) Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-06 12:30:57 -07:00
Ralph Castain	a28eaf914a	Silence warnings when terminating Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-05 13:53:07 -07:00
Ralph Castain	8f526968c2	Do not hang if we cannot relay messages. Eliminate extra error log message Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-05 06:35:19 -07:00
Ralph Castain	51b4078b70	Merge pull request #3648 from rhc54/topic/ofi Clean up the conduit open code so we return detectable errors when co…	2017-06-02 18:08:55 -07:00
Ralph Castain	e884cbf5f5	Even though the ofi component doesn't do any routing itself, the rest of the code base (e.g., grpcomm) needs to know what routing module this component is using. So set it to the "direct" module, and don't allow ofi to be used if that module isn't available. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-02 15:47:25 -07:00
Ralph Castain	ba9a6078c2	Add ability to select transport, and only compare the first one in the conduit list for a match. This lets you select which conduit to use for OFI - if you set "-mca rml_ofi_transports ethernet" you'll pickup the mgmt conduit. If you set "-mca rml_ofi_transports fabric", you'll get the coll conduit Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-02 14:31:23 -07:00
Jeff Squyres	af9565ec25	ess: add missing <signal.h> header Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-06-02 14:11:40 -07:00
Ralph Castain	066d5eedce	Shift the signal forwarding code to ess/base so it can be available to more than just the hnp component. Extend the slurm component to use it so that any signals given directly to the daemons by their slurmstepd get forwarded to their local clients Check for NULL Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-02 10:59:14 -07:00
Ralph Castain	6b3bbd30c5	Clean up the conduit open code so we return detectable errors when conduit not opened. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-02 10:40:51 -07:00
Ralph Castain	2ab4f93f6a	Instead of "forced_terminate" just quietly causing the daemon to disappear, let's at least attempt to let the user know where the problem occurred. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-06-02 08:28:16 -07:00
anandhi	6ddb487744	Cleaned up the send_msg(), moved checking for send to self into the send_nb() and send_buffer_nb() modified: orte/mca/rml/ofi/rml_ofi_send.c Signed-off-by: Anandhi Jayakumar <anandhi.s.jayakumar@intel.com>	2017-06-01 17:50:54 -07:00
Ralph Castain	9d6b929894	Fix uninitialized variable. Set exit codes for failed launch so we get pretty error messages Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-31 07:38:37 -07:00
Ralph Castain	26e7515a5e	Don't sweat the "sync" settings on file descriptors as those flags aren't apparently fully portable Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-30 20:37:26 -07:00
Ralph Castain	5d990b557c	Reorg ordering so that bare executable names also are found Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-30 15:58:55 -07:00
Ralph Castain	321abfc8c6	Fix cwd and preload-binary options Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-30 14:07:22 -07:00
Ralph Castain	ad108ba44d	Fix the DVM Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-30 11:42:42 -07:00
Ralph Castain	9a8811a246	Ensure that data from a job that was stored in ompi-server is purged once that job completes. Cleanup a few typos. Silence a Coverity warning Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-30 09:43:01 -07:00
Ralph Castain	e8759ca66b	Add minor test to ORTE test suite Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-29 15:43:52 -07:00
Ralph Castain	f3ab326b4a	Add some debug code for detecting leaking file descriptors. At the end of each job (and if MCA param is set), have each daemon compute the number of open fds and their characteristics and print a summary Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-29 11:25:20 -07:00
Ralph Castain	87201a80ff	Silence coverity warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-27 11:45:53 -07:00
Ralph Castain	9f60cd0fe7	Update the connect/accept support so we check to see if we have the proper infrastructure and RTE support, including whether we have ompi-server available if the connect/accept spans multiple applications. Print pretty help messages in all cases where we do not have support Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-27 10:47:08 -07:00
Ralph Castain	8c2a06477c	Fix ompi-server operations Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-26 08:57:55 -07:00
Ralph Castain	657e701c65	Add debug verbosity to the orte data server and pmix pub/lookup functions Start updating the various mappers to the new procedure. Remove the stale lama component as it is now very out-of-date. Bring round_robin and PPR online, and modify the mindist component (but cannot test/debug it). Remove unneeded test Fix memory corruption by re-initializing variable to NULL in loop Resolve the race condition identified by @ggouaillardet by resetting the mapped flag within the same event where it was set. There is no need to retain the flag beyond that point as it isn't used again. Add a new job attribute ORTE_JOB_FULLY_DESCRIBED to indicate that all the job information (including locations and binding) is included in the launch message. Thus, the backend daemons do not need to do any map computation for the job. Use this for the seq, rankfile, and mindist mappers until someone decides to update them. Note that this will maintain functionality, but means that users of those three mappers will see large launch messages and less performant scaling than those using the other mappers. Have the mindist module add procs to the job's proc array as it is a fully described module Protect the hnp-not-in-allocation case Per path suggested by Gilles - protect the HNP node when it gets added in the absence of any other allocation or hostfile Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-25 18:41:27 -07:00

... 2 3 4 5 6 ...

5867 Коммитов