openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	43a3baad5e	Ensure we use the first compute node's topology for mapping Don't filter the topology by cpuset if you are mpirun until you know that no other compute nodes are involved. This deals with the corner case where mpirun is executing on a node of different topology from the compute nodes. Simplify - don't mandate that all cpus in the given cpuset be present on every node. We can then run everything thru the filter as before, which ensures that any procs run on mpirun are also contained within the specified cpuset. Correctly count the number of available PUs under each object when given a cpuset Fix the default binding settings, and correctly count PUs when no cpuset is given Ensure the binding policy gets set in all cases	2015-03-19 16:30:36 -07:00
Gilles Gouaillardet	2ab9a411f8	plm/base: fix misc memory leaks as reported by Coverity with CIDs 1196733 and 1196745	2015-03-09 16:25:07 +09:00
Gilles Gouaillardet	7de3f35b90	pml/rsh: fix misc memory leaks as reported by Coverity with CIDs 71091, 71230, 71231, 72274, 72389, 1196718 and 1196719	2015-03-05 20:03:37 +09:00
Jeff Squyres	05f00aface	plm base: ensure mca_base_var_get_value() and mca_base_var_find() succeed This was CID 993712	2015-02-24 15:48:50 -05:00
Jeff Squyres	e2223cd9bf	plm_rsh: ensure cwd array is \0-terminated This was CID 72257	2015-02-24 15:24:08 -05:00
Howard Pritchard	bf89131f9e	add owner files to opa/ompi/orte mca directories This commit adds an owner file in each of the component directories for each framework. This allows for a simple script to parse the contents of the files and generate, among other things, tables to be used on the project's wiki page. Currently there are two "fields" in the file, an owner and a status. A tool to parse the files and generate tables for the wiki page will be added in a subsequent commit.	2015-02-22 15:10:23 -07:00
Ralph Castain	3ae3b96c17	Fix master compilation - a buried header dependency must have been removed.	2015-02-10 07:22:10 -08:00
Ralph Castain	a3275aa867	Once again, fix the blasted singleton comm_spawn	2015-02-05 17:34:25 -08:00
Ralph Castain	2b0b012460	Continue refinement of the DVM operations. Send the spawn request to the right place (it helps) as it isn't a comm_spawn request and has to be treated a little differently. Ensure IO gets forwarded back to the tool. Ensure the tool outputs show_help locally as there is no place to send it.	2015-02-04 06:21:54 -08:00
Ralph Castain	ec5ccb76cf	Enable persistent ORTE DVM so users can execute multiple OMPI jobs within an allocation without restarting the DVM every time.	2015-01-30 11:00:43 -08:00
Howard Pritchard	f34dd5f5fd	plm/alps: update copyright	2015-01-07 12:33:38 -07:00
Howard Pritchard	c454d11b01	plm/alps: fix orted abort hang problem Turns out the alps plm component wasn't changing the state of the job upon terminating the orted's in the case of an abnormal termination. This caused mpirun to hang with a zommbie'd aprun process if an orted on a node in the job was killed via signal.	2015-01-07 12:31:41 -07:00
Jeff Squyres	7b43bdc984	plm base: move flag inside the #if in which it is used Avoid a compiler warning by declaring the tflag only inside the #if in which it is used (i.e., if hwloc support is built).	2014-12-18 10:56:23 -08:00
Ralph Castain	bb529ebd8e	Revise the way we handle hetero nodes as users are finding this (a) a significant surprise, and (b) confusing as to when it is required. So try to automate it a bit by creating a topology "signature" that mpirun can share on the cmd line with the remote daemons, thus allowing them to check to see if they match. This isn't comprehensive of course - for now, it only checks the number of each type of hwloc object on the node. This is good enough to pickup major differences (e.g., where we have different numbers of sockets or assigned core bindings). Retain the hetero-nodes flag for those cases where the user knows that there are differences and our automated system isn't good enough to see it. Will obviously require further refinement as we find out which variances it can detect, and which it cannot.	2014-12-08 15:38:14 -08:00
Ralph Castain	c4002a8485	Further cleanups on the LSF integration - the affinity file is apparently always present, but simply empty if affinity wasn't set.	2014-12-04 12:24:35 -08:00
Ralph Castain	c88f181efe	Fix singleton comm-spawn, yet again. The new grpcomm collectives require a complete knowledge of every active proc in the system in case they participate in a collective. So ensure we pass the required job info when we spawn new daemons, and construct the necessary connections to allow grpcomm to operate.	2014-12-03 18:11:17 -08:00
Jeff Squyres	a3af7d6dbb	Revert "lsf configury: add dependent libraries for static linking" This reverts commit `56cfa90dda`.	2014-12-03 13:32:56 -08:00
Jeff Squyres	92c2ff91ec	Revert "Cleanup static build requirements by adding the wrapper flags back to the component configure.m4's. Minor cleanup of the lsf configure logic." This reverts commit open-mpi/ompi@32bf0e7b7e.	2014-12-03 13:15:20 -08:00
Ralph Castain	54c955c92d	Fix a race condition that only appears to be affecting certain setups. The pmix.finalize function closes the file descriptor to the server, which then triggers the errhandler callback. Since the errmgr is about to be unloaded, it might be getting hit.	2014-12-03 12:19:00 -08:00
Ralph Castain	32bf0e7b7e	Cleanup static build requirements by adding the wrapper flags back to the component configure.m4's. Minor cleanup of the lsf configure logic.	2014-12-03 07:14:06 -08:00
Jeff Squyres	56cfa90dda	lsf configury: add dependent libraries for static linking Ensure to add the LSF dependent libraries and LD flags for the wrapper compiler static linking case.	2014-12-01 14:59:10 -08:00
Ralph Castain	48f702827e	First part of memory leak cleanups from Gilles	2014-11-24 16:53:33 -08:00
Ralph Castain	738c3e1d72	Ensure that mpirun correctly selects the HNP ess component without attempting to init the PMI subsystem as mpirun won't be supported anyway, so let's avoid the error message. Also, daemons launched by the plm/slurm component must use the ess/slurm module as we cannot trust the Slurm PMI_Init functions to correctly tell us when PMI support is available.	2014-11-03 21:35:42 -08:00
Ralph Castain	526682e2f9	Add the ability for a tool that requests spawn of a job to also request forwarding of all output to the tool. The tool is responsible for its own call to push its stdin to the new job. The push request can come -after- the job is started, but the pull request has to be done during the spawn procedure or else output can be lost.	2014-10-23 08:16:49 -07:00
Ralph Castain	894acb0aa8	configury: new OPAL_SET_MCA_PREFIX/ORTE_SET_MCA_CMD_LINE_ID macros These two macros set the MCA prefix and MCA cmd line id, respectively. Specifically, MCA parameters will be named PREFIX<foo> in the environment, and the cmd line will use -ID foo bar. These macros must be called during configure.ac and a value supplied. In the case of Open MPI, the values given are PREFIX=OMPI_MCA_ and ID=mca. Other projects (such as ORCM) will call these macros with their own unique values. For example, ORCM uses PREFIX=ORCM_MCA_ and ID=omca This scheme is necessary to allow running Open MPI applications under systems that use their own versions of ORTE and OPAL. For example, when running OMPI applications under ORCM, we need the MCA params passed to the ORCM daemons to be separated from those recognized by the OMPI application.	2014-10-22 18:57:40 -07:00
Ralph Castain	b6aa691e0a	Fix incorrect implementation of new MCA param mca_base_env_list - it was not picking up envars and forwarding them, but only worked if you explicitly set a value for the envar. Ensure it works for both direct and indirect launch modes. Remove stale code as this replaced orte_forward_envars. Ensure it doesn't get passed to the ORTE daemons.	2014-10-16 12:58:56 -07:00
hpp	8ded59ce0f	fix alps plm to allow explicit host placement It turns out that the alps plm code was developed only on cray systems that were running batch schedulers. However, for bring up and development systems, its not at all uncommon for there to be no batch scheduler, and thus to orte it appears that orte_num_allocated_nodes is always zero. This forces a user using mpirun on such a system to always specify a host list: mpirun -n 4 -N 1 -host 32,45,68 .... just to get the job to run, but then since the -L argument for aprun is never built, the app always runs on the first batch of nodes that aprun finds available.	2014-10-02 10:42:01 -06:00
Howard Pritchard	1508a01325	Fixes to enable mpirun to work again on Cray The ess pmi module was not handling aprun launched daemons. All daemons were thinking they were vpid 1. Also, turns out that on cray systems using MOM nodes for launched jobs, just detecting whether or not a process is in a PAGG container is not sufficient. Crank up the priority of the alps PLM component in the event that the configure detected the presence of both slurm and alps. Have the ESS pmi component open the pmix framework and select a pmix component. This commit was SVN r32773.	2014-09-23 15:37:26 +00:00
Ralph Castain	dfb952fa78	[Contribution from Artem - moved it to svn from git for him] Replace our old, clunky timing setup with a much nicer one that is only available if configured with --enable-timing. Add a tool for profiling clock differences between the nodes so you can get more precise timing measurements. I'll ask Artem to update the Github wiki with full instructions on how to use this setup. This commit was SVN r32738.	2014-09-15 18:00:46 +00:00
Jeff Squyres	e95ed94a94	plm_rsh_module.c: output to the framework output Trivial fix from r32686: don't output to stream 0, but rather to orte_plm_base_framework.framework_output (this is the way it was before r32686). In reality, this is going to end up being stream 0, anyway, but we might as well be pedantically correct... Refs trac:4897. This commit was SVN r32726. The following SVN revision numbers were found above: r32686 --> open-mpi/ompi@4df1aa63f7 The following Trac tickets were found above: Ticket 4897 --> https://svn.open-mpi.org/trac/ompi/ticket/4897	2014-09-13 00:46:35 +00:00
Ralph Castain	4df1aa63f7	Since we've run into the situation where someone puts a script wrapper around a launcher such as srun, we need to always protect MCA cmd line params with quotes. This means we also need to protect the backend from quotes coming into the system as part of a value, or else the parser gets confused. So add a new function for wrapping MCA arguments, and tell the backend parser to ignore/remove leading/trailing quotes. cmr=v1.8.3:reviewer=jsquyres This commit was SVN r32686.	2014-09-08 20:38:46 +00:00
Ralph Castain	039b7acfb5	Fix the quoting algorithm so only rsh command lines get quoted values cmr=v1.8.2:reviewer=jsquyres This commit was SVN r32586.	2014-08-22 22:47:38 +00:00
Ralph Castain	aec5cd08bd	Per the PMIx RFC: WHAT: Merge the PMIx branch into the devel repo, creating a new OPAL “lmix” framework to abstract PMI support for all RTEs. Replace the ORTE daemon-level collectives with a new PMIx server and update the ORTE grpcomm framework to support server-to-server collectives WHY: We’ve had problems dealing with variations in PMI implementations, and need to extend the existing PMI definitions to meet exascale requirements. WHEN: Mon, Aug 25 WHERE: https://github.com/rhc54/ompi-svn-mirror.git Several community members have been working on a refactoring of the current PMI support within OMPI. Although the APIs are common, Slurm and Cray implement a different range of capabilities, and package them differently. For example, Cray provides an integrated PMI-1/2 library, while Slurm separates the two and requires the user to specify the one to be used at runtime. In addition, several bugs in the Slurm implementations have caused problems requiring extra coding. All this has led to a slew of #if’s in the PMI code and bugs when the corner-case logic for one implementation accidentally traps the other. Extending this support to other implementations would have increased this complexity to an unacceptable level. Accordingly, we have: * created a new OPAL “pmix” framework to abstract the PMI support, with separate components for Cray, Slurm PMI-1, and Slurm PMI-2 implementations. * Replaced the current ORTE grpcomm daemon-based collective operation with an integrated PMIx server, and updated the grpcomm APIs to provide more flexible, multi-algorithm support for collective operations. At this time, only the xcast and allgather operations are supported. * Replaced the current global collective id with a signature based on the names of the participating procs. The allows an unlimited number of collectives to be executed by any group of processes, subject to the requirement that only one collective can be active at a time for a unique combination of procs. Note that a proc can be involved in any number of simultaneous collectives - it is the specific combination of procs that is subject to the constraint * removed the prior OMPI/OPAL modex code * added new macros for executing modex send/recv to simplify use of the new APIs. The send macros allow the caller to specify whether or not the BTL supports async modex operations - if so, then the non-blocking “fence” operation is used, if the active PMIx component supports it. Otherwise, the default is a full blocking modex exchange as we currently perform. * retained the current flag that directs us to use a blocking fence operation, but only to retrieve data upon demand This commit was SVN r32570.	2014-08-21 18:56:47 +00:00
Jeff Squyres	1551339eba	rsh: revert part of r32517: keep the quoting As part of reviewing CMR #4860, I talked through r32517 with Ralph. In attempt to fix various rsh quoting problems, r32517 removed all the quoting from the main code path and then only added it back in at the end in some cases. This commit puts back the quoting parts that were removed in r32517 (r32517 fixed 2 other important bugs: a) change "--<foo>" to "--mca <foo_equivalent> 1" so that de-duplication works, and b) change a != to ==). refs trac:4860 This commit was SVN r32524. The following SVN revision numbers were found above: r32517 --> open-mpi/ompi@7342bce58f The following Trac tickets were found above: Ticket 4860 --> https://svn.open-mpi.org/trac/ompi/ticket/4860	2014-08-13 19:27:10 +00:00
Ralph Castain	7342bce58f	Cleanup the over-aggressive quoting of params on the orted cmd line. Remove duplicates caused by passing on both cmd line shortcuts and the mca param version of the same thing. Fixes trac:4857 cmr=v1.8.2:reviewer=jsquyres This commit was SVN r32517. The following Trac tickets were found above: Ticket 4857 --> https://svn.open-mpi.org/trac/ompi/ticket/4857	2014-08-13 03:51:04 +00:00
George Bosilca	de7191132d	Remove few warnings. This commit was SVN r32506.	2014-08-11 13:34:44 +00:00
Ralph Castain	0cad281a92	Single-word cmd line values for orted are dealt with in orte_plm_base_orted_append_basic_args, so protect against special characters there. Have the rsh module only deal with multi-word arguments as those were skipped by orte_plm_base_orted_append_basic_args. Refs trac:4802 This commit was SVN r32293. The following Trac tickets were found above: Ticket 4802 --> https://svn.open-mpi.org/trac/ompi/ticket/4802	2014-07-23 17:06:51 +00:00
Ralph Castain	a94a97bd50	Cleanup the passing of MCA params on the orted cmd line in ssh by ensuring that we quote all values since they could be multi-word and/or contain special characters. Thanks to Dirk Schubert for pointing it out. cmr=v1.8.2:reviewer=jsquyres This commit was SVN r32280.	2014-07-22 18:22:06 +00:00
Ralph Castain	6c5e592785	Revert r32222, r32210, and r32203 as they created a problem when daemon collectives did not involve app procs on every node. Instead, modify the ompi/mca/rte/orte/rte_orte.h to add a new function that allows apps to request new daemon collective ids for use in barrier and modex operations. This will only appear in ORTE-based installations, but it is only being used by a couple of researchers at the moment. Update the orte/test/mpi/coll_test.c test to show the revised example. This commit was SVN r32234. The following SVN revision numbers were found above: r32203 --> open-mpi/ompi@a523dba41d r32210 --> open-mpi/ompi@2ce11ed5c4 r32222 --> open-mpi/ompi@d55f16db50	2014-07-15 03:48:00 +00:00
Ralph Castain	1feaffbb15	Get the blasted singleton comm_spawn working again. There remain problems with the Slurm interaction in this use-case as the PMI components (if configured to build) try to run even when a Slurm allocation hasn't been made, but I leave that to someone else to resolve. I did, however, tell the Slurm ess to quit interfering with applications launched in this use-case by ORTE daemons, so things do work when inside a Slurm allocation. Also discovered that the rsh launcher is not picking up --enable-orterun-prefix-by-default when invoked during singleton comm_spawn, but I was unable to see why that was happening and ran out of time. cmr=v1.8.2:reviewer=rhc This commit was SVN r32229.	2014-07-13 14:47:22 +00:00
Ralph Castain	a523dba41d	NOTE: this modifies the MPI-RTE interface We have been getting several requests for new collectives that need to be inserted in various places of the MPI layer, all in support of either checkpoint/restart or various research efforts. Until now, this would require that the collective id's be generated at launch. which required modification s to ORTE and other places. We chose not to make collectives reusable as the race conditions associated with resetting collective counters are daunti ng. This commit extends the collective system to allow self-generation of collective id's that the daemons need to support, thereby allowing developers to request any number of collectives for their work. There is one restriction: RTE collectives must occur at the process level - i.e., we don't curren tly have a way of tagging the collective to a specific thread. From the comment in the code: * In order to allow scalable * generation of collective id's, they are formed as: * * top 32-bits are the jobid of the procs involved in * the collective. For collectives across multiple jobs * (e.g., in a connect_accept), the daemon jobid will * be used as the id will be issued by mpirun. This * won't cause problems because daemons don't use the * collective_id * * bottom 32-bits are a rolling counter that recycles * when the max is hit. The daemon will cleanup each * collective upon completion, so this means a job can * never have more than 2*32 collectives going on at a time. If someone needs more than that - they've got * a problem. * * Note that this means (for now) that RTE-level collectives * cannot be done by individual threads - they must be * done at the overall process level. This is required as * there is no guaranteed ordering for the collective id's, * and all the participants must agree on the id of the * collective they are executing. So if thread A on one * process asks for a collective id before thread B does, * but B asks before A on another process, the collectives will * be mixed and not result in the expected behavior. We may * find a way to relax this requirement in the future by * adding a thread context id to the jobid field (maybe taking the * lower 16-bits of that field). This commit includes a test program (orte/test/mpi/coll_test.c) that cycles 100 times across barrier and modex collectives. This commit was SVN r32203.	2014-07-10 18:53:12 +00:00
Ralph Castain	8c85ca350e	Remove debug This commit was SVN r32200.	2014-07-10 18:28:24 +00:00
Ralph Castain	356e7ea904	Move all collective id's into the attributes and let the job pack/unpack take care of them instead of singling them out. Add the envars just prior to forking the children instead of into the launch message itself. Remove a few #if CR as the attributes functionality can handle this condition now. This commit was SVN r32133.	2014-07-03 15:58:13 +00:00
Adrian Reber	cabf1d4e68	use the orte attributes in the FT code to fix compile errors This commit was SVN r32093.	2014-06-26 03:19:17 +00:00
Nathan Hjelm	563eaf0726	Fix support for Cray alps The alps ras and plm components were broken by recent changes in ORTE. This commit resolves those issues. Changes: - Define PMI2_SUCCESS if it isn't defined. This fixes a problem with Cray's PMI implementation which does not define (for some reason) PMI2_SUCCESS. We had previously just used PMI_SUCCESS. - Add missing definition and a typo in pml_alps_module. - launch_id is no longer available in the orte_node_t structure. Use the attribute lookup to get the value. - Do not use an O(n^2) sorting algorithm when putting alps nodes in order. Use opal_list_sort instead (O(nlogn)). This commit was SVN r32076.	2014-06-24 21:29:04 +00:00
Ralph Castain	3f032d39e8	Mark the proc as alive so waitpid callback system doesn't immediately activate the callback Refs trac:4717 This commit was SVN r32026. The following Trac tickets were found above: Ticket 4717 --> https://svn.open-mpi.org/trac/ompi/ticket/4717	2014-06-18 14:04:55 +00:00
Ralph Castain	8e7c0257f0	Cleanup some missed updates to orte_wait_cb as params have changed Refs trac:4717 This commit was SVN r32025. The following Trac tickets were found above: Ticket 4717 --> https://svn.open-mpi.org/trac/ompi/ticket/4717	2014-06-17 23:40:31 +00:00
Ralph Castain	5216bd5558	Multiple sigchld reports can occur within a single event callback, so have to reap them until none remain. Also, need to ensure the daemon is flagged as alive prior to calling wait_cb Refs trac:4717 This commit was SVN r32020. The following Trac tickets were found above: Ticket 4717 --> https://svn.open-mpi.org/trac/ompi/ticket/4717	2014-06-17 18:46:40 +00:00
Ralph Castain	42bf7466fc	This isn't as big a change as it appears - a change in one place caused a whole bunch of files to require updated #include's due to some arcane linkage. Rework the orte_wait code to reflect the introduction of the state machine. If we are in cleanup mode and just want to kill all our local children, then there is no reason to be polite about it as that introduces very long delays at scale. Just kill the procs and move on. Refs trac:4717 This commit was SVN r32019. The following Trac tickets were found above: Ticket 4717 --> https://svn.open-mpi.org/trac/ompi/ticket/4717	2014-06-17 17:57:51 +00:00
Ralph Castain	b2413a6b88	Cannot update the proc state prior to activating the state machine as some callback functions need to compare the prior proc state against the new one. cmr=v1.8.2:reviewer=jsquyres This commit was SVN r31949.	2014-06-04 03:40:08 +00:00
Ralph Castain	c5384d44d7	Protect against NULL result in get_attr This commit was SVN r31947.	2014-06-04 03:09:37 +00:00
Ralph Castain	f1978fba7c	Cleanup a set of typos on the orte_get_attribute call This commit was SVN r31942.	2014-06-03 20:36:38 +00:00
Ralph Castain	5668f085a3	Silence some useless warnings, and fix a missed updated in the tm plm This commit was SVN r31930.	2014-06-02 17:57:56 +00:00
Ralph Castain	742c0d2284	Fix typo that would cause a segfault if orte_startup_timeout was set This commit was SVN r31929.	2014-06-02 15:59:18 +00:00
Ralph Castain	65a35d92ef	Cleanup compile issues - missing updates to some plm components and the slurm ras component This commit was SVN r31921.	2014-06-01 17:59:06 +00:00
Ralph Castain	8736a1c138	Per RFC: http://www.open-mpi.org/community/lists/devel/2014/05/14822.php Revamp the ORTE global data structures to reduce memory footprint and add new features. Add ability to control/set cpu frequency, though this can only be done if the sys admin has setup the system to support it (or you run as root). This commit was SVN r31916.	2014-06-01 16:14:10 +00:00
Nathan Hjelm	041b72b0cc	plm/alps: better workaround for the noisy cray pmi implementation This commit is a slightly better workaround to prevent mesages of the form: [unset]:_pmi_alps_get_apid:alps_app_lli_put_request failed [unset]:_pmi_alps_get_appLayout:pmi_alps_get_apid returned with error: Bad file descriptor It works by completely disabling PMI in the application process when using mpirun. This should not be an issue for any apps. cmr=v1.8.2:reviewer=rhc This commit was SVN r31882.	2014-05-22 16:04:36 +00:00
Nathan Hjelm	2a57e71a47	plm/alps: fix typo introduced in r31589 This commit was SVN r31747. The following SVN revision numbers were found above: r31589 --> open-mpi/ompi@445b552d3a	2014-05-13 22:36:54 +00:00
Ralph Castain	5602156a1c	Use the correct abstraction layer name for the data dirs This commit was SVN r31684.	2014-05-08 14:32:24 +00:00
Ralph Castain	11faab1091	The final step of the RFC: convert the <foo>libdir and friends to fit their respective code areas, and equate them all at the top. Note that we can't entirely separate things as the opal_install_dirs framework can't handle separated locations for the various trees. This commit was SVN r31679.	2014-05-08 02:01:35 +00:00
Ralph Castain	445b552d3a	Try again to get an error message printed when a daemon fails to successfully report back to mpirun. In this case, there is no guaranteed way for the daemon to output the error report itself - we don't have a connection back to the HNP, and we have tied stderr off to /dev/null (for good reasons). So the HNP has to detect the failure itself and report it. The HNP can't know the precise reason, of course - all it knows is that the daemon failed. So output a generic error message that provides guidance on probable causes. Refs trac:4571 This commit was SVN r31589. The following Trac tickets were found above: Ticket 4571 --> https://svn.open-mpi.org/trac/ompi/ticket/4571	2014-05-01 19:48:21 +00:00
Ralph Castain	238ecea311	When we comm_spawn, we really want to respect the original -host directives and not expand the daemon virtual machine unless directed to do so in the comm_spawn command. Otherwise, we will automatically launch daemons on every node in the allocation. cmr=v1.8.2:reviewer=rhc:subject=respect vm boundaries during comm_spawn This commit was SVN r31578.	2014-04-30 22:26:18 +00:00
Jeff Squyres	ea4c916096	plm_slurm_module.c: don't leave the extra fd to /dev/null open Prior to r29058, this same logic was in place (i.e., ensure that the extra fd to /dev/null is closed). It looks like it was accidentally removed in the ORTE conversion to the state machine in r29058. This ''might'' have something to do with many hangs that we're seeing in Cisco MTT with jobs that exhibit failure (e.g., call MPI_ABORT)...? cmr=v1.8.2:reviewer=rhc This commit was SVN r31469. The following SVN revision numbers were found above: r29058 --> open-mpi/ompi@a200e4f865	2014-04-21 20:09:15 +00:00
Ralph Castain	a368e84e70	Per the RFC, remove the sensor framework from the ORTE code area, relocating it offsite to the ORCM code area. Also update some ignores to ensure we don't pickup crosstalk in components This commit was SVN r31403.	2014-04-15 21:48:24 +00:00
Nathan Hjelm	9df795d1dd	plm/alps: silence annoying warning message when using Cray PMI 3.x or newer This commit adds a workaround for messages printed by the Cray PMI library when launching using mpirun. We are still talking with Cray to find a better fix but this will silence the warnings for now. cmr=v1.8.1:reviewer=manjugv This commit was SVN r31352.	2014-04-08 21:54:10 +00:00
Dave Goodell	19efa09540	plm/slurm: tweak /dev/null usage (#4489 ) See the ticket for more details. cmr=v1.8.1:reviewer=rhc:ticket=4489 This commit was SVN r31351. The following Trac tickets were found above: Ticket 4489 --> https://svn.open-mpi.org/trac/ompi/ticket/4489	2014-04-08 21:46:07 +00:00
Ralph Castain	957c9ecf53	Okay, silence the anality by simplifying the already irrelevant code, thus allowing us to turn our attention to things that actually matter Refs trac:4489 This commit was SVN r31348. The following Trac tickets were found above: Ticket 4489 --> https://svn.open-mpi.org/trac/ompi/ticket/4489	2014-04-08 19:51:11 +00:00
Ralph Castain	8ce98ccc8d	Not sure when this got messed up, but correct the stdout/stderr redirection on the srun command so we don't get all those slurm warnings cmr=v1.8.1:reviewer=dgoodell:subject=silence srun warning output This commit was SVN r31308.	2014-04-04 04:23:31 +00:00
Ralph Castain	3fdcaeab97	Fix a problem where we need to abort due to a mapping failure, but we are in a managed environment and thus the orteds have not wired up. Thus, if we send the exit message across the routed network, the remote daemons won't have a way to relay the message along - and we won't exit. If we are aborting, then set the flags so the HNP directly sends an exit command to each daemon. Make it the halt_vm command so the remote daemon doesn't try to relay it, but instead just exits without waiting for its routed children to exit first. cmr=v1.8.1:reviewer=jsquyres:subject=fix hangs due to abort prior to daemon wireup This commit was SVN r31304.	2014-04-02 04:17:55 +00:00
Ralph Castain	70ee3fb000	Ensure that orted's are not bound to single processors if the TaskAffinity option is set by default. Thanks to Artem Polyakov for the patch, and for his patience in explaining the situation. Reviewed with Moe Jette to ensure this was correct, and confirmed by me. RM-approved cmr=v1.8:reviewer=ompi-gk1.8 This commit was SVN r31288.	2014-03-29 18:30:38 +00:00
Ralph Castain	bd9bd2ff16	Be consistent in our handling of the "only HNP in allocation" case when setting up the VM. Thanks to Tetsuya Mishima for the suggestion. cmr=v1.8:reviewer=rhc This commit was SVN r31195.	2014-03-24 15:28:09 +00:00
Ralph Castain	d17f811ff5	Surrender to the tyranny of C++ and give up on enum for node states, as nice as that would be, in favor of retaining memory footprint constraints. This commit was SVN r31149.	2014-03-19 16:15:24 +00:00
Ralph Castain	0aa23cdc35	Cleanup copy/paste errors to ensure we progress the launch cmr=v1.7.5:reviewer=rhc This commit was SVN r31102.	2014-03-18 01:24:49 +00:00
Ralph Castain	45196d222b	Minor cleanup of the node state definitions - using the enum allows the debuggers to pretty-print the value This commit was SVN r31090.	2014-03-17 21:27:58 +00:00
Ralph Castain	b248b27637	Remove a check that prevented mpirun from exiting when it should in the single-node case Refs trac:4393 This commit was SVN r31080. The following Trac tickets were found above: Ticket 4393 --> https://svn.open-mpi.org/trac/ompi/ticket/4393	2014-03-15 15:25:44 +00:00
Ralph Castain	fbc5e3b773	Deal with the corner case where we encounter an error when attempting to launch a daemon. In this case, we will order abnormal termination before daemons callback to us, and thus any attempt to send them a "die" message will fail. Ensure that mpirun at least exits cleanly in this scenario, thereby allowing the remote daemons that did get launched to commit suicide when comm fails. cmr=v1.7.5:reviewer=jsquyres This commit was SVN r31068.	2014-03-14 15:32:30 +00:00
Adrian Reber	7304b700e1	Fix the newly added FT event state when compiling --with-ft This commit was SVN r30988.	2014-03-11 13:20:08 +00:00
Ralph Castain	7a44af375c	Add an FT event state and set the state machine to callback to the OOB base ft event when activated This commit was SVN r30950.	2014-03-06 02:44:29 +00:00
Ralph Castain	c9465d97b4	Resolve a race condition when responding to a SIGTERM to ensure that any final message from the application is correctly output. Remove a duplicate command, reduce the priority of the daemon exit command to MSG so that the IOF will have a chance to output cached messages. Update the signal trapping test. Thanks to Paul Kapinos for reporting the problem. cmr=v1.7.5:reviewer=jsquyres:subject=resolve a race condition This commit was SVN r30942.	2014-03-05 04:38:17 +00:00
Ralph Castain	0ac97761cc	Now that we are binding by default, the issue of #slots and what to do when oversubscribed has become a bit more complicated. This isn't a problem in managed environments as we are always provided an accurate assignment for the #slots, or when -host is used to define the allocation since we automatically assume one slot for every time a node is named. The problem arises when a hostfile is used, and the user provides host names without specifying the slots= paramater. In these cases, we assign slots=1, but automatically allow oversubscription since that number isn't confirmed. We then provide a separate parameter by which the user can direct that we assign the number of slots based on the sensed hardware - e.g., by telling us to set the #slots equal to the #cores on each node. However, this has been set to "off" by default. In order to make this a little less complex for the user, set the default such that we automatically set #slots equal to #cores (or #hwt's if use_hwthreads_as_cpus has been set) only for those cases where the user provides names in a hostfile but does not provide slot information. Also cleanup some a couple of issues in the mapping/binding system: * ensure we only override the binding directive if we are oversubscribed and overload is not allowed * ensure that the MPI procs don't attempt to bind themselves if they are launched by an orted as any binding directive (no matter what it was) would have been serviced by the orted on launch * minor cleanup to the warning message when oversubscribed and binding was requested cmr=v1.7.5:reviewer=rhc:subject=update mapping/binding system This commit was SVN r30909.	2014-03-03 16:46:37 +00:00
Ralph Castain	0dc5f50d27	Add a plm component for local-only operation that doesn't require rsh/ssh to be installed. Requested by Fedora packagers for testing purposes. cmr=v1.7.5:reviewer=jsquyres:subject=Add a plm component for local-only operation This commit was SVN r30645.	2014-02-09 15:53:10 +00:00
Ralph Castain	1326ed704f	Per the RFC discussed here: http://www.open-mpi.org/community/lists/devel/2014/01/13789.php add support for async modex when requested. cmr=v1.7.5:reviewer=jsquyres:subject=Add async modex support This commit was SVN r30565.	2014-02-05 14:39:27 +00:00
Adrian Reber	fde1040d2f	Use unique collective ids for the checkpoint/restart code This commit was SVN r30552.	2014-02-04 14:03:05 +00:00
Ralph Castain	53b1be5067	Only report launch progress when specifically requested to do so. Thanks to Tetsuya Mishima for spotting it. Reviewed by rhc and RM-approved cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r30434.	2014-01-27 15:17:42 +00:00
Ralph Castain	f73d23e723	Correct the location of the counter when tracking process launch for reporting progress cmr=v1.7.4:reviewer=hjelmn This commit was SVN r30415.	2014-01-24 21:03:05 +00:00
Ralph Castain	e3cb4b4a5b	Grant Nathan his wish - add an --disable-getpwuid to the configure options and protect all users of that code so it disappears if disabled. cmr=v1.7.5:reviewer=hjelmn:subject=disable getpwuid if requested This commit was SVN r30413.	2014-01-24 19:18:37 +00:00
Ralph Castain	fcdd904af4	Simplify and update hostfile handling to correctly support hostfiles that list nodes multiple times, once for each slot, and those that list a host once and include an explicit slot count. Eliminate support for mixing those two modes as this logic became just too complex when attempting to handle all the corner cases. cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30325.	2014-01-18 16:08:40 +00:00
Ralph Castain	4cdc291df1	Ensure slurm properly dies on abnormal termination cmr=v1.7.4:reviewer=jsquyres:subject=Ensure slurm properly dies on abnormal termination This commit was SVN r30182.	2014-01-09 16:52:02 +00:00
Ralph Castain	80497d73cf	Need to mark the daemon as alive so that exit commands are properly routed during abnormal terminations. Also, remove stale references to the "selected oob component" as we no longer require only one component be selected cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30162.	2014-01-08 22:35:48 +00:00
Brian Barrett	8b778903d8	Fix longstanding issue with our multi-project support. Rather than using pkg{data,lib,includedir}, use our own ompi{data,lib,includedir}, which is always set to {datadir,libdir,includedir}/openmpi. This will keep us from having help files in prefix/share/open-rte when building without Open MPI, but in prefix/share/openmpi when building with Open MPI. This commit was SVN r30140.	2014-01-07 22:11:15 +00:00
Nathan Hjelm	3be4536d9b	Cleanup various leaks in ompi_info reported by valgrind. cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30058.	2013-12-23 17:47:43 +00:00
Ralph Castain	71b52fe861	Ensure that comm_spawn'd procs get user-specified forwarded envars Thanks to Tim Miller for reporting the regression from the 1.6 series cmr=v1.7.4:reviewer=jsquyres:subject=Ensure that comm_spawn'd procs get user-specified forwarded envars This commit was SVN r30012.	2013-12-20 14:47:35 +00:00
Adrian Reber	b42aad44a3	Trying to get the C/R code to compile again. This patch includes various fixes all over the C/R code which are hard to group like the other patches. Changes from V1: * explain why mca_base_component_distill_checkpoint_ready no longer works * compare return result of opal functions with OPAL_* values Changes from V2: * use orte_rml_oob_ft_event() instead of referencing through the modules * properly protect variable (thanks to --enable-picky) This commit was SVN r29922.	2013-12-16 15:35:28 +00:00
Jeff Squyres	770bf77149	Fix some minor memory leaks in error code paths. Many thanks to Tom Fogal for the patch. cmr=v1.7.4:reviewer=rhc:subject=Fix minor memory leaks in error code paths This commit was SVN r29905.	2013-12-14 00:41:21 +00:00
Jeff Squyres	2e7653e4c2	Add missing argv.h includes. Noticed these as part of #3694: external libevent's don't cause argv.h to automatically get included. Refs trac:3694 This commit was SVN r29897. The following Trac tickets were found above: Ticket 3694 --> https://svn.open-mpi.org/trac/ompi/ticket/3694	2013-12-13 21:17:36 +00:00
Ralph Castain	83e59e6761	Once again, the Slurm folks have decided to redefine their envars, reversing what they had previously told us to do. So cleanup the Slurm allocation code, and also adjust to a change in srun behavior that now aborts a job if the ntasks-per-node doesn't get specified when ORTE calls it, but the user specified it when getting an allocation. Sigh. cmr=v1.7.4:reviewer=miked:subject=Update Slurm allocation and launch This commit was SVN r29849.	2013-12-09 17:58:46 +00:00
Ralph Castain	f1e510154c	Revise the launch timeout detection so we don't mistakenly declare "failed to start". Recognize that timeout is at the per-job level, and define the timeout param as a total value instead of seconds/daemon as it otherwise can get to be an enormous (and useless) number. Resolves problems in loop_spawn where the timer was incorrectly firing and killing the overall job. cmr=v1.7.4:reviewer=hjelmn This commit was SVN r29661.	2013-11-11 23:50:40 +00:00
Ralph Castain	604970a1a2	Initialize orte_coprocessors hash table to NULL. Delay coprocessor detection on HNP until after node topology final definition in case rmaps changes it. Minor spacing change. Refs trac:3847 This commit was SVN r29504. The following Trac tickets were found above: Ticket 3847 --> https://svn.open-mpi.org/trac/ompi/ticket/3847	2013-10-24 00:08:47 +00:00
Ralph Castain	f5920e9312	Revert r29489. This function only executes in the HNP. In orte/mca/ess/hnp/ess_hnp_module.c, we already check for local coprocessors and add them to the hash table if found. Thus, r29489 simply overwrote what was already present. The data for each remote daemon is added later in the daemon callback function. Only the HNP retains info in the hash table. If it is desirable to have each daemon retain its own coprocessor info, then this must be done in orte/mca/ess/base/ess_base_std_orted.c. This commit was SVN r29497. The following SVN revision numbers were found above: r29489 --> open-mpi/ompi@2e2794fa15	2013-10-23 22:35:24 +00:00
Nathan Hjelm	2e2794fa15	Fix coprocessor detection by always adding the local daemon's co-processors to the hash table. Tested and working on a system with 2 Xeon Phi co-processors. cmr=v1.7.4:ticket=3847:reviewer=ompi-rm1.7 This commit was SVN r29489. The following Trac tickets were found above: Ticket 3847 --> https://svn.open-mpi.org/trac/ompi/ticket/3847	2013-10-23 15:56:23 +00:00
Ralph Castain	960a255e7f	Do some cleanup of the --without-hwloc build - no need to work on coprocessors since we can't detect them anyway, cleanup some unused variables in the ppr mapper This commit was SVN r29476.	2013-10-23 01:45:21 +00:00
Ralph Castain	b12167abef	Per a good suggestion from Jeff, make the coprocessor mapping more scalable by using a hash table to cache the coprocessor list, and then do a single pass thru the nodes at the end to assign hostid's. Refs trac:3847 This commit was SVN r29439. The following Trac tickets were found above: Ticket 3847 --> https://svn.open-mpi.org/trac/ompi/ticket/3847	2013-10-14 22:01:48 +00:00
Ralph Castain	24c811805f	************************************************************** This change contains a non-mandatory modification of the MPI-RTE interface. Anyone wishing to support coprocessors such as the Xeon Phi may wish to add the required definition and underlying support ************************************************************** Add locality support for coprocessors such as the Intel Xeon Phi. Detecting that we are on a coprocessor inside of a host node isn't straightforward. There are no good "hooks" provided for programmatically detecting that "we are on a coprocessor running its own OS", and the ORTE daemon just thinks it is on another node. However, in order to properly use the Phi's public interface for MPI transport, it is necessary that the daemon detect that it is colocated with procs on the host. So we have to split the locality to separately record "on the same host" vs "on the same board". We already have the board-level locality flag, but not quite enough flexibility to handle this use-case. Thus, do the following: 1. add OPAL_PROC_ON_HOST flag to indicate we share a host, but not necessarily the same board 2. modify OPAL_PROC_ON_NODE to indicate we share both a host AND the same board. Note that we have to modify the OPAL_PROC_ON_LOCAL_NODE macro to explicitly check both conditions 3. add support in opal/mca/hwloc/base/hwloc_base_util.c for the host to check for coprocessors, and for daemons to check to see if they are on a coprocessor. The former is done via hwloc, but support for the latter is not yet provided by hwloc. So the code for detecting we are on a coprocessor currently is Xeon Phi specific - hopefully, we will find more generic methods in the future. 4. modify the orted and the hnp startup so they check for coprocessors and to see if they are on a coprocessor, and have the orteds pass that info back in their callback message. Automatically detect that coprocessors have been found and identify which coprocessors are on which hosts. Note that this algo isn't scalable at the moment - this will hopefully be improved over time. 5. modify the ompi proc locality detection function to look for coprocessor host info IF the OMPI_RTE_HOST_ID database key has been defined. RTE's that choose not to provide this support do not have to do anything - the associated code will simply be ignored. 6. include some cleanup of the hwloc open/close code so it conforms to how we did things in other frameworks (e.g., having a single "frame" file instead of open/close). Also, fix the locality flags - e.g., being on the same node means you must also be on the same cluster/cu, so ensure those flags are also set. cmr:v1.7.4:reviewer=hjelmn This commit was SVN r29435.	2013-10-14 16:52:58 +00:00
Ralph Castain	f4f2287958	Singletons currently start out by spawning an HNP - this is required solely in the cases where the singleton subsequently calls MPI_Comm_spawn or publishes port info without support from an external orte-server. In all other cases, the HNP is of no value and can actually be a detriment by creating additional overhead on the node. This is particularly concerning for async operations where processes may begin as singletons and then dynamically wireup to perform pt2pt communications. So we now allow singletons to start on their own, only spawning an HNP when initiating an operation that actually requires it. cmr:v1.7.4:reviewer=jsquyres This commit was SVN r29354.	2013-10-04 02:58:26 +00:00
Ralph Castain	6522963b9c	Flag that a daemon has been launched when it reports back to the HNP so we avoid re-launching it on spawns against dynamic allocations cmr:v1.7.3:reviewer=jsquyres This commit was SVN r29245.	2013-09-25 16:58:19 +00:00
Ralph Castain	23c8848157	Only connect the first time thru the Torque launch, remove stale code cmr:v1.7.3:reviewer=jsquyres This commit was SVN r29227.	2013-09-22 23:53:57 +00:00
Ralph Castain	d32dfc96be	Use the rankfile to obtain list of nodes for VM launch if/when rankfile is given. cmr:v1.7.3:reviewer=jsquyres:subject=Obtain VM nodes from rankfile This commit was SVN r29119.	2013-09-04 16:37:30 +00:00
Ralph Castain	43d1cd92ac	Ensure we activate the "daemons launched" state when only the HNP is left or else we will hang. cmr:v1.7.3:reviewer=jsquyres This commit was SVN r29094.	2013-08-29 22:50:51 +00:00
Ralph Castain	a200e4f865	As per the RFC, bring in the ORTE async progress code and the rewrite of OOB: * THIS RFC INCLUDES A MINOR CHANGE TO THE MPI-RTE INTERFACE * Note: during the course of this work, it was necessary to completely separate the MPI and RTE progress engines. There were multiple places in the MPI layer where ORTE_WAIT_FOR_COMPLETION was being used. A new OMPI_WAIT_FOR_COMPLETION macro was created (defined in ompi/mca/rte/rte.h) that simply cycles across opal_progress until the provided flag becomes false. Places where the MPI layer blocked waiting for RTE to complete an event have been modified to use this macro. *************************************************************************************** I am reissuing this RFC because of the time that has passed since its original release. Since its initial release and review, I have debugged it further to ensure it fully supports tests like loop_spawn. It therefore seems ready for merge back to the trunk. Given its prior review, I have set the timeout for one week. The code is in https://bitbucket.org/rhc/ompi-oob2 WHAT: Rewrite of ORTE OOB WHY: Support asynchronous progress and a host of other features WHEN: Wed, August 21 SYNOPSIS: The current OOB has served us well, but a number of limitations have been identified over the years. Specifically: * it is only progressed when called via opal_progress, which can lead to hangs or recursive calls into libevent (which is not supported by that code) * we've had issues when multiple NICs are available as the code doesn't "shift" messages between transports - thus, all nodes had to be available via the same TCP interface. * the OOB "unloads" incoming opal_buffer_t objects during the transmission, thus preventing use of OBJ_RETAIN in the code when repeatedly sending the same message to multiple recipients * there is no failover mechanism across NICs - if the selected NIC (or its attached switch) fails, we are forced to abort * only one transport (i.e., component) can be "active" The revised OOB resolves these problems: * async progress is used for all application processes, with the progress thread blocking in the event library * each available TCP NIC is supported by its own TCP module. The ability to asynchronously progress each module independently is provided, but not enabled by default (a runtime MCA parameter turns it "on") * multi-address TCP NICs (e.g., a NIC with both an IPv4 and IPv6 address, or with virtual interfaces) are supported - reachability is determined by comparing the contact info for a peer against all addresses within the range covered by the address/mask pairs for the NIC. * a message that arrives on one TCP NIC is automatically shifted to whatever NIC that is connected to the next "hop" if that peer cannot be reached by the incoming NIC. If no TCP module will reach the peer, then the OOB attempts to send the message via all other available components - if none can reach the peer, then an "error" is reported back to the RML, which then calls the errmgr for instructions. * opal_buffer_t now conforms to standard object rules re OBJ_RETAIN as we no longer "unload" the incoming object * NIC failure is reported to the TCP component, which then tries to resend the message across any other available TCP NIC. If that doesn't work, then the message is given back to the OOB base to try using other components. If all that fails, then the error is reported to the RML, which reports to the errmgr for instructions * obviously from the above, multiple OOB components (e.g., TCP and UD) can be active in parallel * the matching code has been moved to the RML (and out of the OOB/TCP component) so it is independent of transport * routing is done by the individual OOB modules (as opposed to the RML). Thus, both routed and non-routed transports can simultaneously be active * all blocking send/recv APIs have been removed. Everything operates asynchronously. KNOWN LIMITATIONS: * although provision is made for component failover as described above, the code for doing so has not been fully implemented yet. At the moment, if all connections for a given peer fail, the errmgr is notified of a "lost connection", which by default results in termination of the job if it was a lifeline * the IPv6 code is present and compiles, but is not complete. Since the current IPv6 support in the OOB doesn't work anyway, I don't consider this a blocker * routing is performed at the individual module level, yet the active routed component is selected on a global basis. We probably should update that to reflect that different transports may need/choose to route in different ways * obviously, not every error path has been tested nor necessarily covered * determining abnormal termination is more challenging than in the old code as we now potentially have multiple ways of connecting to a process. Ideally, we would declare "connection failed" when all transports can no longer reach the process, but that requires some additional (possibly complex) code. For now, the code replicates the old behavior only somewhat modified - i.e., if a module sees its connection fail, it checks to see if it is a lifeline. If so, it notifies the errmgr that the lifeline is lost - otherwise, it notifies the errmgr that a non-lifeline connection was lost. * reachability is determined solely on the basis of a shared subnet address/mask - more sophisticated algorithms (e.g., the one used in the tcp btl) are required to handle routing via gateways * the RML needs to assign sequence numbers to each message on a per-peer basis. The receiving RML will then deliver messages in order, thus preventing out-of-order messaging in the case where messages travel across different transports or a message needs to be redirected/resent due to failure of a NIC This commit was SVN r29058.	2013-08-22 16:37:40 +00:00
Nathan Hjelm	841ed962f6	fix MCA variable and component system leaks cmr=v1.7.3:reviewer=rhc This commit was SVN r29011.	2013-08-09 19:50:28 +00:00
Nathan Hjelm	299d5b3dd7	Fix two debugger attach bugs. - orte_debugger_init_after_spawn was not being called for debuggers that use the MPIR_attach_fifo to co-locate debugger daemons. - MPIR_Breakpoint was not getting called if a debugger reattached. Add a job state (ORTE_JOB_STATE_DEBUGGER_DETACH) to reset mpir_breakpoint_fired to false when a debugger detaches to ensure MPIR_Breakpoint is called if another debugger attaches. Tested with STAT 2.0/launchmon 1.0. cmr:v1.7 This commit was SVN r28665.	2013-06-20 16:18:05 +00:00
Ralph Castain	f15fe5045e	Ensure that debugger connect can occur by getting the rml contact info updated before calling init_after_spawn cmr:v1.7.3,reviewer=jsquyres This commit was SVN r28455.	2013-05-06 22:00:45 +00:00
Jeff Squyres	42a9a4c62c	After examining a '''lot''' of MTT output with Ralph, fix the cause of many, many MTT timeouts when running jobs under SLURM: send the right command at the end to cause remote orteds to shut down. This commit was SVN r28438.	2013-05-02 00:23:53 +00:00
Ralph Castain	76426285f0	Cannot retain opal_buffer_t, so use a copy This commit was SVN r28302.	2013-04-07 23:02:59 +00:00
Ralph Castain	698b4ad6e7	Fix the parameter handling so no-tree-spawn isn't getting reversed This commit was SVN r28300.	2013-04-07 15:48:25 +00:00
Ralph Castain	e6ae088813	Cleanup error outputs when a daemon fails to start This commit was SVN r28261.	2013-03-28 16:51:19 +00:00
Nathan Hjelm	c041156f60	Update ORTE frameworks to use the MCA framework system. This commit was SVN r28240.	2013-03-27 21:14:43 +00:00
Nathan Hjelm	cf377db823	MCA/base: Add new MCA variable system Features: - Support for an override parameter file (openmpi-mca-param-override.conf). Variable values in this file can not be overridden by any file or environment value. - Support for boolean, unsigned, and unsigned long long variables. - Support for true/false values. - Support for enumerations on integer variables. - Support for MPIT scope, verbosity, and binding. - Support for command line source. - Support for setting variable source via the environment using OMPI_MCA_SOURCE_<var name>=source (either command or file:filename) - Cleaner API. - Support for variable groups (equivalent to MPIT categories). Notes: - Variables must be created with a backing store (char *, int , or bool *) that must live at least as long as the variable. - Creating a variable with the MCA_BASE_VAR_FLAG_SETTABLE enables the use of mca_base_var_set_value() to change the value. - String values are duplicated when the variable is registered. It is up to the caller to free the original value if necessary. The new value will be freed by the mca_base_var system and must not be freed by the user. - Variables with constant scope may not be settable. - Variable groups (and all associated variables) are deregistered when the component is closed or the component repository item is freed. This prevents a segmentation fault from accessing a variable after its component is unloaded. - After some discussion we decided we should remove the automatic registration of component priority variables. Few component actually made use of this feature. - The enumerator interface was updated to be general enough to handle future uses of the interface. - The code to generate ompi_info output has been moved into the MCA variable system. See mca_base_var_dump(). opal: update core and components to mca_base_var system orte: update core and components to mca_base_var system ompi: update core and components to mca_base_var system This commit also modifies the rmaps framework. The following variables were moved from ppr and lama: rmaps_base_pernode, rmaps_base_n_pernode, rmaps_base_n_persocket. Both lama and ppr create synonyms for these variables. This commit was SVN r28236.	2013-03-27 21:09:41 +00:00
Ralph Castain	2f43989d22	Add debug and handle the use-case where someone (a) uses a hostfile while in a managed allocation to sub-allocate runs, and (b) includes the HNP's node in one of those hostfiles. cmr:v1.7 This commit was SVN r28203.	2013-03-22 00:53:33 +00:00
Ralph Castain	147c6ff9e7	Clean out the cruft leftover from the use_common_ports experiment cmr:v1.7 This commit was SVN r28184.	2013-03-20 15:07:43 +00:00
Ralph Castain	cf9796accd	Remove the old configure option for disabling full rte support - we now use the OMPI rte framework for such purposes This commit was SVN r28134.	2013-02-28 01:35:55 +00:00
Ralph Castain	8d2fa3693b	First cut at removing the native Windows support. Remove all the Windows-specific components, and the .windows files sprinkled around. Remove the Windows platform files and MTT scripts. Update the NEWS to point Windows users to the cygwin package. This commit was SVN r28116.	2013-02-26 20:44:56 +00:00
Brian Barrett	b8442ba505	Revamp the handling of wrapper compiler flags. The user flags, main configure flags, and mca flags are kept seperate until the very end. The main configure wrapper flags should now be modified by using the OPAL_WRAPPER_FLAGS_ADD macro. MCA components should either let <framework>_<component>_{LIBS,LDFLAGS} be copied over OR set <framework>_<component>_WRAPPER_EXTRA_{LIBS,LDFLAGS}. The situations in which WRAPPER CPPFLAGS can be set by MCA components was made very small to match the one use case where it makes sense. This commit was SVN r27950.	2013-01-29 00:00:43 +00:00
Ralph Castain	e4673f3283	Add new job state This commit was SVN r27878.	2013-01-20 00:30:27 +00:00
Nathan Hjelm	6a9ab9b221	Change orte_startup_timeout to be in seconds and remove the 10 second maximum This commit was SVN r27741.	2013-01-03 23:56:34 +00:00
Ralph Castain	f2ec35536e	Fix a bug that prevented MCA params from being forwarded to daemons upon launch cmr:v1.7 This commit was SVN r27621.	2012-11-18 17:55:26 +00:00
Ralph Castain	e11f32038a	Add an MCA param to retain all aliases based on IP addrs for node names so that procs can look them up by interface, if desired. If the param is set, pass aliases around to all daemons and procs for local use This commit was SVN r27619.	2012-11-16 04:04:29 +00:00
Ralph Castain	a6325e4546	Silence compiler warning This commit was SVN r27590.	2012-11-12 02:51:29 +00:00
Nathan Hjelm	bdedd8b0d3	Per RFC modify the behavior of mca_base_components_close to NOT close the output. Modify frameworks to always close their output and set to -1. Reasoning: The old behavior was a little confusing. mca_base_components_open does not open an output stream so it is a little unexpected that mca_base_components_close does. To add to this several frameworks (that don't use mca_base_components_close) failed to close their output in the framework close function and others closed their output a second time. This change is an improvement to the symantics of mca_base_components_open/close as they are now symetric in their functionality. This commit was SVN r27570.	2012-11-06 19:09:26 +00:00
Brian Barrett	e61c00212d	Add files found in svn but not tarball This commit was SVN r27549.	2012-11-01 02:27:03 +00:00
Nathan Hjelm	2acd0f83de	Revert "Revert r27451 and r27456 - the cmd line parser is incorrectly marking the application as an MCA parameter". It appears the problem was not with the command line parser but the rsh plm. I don't know why this problem was not occuring before the command line parser changes but it appears to be resolved now. This commit was SVN r27527. The following SVN revision numbers were found above: r27451 --> open-mpi/ompi@d59034e6ef r27456 --> open-mpi/ompi@ecdbf34937	2012-10-30 19:45:18 +00:00
Nathan Hjelm	df9bd0ed59	fix bug in plm/rsh that could add extraneous mca options to the orted argv cmr:v1.7 This commit was SVN r27526.	2012-10-30 19:40:04 +00:00
Ralph Castain	e6014bf2e1	Revert r27451 and r27456 - the cmd line parser is incorrectly marking the application as an MCA parameter This commit was SVN r27477. The following SVN revision numbers were found above: r27451 --> open-mpi/ompi@d59034e6ef r27456 --> open-mpi/ompi@ecdbf34937	2012-10-24 18:38:44 +00:00
Ralph Castain	7574d6673b	If someone provides the launch_agent cmd, then don't prefix it cmr:v1.7 This commit was SVN r27473.	2012-10-24 16:14:04 +00:00
Nathan Hjelm	d59034e6ef	MCA: remove deprecated mca_base_param functions (mca_base_param_register_int, mca_base_param_register_string, mca_base_param_environ_variable). Remove all uses of deprecated functions. cmr:v1.7 This commit was SVN r27451.	2012-10-17 20:17:37 +00:00
Ralph Castain	285a3b168d	Add an ability to specify the max number of simultaneous procs/node for an application when operating in staged mode. Change some debug statements from OPAL_OUTPUT_VERBOSE to opal_output_verbose so they are available in optimized builds. This commit was SVN r27445.	2012-10-14 03:31:32 +00:00
Ralph Castain	f592967685	Add missing retain to maintain correct accounting on nodes This commit was SVN r27352.	2012-09-20 02:30:53 +00:00
Ralph Castain	78ccb097f0	Fix vm setup in unmanaged environments - needs to construct a node list in the same way we now do for mapping This commit was SVN r27256.	2012-09-07 01:53:19 +00:00
Ralph Castain	bae5dab916	If (and only if) a user requests, set the default number of slots on any node to the number of objects of the specified type. This only takes effect in an unmanaged environment - i.e., if an external resource manager assigns us a number of slots, then that is what we use. However, if we are using a hostfile, then the user may or may not have given us a value for the number of slots on each node. For those nodes (and only those nodes) where the user does not specify a slot count, we will set the number of slots according to their direction: either to the number of cores, numas, sockets, or hwthreads. Otherwise, the slot count is set to 1. Note that the default behavior remains unchanged: in the absence of any value for #slots, and in the absence of any directive to set #slots, we will set #slots=1. This commit was SVN r27236.	2012-09-04 20:58:26 +00:00
Ralph Castain	a3b08f5800	Fix a few things relating to comm_spawn that causes new daemons to be launched. Ensure that all new daemons receive a full pidmap. Properly mark the daemon job as "updated" when daemons are added This commit was SVN r27177.	2012-08-29 03:11:37 +00:00
Ralph Castain	98580c117b	Introduce staged execution. If you don't have adequate resources to run everything without oversubscribing, don't want to oversubscribe, and aren't using MPI, then staged execution lets you (a) run as many procs as there are available resources, and (b) start additional procs as others complete and free up resources. Adds a new mapper as well as a new state machine. Remove some stale configure.m4's we no longer need. Optimize the nidmaps a bit by only sending info that has changed each time, instead of sending a complete copy of everything. Makes no difference for the typical MPI job - only impacts things like staged execution where we are sending multiple (possibly many) launch messages. This commit was SVN r27165.	2012-08-28 21:20:17 +00:00
Ralph Castain	6e8c97c77c	Per Sam's eagle-eyed review, free the malloc'd memory if getcwd fails for some strange reason. This commit was SVN r27150.	2012-08-27 19:15:16 +00:00
Ralph Castain	b4a544ad2a	Per discussion with Josh, use the --preload-xxx cmd line options to broadcast files to all nodes. Add --set-cwd-to-session-dir option to start procs in their session directories. Add OMPI_FILE_LOCATION envar to tell procs where their prepositioned files went. This commit was SVN r27125.	2012-08-23 21:28:05 +00:00
Ralph Castain	ed4b354846	Ensure we pass along user-specified mca params from the cmd line when doing a tree spawn, but don't extend the cmd line with duplicates or things that shouldn't be there This commit was SVN r27117.	2012-08-22 21:41:50 +00:00
Ralph Castain	49a757e0bd	Silly me - now that all daemons are stripping their prefix on the backend, we no longer need to do it as they report This commit was SVN r27023.	2012-08-13 20:48:13 +00:00
Ralph Castain	b9b41d8662	For cases where the alpha+non-zero prefix must be removed from a node name, be sure to do it everywhere we access node names - otherwise, modex methods such as pmi will fail to correctly identify procs on the same node This commit was SVN r27022.	2012-08-13 20:44:56 +00:00
Ralph Castain	e3e9b7345d	First cut at updating the ccp launcher to use the state machine This commit was SVN r26986.	2012-08-10 17:09:33 +00:00
Ralph Castain	431d5361ed	For those who really preferred our prior mode of operation that mapped procs and only launched daemons on the nodes that had procs on them, introduce the "novm" state machine component. This recreates the old mode of operation by re-ordering the launch sequence so that we allocate, then map, and then launch daemons only on the reqd nodes (instead of across the entire allocation). This commit was SVN r26946.	2012-08-03 16:30:05 +00:00
Ralph Castain	6285f7d8c0	Per request of Shiqing, restore the ccp components This commit was SVN r26904.	2012-07-29 23:49:59 +00:00
Ralph Castain	94d11e04fd	Add an intermediate state when the VM is ready so that third party tools can take action prior to mapping/launching apps This commit was SVN r26902.	2012-07-28 15:33:09 +00:00
Ralph Castain	8bc6694a62	Ensure the daemons don't incorrectly declare a failed launch This commit was SVN r26875.	2012-07-26 19:05:06 +00:00
Ralph Castain	07846f12ae	Reconnect the rsh/ssh error reporting code for remote spawns to report failure to launch. Ensure the HNP correctly reports non-zero exit status when ssh encounters a problem. Thanks to Terry for spotting it! This commit was SVN r26868.	2012-07-25 21:46:45 +00:00
Jeff Squyres	e5cfad0c1a	This variable is only used in FT builds. This commit was SVN r26854.	2012-07-24 12:48:47 +00:00
Abhishek Kulkarni	5c58a1c9c1	Fix C/R support in the trunk. Among other things, this patch deals with the following issues: * fix ompi-checkpoint argument parsing * ompi-restart -showme prints an extraneous "Restarted child with PID" message. Move around the debug statement to avoid this. * fixes for the state machine changes This commit was SVN r26770.	2012-07-09 23:34:13 +00:00
Ralph Castain	b83fc41d54	Add a state that allows mpirun or other tools to be notified of a job completion prior to terminating so that alternative actions can be performed. This commit was SVN r26716.	2012-07-02 22:16:32 +00:00
Ralph Castain	0dfe29b1a6	Roll in the rest of the modex change. Eliminate all non-modex API access of RTE info from the MPI layer - in some cases, the info was already present (either in the ompi_proc_t or in the orte_process_info struct) and no call was necessary. This removes all calls to orte_ess from the MPI layer. Calls to orte_grpcomm remain required. Update all the orte ess components to remove their associated APIs for retrieving proc data. Update the grpcomm API to reflect transfer of set/get modex info to the db framework. Note that this doesn't recreate the old GPR. This is strictly a local db storage that may (at some point) obtain any missing data from the local daemon as part of an async methodology. The framework allows us to experiment with such methods without perturbing the default one. This commit was SVN r26678.	2012-06-27 14:53:55 +00:00
Ralph Castain	a34f09e67a	Ensure common port is off when not being used This commit was SVN r26666.	2012-06-26 16:09:58 +00:00
Ralph Castain	0103f82918	Turn off the common port for slurm for now This commit was SVN r26656.	2012-06-25 21:55:51 +00:00
Ralph Castain	e6f3586415	Remove the orte notifier framework, per discussion at the devel meeting and follow-up with Jeff (who took the action item) This commit was SVN r26637.	2012-06-22 18:09:23 +00:00
Ralph Castain	e9591f2563	Fix tree spawn in the rsh/qrsh environment This commit was SVN r26631.	2012-06-21 21:29:28 +00:00
Ralph Castain	96c778656a	Improve launch performance on clusters that use dedicated nodes by instructing the orteds to use the same port as the HNP, thus allowing them to "rollup" their initial callback via the routed network. This substantially reduces the HNP bottleneck and the number of ports opened by the HNP. Restore enable-static-ports option by default - the Cray will have to disable it to get around their library issues, but that's just a warning problem as opposed to blocking the build. This commit was SVN r26606.	2012-06-15 10:15:07 +00:00
Ralph Castain	ecc51d8583	Add missing endif This commit was SVN r26596.	2012-06-12 15:07:09 +00:00
Ralph Castain	269cb2b8d9	Some cleanup to remove calls to opal_progress when running with orte progress threads, and to ensure that all orte-related events are in the orte event base. This commit was SVN r26591.	2012-06-11 19:59:53 +00:00
Ralph Castain	0442a807c0	Default the OOB to the "ud" component IFF the HNP finds itself on a node with a supported Infiniband device. Ensure that the daemons all pick the matching component by dictating the selection via mca param on the orted cmd line. This commit was SVN r26582.	2012-06-08 01:23:08 +00:00
Ralph Castain	9bedb25dda	Cleanup some compiler warnings, some of which are actual logic errors This commit was SVN r26519.	2012-05-29 20:11:51 +00:00
Ralph Castain	be6ed9c2df	Allow partial use of allocations by specifying the max number of daemons (i.e., max VM size) for the job This commit was SVN r26499.	2012-05-27 16:48:19 +00:00
Ralph Castain	96bfeb591c	Ensure flag is passed to remote daemons This commit was SVN r26383.	2012-05-03 22:31:25 +00:00
Ralph Castain	45fee2b491	Resolve the case where only the HNP is in the system (i.e., single-node operation) This commit was SVN r26382.	2012-05-03 18:00:01 +00:00
Ralph Castain	b2f77bf08f	Extend the iof by adding two new components to support map-reduce IO chaining. Add a mapreduce tool for running such applications. Fix the state machine to support multiple jobs being simultaneously launched as this is not only required for mapreduce, but can happen under comm-spawn applications as well. This commit was SVN r26380.	2012-05-02 21:00:22 +00:00
Ralph Castain	3461809341	Fix reporting of launch progress so the numbers are correct and appear when they should This commit was SVN r26342.	2012-04-26 00:10:09 +00:00
Ralph Castain	71805bf7e4	Clearout the startup_timeout event if the job did in fact start. Have ORTE_TERMINATE use the job state macro so debug will show where it was called This commit was SVN r26334.	2012-04-25 01:05:17 +00:00
Ralph Castain	4d16790836	Fix collectives for jobs running across partial allocations This commit was SVN r26267.	2012-04-13 00:38:47 +00:00
Ralph Castain	19630ca28d	Remove stale code This commit was SVN r26252.	2012-04-07 13:33:40 +00:00
Ralph Castain	93bbeabc55	Remove stale code This commit was SVN r26251.	2012-04-07 13:33:30 +00:00
Ralph Castain	b6cde9a8d1	Remove stale code This commit was SVN r26250.	2012-04-07 13:33:18 +00:00
George Bosilca	319f76d66a	Low hanging fruit. Remove a declared but not defined function. This commit was SVN r26245.	2012-04-06 15:43:28 +00:00
Ralph Castain	ed197acaa2	Eliminate stale code This commit was SVN r26244.	2012-04-06 15:31:13 +00:00
Ralph Castain	bd8b4f7f1e	Sorry for mid-day commit, but I had promised on the call to do this upon my return. Roll in the ORTE state machine. Remove last traces of opal_sos. Remove UTK epoch code. Please see the various emails about the state machine change for details. I'll send something out later with more info on the new arch. This commit was SVN r26242.	2012-04-06 14:23:13 +00:00
Ralph Castain	ceb34ed0c9	Fix typo This commit was SVN r26079.	2012-03-02 09:58:09 +00:00
Ralph Castain	b2f1bade37	Fix the -H localhost issue This commit was SVN r26071.	2012-02-29 16:56:00 +00:00
Ralph Castain	b3aabf1565	Cleanup the --without-hwloc build. Thanks to Paul Hargrove for reporting it broken. This commit was SVN r25931.	2012-02-15 11:08:57 +00:00
Ralph Castain	bba6508b4b	Handle the default hostfile case a little better... This commit was SVN r25928.	2012-02-15 03:33:49 +00:00
Ralph Castain	3f31feee6f	Handle the case where a user's rankfile specifies only cpus, and not socket:cpu pairs. This commit was SVN r25803.	2012-01-27 12:21:45 +00:00
Ralph Castain	07f3a91075	Okay, get srun to play nice. Problem was that everything worked fine so long as the user did "salloc" with an argument requesting a specific number of nodes. However, if the user specified instead a number of processes, then we launched that number of daemons - resulting in multiple daemons/node. Not good. So force things to behave correctly either way. This commit was SVN r25792.	2012-01-26 19:58:57 +00:00
Ralph Castain	1449b27e9f	Ensure that slurm only launches one orted/node, regardless of how the allocation was obtained. This commit was SVN r25790.	2012-01-26 19:23:15 +00:00
Jeff Squyres	3751495443	Add missing arguments for the new DYLD_LIBRARY_PATH stuff. This commit was SVN r25780.	2012-01-26 00:35:48 +00:00
Ralph Castain	079e4d9156	Per George's comment, just duplicate the lib path envars to provide both Linux and Mac compatible values This commit was SVN r25776.	2012-01-25 14:37:36 +00:00
Ralph Castain	8b115754e6	Fix typo This commit was SVN r25763.	2012-01-21 23:50:39 +00:00
Ralph Castain	469e40ace2	Expand the coverage a little when looking at remote shells for rsh. Prior patch (r25758) works only if both ends of the rsh/ssh connection are Mac. What we really want is to use the Mac version of ld_library_path when the remote end is Mac, regardless of the OS where mpirun is executing. So add a test for system type to the remote_shell test, and set the ld_library_path name to match the remote system type. This commit was SVN r25762. The following SVN revision numbers were found above: r25758 --> open-mpi/ompi@1afb77e603	2012-01-21 23:48:42 +00:00
Ralph Castain	1afb77e603	Mac requires setting DYLD_LIBRARY_PATH instead of the Linux standard LD_LIBRARY_PATH, so ensure we set that when using rsh to launch in Mac environments. Thanks to Teng Lin for the patch! This commit was SVN r25758.	2012-01-20 19:14:32 +00:00
Ralph Castain	9d556e2f17	Allow daemons to use PMI to get their name where PMI support is available while using the standard grpcomm and other capabilities. Remove the GNI code from the alps ess component as that component should only be for alps/cnos installations. This commit was SVN r25737.	2012-01-18 20:56:53 +00:00
Ralph Castain	6235a355de	Correctly handle co-spawning of daemons when attaching to a running job. We cannot use the general process mappers as we only want debugger daemons spawned on nodes where application procs already exist. So custom build the map for the debugger daemon job, and have the plm just launch that job without doing its usual vm-spawn step. This commit was SVN r25736.	2012-01-18 00:19:49 +00:00
Ralph Castain	bf103de66c	My apologies for doing this outside of the usual time restrictions, but we need to get this in so we can make progress. Move the ORTE-level debugger code back into orterun and out of the ORTE library to resolve symbol conflicts. This commit was SVN r25713.	2012-01-11 15:53:09 +00:00
Shiqing Fan	e3dfc49ced	make correct use of the newly updated structures in the Windows module. This commit was SVN r25699.	2012-01-09 11:08:34 +00:00
Ralph Castain	840841bb8f	Missed a couple This commit was SVN r25686.	2011-12-29 23:30:19 +00:00
Ralph Castain	af7fb68cfb	If we forward envars in rsh, then we have to be very careful about both duplicate entries and disallowed characters on the cmd line. To aid with detecting duplicates, make all cmd line options be given in their mca variant. Check anything we might add for semi-colons and protect those values with quotes. This commit was SVN r25685.	2011-12-29 23:25:25 +00:00
Ralph Castain	2dd2694f25	Fix comm_spawn in oversubscribed conditions. IF oversubscription is allowed, let nodes flow into the mapper even if they are oversubscribed, constrained by the slots_max absolute ceiling. Cleanup error messages when comm_spawn fails so it correctly and succintly reports the ereror. This commit was SVN r25659.	2011-12-15 18:04:48 +00:00
Ralph Castain	912abe8a6c	Catch one more use-case This commit was SVN r25649.	2011-12-14 21:03:19 +00:00
Ralph Castain	f531b09a8d	Correctly handle -host and -hostfile options. Ensure the initial vm launch constrains itself to the union of specified hosts if those options are given. Get oversubscribe set correctly for that case. This commit was SVN r25648.	2011-12-14 20:01:15 +00:00
Ralph Castain	3f1ae5d89b	No longer need this include This commit was SVN r25606.	2011-12-09 00:40:07 +00:00

... 2 3 4 5 6 ...

752 Коммитов