openmpi

Автор	SHA1	Сообщение	Дата
Nathan Hjelm	ebbb32120a	MCA/base: variable system updates - Use an enumerator to handle bool values. - Fix a leak in the variable enumerator. - Fix a leak in an orte parameter. This commit was SVN r28949.	2013-07-25 15:42:01 +00:00
Ralph Castain	6c1a140e99	Per request from Nathan, add a "commit" API to the opal db framework. This allows him to aggregate keys to work around the Cray's severe PMI limitations This commit was SVN r28917.	2013-07-22 22:57:16 +00:00
Ralph Castain	5d12ab3873	Ensure we always set num_local_peers for both PMI2 and PMI1 This commit was SVN r28860.	2013-07-19 04:34:58 +00:00
Ralph Castain	b033a6b6d6	One last Cray-inspired fix... Refs trac:3685 This commit was SVN r28857. The following Trac tickets were found above: Ticket 3685 --> https://svn.open-mpi.org/trac/ompi/ticket/3685	2013-07-19 03:04:00 +00:00
Ralph Castain	92cb93b21e	Remove set-but-unused variable Refs trac:3685 This commit was SVN r28855. The following Trac tickets were found above: Ticket 3685 --> https://svn.open-mpi.org/trac/ompi/ticket/3685	2013-07-19 01:42:35 +00:00
Ralph Castain	bc2586cf3c	Refs trac:3685. Check error code returned by PMI2_Info_GetJobAttr. This commit was SVN r28854. The following Trac tickets were found above: Ticket 3685 --> https://svn.open-mpi.org/trac/ompi/ticket/3685	2013-07-19 01:24:51 +00:00
Ralph Castain	a10546d5c1	Cleanup and rename of platform files This commit was SVN r28853.	2013-07-19 01:18:41 +00:00
Ralph Castain	e4e678e234	Per the RFC and discussion on the devel list, update the RTE-MPI error handling interface. There are a few differences in the code from the original RFC that came out of the discussion - I've captured those in the following writeup George and I were talking about ORTE's error handling the other day in regards to the right way to deal with errors in the updated OOB. Specifically, it seemed a bad idea for a library such as ORTE to be aborting the job on its own prerogative. If we lose a connection or cannot send a message, then we really should just report it upwards and let the application and/or upper layers decide what to do about it. The current code base only allows a single error callback to exist, which seemed unduly limiting. So, based on the conversation, I've modified the errmgr interface to provide a mechanism for registering any number of error handlers (this replaces the current "set_fault_callback" API). When an error occurs, these handlers will be called in order until one responds that the error has been "resolved" - i.e., no further action is required - by returning OMPI_SUCCESS. The default MPI layer error handler is specified to go "last" and calls mpi_abort, so the current "abort" behavior is preserved unless other error handlers are registered. In the register_callback function, I provide an "order" param so you can specify "this callback must come first" or "this callback must come last". Seemed to me that we will probably have different code areas registering callbacks, and one might require it go first (the default "abort" will always require it go last). So you can append and prepend, or go first. Note that only one registration can declare itself "first" or "last", and since the default "abort" callback automatically takes "last", that one isn't available. :-) The errhandler callback function passes an opal_pointer_array of structs, each of which contains the name of the proc involved (which can be yourself for internal errors) and the error code. This is a change from the current fault callback which returned an opal_pointer_array of just process names. Rationale is that you might need to see the cause of the error to decide what action to take. I realize that isn't a requirement for remote procs, but remember that we will use the SAME interface to report RTE errors internal to the proc itself. In those cases, you really do need to see the error code. It is legal to pass a NULL for the pointer array (e.g., when reporting an internal failure without error code), so handlers must be prepared for that possibility. If people find that too burdensome, we can remove it. Should we ever decide to create a separate callback path for internal errors vs remote process failures, or if we decide to do something different based on experience, then we can adjust this API. This commit was SVN r28852.	2013-07-19 01:08:53 +00:00
Ralph Castain	6c50c8167c	Fix pmi-1 compile when no pmi2 is present This commit was SVN r28849.	2013-07-18 22:45:08 +00:00
Ralph Castain	256034a3dc	Sigh - fix a couple of spots I missed Refs trac:3683 This commit was SVN r28843. The following Trac tickets were found above: Ticket 3683 --> https://svn.open-mpi.org/trac/ompi/ticket/3683	2013-07-18 19:07:16 +00:00
Ralph Castain	fc3b777ef5	Cleanup a variable that isn't used if pmi2 support is available Refs trac:3683 This commit was SVN r28841. The following Trac tickets were found above: Ticket 3683 --> https://svn.open-mpi.org/trac/ompi/ticket/3683	2013-07-18 17:19:13 +00:00
Ralph Castain	92c6b806b9	Based on a patch submitted by Piotr Lesnicki of Bull, cleanup the PMI2 support. This has not been tested yet on multiple environments (e.g., Cray), so it needs more evaluation prior to moving to the 1.7 branch. cmr:v1.7.3:reviewer=rhc This commit was SVN r28837.	2013-07-18 14:46:07 +00:00
Ralph Castain	10ca1c1b04	Turns out that there was exactly ONE place in all of the OMPI code base that still referred to OPAL_TRACE, though a few places retained the include file for no reason. So no point in letting this sit as it is clearly an unused "feature". This commit was SVN r28789.	2013-07-14 18:57:20 +00:00
Ralph Castain	563bf60fb8	Fix an ordering problem that crept in due to the change in MCA param system. One MCA param would set a value, and then we did a hard reset of that value before testing another MCA param, thus removing a critical value for proper operation of the first param. cmr:v1.7.3:reviewer=brbarret This commit was SVN r28788.	2013-07-14 16:22:13 +00:00
Ralph Castain	7a21661785	Silence a warning when --without-hwloc is used This commit was SVN r28783.	2013-07-13 17:17:17 +00:00
Dave Goodell	3741d62308	fix --without-hwloc build failure All builds since r28682 configured with '--without-hwloc' fail at "make" time without this fix. Reviewed by rhc@ This commit was SVN r28769. The following SVN revision numbers were found above: r28682 --> open-mpi/ompi@446e33a5d8	2013-07-12 17:21:14 +00:00
Jeff Squyres	baa3182794	Per RFC (http://www.open-mpi.org/community/lists/devel/2013/07/12534.php), remove a bunch of dead code. This commit was SVN r28756.	2013-07-11 17:34:28 +00:00
Ralph Castain	028f5ee7a6	Cleanup some bitrot from moving the db framework to opal and from the new mca param system This commit was SVN r28741.	2013-07-09 14:37:08 +00:00
Ralph Castain	62378209f0	Even if we don't find the default hostfile, and nothing else was provided, then use all the known nodes. cmr:v1.7.3:#3653:reviewer=jsquyres cmr:v1.6.6:#3654:reviewer=jsquyres This commit was SVN r28718.	2013-07-03 22:31:32 +00:00
Ralph Castain	443a6802b9	If the default hostfile is empty, we need to pickup all the known nodes, not just the head node. cmr:v1.7.3:reviewer=jsquyres cmr:v1.6.6:reviewer=jsquyres This commit was SVN r28717.	2013-07-03 22:25:51 +00:00
Ralph Castain	446e33a5d8	There are cases where we want to use the novm state machine, but the backend node topology differs from that where mpirun is executing. In those cases, we can wind up thinking we are oversubscribed because the head node has fewer cores than the compute nodes. To resolve this situation, add the ability to specify a backend topology file that mpirun shall use for its mapping operations. Create a new "set_topology" function in opal hwloc to support it. This commit was SVN r28682.	2013-06-27 03:04:50 +00:00
Joshua Ladd	0b5c1f2ea8	Add 'generic' support for PMI2 (previously, we checked for PMI2 only on Cray systems.) If your resource manager (e.g. SLURM) has support for PMI2, then the --with-pmi configure flag will enable its usage. If you don't have PMI2, then you will fallback to regular old PMI1. This patch was submitted by Ralph Castain and reviewed and pushed by Josh Ladd. This should be added to cmr:v1.7:reviewer=jladd This commit was SVN r28666.	2013-06-21 15:28:14 +00:00
Nathan Hjelm	299d5b3dd7	Fix two debugger attach bugs. - orte_debugger_init_after_spawn was not being called for debuggers that use the MPIR_attach_fifo to co-locate debugger daemons. - MPIR_Breakpoint was not getting called if a debugger reattached. Add a job state (ORTE_JOB_STATE_DEBUGGER_DETACH) to reset mpir_breakpoint_fired to false when a debugger detaches to ensure MPIR_Breakpoint is called if another debugger attaches. Tested with STAT 2.0/launchmon 1.0. cmr:v1.7 This commit was SVN r28665.	2013-06-20 16:18:05 +00:00
Jeff Squyres	b9ca8e3cd1	Tweaked the help message a bit (this is the end result of iterating on the message in email between Mike, Ralph, Jeff). Add this to CMR #3642 and #3643. This commit was SVN r28662.	2013-06-20 13:19:23 +00:00
Ralph Castain	13665bffe8	Per an off-list discussion, it appears possible for a system to report failure when executing getpwuid. There are several reasons for this error to occur, most notably if the system uses a network-based authentication protocol (e.g., NIS) and that sytem gets overwhelmed when we launch on a lot of nodes. There is no good way to recover from this scenario, and from past experience, using the user's name in the session directory (as opposed to the uid) is very helpful when things go wrong. So print a help message when this happens (it is extremely rare, but has happened at least once now) and return an error. cmr:v1.7.3,reviewer=jsquyres cmr:v1.6.5,reviewer=jsquyres This commit was SVN r28658.	2013-06-20 04:30:42 +00:00
Ralph Castain	a51a0a8c48	Fix uninitialized var This commit was SVN r28652.	2013-06-18 22:41:47 +00:00
George Bosilca	b4ebc417a1	Correctly register the component MCA parameters. Few cleanups in the includes. This commit was SVN r28645.	2013-06-15 16:05:09 +00:00
Nathan Hjelm	518d1fe200	Fix two typos that prevented alps direct launch from working This commit was SVN r28628.	2013-06-13 17:04:08 +00:00
Joshua Ladd	61ffb47573	Minor fix for the min-dist mapping algorithm: we need to call 'get_nbobjs_by_type' first, before we get the sorted list of nodes - we need to add node objects and fill them in the summary object for the current topology. This patch was submitted by Elena Elkina and pushed by Josh Ladd. This should be added to cmr:v1.7:reviewer=jladd This commit was SVN r28578.	2013-05-31 15:19:59 +00:00
Jeff Squyres	6d173af329	This commit introduces a new "mindist" ORTE RMAPS mapper, as well as some relevant updates/new functionality in the opal/mca/hwloc and orte/mca/rmaps bases. This work was mainly developed by Mellanox, with a bunch of advice from Ralph Castain, and some minor advice from Brice Goglin and Jeff Squyres. Even though this is mainly Mellanox's work, Jeff is committing only for logistical reasons (he holds the hg+svn combo tree, and can therefore commit it directly back to SVN). ----- Implemented distance-based mapping algorithm as a new "mindist" component in the rmaps framework. It allows mapping processes by NUMA due to PCI locality information as reported by the BIOS - from the closest to device to furthest. To use this algorithm, specify: {{{mpirun --map-by dist:<device_name>}}} where <device_name> can be mlx5_0, ib0, etc. There are two modes provided: 1. bynode: load-balancing across nodes 1. byslot: go through slots sequentially (i.e., the first nodes are more loaded) These options are regulated by the optional ''span'' modifier; the command line parameter looks like: {{{mpirun --map-by dist:<device_name>,span}}} So, for example, if there are 2 nodes, each with 8 cores, and we'd like to run 10 processes, the mindist algorithm will place 8 processes to the first node and 2 to the second by default. But if you want to place 5 processes to each node, you can add a span modifier in your command line to do that. If there are two NUMA nodes on the node, each with 4 cores, and we run 6 processes, the mindist algorithm will try to find the NUMA closest to the specified device, and if successful, it will place 4 processes on that NUMA but leaving the remaining two to the next NUMA node. You can also specify the number of cpus per MPI process. This option is handled so that we map as many processes to the closest NUMA as we can (number of available processors at the NUMA divided by number of cpus per rank) and then go on with the next closest NUMA. The default binding option for this mapping is bind-to-numa. It works if you don't specify any binding policy. But if you specified binding level that was "lower" than NUMA (i.e hwthread, core, socket) it would bind to whatever level you specify. This commit was SVN r28552.	2013-05-22 13:04:40 +00:00
Jeff Squyres	089c632cce	Remove a bunch of dead code: gcc 4.7 warns of set-but-unused variables. So get rid of them. This commit was SVN r28538.	2013-05-17 21:45:49 +00:00
Ralph Castain	e100b8d165	don't need the return value, but should check for error This commit was SVN r28534.	2013-05-16 15:15:02 +00:00
Jeff Squyres	128cc27417	Minor type fix (they're both enums/ints, so the compiler previously silently cast them). This commit was SVN r28532.	2013-05-16 00:47:37 +00:00
Ralph Castain	93ba4247f8	remove extra paren when --without-hwloc This commit was SVN r28530.	2013-05-15 21:31:45 +00:00
Ralph Castain	3a372a65b8	Mapping policies must be tested as equalities as they are values, not bitmasks This commit was SVN r28526.	2013-05-15 13:45:00 +00:00
Ralph Castain	29e4b0cc50	Cannot test equality on mapping directives as it is a bitmask This commit was SVN r28525.	2013-05-15 13:41:49 +00:00
Ralph Castain	04b11accd3	Silience a few warnings This commit was SVN r28515.	2013-05-14 21:58:40 +00:00
Ralph Castain	5296099ecb	Fix the cpus-per-rank when binding to hwthreads. Add cpus-per-rank to diag printout Thanks to Elena for reporting the problem This commit was SVN r28508.	2013-05-14 20:17:50 +00:00
Ralph Castain	427b6b0b47	Fix the verbosity of yet another framework...sigh. This commit was SVN r28481.	2013-05-13 14:36:32 +00:00
Jeff Squyres	456df1c9f7	Remove redundant opal_output() messages from the module; the called functions will now show_help() their own error messages if something goes wrong (per r28470). This commit was SVN r28471. The following SVN revision numbers were found above: r28470 --> open-mpi/ompi@2ff95a7739	2013-05-10 15:12:07 +00:00
Jeff Squyres	2ff95a7739	Proper show_help error messages for LAMA. This commit was SVN r28470.	2013-05-10 15:06:25 +00:00
Ralph Castain	f15fe5045e	Ensure that debugger connect can occur by getting the rml contact info updated before calling init_after_spawn cmr:v1.7.3,reviewer=jsquyres This commit was SVN r28455.	2013-05-06 22:00:45 +00:00
Ralph Castain	c52b94af8b	Revert r28453 and r28452 - wrong fix This commit was SVN r28454. The following SVN revision numbers were found above: r28452 --> open-mpi/ompi@756ee4b5e0 r28453 --> open-mpi/ompi@6da24143a2	2013-05-06 21:52:17 +00:00
Ralph Castain	6da24143a2	Minor performance improvement This commit was SVN r28453.	2013-05-06 20:27:16 +00:00
Ralph Castain	756ee4b5e0	Update the rml_uri for each proc so debuggers can attach This commit was SVN r28452.	2013-05-06 20:18:14 +00:00
Ralph Castain	707d0e653a	Must use equal and not & comparison for mapping directives This commit was SVN r28451.	2013-05-06 15:07:12 +00:00
Ralph Castain	a0a6412545	Do a little cleanup on abnormal termination procedure - don't keep submitting forced exit events (one will do), no need to reset the abnormal termination pipe event in orterun, etc. This commit was SVN r28450.	2013-05-05 17:39:45 +00:00
Ralph Castain	fb2a694587	Fix print This commit was SVN r28446.	2013-05-04 22:37:34 +00:00
Ralph Castain	27e3e382d5	No need for ORTE tools to use orte progress thread This commit was SVN r28445.	2013-05-04 21:13:20 +00:00
Jeff Squyres	42a9a4c62c	After examining a '''lot''' of MTT output with Ralph, fix the cause of many, many MTT timeouts when running jobs under SLURM: send the right command at the end to cause remote orteds to shut down. This commit was SVN r28438.	2013-05-02 00:23:53 +00:00

1 2 3 4 5 ...

3955 Коммитов