Jeff Squyres
089c632cce
Remove a bunch of dead code: gcc 4.7 warns of set-but-unused
...
variables. So get rid of them.
This commit was SVN r28538.
2013-05-17 21:45:49 +00:00
Ralph Castain
e100b8d165
don't need the return value, but should check for error
...
This commit was SVN r28534.
2013-05-16 15:15:02 +00:00
Jeff Squyres
128cc27417
Minor type fix (they're both enums/ints, so the compiler previously
...
silently cast them).
This commit was SVN r28532.
2013-05-16 00:47:37 +00:00
Ralph Castain
3a372a65b8
Mapping policies must be tested as equalities as they are values, not bitmasks
...
This commit was SVN r28526.
2013-05-15 13:45:00 +00:00
Ralph Castain
29e4b0cc50
Cannot test equality on mapping directives as it is a bitmask
...
This commit was SVN r28525.
2013-05-15 13:41:49 +00:00
Ralph Castain
5296099ecb
Fix the cpus-per-rank when binding to hwthreads. Add cpus-per-rank to diag printout
...
Thanks to Elena for reporting the problem
This commit was SVN r28508.
2013-05-14 20:17:50 +00:00
Ralph Castain
427b6b0b47
Fix the verbosity of yet another framework...sigh.
...
This commit was SVN r28481.
2013-05-13 14:36:32 +00:00
Jeff Squyres
456df1c9f7
Remove redundant opal_output() messages from the module; the called
...
functions will now show_help() their own error messages if something
goes wrong (per r28470).
This commit was SVN r28471.
The following SVN revision numbers were found above:
r28470 --> open-mpi/ompi@2ff95a7739
2013-05-10 15:12:07 +00:00
Jeff Squyres
2ff95a7739
Proper show_help error messages for LAMA.
...
This commit was SVN r28470.
2013-05-10 15:06:25 +00:00
Ralph Castain
707d0e653a
Must use equal and not & comparison for mapping directives
...
This commit was SVN r28451.
2013-05-06 15:07:12 +00:00
Ralph Castain
5d7a93c032
Add the ability to use an external version of libevent. Clearly not recommended at this time. I've verified that it works in limited scenarios, but more thorough testing and performance impacts need to be assessed.
...
Interesting how many includes had to be fixed here and there to fill in missing dependencies :-)
This commit was SVN r28411.
2013-04-29 17:02:37 +00:00
Ralph Castain
252147fba6
Cleanup error message if unknown host is given in -host and -hostfile options
...
This commit was SVN r28262.
2013-03-28 16:52:10 +00:00
Nathan Hjelm
c041156f60
Update ORTE frameworks to use the MCA framework system.
...
This commit was SVN r28240.
2013-03-27 21:14:43 +00:00
Nathan Hjelm
cf377db823
MCA/base: Add new MCA variable system
...
Features:
- Support for an override parameter file (openmpi-mca-param-override.conf).
Variable values in this file can not be overridden by any file or environment
value.
- Support for boolean, unsigned, and unsigned long long variables.
- Support for true/false values.
- Support for enumerations on integer variables.
- Support for MPIT scope, verbosity, and binding.
- Support for command line source.
- Support for setting variable source via the environment using
OMPI_MCA_SOURCE_<var name>=source (either command or file:filename)
- Cleaner API.
- Support for variable groups (equivalent to MPIT categories).
Notes:
- Variables must be created with a backing store (char **, int *, or bool *)
that must live at least as long as the variable.
- Creating a variable with the MCA_BASE_VAR_FLAG_SETTABLE enables the use of
mca_base_var_set_value() to change the value.
- String values are duplicated when the variable is registered. It is up to
the caller to free the original value if necessary. The new value will be
freed by the mca_base_var system and must not be freed by the user.
- Variables with constant scope may not be settable.
- Variable groups (and all associated variables) are deregistered when the
component is closed or the component repository item is freed. This
prevents a segmentation fault from accessing a variable after its component
is unloaded.
- After some discussion we decided we should remove the automatic registration
of component priority variables. Few component actually made use of this
feature.
- The enumerator interface was updated to be general enough to handle
future uses of the interface.
- The code to generate ompi_info output has been moved into the MCA variable
system. See mca_base_var_dump().
opal: update core and components to mca_base_var system
orte: update core and components to mca_base_var system
ompi: update core and components to mca_base_var system
This commit also modifies the rmaps framework. The following variables were
moved from ppr and lama: rmaps_base_pernode, rmaps_base_n_pernode,
rmaps_base_n_persocket. Both lama and ppr create synonyms for these variables.
This commit was SVN r28236.
2013-03-27 21:09:41 +00:00
Ralph Castain
e7ac6c9bde
Don't build rank_file if you can't use it anyway
...
This commit was SVN r28233.
2013-03-27 15:12:40 +00:00
Ralph Castain
256414121e
Protect the cpus-per-rank MCA param registration so that --without-hwloc will build
...
This commit was SVN r28232.
2013-03-27 14:53:30 +00:00
Ralph Castain
317915225c
Finish the binding cleanup by removing the no-longer-used binding level scheme. This proved to be fallible as there is no guarantee that the hierarchy it used matched physical reality of the machine (e.g., is L3 "above" the socket or not). Still have to complete the ppr update, but get the rest of it correct.
...
This commit was SVN r28223.
2013-03-26 20:09:49 +00:00
Ralph Castain
6ee32767d4
Restore the cpus-per-proc option for byslot and bynode mapping. Remove the bind_idx (which recorded the index of the hwloc object where the proc was bound) as this would no longer be unique, and just use the bitmap as the standard reference for location. Update the relative locality computation to take bitmaps as its argument.
...
This commit was SVN r28219.
2013-03-26 18:27:50 +00:00
Ralph Castain
2f43989d22
Add debug and handle the use-case where someone (a) uses a hostfile while in a managed allocation to sub-allocate runs, and (b) includes the HNP's node in one of those hostfiles.
...
cmr:v1.7
This commit was SVN r28203.
2013-03-22 00:53:33 +00:00
Ralph Castain
cf9796accd
Remove the old configure option for disabling full rte support - we now use the OMPI rte framework for such purposes
...
This commit was SVN r28134.
2013-02-28 01:35:55 +00:00
Ralph Castain
8d2fa3693b
First cut at removing the native Windows support. Remove all the Windows-specific components, and the .windows files sprinkled around. Remove the Windows platform files and MTT scripts. Update the NEWS to point Windows users to the cygwin package.
...
This commit was SVN r28116.
2013-02-26 20:44:56 +00:00
Jeff Squyres
8e25b927ab
Clean some minor warnings: remove variables that were set but never
...
used.
This commit was SVN r27974.
2013-01-29 23:35:42 +00:00
Ralph Castain
112f8eedb1
Handle the case where rankfile is providing the allocation
...
This commit was SVN r27971.
2013-01-29 20:37:58 +00:00
Ralph Castain
f6b4db0b79
Fix rank_file operations. We changed the syntax to use semi-colons between multiple slot assignments so that we could use the comma to separate specific cores, but somehow the flex definitions didn't get updated to accept that character. We also incorrectly zero'd the bitmap between slot assignment sections, and so multiple slot assignments only wound up making the last one in the list.
...
This commit was SVN r27908.
2013-01-25 18:33:25 +00:00
Nathan Hjelm
3e1b13b13a
Re-add support for old flex (2.5.4a and earlier) while still cleaning up properly in new flex.
...
This commit was SVN r27657.
2012-12-07 00:12:43 +00:00
Nathan Hjelm
e0f5137e46
add prototypes for lex destroy functions
...
This commit was SVN r27580.
2012-11-09 22:00:27 +00:00
Nathan Hjelm
8658bbc902
instead of relying on yyterminate to clean up the lex context call the destroy functions directly (after closing the file)
...
This commit was SVN r27577.
2012-11-09 16:10:55 +00:00
Ralph Castain
9b729794f2
A prior commit apparently broke the trunk when something was inadvertently left behind - so remove a reference to a no-longer-existing function
...
This commit was SVN r27574.
2012-11-07 11:11:05 +00:00
Nathan Hjelm
7fb5caea92
Remove the finish_parsing function from various .l files. The function is incomplete (doesn't clean up the lex state) and should be replaced by *_yylex_destroy which correctly cleans up the state.
...
Checked with the flex 2.5.35. Verified with valgrind that this fixes several "still reachable" leaks.
cmr:v1.7
This commit was SVN r27571.
2012-11-06 19:26:14 +00:00
Nathan Hjelm
bdedd8b0d3
Per RFC modify the behavior of mca_base_components_close to NOT close the output. Modify frameworks to always close their output and set to -1.
...
Reasoning: The old behavior was a little confusing. mca_base_components_open does not open an output stream so it is a little unexpected that mca_base_components_close does. To add to this several frameworks (that don't use mca_base_components_close) failed to close their output in the framework close function and others closed their output a second time. This change is an improvement to the symantics of mca_base_components_open/close as they are now symetric in their functionality.
This commit was SVN r27570.
2012-11-06 19:09:26 +00:00
Ralph Castain
094d6f3143
Add a new "distributed file system" capability to support file access operations across nodes that do not have a network file system attached to them.
...
Add a set of URI create/parse utilities
This commit was SVN r27483.
2012-10-25 17:15:17 +00:00
Ralph Castain
4028ce7a5d
Silence warnings by making types match
...
This commit was SVN r27446.
2012-10-14 03:45:28 +00:00
Ralph Castain
285a3b168d
Add an ability to specify the max number of simultaneous procs/node for an application when operating in staged mode. Change some debug statements from OPAL_OUTPUT_VERBOSE to opal_output_verbose so they are available in optimized builds.
...
This commit was SVN r27445.
2012-10-14 03:31:32 +00:00
Ralph Castain
54db4c35eb
Get the trunk to build again when --without-hwloc is specified. Move a couple of key type definitions and utilities out from under the HAVE_HWLOC test so they are always available as they don't really depend on hwloc's presence. Tell two compnents not to build if hwloc is disabled:
...
ompi/mca/sbgp/basesmsocket
orte/mca/rmaps/lama
Remove stale configure.params files from the sbgp framework as the OMPI build system no longer looks at those files.
This commit was SVN r27377.
2012-09-26 23:24:27 +00:00
Ralph Castain
d95025f53a
Ensure we clear the usage numbers when binding on multiple nodes so we don't "carry over" info from one node to the next. Use the same tracking mechanism for binding upwards and in-place to avoid doing a bunch of mallocs.
...
Refs trac:3322
This commit was SVN r27356.
The following Trac tickets were found above:
Ticket 3322 --> https://svn.open-mpi.org/trac/ompi/ticket/3322
2012-09-20 15:16:06 +00:00
Ralph Castain
a3060cdd15
Fix the bind_downward code - it was incorrectly looking across the entire node instead of only looking below the locale to which the proc had been assigned. In other words, if the proc was mapped to a core, then the only hwthreads that should be considered for binding are those directly below that core. The binding algo was incorrectly looking at ALL hwthreads in that scenario, causing the proc to be bound to an HT outside of the mapped location.
...
This now results in the procs being bound within their assigned location. It also causes us to use only the 0th HT on a core unless --use-hwthread-cpus has been specified (in which case, we use all the HTs in a core). Bind to core binds you to all HTs regardless - the --use-hwthread-cpus only impacts the oversubscribed determination and when binding to HT.
cmr:v1.7
This commit was SVN r27342.
2012-09-14 22:01:19 +00:00
Ralph Castain
6b5f9d7767
Some cleanups for staged execution
...
This commit was SVN r27317.
2012-09-12 09:15:33 +00:00
Ralph Castain
ca40cb5f1c
Fix comm_spawn by mpirun
...
This commit was SVN r27285.
2012-09-10 17:09:25 +00:00
Ralph Castain
2110fb7f95
Add some debug
...
This commit was SVN r27257.
2012-09-07 04:06:37 +00:00
Ralph Castain
e9e52fc78f
Gain some efficiency in the staged mapper - if soft locations are in use and get_nodes returns busy, then no need to continue cycling thru the remaining apps as all nodes are occupied
...
This commit was SVN r27253.
2012-09-06 22:01:18 +00:00
Ralph Castain
efa50346c8
Error out if we are filtering a hostfile and encounter a node that is not in the resource-managed allocation, giving an error message identifying the file and the node. Don't filter managed allocations thru a default hostfile as this can lead to "hidden" errors.
...
Don't use dash-host info on managed allocations if we using soft locations
This commit was SVN r27245.
2012-09-05 19:42:00 +00:00
Ralph Castain
d772e0fc3d
Add an option to treat dash-host specifications as "requested, but not required". So-called "soft" location requests can allow an application to execute even if the ideal allocation isn't available.
...
This commit was SVN r27242.
2012-09-05 18:42:09 +00:00
Ralph Castain
fde83a44ab
This confusion has been around for awhile, caused by a long-ago decision to track slots allocated to a specific job as opposed to allocated to the overall mpirun instance. We eliminated that quite a while ago, but never consolidated the "slots_alloc" and "slots" fields in orte_node_t. As a result, confusion has grown in the code base as to which field to look at and/or update.
...
So (finally) consolidate these two fields into one "slots" field. Add a field in orte_job_t to indicate when all the procs for a job will be launched together, so that staged operations can know when MPI operations are allowed.
This commit was SVN r27239.
2012-09-05 01:30:39 +00:00
Ralph Castain
11de735e8a
Complete the revamp of hostfile support in non-managed environments. Working at the app level, ensure that we utilize only those nodes specified for that app, but fall back to the default hostfile (if available) for those with no specification, further falling back to the local host if the default hostfile is not present or is empty.
...
This commit was SVN r27230.
2012-09-04 16:34:05 +00:00
Jeff Squyres
b23a6b8eda
Shiqing removed this file in r27217 (but neglected to remove it from
...
the Makefile.am).
This commit was SVN r27226.
The following SVN revision numbers were found above:
r27217 --> open-mpi/ompi@ddbd542732
2012-09-04 13:06:39 +00:00
Ralph Castain
3894179e2f
Add missing file
...
This commit was SVN r27222.
2012-09-04 01:16:58 +00:00
Shiqing Fan
ddbd542732
Remove one .windows file.
...
Add a macro definition for isblank function.
This commit was SVN r27217.
2012-09-03 09:51:44 +00:00
Ralph Castain
66c3f5d18d
When getting target nodes for mapping, there is a difference between not finding any nodes that match the required constraints (either in hostfile or dash-host filtering) and finding at least one such node, but all its slots are busy. Make the return code reflect this difference so the caller can take appropriate action.
...
This commit was SVN r27213.
2012-09-01 10:30:40 +00:00
Ralph Castain
95019cc310
Fix a few places where we weren't completely identifying hostfile-based operations against "localhost" entries. Tell the mapper base to be silent when we don't want errors announced because nodes aren't available for mapping (something it is okay if they are fully used). Fix an infinite loop in the file prepositioning code.
...
This commit was SVN r27210.
2012-08-31 21:28:49 +00:00
Jeff Squyres
da00d281e6
Oops -- we want the priority to be low, not high (for now). :-)
...
This commit was SVN r27208.
2012-08-31 21:08:35 +00:00