Ralph Castain
43f883cb42
Add some more detailed error output to the db_hash component and nidmap code. Ensure the local nodename is included in the HNP's aliases
...
This commit was SVN r27622.
2012-11-18 17:57:19 +00:00
Ralph Castain
f2ec35536e
Fix a bug that prevented MCA params from being forwarded to daemons upon launch
...
cmr:v1.7
This commit was SVN r27621.
2012-11-18 17:55:26 +00:00
Ralph Castain
e11f32038a
Add an MCA param to retain all aliases based on IP addrs for node names so that procs can look them up by interface, if desired. If the param is set, pass aliases around to all daemons and procs for local use
...
This commit was SVN r27619.
2012-11-16 04:04:29 +00:00
Ralph Castain
da6428a822
Add an MCA param to help debug the ORTE progress thread
...
This commit was SVN r27614.
2012-11-15 15:54:38 +00:00
Ralph Castain
5241925b62
Add the dfs to ompi_info
...
This commit was SVN r27613.
2012-11-15 15:54:07 +00:00
Ralph Castain
3cecc1569b
Fix segfault if no file_maps were pushed
...
This commit was SVN r27612.
2012-11-15 15:39:17 +00:00
Ralph Castain
fe6dfad625
Update DFS to support multi-node operations
...
This commit was SVN r27594.
2012-11-12 02:54:53 +00:00
Ralph Castain
fefec03e78
Enable all ORTE tools to use progress threads if they are enabled
...
This commit was SVN r27593.
2012-11-12 02:54:09 +00:00
Ralph Castain
a6325e4546
Silence compiler warning
...
This commit was SVN r27590.
2012-11-12 02:51:29 +00:00
Ralph Castain
26f1cd0909
Fix compiler warnings
...
This commit was SVN r27588.
2012-11-12 02:50:45 +00:00
Ralph Castain
bd887f7f56
Add a new "test" component to the DFS that treats all files as remote in order to test the app-to-daemon interactions on a single machine. Set a global param to indicate we are using staged execution. Add a param to indicate it is okay for non-MPI processes to execute without finalizing. Cleanup file map load and fetch operations.
...
This commit was SVN r27587.
2012-11-10 14:09:12 +00:00
Ralph Castain
81d0b06842
Strip the domain info from the hostname if that option is specified, protecting IP address-based names
...
This commit was SVN r27586.
2012-11-10 14:05:27 +00:00
Ralph Castain
615cc66b44
Protect the HNP cleanup in cases where no session dirs are created
...
This commit was SVN r27585.
2012-11-10 14:03:07 +00:00
Ralph Castain
fd632147df
Per patch from Nathan, with a few fixes, cleanup the orte-info tool
...
This commit was SVN r27581.
2012-11-10 04:11:40 +00:00
Nathan Hjelm
e0f5137e46
add prototypes for lex destroy functions
...
This commit was SVN r27580.
2012-11-09 22:00:27 +00:00
Nathan Hjelm
a754674fd7
Per the specification for putenv ( http://pubs.opengroup.org/onlinepubs/009604599/functions/putenv.html ) the string given to putenv becomes part of the environment. The string must not be changed or freed.
...
cmr:v1.7
This commit was SVN r27578.
2012-11-09 16:33:14 +00:00
Nathan Hjelm
8658bbc902
instead of relying on yyterminate to clean up the lex context call the destroy functions directly (after closing the file)
...
This commit was SVN r27577.
2012-11-09 16:10:55 +00:00
Nathan Hjelm
842caae4c7
Fix a small leak in orte/util/name_fns.c
...
cmr:v1.7
This commit was SVN r27576.
2012-11-07 23:59:49 +00:00
Ralph Castain
9b729794f2
A prior commit apparently broke the trunk when something was inadvertently left behind - so remove a reference to a no-longer-existing function
...
This commit was SVN r27574.
2012-11-07 11:11:05 +00:00
Nathan Hjelm
7fb5caea92
Remove the finish_parsing function from various .l files. The function is incomplete (doesn't clean up the lex state) and should be replaced by *_yylex_destroy which correctly cleans up the state.
...
Checked with the flex 2.5.35. Verified with valgrind that this fixes several "still reachable" leaks.
cmr:v1.7
This commit was SVN r27571.
2012-11-06 19:26:14 +00:00
Nathan Hjelm
bdedd8b0d3
Per RFC modify the behavior of mca_base_components_close to NOT close the output. Modify frameworks to always close their output and set to -1.
...
Reasoning: The old behavior was a little confusing. mca_base_components_open does not open an output stream so it is a little unexpected that mca_base_components_close does. To add to this several frameworks (that don't use mca_base_components_close) failed to close their output in the framework close function and others closed their output a second time. This change is an improvement to the symantics of mca_base_components_open/close as they are now symetric in their functionality.
This commit was SVN r27570.
2012-11-06 19:09:26 +00:00
Ralph Castain
27b41a7db4
If the nodename is an IP address, we need to retain the full name (even if keep_fqdn is false) so that the ssh tree spawn can proceed.
...
cmr:v1.7
This commit was SVN r27561.
2012-11-05 16:59:53 +00:00
Ralph Castain
3812979315
Remove stale file - we removed the size functions awhile back
...
This commit was SVN r27551.
2012-11-01 03:36:51 +00:00
Brian Barrett
e61c00212d
Add files found in svn but not tarball
...
This commit was SVN r27549.
2012-11-01 02:27:03 +00:00
Brian Barrett
dd907e8d4c
Remove unused macro
...
This commit was SVN r27547.
2012-11-01 02:04:57 +00:00
Nathan Hjelm
2acd0f83de
Revert "Revert r27451 and r27456 - the cmd line parser is incorrectly marking the application as an MCA parameter".
...
It appears the problem was not with the command line parser but the rsh plm. I don't know why this problem was not occuring before the command line parser changes but it appears to be resolved now.
This commit was SVN r27527.
The following SVN revision numbers were found above:
r27451 --> open-mpi/ompi@d59034e6ef
r27456 --> open-mpi/ompi@ecdbf34937
2012-10-30 19:45:18 +00:00
Nathan Hjelm
df9bd0ed59
fix bug in plm/rsh that could add extraneous mca options to the orted argv
...
cmr:v1.7
This commit was SVN r27526.
2012-10-30 19:40:04 +00:00
Ralph Castain
7c3a2c3c92
Allow PMI support to find subdirs named lib64 instead of lib - thanks to Guillaume.Papaure of Bull for the patch
...
This commit was SVN r27519.
2012-10-30 17:38:57 +00:00
Ralph Castain
a080de188f
Enable orterun to directly support staged execution, treating each app as a separate job. Support transfer of file maps when support exists.
...
This commit was SVN r27516.
2012-10-29 23:11:30 +00:00
Ralph Castain
e5e72c3137
Expand the dfs API to support retrieval, loading and purging of file maps.
...
This commit was SVN r27515.
2012-10-29 23:05:45 +00:00
Ralph Castain
4e52a15e70
Provide for sync on seek and close DFS operations. Eliminate an unnecessary wake-up timer when using ORTE progress thread
...
This commit was SVN r27500.
2012-10-26 15:49:04 +00:00
Ralph Castain
4ef30c016b
Remove stale windows references
...
This commit was SVN r27491.
2012-10-26 01:19:14 +00:00
Ralph Castain
35e5e5b512
Set the orte_event_base to the opal_event_base in ompi_info - we aren't doing anything with progress threads anyway
...
This commit was SVN r27488.
2012-10-25 22:36:08 +00:00
Ralph Castain
df642f1508
Add an API to get a remote file's size. Separate dfs cmds from returned data messages so daemons don't get confused.
...
This commit was SVN r27487.
2012-10-25 22:23:08 +00:00
Ralph Castain
79e36413c2
There was some discussion of this at an earlier time, but we never got around to doing it - so make orte behave more like a regular library, counting the number of times init is called, and executing finalize when all those are exhausted.
...
This commit was SVN r27484.
2012-10-25 18:39:37 +00:00
Ralph Castain
094d6f3143
Add a new "distributed file system" capability to support file access operations across nodes that do not have a network file system attached to them.
...
Add a set of URI create/parse utilities
This commit was SVN r27483.
2012-10-25 17:15:17 +00:00
Ralph Castain
32c185f730
Set a priority for output of forwarded IO so it can effectively compete against inbound messages
...
This commit was SVN r27480.
2012-10-24 23:34:50 +00:00
Ralph Castain
e06c330635
Add the ability to set a backlog limit on forwarded output waiting at mpirun - helps to avoid crashing systems during debug. Note that we default to "unlimited" to maintain current behavior.
...
This commit was SVN r27479.
2012-10-24 23:21:40 +00:00
Ralph Castain
e6014bf2e1
Revert r27451 and r27456 - the cmd line parser is incorrectly marking the application as an MCA parameter
...
This commit was SVN r27477.
The following SVN revision numbers were found above:
r27451 --> open-mpi/ompi@d59034e6ef
r27456 --> open-mpi/ompi@ecdbf34937
2012-10-24 18:38:44 +00:00
Ralph Castain
7574d6673b
If someone provides the launch_agent cmd, then don't prefix it
...
cmr:v1.7
This commit was SVN r27473.
2012-10-24 16:14:04 +00:00
Ralph Castain
5c0534a7ad
Ensure that comm_spawn launches procs on the nodes specified by add-host and add-hostfile
...
This commit was SVN r27452.
2012-10-18 00:40:44 +00:00
Nathan Hjelm
d59034e6ef
MCA: remove deprecated mca_base_param functions (mca_base_param_register_int, mca_base_param_register_string, mca_base_param_environ_variable). Remove all uses of deprecated functions.
...
cmr:v1.7
This commit was SVN r27451.
2012-10-17 20:17:37 +00:00
Ralph Castain
4028ce7a5d
Silence warnings by making types match
...
This commit was SVN r27446.
2012-10-14 03:45:28 +00:00
Ralph Castain
285a3b168d
Add an ability to specify the max number of simultaneous procs/node for an application when operating in staged mode. Change some debug statements from OPAL_OUTPUT_VERBOSE to opal_output_verbose so they are available in optimized builds.
...
This commit was SVN r27445.
2012-10-14 03:31:32 +00:00
Ralph Castain
04304c186f
Remove the setup_hadoop configure script as it is no longer required - the hadoop support components can build without accessing hadoop itself.
...
This commit was SVN r27385.
2012-09-29 18:30:35 +00:00
Ralph Castain
9daaa001d9
Remove tools that are no longer required
...
This commit was SVN r27383.
2012-09-29 17:33:16 +00:00
Ralph Castain
54db4c35eb
Get the trunk to build again when --without-hwloc is specified. Move a couple of key type definitions and utilities out from under the HAVE_HWLOC test so they are always available as they don't really depend on hwloc's presence. Tell two compnents not to build if hwloc is disabled:
...
ompi/mca/sbgp/basesmsocket
orte/mca/rmaps/lama
Remove stale configure.params files from the sbgp framework as the OMPI build system no longer looks at those files.
This commit was SVN r27377.
2012-09-26 23:24:27 +00:00
Samuel Gutierrez
42280e2af5
Temporarily make routed binomial the default. We are experiencing issues with
...
debruijn when launching fewer processes than are actually available within an
allocation. When this is fixed, please revert this change.
This commit was SVN r27376.
2012-09-26 16:08:12 +00:00
Jeff Squyres
cb65a44c6c
Fix the component priority assignment. Thanks to Alex Margolin for
...
the patch.
This commit was SVN r27363.
2012-09-25 07:13:23 +00:00
George Bosilca
6ec41400b3
Fix the error message in case a daemon does not succeed at killing the
...
local offspring.
This commit was SVN r27362.
2012-09-24 15:25:21 +00:00