Jeff Squyres
798202c424
Allow the mca_component_path to change over time.
...
This commit was SVN r22957.
2010-04-12 22:02:34 +00:00
Jeff Squyres
f77257d931
These don't belong in this file.
...
This commit was SVN r22956.
2010-04-12 20:50:23 +00:00
Rainer Keller
a48a11821b
- mca_base_param_reg_string_name allocates default_pml.
...
As it is strdup, just free(default_pml).
cmr:v1.5
This commit was SVN r22955.
2010-04-12 19:54:07 +00:00
Jeff Squyres
1919ba225d
Allow static_components to be NULL for cases where we ''know'' there
...
will be no static components to be searched.
This commit was SVN r22954.
2010-04-12 14:51:47 +00:00
Pavel Shamis
fc077a2102
Fix a minor bug in the error flow of check_if_device_support_modify_srq
...
Signed-off-by: Ishai Rabinovitz <ishai@mellanox.co.il>
This commit was SVN r22953.
2010-04-12 11:28:44 +00:00
Shiqing Fan
96b20a29b5
An easy solution to make singleton work on Windows.
...
This commit was SVN r22952.
2010-04-10 16:30:59 +00:00
Ralph Castain
d3ed4e68b7
Utilize a non-used mapping policy bit to define a policy that uses only existing alive daemons to support virtual machines and restarting processes on already-active nodes
...
This commit was SVN r22951.
2010-04-10 05:02:47 +00:00
Ralph Castain
4f8279df3d
Enable substitution of the communication calls in the orted when sending messages back to the HNP by creating a function for this purpose and saving the pointer to it in orte_odls_base. Higher level libraries can then override the default function to use their own method.
...
This commit was SVN r22950.
2010-04-09 18:50:10 +00:00
Ralph Castain
c32f046d7c
Tiny cleanup - when the user kills us with a ctrl-c, there really isn't a need to tell him "your procs died and we don't know why". Just shaddup and die.
...
This commit was SVN r22949.
2010-04-09 18:47:35 +00:00
Terry Dontje
282a537cf7
This commit fixes 2370, by having the solaris paffinity module return error codes for get_physical_processor_id and having odls_default_fork_local_proc check get_physical_processor_id for OPAL_ERROR
...
This commit was SVN r22948.
2010-04-09 15:10:46 +00:00
Brad Benton
101b896f2e
IBM has approved the release of the LoadLeveler sample code under the
...
BSD license. Consequently, a more restrictive licensing clause that was
originally associated with the LoadLeveler sample code documentation and
replicated in a comment block in this file has been removed.
This commit was SVN r22947.
2010-04-08 19:41:44 +00:00
Terry Dontje
929c58e38d
This commit fixes trac:2073
...
This commit was SVN r22946.
The following Trac tickets were found above:
Ticket 2073 --> https://svn.open-mpi.org/trac/ompi/ticket/2073
2010-04-08 18:17:44 +00:00
Ralph Castain
75e99e6118
Do a better job of selecting cm ess component, handle tool and daemon issues
...
This commit was SVN r22942.
2010-04-07 18:59:21 +00:00
Ralph Castain
f1fc344336
Add some diagnostics
...
This commit was SVN r22941.
2010-04-07 18:58:17 +00:00
Ralph Castain
bdb62a3e4e
Update cisco platform files
...
This commit was SVN r22940.
2010-04-07 18:57:54 +00:00
Rolf vandeVaart
0adb570693
Add pml_ob1_verbose flag. Fix the current location it is being used
...
This commit was SVN r22939.
2010-04-07 13:51:42 +00:00
Ralph Castain
8e29a6858a
Properly handle the case when a daemon is given both parts of its name
...
This commit was SVN r22935.
2010-04-06 22:41:18 +00:00
Ralph Castain
2b8ab61328
Add another helpful macro
...
This commit was SVN r22934.
2010-04-06 22:40:45 +00:00
Brad Benton
7fe33ec90b
add ibm platform files.
...
This commit was SVN r22933.
2010-04-06 21:47:12 +00:00
Brad Benton
62ecc9a7ae
Add IBM platform files.
...
This commit was SVN r22932.
2010-04-06 21:04:44 +00:00
Ralph Castain
a1e82e9d05
Per discussion with Josh, cleanup the errmgr API by creating separate modules for the public vs internal APIs. This mirrors the architecture used in other frameworks that had similar requirements.
...
Remove the orcm errmgr module - moving to the orcm code base so it can utilize orcm communications and not interfere with ompi-related operations.
This commit was SVN r22931.
2010-04-05 22:59:21 +00:00
Ralph Castain
1caba7af2f
Fix a bunch of compiler warnings reported by Jeff
...
This commit was SVN r22930.
2010-04-03 00:20:19 +00:00
Ralph Castain
84c7973df8
Update the #procs in the job prior to assigning vpids for each app_context.
...
This commit was SVN r22929.
2010-04-03 00:03:35 +00:00
Ralph Castain
6b43b76f9d
Some updates required for generating a LAM-style virtual machine. Retain the local node if requested. Properly setup the daemon job map for a VM launch.
...
This commit was SVN r22928.
2010-04-03 00:03:01 +00:00
Brad Benton
58a9aeff5a
================================================================================
...
modify the OPAL_PAFFINITY_PROCESS_IS_BOUND macro to search the cpuset for
the maximum possible number of cpus rather than just the number of cpus
currently online. This corrects a problem where mpi_paffinity_alone was
not working properly on systems in which there can be cpu namespaces with
holes, such as on ppc64 with smt off (as discussed in #2365 ).
This commit was SVN r22927.
2010-04-02 18:24:12 +00:00
Ralph Castain
de6679dbd3
Truly respect the -quiet option. Make it an mca param so someone doesn't have to put it solely on the cmd line. Tell show_help to shaddup as well.
...
This commit was SVN r22926.
2010-04-02 14:19:38 +00:00
Ralph Castain
25a9a195b0
When the user requests "quiet", mpirun isn't supposed to output "helpful error messages" - so shaddup!
...
This commit was SVN r22925.
2010-04-02 07:13:11 +00:00
Ralph Castain
ed0f42fa49
Fix a bug courtesy of Jeff - since check_job_complete removes the child object and releases it, preserve the pointer to the next item on the list prior to working with it
...
This commit was SVN r22924.
2010-04-02 07:08:34 +00:00
Jeff Squyres
8a85c4617f
Fixes trac:2366: dragonboy noticed that the PGI compiler is picky about
...
#if directives -- had to change a pair of #if conditionals in
opal/util/stacktrace.c to make the PGI compiler accept it.
This commit was SVN r22923.
The following Trac tickets were found above:
Ticket 2366 --> https://svn.open-mpi.org/trac/ompi/ticket/2366
2010-04-01 17:04:06 +00:00
Jeff Squyres
c57a8fba5a
Improve the help message when mpirun cannot find an executable.
...
Refs trac:2035
This commit was SVN r22922.
The following Trac tickets were found above:
Ticket 2035 --> https://svn.open-mpi.org/trac/ompi/ticket/2035
2010-04-01 13:26:29 +00:00
Ralph Castain
871a9e0df4
Track process heartbeats with time_t, be a little less restrictive on who can retrieve an orte_job_t object
...
This commit was SVN r22921.
2010-03-31 19:20:06 +00:00
Rainer Keller
3111b2debf
- Update platform script for Jaguar to latest version
...
This should be cmr'd to v1.5
This commit was SVN r22920.
2010-03-31 14:39:33 +00:00
Rainer Keller
66ce72201f
- Convert shell script to perl ;-) Only (XML) svn log, instead of
...
one PER eligible revision speeds this up 24x...
Now should be worthwhile to run for the v1.4 branch as well.
As before creates HTML output by default:
perl ompi_branch_check_revisions.pl > revisions-v1.5.html
By default now does the parsing of the branch's svn log -- should
then work hand in hand with Jeff's gkcommit.pl
This commit was SVN r22917.
2010-03-31 03:00:58 +00:00
Josh Hursey
62f8d3c471
r22885 missed a few symbol updates when it changed ompi_want_ft to opal_want_ft
...
This commit was SVN r22916.
The following SVN revision numbers were found above:
r22885 --> open-mpi/ompi@522a23d6a3
2010-03-30 16:47:39 +00:00
Rainer Keller
9a8b794eb4
- Allow -r=r<NUM>, and -r=<NUM> for easier copy-pasting...
...
This commit was SVN r22915.
2010-03-30 14:02:35 +00:00
Ralph Castain
f6bfaa76ba
Add some debug output to job_complete. If no session dirs were created, then cannot check for abort file - which wouldn't be created anyway
...
This commit was SVN r22903.
2010-03-29 23:21:03 +00:00
Jeff Squyres
eaed49594c
Fix typo (I'm assuming this was a copy-n-paste error :-) ).
...
This commit was SVN r22902.
2010-03-29 21:54:02 +00:00
Ralph Castain
24c3b4f849
Add the sysinfo framework to the "info" tools, especially since the odls_base_open function calls it!
...
This commit was SVN r22901.
2010-03-29 20:47:29 +00:00
Ralph Castain
2603bd8a47
Eliminate a race condition (first reported by Josh) when deliberately killing procs. Need to cancel the waitpid callback for the proc, then properly flag it as dead (both not-alive and waitpid-fired) so that the system cleans up properly.
...
This commit was SVN r22900.
2010-03-28 16:08:05 +00:00
Ralph Castain
4f9db20d94
Couple of minor cleanups
...
This commit was SVN r22899.
2010-03-28 15:41:27 +00:00
Ralph Castain
1a100812a9
Add some new cisco platform files
...
This commit was SVN r22898.
2010-03-28 15:40:51 +00:00
Jeff Squyres
3449c34bc9
Remove no-longer-existing patchfiles from the Makefile.am.
...
This commit was SVN r22897.
2010-03-27 11:36:31 +00:00
Jeff Squyres
319fb12504
Per RFC initially started here:
...
http://www.open-mpi.org/community/lists/devel/2010/02/7496.php
Increase the required versions of AM, AC, and LT:
* Autoconf: 2.65
* Automake: 1.11.1
* Libtool: 2.2.6b
And therefore removed a bunch of patches that we used to apply to make
older versions of these tools work.
Also updated the HACKING document to match these version numbers,
specifically mentioned Mercurial in a few places, and removed some
outdated language about running autogen.sh in subdirectories.
This commit was SVN r22896.
2010-03-26 21:03:50 +00:00
Ralph Castain
b2e6c02e22
Add critical mca param to cisco platform files
...
This commit was SVN r22895.
2010-03-26 16:26:35 +00:00
Shiqing Fan
9d3613c259
Only check the necessary headers and others on Windows, so that to speed up the configuration a lot.
...
Set up the integer kind family for f77 build.
This commit was SVN r22894.
2010-03-26 16:01:36 +00:00
Jeff Squyres
e307df8a85
* Fix help message
...
* Fix to use SVN log messages relative to the trunk
* Fix error message printing when LWP fails
This commit was SVN r22892.
2010-03-26 14:13:56 +00:00
Jeff Squyres
f9f85692f2
Add a dry-run mode.
...
This commit was SVN r22891.
2010-03-26 12:36:05 +00:00
Ralph Castain
d9e9c6114d
Update cisco platform files
...
This commit was SVN r22887.
2010-03-25 23:05:05 +00:00
Ralph Castain
1bf9684ebb
Don't include jobs in the nidmap if they aren't mapped jobs
...
This commit was SVN r22886.
2010-03-25 22:54:57 +00:00
Ralph Castain
522a23d6a3
A few changes to the FT-related configure options:
...
1. fix a bug that caused an infinite loop in configure when specifying want-ft but not want-ft-thread by removing a stale reference to the opal-progress-thread option
2. add want-ft=orcm so we can build the orcm errmgr component
3. cleanup the use of "ompi_want_ft_xxx" and replace it with "opal_want_ft_xxx" so that naming conventions are preserved
This commit was SVN r22885.
2010-03-25 22:53:48 +00:00