Terry Dontje
282a537cf7
This commit fixes 2370, by having the solaris paffinity module return error codes for get_physical_processor_id and having odls_default_fork_local_proc check get_physical_processor_id for OPAL_ERROR
...
This commit was SVN r22948.
2010-04-09 15:10:46 +00:00
Brad Benton
101b896f2e
IBM has approved the release of the LoadLeveler sample code under the
...
BSD license. Consequently, a more restrictive licensing clause that was
originally associated with the LoadLeveler sample code documentation and
replicated in a comment block in this file has been removed.
This commit was SVN r22947.
2010-04-08 19:41:44 +00:00
Terry Dontje
929c58e38d
This commit fixes trac:2073
...
This commit was SVN r22946.
The following Trac tickets were found above:
Ticket 2073 --> https://svn.open-mpi.org/trac/ompi/ticket/2073
2010-04-08 18:17:44 +00:00
Ralph Castain
75e99e6118
Do a better job of selecting cm ess component, handle tool and daemon issues
...
This commit was SVN r22942.
2010-04-07 18:59:21 +00:00
Ralph Castain
f1fc344336
Add some diagnostics
...
This commit was SVN r22941.
2010-04-07 18:58:17 +00:00
Ralph Castain
bdb62a3e4e
Update cisco platform files
...
This commit was SVN r22940.
2010-04-07 18:57:54 +00:00
Rolf vandeVaart
0adb570693
Add pml_ob1_verbose flag. Fix the current location it is being used
...
This commit was SVN r22939.
2010-04-07 13:51:42 +00:00
Ralph Castain
8e29a6858a
Properly handle the case when a daemon is given both parts of its name
...
This commit was SVN r22935.
2010-04-06 22:41:18 +00:00
Ralph Castain
2b8ab61328
Add another helpful macro
...
This commit was SVN r22934.
2010-04-06 22:40:45 +00:00
Brad Benton
7fe33ec90b
add ibm platform files.
...
This commit was SVN r22933.
2010-04-06 21:47:12 +00:00
Brad Benton
62ecc9a7ae
Add IBM platform files.
...
This commit was SVN r22932.
2010-04-06 21:04:44 +00:00
Ralph Castain
a1e82e9d05
Per discussion with Josh, cleanup the errmgr API by creating separate modules for the public vs internal APIs. This mirrors the architecture used in other frameworks that had similar requirements.
...
Remove the orcm errmgr module - moving to the orcm code base so it can utilize orcm communications and not interfere with ompi-related operations.
This commit was SVN r22931.
2010-04-05 22:59:21 +00:00
Ralph Castain
1caba7af2f
Fix a bunch of compiler warnings reported by Jeff
...
This commit was SVN r22930.
2010-04-03 00:20:19 +00:00
Ralph Castain
84c7973df8
Update the #procs in the job prior to assigning vpids for each app_context.
...
This commit was SVN r22929.
2010-04-03 00:03:35 +00:00
Ralph Castain
6b43b76f9d
Some updates required for generating a LAM-style virtual machine. Retain the local node if requested. Properly setup the daemon job map for a VM launch.
...
This commit was SVN r22928.
2010-04-03 00:03:01 +00:00
Brad Benton
58a9aeff5a
================================================================================
...
modify the OPAL_PAFFINITY_PROCESS_IS_BOUND macro to search the cpuset for
the maximum possible number of cpus rather than just the number of cpus
currently online. This corrects a problem where mpi_paffinity_alone was
not working properly on systems in which there can be cpu namespaces with
holes, such as on ppc64 with smt off (as discussed in #2365 ).
This commit was SVN r22927.
2010-04-02 18:24:12 +00:00
Ralph Castain
de6679dbd3
Truly respect the -quiet option. Make it an mca param so someone doesn't have to put it solely on the cmd line. Tell show_help to shaddup as well.
...
This commit was SVN r22926.
2010-04-02 14:19:38 +00:00
Ralph Castain
25a9a195b0
When the user requests "quiet", mpirun isn't supposed to output "helpful error messages" - so shaddup!
...
This commit was SVN r22925.
2010-04-02 07:13:11 +00:00
Ralph Castain
ed0f42fa49
Fix a bug courtesy of Jeff - since check_job_complete removes the child object and releases it, preserve the pointer to the next item on the list prior to working with it
...
This commit was SVN r22924.
2010-04-02 07:08:34 +00:00
Jeff Squyres
8a85c4617f
Fixes trac:2366: dragonboy noticed that the PGI compiler is picky about
...
#if directives -- had to change a pair of #if conditionals in
opal/util/stacktrace.c to make the PGI compiler accept it.
This commit was SVN r22923.
The following Trac tickets were found above:
Ticket 2366 --> https://svn.open-mpi.org/trac/ompi/ticket/2366
2010-04-01 17:04:06 +00:00
Jeff Squyres
c57a8fba5a
Improve the help message when mpirun cannot find an executable.
...
Refs trac:2035
This commit was SVN r22922.
The following Trac tickets were found above:
Ticket 2035 --> https://svn.open-mpi.org/trac/ompi/ticket/2035
2010-04-01 13:26:29 +00:00
Ralph Castain
871a9e0df4
Track process heartbeats with time_t, be a little less restrictive on who can retrieve an orte_job_t object
...
This commit was SVN r22921.
2010-03-31 19:20:06 +00:00
Rainer Keller
3111b2debf
- Update platform script for Jaguar to latest version
...
This should be cmr'd to v1.5
This commit was SVN r22920.
2010-03-31 14:39:33 +00:00
Rainer Keller
66ce72201f
- Convert shell script to perl ;-) Only (XML) svn log, instead of
...
one PER eligible revision speeds this up 24x...
Now should be worthwhile to run for the v1.4 branch as well.
As before creates HTML output by default:
perl ompi_branch_check_revisions.pl > revisions-v1.5.html
By default now does the parsing of the branch's svn log -- should
then work hand in hand with Jeff's gkcommit.pl
This commit was SVN r22917.
2010-03-31 03:00:58 +00:00
Josh Hursey
62f8d3c471
r22885 missed a few symbol updates when it changed ompi_want_ft to opal_want_ft
...
This commit was SVN r22916.
The following SVN revision numbers were found above:
r22885 --> open-mpi/ompi@522a23d6a3
2010-03-30 16:47:39 +00:00
Rainer Keller
9a8b794eb4
- Allow -r=r<NUM>, and -r=<NUM> for easier copy-pasting...
...
This commit was SVN r22915.
2010-03-30 14:02:35 +00:00
Ralph Castain
f6bfaa76ba
Add some debug output to job_complete. If no session dirs were created, then cannot check for abort file - which wouldn't be created anyway
...
This commit was SVN r22903.
2010-03-29 23:21:03 +00:00
Jeff Squyres
eaed49594c
Fix typo (I'm assuming this was a copy-n-paste error :-) ).
...
This commit was SVN r22902.
2010-03-29 21:54:02 +00:00
Ralph Castain
24c3b4f849
Add the sysinfo framework to the "info" tools, especially since the odls_base_open function calls it!
...
This commit was SVN r22901.
2010-03-29 20:47:29 +00:00
Ralph Castain
2603bd8a47
Eliminate a race condition (first reported by Josh) when deliberately killing procs. Need to cancel the waitpid callback for the proc, then properly flag it as dead (both not-alive and waitpid-fired) so that the system cleans up properly.
...
This commit was SVN r22900.
2010-03-28 16:08:05 +00:00
Ralph Castain
4f9db20d94
Couple of minor cleanups
...
This commit was SVN r22899.
2010-03-28 15:41:27 +00:00
Ralph Castain
1a100812a9
Add some new cisco platform files
...
This commit was SVN r22898.
2010-03-28 15:40:51 +00:00
Jeff Squyres
3449c34bc9
Remove no-longer-existing patchfiles from the Makefile.am.
...
This commit was SVN r22897.
2010-03-27 11:36:31 +00:00
Jeff Squyres
319fb12504
Per RFC initially started here:
...
http://www.open-mpi.org/community/lists/devel/2010/02/7496.php
Increase the required versions of AM, AC, and LT:
* Autoconf: 2.65
* Automake: 1.11.1
* Libtool: 2.2.6b
And therefore removed a bunch of patches that we used to apply to make
older versions of these tools work.
Also updated the HACKING document to match these version numbers,
specifically mentioned Mercurial in a few places, and removed some
outdated language about running autogen.sh in subdirectories.
This commit was SVN r22896.
2010-03-26 21:03:50 +00:00
Ralph Castain
b2e6c02e22
Add critical mca param to cisco platform files
...
This commit was SVN r22895.
2010-03-26 16:26:35 +00:00
Shiqing Fan
9d3613c259
Only check the necessary headers and others on Windows, so that to speed up the configuration a lot.
...
Set up the integer kind family for f77 build.
This commit was SVN r22894.
2010-03-26 16:01:36 +00:00
Jeff Squyres
e307df8a85
* Fix help message
...
* Fix to use SVN log messages relative to the trunk
* Fix error message printing when LWP fails
This commit was SVN r22892.
2010-03-26 14:13:56 +00:00
Jeff Squyres
f9f85692f2
Add a dry-run mode.
...
This commit was SVN r22891.
2010-03-26 12:36:05 +00:00
Ralph Castain
d9e9c6114d
Update cisco platform files
...
This commit was SVN r22887.
2010-03-25 23:05:05 +00:00
Ralph Castain
1bf9684ebb
Don't include jobs in the nidmap if they aren't mapped jobs
...
This commit was SVN r22886.
2010-03-25 22:54:57 +00:00
Ralph Castain
522a23d6a3
A few changes to the FT-related configure options:
...
1. fix a bug that caused an infinite loop in configure when specifying want-ft but not want-ft-thread by removing a stale reference to the opal-progress-thread option
2. add want-ft=orcm so we can build the orcm errmgr component
3. cleanup the use of "ompi_want_ft_xxx" and replace it with "opal_want_ft_xxx" so that naming conventions are preserved
This commit was SVN r22885.
2010-03-25 22:53:48 +00:00
Jeff Squyres
3179daa5e0
Add in Ralph's suggestion of running "svn up". Also add a command
...
line option to ''not'' run it if you don't want to. Also add a --help
output so that you can see the command line options.
This commit was SVN r22883.
2010-03-25 15:00:17 +00:00
Jeff Squyres
370c987486
Add a helper script for the gatekeepers -- automatically create a
...
suggested SVN commit message for when closing a CMR (containing CMR
#s, CMR subject lines, and SVN commit log messages).
This commit was SVN r22882.
2010-03-25 14:38:21 +00:00
Christopher Yeoh
a6175bbefc
Adds copyright notice that should have gone in with r22700
...
This commit was SVN r22881.
The following SVN revision numbers were found above:
r22700 --> open-mpi/ompi@774a7a58b0
2010-03-25 04:03:52 +00:00
Christopher Yeoh
cd5294944b
fixes trac:2355 - race in opal_atomic_lifo
...
Adds memory barriers to remove race condition which can
occur on PowerPC architectures (and probably others)
This commit was SVN r22880.
The following Trac tickets were found above:
Ticket 2355 --> https://svn.open-mpi.org/trac/ompi/ticket/2355
2010-03-25 03:44:38 +00:00
Christopher Yeoh
768ea2bab0
fixes trac:2351 - race in use of ompi free lists
...
Adds memory barriers which are definitely needed on powerpc
This commit was SVN r22879.
The following Trac tickets were found above:
Ticket 2351 --> https://svn.open-mpi.org/trac/ompi/ticket/2351
2010-03-25 03:38:14 +00:00
Christopher Yeoh
81e06a2baf
fixes trac:2340 - race in mca_mpool_base_free
...
This commit was SVN r22878.
The following Trac tickets were found above:
Ticket 2340 --> https://svn.open-mpi.org/trac/ompi/ticket/2340
2010-03-25 03:29:27 +00:00
Christopher Yeoh
0b93c87c2c
Correct year for copyright notices
...
This commit was SVN r22877.
2010-03-25 03:14:21 +00:00
Josh Hursey
e4f2d03d28
ErrMgr Framework redesign to better support fault tolerance development activities.
...
Explained in more detail in the following RFC:
http://www.open-mpi.org/community/lists/devel/2010/03/7589.php
This commit was SVN r22872.
2010-03-23 21:28:02 +00:00
Ralph Castain
0b9552cd4e
Expand the ESS framework's API to include a new function "query_sys_info" that allows the caller to retrieve key-value pairs of info on the local system capabilities (e.g., cpu type/model). Have each daemon and the HNP "sense" that information and provide it to their local procs to avoid having every proc querying the system directly.
...
This commit was SVN r22870.
2010-03-23 20:47:41 +00:00