Ralph Castain
628936a99f
Provide a convenience option to disable fault recovery (as opposed to setting three separate, long-named mca params)
...
This commit was SVN r23327.
2010-07-01 19:31:11 +00:00
Ralph Castain
09acea1ccc
Update platform file
...
This commit was SVN r23326.
2010-07-01 19:30:15 +00:00
Jeff Squyres
222c4c8dd8
Reformat the verbatim sections of these man pages for narrower (80
...
char) displays.
This commit was SVN r23325.
2010-07-01 18:52:45 +00:00
Ralph Castain
1102f0c171
Replace old platform file with newer ones
...
This commit was SVN r23322.
2010-06-29 15:00:10 +00:00
Ralph Castain
73eabc83d6
Add new platform files
...
This commit was SVN r23321.
2010-06-29 14:58:40 +00:00
Jeff Squyres
ad95e00b42
Remove an extraneous/misleading comment.
...
This commit was SVN r23320.
2010-06-29 14:42:03 +00:00
Jeff Squyres
9ac56c8674
Add "-j4" into the flags passed when we "make distcheck" (these flags
...
don't help when just running "make dist"). On my (somewhat older)
machines, it cut the wall clock time of make_dist_tarball down from
~55 minutes to ~40 minutes.
This commit was SVN r23318.
2010-06-29 14:32:20 +00:00
Jeff Squyres
e82e7f896e
These compile warnings have been forever; I finally got inspired to
...
fix them.
This commit was SVN r23316.
2010-06-28 17:26:38 +00:00
Ralph Castain
3237b9ec87
Print a nice error message when a daemon fails, and exit with a non-zero status
...
This commit was SVN r23314.
2010-06-28 16:38:54 +00:00
Jeff Squyres
1fad51776d
Also add <stdlib.h> for exit().
...
This commit was SVN r23308.
2010-06-28 15:17:42 +00:00
Jeff Squyres
f9d4426c19
OS X / Absoft needs <string.h>
...
This commit was SVN r23307.
2010-06-28 15:15:06 +00:00
Ralph Castain
a1ea6bc130
Ignore debugger daemon termination status - we don't care how they died.
...
This commit was SVN r23306.
2010-06-26 03:08:50 +00:00
Jeff Squyres
6d07a1cc0b
Per comments in this commit, hwloc isn't able to find cores on all
...
platforms (e.g., PPC64 running RHEL 5.4) -- sometimes it only finds
PUs. So in that case, just run the same calculation, but with PUs
instead of cores.
This commit was SVN r23305.
2010-06-25 21:36:53 +00:00
Ralph Castain
f325ac030a
Add a function to prepend a string to the beginning of an argv array - useful when building app_contexts from user input
...
This commit was SVN r23303.
2010-06-24 15:52:36 +00:00
Nadia Derbey
c22e6b3613
openib btl unsafe in case of extremely low srq settings
...
This commit was SVN r23301.
2010-06-24 09:59:45 +00:00
Ralph Castain
099c3aad97
Fix a major foopah that broke debugger attach. With the revisions in updating proc state, we dropped the recording of each proc's pid. Thus, attaching debuggers would find a proctable whose pids all equal 0.
...
This required modification of the errmgr.update_state API so the pid could be passed in to the function that could update the proper data record(s). All calls to that API have been updated as well, but I obviously couldn't test them all.
Thanks to Dong Ahn (LLNL) for catching this problem!
Also fixed debugger daemon cospawn, both for initial launch and attach-while-running modes. Tested and verified on rsh and slurm.
This commit was SVN r23300.
2010-06-24 05:13:53 +00:00
Ralph Castain
e9f4c84d7e
Add another name field to the job object
...
This commit was SVN r23299.
2010-06-24 01:57:27 +00:00
Jeff Squyres
5cdd79ef13
Oops -- set the bits one at a time via _set. Using _cpu effectively
...
zeroed out the cpuset before setting the bit (i.e., we always had a
cpuset of 1).
This commit was SVN r23298.
2010-06-23 20:56:59 +00:00
Shiqing Fan
681df0089b
Add a few new files into the tarball.
...
This commit was SVN r23297.
2010-06-22 16:45:56 +00:00
Ralph Castain
8b2a682fba
Return a silent error when -do-not-launch is given
...
This commit was SVN r23291.
2010-06-22 01:06:10 +00:00
Shiqing Fan
2e5e9f0a03
Fix a wrong windows path in hpn_contack, which causes problems when looking up in the session directories. Add two more ess module for Windows.
...
This commit was SVN r23286.
2010-06-21 09:47:33 +00:00
Ralph Castain
ae746a390f
Debugger daemons spawned upon attachment to a running job need to be treated just like a regular job - they are not "piggybacking" onto an existing launch, and so the orte daemons need to report them just like a regular job launch in order to release from spawn.
...
Modify the debugger control flag to include "do not monitor" so orterun will not take debugger daemon termination into account when deciding that all jobs are done.
This commit was SVN r23282.
2010-06-19 15:22:36 +00:00
Jeff Squyres
ea05c73cfc
Use the right number of characters for the strncmp. Thanks to Brad
...
for catching that!
This commit was SVN r23281.
2010-06-18 15:45:38 +00:00
Jeff Squyres
cdc5541cb0
Search for "dlname", not "dlopen". This value will be filled in if
...
there is a DSO to open.
This commit was SVN r23280.
2010-06-18 15:13:34 +00:00
Shiqing Fan
e32159d118
Updates and fixes for Fortran bindings on Windows, including two missing feature tests and CMake scripts improvements.
...
This commit was SVN r23279.
2010-06-18 13:03:16 +00:00
Ethan Mallove
fc37e408c2
Avoid SEGV in case rsh/ssh is not in PATH
(refs trac:1490)
...
This commit was SVN r23278.
The following Trac tickets were found above:
Ticket 1490 --> https://svn.open-mpi.org/trac/ompi/ticket/1490
2010-06-17 14:58:09 +00:00
Ralph Castain
1e90b91b84
Unset envars we set during initialization so we leave environ intact after orte_finalize.
...
Thanks to Damien Gunter for pointing it out.
This commit was SVN r23277.
2010-06-17 13:42:21 +00:00
Ralph Castain
9ba3459135
Use the correct command to revert VERSION when making tarballs
...
This commit was SVN r23276.
2010-06-17 04:19:42 +00:00
Ralph Castain
628ffd1d6e
Make the mcast channel assignments unsigned ints so they can be used as array indices. Assign input/output channels for apps. Cleanup some bugs in open_channel
...
This commit was SVN r23275.
2010-06-16 19:40:59 +00:00
Matthias Jurenz
1467f2db52
Added workaround for PGI compiler bug (see http://www.pgroup.com/support/release_tprs_90.htm TPR 4337):
...
Disable OpenMP if compiler version is less than 9.0-3.
This commit was SVN r23274.
2010-06-15 07:16:13 +00:00
Ralph Castain
6cbe947810
Modify the multicast scheme so that applications have separate input and output channels to avoid cross-talk. Update the multicast test to conform.
...
This commit was SVN r23271.
2010-06-15 03:50:31 +00:00
Jeff Squyres
b620e63bdc
Add in 2 cases for where this test may be skipped:
...
1. If opal wasn't built with libltdl support
1. If opal was built statically (i.e., dlopen='' in the .la file)
This commit was SVN r23270.
2010-06-14 16:06:43 +00:00
Shiqing Fan
d391c57b0f
A more proper fix for the HANDLE definition.
...
This commit was SVN r23269.
2010-06-14 14:17:07 +00:00
Ralph Castain
fdf9e5f92d
Update cisco platform files
...
This commit was SVN r23268.
2010-06-12 16:05:39 +00:00
Jeff Squyres
6bcdadbf0e
If we're not building project_ompi, don't do anything with C++. Also
...
rename OMPI_CHECK_ATTRIBUTES -> OPAL_CHECK_ATTRIBUTES, because it's in
OPAL (somehow that name must have gotten missed in the Great M4 split
of '10...?)
This commit was SVN r23267.
2010-06-12 03:15:47 +00:00
Jeff Squyres
8ce59bb3e3
Use HWLOC_EMBEDDED_LIBS properly (new variable as of 1.0.2a12214).
...
Should fix some Solaris build issues.
This commit was SVN r23266.
2010-06-09 19:58:42 +00:00
Jeff Squyres
2887fe77c5
Refresh hwloc to an as-yet unreleased tarball from the hwloc 1.0
...
release branch in order to fix some Solaris bugs.
This commit was SVN r23265.
2010-06-09 19:56:18 +00:00
Ralph Castain
bb602694e6
Add a new example program, update cisco platform file
...
This commit was SVN r23262.
2010-06-09 18:21:06 +00:00
Samuel Gutierrez
2fb7c344fc
Added a new System V (sysv) shared memory component for Open MPI.
...
Configure Option:
--enable-sysv
MCA Parameter:
mpi_common_sm
mpi_common_sm accepts a comma delimited list of: [sysv],mmap (order
dependent). The first component that is successfully selected is used. For
example, -mca mpi_common_sm sysv,mmap will first try sysv. If sysv is not
successfully selected, then mmap will be used. mmap will be used if
mpi_common_sm is not provided.
Notes:
Please make certain that your system's shmmax limit, or equivalent, is larger
than mpool_sm_min_size. Otherwise, shmget may fail.
This commit was SVN r23260.
2010-06-09 16:58:52 +00:00
Ralph Castain
da43547983
Don't define the active_jobid until -after- the job has been setup.
...
Cleanup references to pointer_array objects
This commit was SVN r23250.
2010-06-09 02:16:05 +00:00
Jeff Squyres
f1a7b5cc33
Make "processor affinity not supported" error message a little better:
...
* Remove OPAL_ERR_PAFFINITY_NOT_SUPPORTED; fit it into the generic
OPAL_ERR_NOT_SUPPORTED case.
* When odls_default detects that processor affinity is not supported,
it prints a specific message about it, and then it suppressed a
generic HNP help message that would normally follow it (i.e., it's
easier to have the "processor affinity is not supported" show_help
message last).
* Use some symbolic names in odls_default instead of fixed int's,
just for slight readability improvements in the code.
* Introduce orte_show_help_suppress(), which gives the ability to
suppress any future showings of any arbitrary show_help() message.
This is useful if you display message X and want to suppress
message Y. This suppression *only* works in environments where
orte_show_help() does coalescing.
This commit was SVN r23249.
2010-06-08 20:16:07 +00:00
Ralph Castain
e52a54183f
Let max restarts be associated with an app_context instead of a job so that individual apps can have different values. Default to a single job-level value
...
This commit was SVN r23248.
2010-06-07 14:21:08 +00:00
Ralph Castain
799a77a187
Some updates to the routed-cm module so it properly supports the tcp rmcast module
...
This commit was SVN r23247.
2010-06-07 14:19:32 +00:00
Ralph Castain
bd045468e5
Let apps use the ess cm module too...
...
This commit was SVN r23246.
2010-06-07 14:16:34 +00:00
Ralph Castain
ec7b5dae2b
Add missing include file
...
This commit was SVN r23245.
2010-06-07 14:15:25 +00:00
Ralph Castain
d80c90c7b9
Include missing tests
...
This commit was SVN r23244.
2010-06-07 14:15:00 +00:00
Ralph Castain
17fd8b3607
Update cisco platform files
...
This commit was SVN r23243.
2010-06-07 14:14:41 +00:00
George Bosilca
c8ee150c95
If we fail to correctly initialize the MX device, don't mark it as initialized.
...
This commit was SVN r23238.
2010-06-02 15:00:42 +00:00
Jeff Squyres
e45be29f0d
This function shouldn't have an ibv_ prefix -- it's not part of
...
verbs (it's just a static convenience function here in this file).
This commit was SVN r23237.
2010-06-02 12:54:56 +00:00
Jeff Squyres
7676d5adda
Change "intra-communicator" to "inter-communicator". Thanks to
...
Simon/Number Cruncher for reporting the typo.
This commit was SVN r23236.
2010-06-02 12:35:53 +00:00