Ralph Castain
93bbeabc55
Remove stale code
...
This commit was SVN r26251.
2012-04-07 13:33:30 +00:00
Ralph Castain
b6cde9a8d1
Remove stale code
...
This commit was SVN r26250.
2012-04-07 13:33:18 +00:00
George Bosilca
319f76d66a
Low hanging fruit. Remove a declared but not defined function.
...
This commit was SVN r26245.
2012-04-06 15:43:28 +00:00
Ralph Castain
ed197acaa2
Eliminate stale code
...
This commit was SVN r26244.
2012-04-06 15:31:13 +00:00
Ralph Castain
bd8b4f7f1e
Sorry for mid-day commit, but I had promised on the call to do this upon my return.
...
Roll in the ORTE state machine. Remove last traces of opal_sos. Remove UTK epoch code.
Please see the various emails about the state machine change for details. I'll send something out later with more info on the new arch.
This commit was SVN r26242.
2012-04-06 14:23:13 +00:00
Ralph Castain
ca3ff58c76
Ensure we get a non-zero exit status when we can't find the specified fork agent. Output a better error message, and ensure we don't multiply report the problem.
...
This commit was SVN r26191.
2012-03-24 00:49:38 +00:00
Ralph Castain
46b040c79f
Fix typo
...
This commit was SVN r26189.
2012-03-24 00:31:05 +00:00
Ralph Castain
2bd75ec7e3
Fix Cray XE builds - the priority here needs to equal that of the HNP component so that both build. Otherwise, mpirun tries to use PMI for its basis, and that doesn't work!
...
This commit was SVN r26188.
2012-03-23 20:06:34 +00:00
Ralph Castain
811413e9bc
Correctly handle multiple cpu-set ranges. Correctly support optional binding directives combined with cpu-set.
...
This commit was SVN r26187.
2012-03-23 14:50:41 +00:00
Ralph Castain
ce0caf7567
Support -cpu-set by binding to the specified cpus in the absence of any other binding directive. Allows users to subdivide nodes for multiple parallel mpirun invocations.
...
This commit was SVN r26186.
2012-03-23 14:05:52 +00:00
Ralph Castain
33ed3cda07
Update the gridengine allocator to support data from multiple queues by checking for duplicate node entries
...
This commit was SVN r26148.
2012-03-15 17:45:50 +00:00
Josh Hursey
4dd9f89a99
Create an MCA parameter (ess_base_stream_buffering) that allows the user to override the system default for buffering of stdout/stderr streams. See 'man setvbuf' for more information.
...
Note: I am working on a system that buffered all output until the application fishished due to a default of 'fully buffered.' This makes debugging painful. This switch fixed the problem by allowing me to adjust the buffering.
This commit was SVN r26119.
2012-03-08 22:02:28 +00:00
Ralph Castain
e71e871bae
Initialize sink location when stdin is forwarded to all ranks
...
This commit was SVN r26107.
2012-03-06 15:47:04 +00:00
Ralph Castain
366f9d1518
Add some missing localities to the hwloc pretty-print, fix pmi modex
...
This commit was SVN r26105.
2012-03-06 06:21:10 +00:00
Ralph Castain
834a86420b
Ensure we use the slurm module for slurm environments, and correct init order in pmi module when used by daemons
...
This commit was SVN r26089.
2012-03-02 23:10:48 +00:00
Ralph Castain
ceb34ed0c9
Fix typo
...
This commit was SVN r26079.
2012-03-02 09:58:09 +00:00
Ralph Castain
b2f1bade37
Fix the -H localhost issue
...
This commit was SVN r26071.
2012-02-29 16:56:00 +00:00
Jeff Squyres
81dc6a11ee
Fix typo in copyright notice, found by Paul Hargrove
...
This commit was SVN r26070.
2012-02-29 02:02:54 +00:00
Ralph Castain
a83da303c5
When using PMI, we know the ranks that share our node and their relative local/node ranks. Save that info in the pidmap array so that BTLs that require early knowledge of local ranks can access it.
...
This commit was SVN r25992.
2012-02-21 16:43:17 +00:00
Jeff Squyres
b6a90434e4
Fix some include file header ordering issues for some BSDs, suggested
...
by Paul Hargrove.
This commit was SVN r25984.
2012-02-21 13:32:14 +00:00
Jeff Squyres
b295a01d8e
Fix another configury error found by Paul Hargrove. Thanks, Paul!
...
This commit was SVN r25971.
2012-02-20 21:38:27 +00:00
Jeff Squyres
cdc783925e
(Re-)Add oob_tcp_if_(in|ex)clude functionality to allow CIDR notation,
...
just like the btl_tcp_if_(in|ex)clude MCA param.
This commit was SVN r25953.
2012-02-17 15:38:42 +00:00
Jeff Squyres
3e22450345
Fix the oob_tcp_verbose MCA param; make it actually apply to the OOB
...
TCP verbose handle (not the generic/0 handle).
This commit was SVN r25942.
2012-02-16 22:28:11 +00:00
Ralph Castain
b3aabf1565
Cleanup the --without-hwloc build. Thanks to Paul Hargrove for reporting it broken.
...
This commit was SVN r25931.
2012-02-15 11:08:57 +00:00
Ralph Castain
91977444af
Silence warnings
...
This commit was SVN r25929.
2012-02-15 03:42:27 +00:00
Ralph Castain
bba6508b4b
Handle the default hostfile case a little better...
...
This commit was SVN r25928.
2012-02-15 03:33:49 +00:00
Ralph Castain
f14c4be580
Correct the ordering logic so the list gets correctly built in daemon vpid order
...
This commit was SVN r25818.
2012-01-30 16:25:07 +00:00
Shiqing Fan
bfbd3c67a5
Add a windows file into the tarball.
...
This commit was SVN r25811.
2012-01-29 10:12:02 +00:00
Ralph Castain
a0edae52f2
Ensure the wrapper flags get entered in the right order, with -lpmi coming before the alps util libs
...
This commit was SVN r25809.
2012-01-27 20:56:21 +00:00
Ralph Castain
3f31feee6f
Handle the case where a user's rankfile specifies only cpus, and not socket:cpu pairs.
...
This commit was SVN r25803.
2012-01-27 12:21:45 +00:00
Ralph Castain
07f3a91075
Okay, get srun to play nice. Problem was that everything worked fine so long as the user did "salloc" with an argument requesting a specific number of nodes. However, if the user specified instead a number of processes, then we launched that number of daemons - resulting in multiple daemons/node. Not good.
...
So force things to behave correctly either way.
This commit was SVN r25792.
2012-01-26 19:58:57 +00:00
Ralph Castain
ef94e606c7
Add some debug
...
This commit was SVN r25791.
2012-01-26 19:23:32 +00:00
Ralph Castain
1449b27e9f
Ensure that slurm only launches one orted/node, regardless of how the allocation was obtained.
...
This commit was SVN r25790.
2012-01-26 19:23:15 +00:00
Jeff Squyres
64165ce758
r25775 removed the .windows from this directory, but left it in the
...
Makefile.am.
This commit was SVN r25782.
The following SVN revision numbers were found above:
r25775 --> open-mpi/ompi@2c9a4beffd
2012-01-26 10:45:06 +00:00
Jeff Squyres
3751495443
Add missing arguments for the new DYLD_LIBRARY_PATH stuff.
...
This commit was SVN r25780.
2012-01-26 00:35:48 +00:00
Ralph Castain
079e4d9156
Per George's comment, just duplicate the lib path envars to provide both Linux and Mac compatible values
...
This commit was SVN r25776.
2012-01-25 14:37:36 +00:00
Shiqing Fan
2c9a4beffd
Add and remove a few components for windows build.
...
This commit was SVN r25775.
2012-01-25 09:01:27 +00:00
Ralph Castain
8b115754e6
Fix typo
...
This commit was SVN r25763.
2012-01-21 23:50:39 +00:00
Ralph Castain
469e40ace2
Expand the coverage a little when looking at remote shells for rsh. Prior patch (r25758) works only if both ends of the rsh/ssh connection are Mac. What we really want is to use the Mac version of ld_library_path when the remote end is Mac, regardless of the OS where mpirun is executing. So add a test for system type to the remote_shell test, and set the ld_library_path name to match the remote system type.
...
This commit was SVN r25762.
The following SVN revision numbers were found above:
r25758 --> open-mpi/ompi@1afb77e603
2012-01-21 23:48:42 +00:00
Ralph Castain
1afb77e603
Mac requires setting DYLD_LIBRARY_PATH instead of the Linux standard LD_LIBRARY_PATH, so ensure we set that when using rsh to launch in Mac environments.
...
Thanks to Teng Lin for the patch!
This commit was SVN r25758.
2012-01-20 19:14:32 +00:00
Ralph Castain
be3dfb6a1a
Ensure that we only add -lpmi once to the wrapper compilers, no matter how many components might use it.
...
This commit was SVN r25753.
2012-01-20 04:56:38 +00:00
Ralph Castain
d7fe1615b6
Add missing dollar sign on variable
...
This commit was SVN r25745.
2012-01-19 20:45:22 +00:00
Ralph Castain
0d20f745e2
Remove stale function def
...
This commit was SVN r25744.
2012-01-19 20:40:48 +00:00
Nathan Hjelm
6d0e7a0a0e
don't enable ess/alps unless cnos is available
...
This commit was SVN r25743.
2012-01-19 19:36:00 +00:00
Ralph Castain
9d556e2f17
Allow daemons to use PMI to get their name where PMI support is available while using the standard grpcomm and other capabilities. Remove the GNI code from the alps ess component as that component should only be for alps/cnos installations.
...
This commit was SVN r25737.
2012-01-18 20:56:53 +00:00
Ralph Castain
6235a355de
Correctly handle co-spawning of daemons when attaching to a running job. We cannot use the general process mappers as we only want debugger daemons spawned on nodes where application procs already exist. So custom build the map for the debugger daemon job, and have the plm just launch that job without doing its usual vm-spawn step.
...
This commit was SVN r25736.
2012-01-18 00:19:49 +00:00
Nathan Hjelm
a2437feba7
removed debug message
...
This commit was SVN r25722.
2012-01-12 20:23:59 +00:00
Nathan Hjelm
5ab1674138
fixed de bruijn copyrights
...
This commit was SVN r25720.
2012-01-12 17:18:08 +00:00
Nathan Hjelm
c57f18999d
added Debruijn routed component
...
This commit was SVN r25717.
2012-01-12 17:11:03 +00:00
Ralph Castain
477582abef
Grrrr....fix ALL the cases where the membind warning occurs.
...
This commit was SVN r25715.
2012-01-11 23:51:18 +00:00