Ralph Castain
|
a5e4dc6803
|
In accordance with prior releases, we are supposed to default to looking at the openmpi-default-hostfile as a default hostfile. Restore that behavior, but ignore the file if it is empty. Allow the user to ignore any MCA param setting pointing to a default hostfile by setting the param to "none" (via cmd line or whatever) - this allows them to override a setting in the system default MCA param file.
This commit was SVN r25851.
|
2012-02-01 17:40:44 +00:00 |
|
Ralph Castain
|
f14c4be580
|
Correct the ordering logic so the list gets correctly built in daemon vpid order
This commit was SVN r25818.
|
2012-01-30 16:25:07 +00:00 |
|
Shiqing Fan
|
bfbd3c67a5
|
Add a windows file into the tarball.
This commit was SVN r25811.
|
2012-01-29 10:12:02 +00:00 |
|
Ralph Castain
|
a0edae52f2
|
Ensure the wrapper flags get entered in the right order, with -lpmi coming before the alps util libs
This commit was SVN r25809.
|
2012-01-27 20:56:21 +00:00 |
|
Ralph Castain
|
3f31feee6f
|
Handle the case where a user's rankfile specifies only cpus, and not socket:cpu pairs.
This commit was SVN r25803.
|
2012-01-27 12:21:45 +00:00 |
|
Ralph Castain
|
61ac2bb11b
|
If no session directories are being created, then we cannot create the debugger attachment fifo - so don't complain about it.
This commit was SVN r25802.
|
2012-01-27 04:05:23 +00:00 |
|
Ralph Castain
|
07f3a91075
|
Okay, get srun to play nice. Problem was that everything worked fine so long as the user did "salloc" with an argument requesting a specific number of nodes. However, if the user specified instead a number of processes, then we launched that number of daemons - resulting in multiple daemons/node. Not good.
So force things to behave correctly either way.
This commit was SVN r25792.
|
2012-01-26 19:58:57 +00:00 |
|
Ralph Castain
|
ef94e606c7
|
Add some debug
This commit was SVN r25791.
|
2012-01-26 19:23:32 +00:00 |
|
Ralph Castain
|
1449b27e9f
|
Ensure that slurm only launches one orted/node, regardless of how the allocation was obtained.
This commit was SVN r25790.
|
2012-01-26 19:23:15 +00:00 |
|
Jeff Squyres
|
64165ce758
|
r25775 removed the .windows from this directory, but left it in the
Makefile.am.
This commit was SVN r25782.
The following SVN revision numbers were found above:
r25775 --> open-mpi/ompi@2c9a4beffd
|
2012-01-26 10:45:06 +00:00 |
|
Jeff Squyres
|
3751495443
|
Add missing arguments for the new DYLD_LIBRARY_PATH stuff.
This commit was SVN r25780.
|
2012-01-26 00:35:48 +00:00 |
|
Ralph Castain
|
079e4d9156
|
Per George's comment, just duplicate the lib path envars to provide both Linux and Mac compatible values
This commit was SVN r25776.
|
2012-01-25 14:37:36 +00:00 |
|
Shiqing Fan
|
2c9a4beffd
|
Add and remove a few components for windows build.
This commit was SVN r25775.
|
2012-01-25 09:01:27 +00:00 |
|
Ralph Castain
|
6db8c56cd4
|
Add local and node ranks to debugger daemon procs so the odls properly launches them
This commit was SVN r25774.
|
2012-01-25 03:17:10 +00:00 |
|
Ralph Castain
|
8b115754e6
|
Fix typo
This commit was SVN r25763.
|
2012-01-21 23:50:39 +00:00 |
|
Ralph Castain
|
469e40ace2
|
Expand the coverage a little when looking at remote shells for rsh. Prior patch (r25758) works only if both ends of the rsh/ssh connection are Mac. What we really want is to use the Mac version of ld_library_path when the remote end is Mac, regardless of the OS where mpirun is executing. So add a test for system type to the remote_shell test, and set the ld_library_path name to match the remote system type.
This commit was SVN r25762.
The following SVN revision numbers were found above:
r25758 --> open-mpi/ompi@1afb77e603
|
2012-01-21 23:48:42 +00:00 |
|
Ralph Castain
|
1afb77e603
|
Mac requires setting DYLD_LIBRARY_PATH instead of the Linux standard LD_LIBRARY_PATH, so ensure we set that when using rsh to launch in Mac environments.
Thanks to Teng Lin for the patch!
This commit was SVN r25758.
|
2012-01-20 19:14:32 +00:00 |
|
Ralph Castain
|
be3dfb6a1a
|
Ensure that we only add -lpmi once to the wrapper compilers, no matter how many components might use it.
This commit was SVN r25753.
|
2012-01-20 04:56:38 +00:00 |
|
Ralph Castain
|
d643882f89
|
Cleanup the alps configure logic so we only add the pmi support libs once
This commit was SVN r25747.
|
2012-01-19 22:10:03 +00:00 |
|
Ralph Castain
|
d7fe1615b6
|
Add missing dollar sign on variable
This commit was SVN r25745.
|
2012-01-19 20:45:22 +00:00 |
|
Ralph Castain
|
0d20f745e2
|
Remove stale function def
This commit was SVN r25744.
|
2012-01-19 20:40:48 +00:00 |
|
Nathan Hjelm
|
6d0e7a0a0e
|
don't enable ess/alps unless cnos is available
This commit was SVN r25743.
|
2012-01-19 19:36:00 +00:00 |
|
Ralph Castain
|
bf09133631
|
Correctly track the number of debugger daemons being spawned
This commit was SVN r25741.
|
2012-01-19 18:17:07 +00:00 |
|
Ralph Castain
|
9d556e2f17
|
Allow daemons to use PMI to get their name where PMI support is available while using the standard grpcomm and other capabilities. Remove the GNI code from the alps ess component as that component should only be for alps/cnos installations.
This commit was SVN r25737.
|
2012-01-18 20:56:53 +00:00 |
|
Ralph Castain
|
6235a355de
|
Correctly handle co-spawning of daemons when attaching to a running job. We cannot use the general process mappers as we only want debugger daemons spawned on nodes where application procs already exist. So custom build the map for the debugger daemon job, and have the plm just launch that job without doing its usual vm-spawn step.
This commit was SVN r25736.
|
2012-01-18 00:19:49 +00:00 |
|
Ralph Castain
|
11a37d3978
|
Fix the default
This commit was SVN r25733.
|
2012-01-17 21:09:27 +00:00 |
|
Ralph Castain
|
12d163293b
|
Yeah, I know it's the middle of the afternoon. I'm bound to forget and commit this in with something else if I don't. Per request from LANL, if PMI support is requested on an ALPS machine, add a couple of libs in the right ordering so that static builds will work correctly.
This commit was SVN r25732.
|
2012-01-17 20:41:50 +00:00 |
|
Ralph Castain
|
fd0d9f73c6
|
Make preload_binaries an MCA param so it can be set in the default MCA parameters for a system
This commit was SVN r25728.
|
2012-01-17 17:16:05 +00:00 |
|
Shiqing Fan
|
f57f873404
|
Disable the debugger support for Windows.
This commit was SVN r25725.
|
2012-01-17 16:21:33 +00:00 |
|
Nathan Hjelm
|
a2437feba7
|
removed debug message
This commit was SVN r25722.
|
2012-01-12 20:23:59 +00:00 |
|
Nathan Hjelm
|
5ab1674138
|
fixed de bruijn copyrights
This commit was SVN r25720.
|
2012-01-12 17:18:08 +00:00 |
|
Nathan Hjelm
|
c57f18999d
|
added Debruijn routed component
This commit was SVN r25717.
|
2012-01-12 17:11:03 +00:00 |
|
Ralph Castain
|
477582abef
|
Grrrr....fix ALL the cases where the membind warning occurs.
This commit was SVN r25715.
|
2012-01-11 23:51:18 +00:00 |
|
Ralph Castain
|
ce7ddd0e10
|
Create the debugger attach fifo unless the user requests that we periodically poll insteaad.
This commit was SVN r25714.
|
2012-01-11 19:44:22 +00:00 |
|
Ralph Castain
|
bf103de66c
|
My apologies for doing this outside of the usual time restrictions, but we need to get this in so we can make progress.
Move the ORTE-level debugger code back into orterun and out of the ORTE library to resolve symbol conflicts.
This commit was SVN r25713.
|
2012-01-11 15:53:09 +00:00 |
|
Ralph Castain
|
167ad944c4
|
Surprise, surprise - hwloc treats memory binding as at the thread, not process, level. Thus, hwloc always sets the membind proc-level support flag to false, and indicates actual memory binding support via the thread-level flag. So...just to be safe, test -both- flags and issue the "no support" warning ONLY if both are false.
This commit was SVN r25709.
|
2012-01-11 01:12:57 +00:00 |
|
Shiqing Fan
|
e3dfc49ced
|
make correct use of the newly updated structures in the Windows module.
This commit was SVN r25699.
|
2012-01-09 11:08:34 +00:00 |
|
Ralph Castain
|
840841bb8f
|
Missed a couple
This commit was SVN r25686.
|
2011-12-29 23:30:19 +00:00 |
|
Ralph Castain
|
af7fb68cfb
|
If we forward envars in rsh, then we have to be very careful about both duplicate entries and disallowed characters on the cmd line. To aid with detecting duplicates, make all cmd line options be given in their mca variant. Check anything we might add for semi-colons and protect those values with quotes.
This commit was SVN r25685.
|
2011-12-29 23:25:25 +00:00 |
|
Jeff Squyres
|
a4c8bb27fa
|
Pull in the MPIR_Breakpoint symbol via a dummy function in
debuggers_base_fns.c: orte_debugger_base_pull_mpir_breakpoint().
This commit was SVN r25660.
|
2011-12-15 18:39:34 +00:00 |
|
Ralph Castain
|
2dd2694f25
|
Fix comm_spawn in oversubscribed conditions. IF oversubscription is allowed, let nodes flow into the mapper even if they are oversubscribed, constrained by the slots_max absolute ceiling. Cleanup error messages when comm_spawn fails so it correctly and succintly reports the ereror.
This commit was SVN r25659.
|
2011-12-15 18:04:48 +00:00 |
|
Ralph Castain
|
437c52d2bf
|
Routing must be enabled by default
This commit was SVN r25657.
|
2011-12-15 17:13:52 +00:00 |
|
Ralph Castain
|
1adefcc176
|
When routing is not enabled, all routes must go direct
This commit was SVN r25656.
|
2011-12-15 15:32:09 +00:00 |
|
Ralph Castain
|
a309c53bf2
|
Set the lifeline when we are tree spawning under rsh so that the orted can self-terminate when its parent dies
This commit was SVN r25655.
|
2011-12-15 15:29:53 +00:00 |
|
Nathan Hjelm
|
9dec101043
|
fix totalview launch through --debug
This commit was SVN r25654.
|
2011-12-15 15:19:13 +00:00 |
|
Ralph Castain
|
e683b2f9c7
|
Minor touchup - reset the pointer to the end of the list each time to ensure we get the nodes in correct daemon order
This commit was SVN r25651.
|
2011-12-14 22:16:52 +00:00 |
|
Ralph Castain
|
912abe8a6c
|
Catch one more use-case
This commit was SVN r25649.
|
2011-12-14 21:03:19 +00:00 |
|
Ralph Castain
|
f531b09a8d
|
Correctly handle -host and -hostfile options. Ensure the initial vm launch constrains itself to the union of specified hosts if those options are given. Get oversubscribe set correctly for that case.
This commit was SVN r25648.
|
2011-12-14 20:01:15 +00:00 |
|
George Bosilca
|
ac26f58bd7
|
I guess this wasn't yet ready for prime time.
This commit was SVN r25624.
|
2011-12-12 23:55:11 +00:00 |
|
Nathan Hjelm
|
885d5cbcf8
|
enable ptmalloc with using uGNI
This commit was SVN r25621.
|
2011-12-12 20:52:51 +00:00 |
|