1
1
openmpi/orte/mca/plm
Ralph Castain 3f3b46495e Add some error checking to the tm launcher
This commit was SVN r21818.
2009-08-14 03:13:02 +00:00
..
alps - Based and updated from Ken's patch: since CLE-2.1 does not offer 2009-08-07 01:15:09 +00:00
base - Fix reference counting of daemons killed. 2009-08-12 14:04:50 +00:00
bproc The system headers are supposed to be protected by #ifdef and not by #if. 2009-07-16 18:27:33 +00:00
ccp Update related .windows files with proper compiling properties, in order to have a successful DSO build. 2009-08-12 08:55:58 +00:00
lsf Restore the original API to terminate individual processes instead of the entire job. This was originally removed as we didn't at that time know how to take advantage of it. Some of us are now working on proactive resilience methods that move procs prior to node failure, so this is now a required API. Modify the odls, plm, and orted functions to support this new functionality. 2009-07-13 02:29:17 +00:00
process Update related .windows files with proper compiling properties, in order to have a successful DSO build. 2009-08-12 08:55:58 +00:00
rsh Use a conditioned wait to serialize launches when they come from multiple sources (e.g., an orte application that spawns multiple jobs). 2009-07-20 01:51:29 +00:00
rshd Restore the original API to terminate individual processes instead of the entire job. This was originally removed as we didn't at that time know how to take advantage of it. Some of us are now working on proactive resilience methods that move procs prior to node failure, so this is now a required API. Modify the odls, plm, and orted functions to support this new functionality. 2009-07-13 02:29:17 +00:00
slurm The system headers are supposed to be protected by #ifdef and not by #if. 2009-07-16 18:27:33 +00:00
submit Restore the original API to terminate individual processes instead of the entire job. This was originally removed as we didn't at that time know how to take advantage of it. Some of us are now working on proactive resilience methods that move procs prior to node failure, so this is now a required API. Modify the odls, plm, and orted functions to support this new functionality. 2009-07-13 02:29:17 +00:00
tm Add some error checking to the tm launcher 2009-08-14 03:13:02 +00:00
tmd The system headers are supposed to be protected by #ifdef and not by #if. 2009-07-16 18:27:33 +00:00
xgrid Restore the original API to terminate individual processes instead of the entire job. This was originally removed as we didn't at that time know how to take advantage of it. Some of us are now working on proactive resilience methods that move procs prior to node failure, so this is now a required API. Modify the odls, plm, and orted functions to support this new functionality. 2009-07-13 02:29:17 +00:00
Makefile.am Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. 2008-02-28 01:57:57 +00:00
plm_types.h Continue implementation of resilient operations by supporting reuse of jobids for restarted procs. Ensure that restarted processes have valid node and local ranks, and that node rank values are passed to direct-launched processes. 2009-06-06 01:08:47 +00:00
plm.h Restore the original API to terminate individual processes instead of the entire job. This was originally removed as we didn't at that time know how to take advantage of it. Some of us are now working on proactive resilience methods that move procs prior to node failure, so this is now a required API. Modify the odls, plm, and orted functions to support this new functionality. 2009-07-13 02:29:17 +00:00