Ralph Castain
a0ef517550
Fix some errors in the bproc components that prevented compiling. Thought I had already done this, but either those changes were lost when I did the merge, or my old man's memory is fading....
...
Whaz-at??? :-)
This commit was SVN r12874.
2006-12-15 19:40:04 +00:00
Ralph Castain
64ec238b7b
Repair support for Bproc 4 on 64-bit systems. Update the SMR framework to actually support the begin_monitoring API. Implement the get/set_node_state APIs.
...
This commit was SVN r12864.
2006-12-15 02:34:14 +00:00
Ralph Castain
9bc25f0bec
Fix a potential bug in the registry where it didn't fully check a segment's name when searching for it. Will have to verify that this doesn't break other things.
...
Bring the bproc system close to being back online....
This commit was SVN r12659.
2006-11-23 04:17:37 +00:00
Ralph Castain
b1ff5fe868
Move the name of the bproc common segment to the central schema location - avoids conflicts when bproc 3 components try to build
...
This commit was SVN r12654.
2006-11-22 20:23:17 +00:00
Ralph Castain
428c1f14c3
Modify the bproc components to resolve the current allocation problem
...
This commit was SVN r12652.
2006-11-22 19:10:58 +00:00
Ralph Castain
050e401671
Simplify - the cellid is simply a field in the process name. Recently, we decided to just directly access it and get rid of extraneous function calls.
...
This commit was SVN r12638.
2006-11-21 09:59:24 +00:00
Galen Shipman
c06b740220
a few more opps to get the cellid...
...
This fixes a compilation error on bproc machines..
This commit was SVN r12631.
2006-11-20 19:45:01 +00:00
Ralph Castain
37dfdb76eb
Here is the major MAD-cure commit. I have written plenty about it, so I refer you here to those messages for a description of everything that was done.
...
This commit was SVN r11661.
2006-09-14 21:29:51 +00:00
Ralph Castain
73a7916946
For Ollie...fix a few names. Should help the Bproc SMR component compile.
...
This commit was SVN r11284.
2006-08-21 15:11:20 +00:00
Ralph Castain
8c7f0ed9ae
Change the SOH to the new State Monitoring and Reporting (SMR) framework. New API's will be appearing in the new framework shortly - this just gets the name change into the system.
...
Other changes:
1. Remove the old xcpu components as they are not functional.
2. Fix a "bug" in orterun whereby we called dump_aborted_procs even when we normally terminated. There is still some kind of bug in this procedure, however, as we appear to be calling the orterun job_state_callback function every time a process terminates (instead of only once when they have all terminated). I'll continue digging into that one.
This will require an autogen/configure, I'm afraid.
This commit was SVN r11228.
2006-08-16 16:35:09 +00:00