Edgar Gabriel
7d34770456
further bugfixes. The hierarchy detection works now as far as I can see (even in unsymmetric sitations). Bcast and reduce work as well. Still to test: the code which generates new local leader communicators, in case the root of the operation is not yet part of the lleader comm.
...
This commit was SVN r7772.
2005-10-15 19:36:54 +00:00
Edgar Gabriel
63554d245f
further bugfixes
...
This commit was SVN r7771.
2005-10-15 18:44:57 +00:00
Edgar Gabriel
92c7b77cbc
minor bug fixes
...
This commit was SVN r7770.
2005-10-15 18:32:40 +00:00
Edgar Gabriel
ba163c611c
checkpoint before moving to a real cluster. Most of the recoding should be
...
done. This version also doesn't break ompi (at least if its not chosen :-) ).
New features compared to the version from last Thursday (where bcast and
reduce seemed to work in most scenarios):
- clearer internal infrastructure
- ability to handle all root processes with a (hopefully) minimal number of
local leader communicators.
This commit was SVN r7769.
2005-10-15 17:04:01 +00:00
Jeff Squyres
e097ee635a
Silence compiler warnings.
...
This commit was SVN r7768.
2005-10-14 22:06:25 +00:00
Jeff Squyres
237bd4c6cd
Fix ompi_info -- cxx:bindings was somehow hard-coded to "yes" instead
...
of reflecting whether the C++ bindings were supported or not.
This commit was SVN r7766.
2005-10-14 20:07:05 +00:00
Jeff Squyres
f47c272986
Fix for the max-31-F90-symbol-limit problem: keep the interface names
...
the same (since those are both mandated by MPI and <31 characters),
but change some of the back-end subroutine names so that they are <31
characters and therefore obey the F90 standard. Remove an outdated /
useless (and confusing) script.
This commit was SVN r7764.
2005-10-14 19:50:30 +00:00
Edgar Gabriel
2c909383bb
abstracting the group_free operation into an internal routine (required
...
by some other components on ompi).
This commit was SVN r7763.
2005-10-14 18:51:20 +00:00
Edgar Gabriel
84c070fc0f
get rid of the different modes how to store the colorarray for now. Might be
...
reintroduced later as an optimization.
This commit was SVN r7762.
2005-10-14 18:11:21 +00:00
Edgar Gabriel
6d14440972
checkpoint for moving again to another machine. major rewrite to clean
...
up internal interfaces in progress.
This commit was SVN r7761.
2005-10-14 17:41:44 +00:00
Edgar Gabriel
770aeaf97b
modifications towards adding new local-leader communicators.
...
This commit was SVN r7760.
2005-10-14 12:18:29 +00:00
Graham Fagg
636b42afff
handle non existant recv buf in reduce for non root processes
...
(basic allreduce does this for mpi_in_place case)
This commit was SVN r7759.
2005-10-14 00:00:37 +00:00
Graham Fagg
61b8218d76
MPI_IN_PLACE fix for reduce.
...
(actually a work around for an optimisation in the reduce for not saving ops on the first recv of each segment)
Minor change in topo.
This commit was SVN r7758.
2005-10-13 23:38:21 +00:00
Edgar Gabriel
48f2563b4c
checkpoint. Moving to another machine.
...
This commit was SVN r7757.
2005-10-13 20:04:26 +00:00
Edgar Gabriel
4b05359b16
minor fixes when freeing the component
...
This commit was SVN r7756.
2005-10-13 18:22:16 +00:00
Edgar Gabriel
0a5a346bbb
first cut on the reduce operation.
...
This commit was SVN r7755.
2005-10-13 17:58:13 +00:00
Edgar Gabriel
30af775d40
further fixes. The first hierarchical MPI_Bcast works! Its just ~ 100 times slower then basic at the moment :-)
...
This commit was SVN r7754.
2005-10-13 17:34:42 +00:00
Edgar Gabriel
460b5cb840
further corrections to the hierarchy detection algorithms. It seems to work now as far as my tests show...
...
This commit was SVN r7753.
2005-10-13 16:21:13 +00:00
Jeff Squyres
9a25554559
Patch from Brooks Davis for some BSD compatibility issues.
...
This commit was SVN r7751.
2005-10-13 15:41:25 +00:00
Edgar Gabriel
f5d16419b2
fix in the logic regarding protocol detection.
...
This commit was SVN r7749.
2005-10-13 15:07:35 +00:00
Thara Angskun
8b59de0f37
Import RAS for POE
...
This commit was SVN r7748.
2005-10-13 14:08:17 +00:00
Brian Barrett
2a44ac2c99
* fix typo found by by Ken Mighell that kept buildpackage from working
...
properly if not root
* remove the --with-rsh LAM-ism
This commit was SVN r7747.
2005-10-13 13:14:12 +00:00
Thara Angskun
73fff4ea2c
- change from mca_base_param_register_* to mca_base_param_reg_*
...
- update include files / fix minor bugs
This commit was SVN r7746.
2005-10-13 12:58:31 +00:00
Jeff Squyres
1d2035cd85
Minor updates to HACKING -- update the list of GNU Auto tools to the
...
latest versions.
This commit was SVN r7744.
2005-10-13 11:16:34 +00:00
Josh Hursey
92429dc90f
Fix for a problem Edgar and Jeff identified WRT PLS determining if we are
...
oversubscribed on a node. And thus whether to call sched_yield or not.
The value of node->node_slots_inuse does not currently represent the number of
slots actually in use, at the moment. This is actually a bug in the RAS/RMAPS
base components, but the fix for that specific bug is bigger than we want to
address at the moment (but will certianly do so in the near future).
Since we cannot trust this value, use the total number of mapped processes
(which was properly set by the RMAPS component upon mapping -- Just not
properly propagated back to the registry's node segment) from the process
mapping.
In addition to this change I cleaned up a couple of the debug messages. It
seems that TM and RSH are the only two directly effected by this. SLURM
would be if that section of code wasn't currently inactive, but put the fix
in for prosparity.
This commit was SVN r7743.
2005-10-13 03:26:48 +00:00
Jeff Squyres
43b9b2c7da
Allow mca-params.conf files to not have a \n at the end of the file
...
This commit was SVN r7741.
2005-10-12 22:49:23 +00:00
Brian Barrett
c7d50fea54
* make sure all the compiler environment variables aren't set when building
...
so that a different compiler isn't accidently used.
If someone is really bored, this could go to the 1.0 branch...
This commit was SVN r7740.
2005-10-12 21:58:05 +00:00
Edgar Gabriel
5d7fbd9d2e
minor change in bml_r2_add_procs: the memory for the bml_endpoints structure
...
has to be allocated outside of the routine. Thus, the update version of pml/ob1/oml_ob1.c
This commit was SVN r7739.
2005-10-12 20:59:25 +00:00
Edgar Gabriel
3e5ad3e681
Updates
...
This commit was SVN r7738.
2005-10-12 20:56:29 +00:00
Tim Woodall
22f460bdc5
merge in changes from release branch
...
This commit was SVN r7737.
2005-10-12 20:24:43 +00:00
Brian Barrett
f899c59102
* don't error in open if we can't find a timebase. Just let the caller
...
know what's up (by returning 0 everywhere) and succeed.
This commit was SVN r7734.
2005-10-12 19:53:20 +00:00
Brian Barrett
cd96ffa41f
* back out Josh's change to linux timer. PowerPC gives you a reference base
...
timer, not a cpu timer, so looking at CPU frequency isn't what you want
to do.
This commit was SVN r7733.
2005-10-12 19:10:07 +00:00
Tim Woodall
6da9561ea8
merge in correction from v1.0
...
This commit was SVN r7732.
2005-10-12 16:40:52 +00:00
Jeff Squyres
c760d51cc2
Fixes for INTEGER and int max values; original problem report from
...
Brooks Davis. Now handle everything as hex, and we don't have to
worry about signed/unsigned implementations of integers in shell and
expr.
This commit was SVN r7729.
2005-10-12 16:08:17 +00:00
Tim Woodall
d859855dea
merge in changes from 1.0
...
This commit was SVN r7728.
2005-10-12 15:54:35 +00:00
Jeff Squyres
727a2cf8b2
Correct a few #if issues that George identified in a code review
...
This commit was SVN r7724.
2005-10-12 13:19:46 +00:00
Josh Hursey
62bd7849f4
Handle the timer for YellowDog's /proc/cpuinfo format
...
clock : 999MHz
This commit was SVN r7723.
2005-10-12 13:13:46 +00:00
Brian Barrett
6ee4116b22
* properly restore CPPFLAGS so that they don't accidently disappear down
...
the line
This commit was SVN r7718.
2005-10-12 03:59:48 +00:00
Brian Barrett
baa4c1eb58
* commit quick XGrid release note
...
This commit was SVN r7717.
2005-10-12 03:27:15 +00:00
Jeff Squyres
c568936a7c
Add support for MPI_OP_SUM, PROD, REPLACE with MPI_DOUBLE_COMPLEX.
...
Need to consult with George -- might also need to add support for
complex types on floats or long doubles...
This commit was SVN r7716.
2005-10-12 02:31:28 +00:00
Josh Hursey
0f08e87a1f
Fixed a max_slots off by one problem that Brian highlighted.
...
Also cleaned up the error message when allocating over the number of
slots available.
This commit was SVN r7715.
2005-10-12 02:09:56 +00:00
Jeff Squyres
01fa036645
Sync README with 1.0 branch
...
This commit was SVN r7714.
2005-10-12 01:14:49 +00:00
Ralph Castain
70779fa2ab
Cleanup some old logic - nothing major.
...
This commit was SVN r7712.
2005-10-12 01:12:27 +00:00
Brian Barrett
128389758f
* fix compile error in XGrid PLS that got introduced sometime in the not
...
too distant past
* work around apparently broken handling of max_slots somewhere along
the line by just setting it to 0
Both changes should go to the trunk.
This commit was SVN r7710.
2005-10-12 00:41:14 +00:00
George Bosilca
cb7b401ca8
Correct the send-recv operation.
...
This commit was SVN r7709.
2005-10-11 22:41:08 +00:00
Edgar Gabriel
25518b63c5
first version of coll_hierarch which does not crash the rest of the
...
library as long as its not selected :-)
This commit was SVN r7707.
2005-10-11 22:05:24 +00:00
Josh Hursey
ef51608a81
fix compiler warning
...
This commit was SVN r7706.
2005-10-11 22:03:21 +00:00
Edgar Gabriel
0675c22dab
updating with Jeff's help to the recent autogen/configure system
...
This commit was SVN r7705.
2005-10-11 21:50:16 +00:00
Josh Hursey
af9ccdf04a
need to use get_first instead of get_begin since we don't want to execute
...
this loop if "nodes" is an empty list. get_first, in this loop context,
allows us to do just that, while get_begin doesn't.
This fixes a --host problem that appeared on the Linux PPC64 build.
This commit was SVN r7703.
2005-10-11 21:33:04 +00:00
Edgar Gabriel
7b07dbc163
another round of fixes. Unfortunatly, I also have to provide a trivial
...
version of reduce and gather to make all this work....
This commit was SVN r7702.
2005-10-11 21:26:07 +00:00