Rich Graham
a7c836a2b0
fix location of the restrict key word.
...
Make the tag in the fan-in/fan-out algorithm be fragment based.
This commit was SVN r17903.
2008-03-21 01:40:36 +00:00
Rich Graham
2c66d396b7
take care of some bit-rot with the fanin-fanout method.
...
This commit was SVN r17902.
2008-03-21 01:08:49 +00:00
Rich Graham
b9520e61dc
get the sm optimized allreduce working for all but user defined
...
operations. Added to the reduction operations a set of reduction
functions that take 2 input buffers and one output buffer to avoid
some extra memory copies. These can't be used with user defined
operations. The intel c collective suite passes both original, and
new (new, not the user defined operations).
This commit was SVN r17901.
2008-03-20 23:51:16 +00:00
Edgar Gabriel
570bbea5e0
fixing the allgather problem reported on the mailing list. The problem was
...
that at one locatin we had the local-size instead of the remote size as a
receive argument.
This commit was SVN r17849.
2008-03-17 19:42:18 +00:00
Rich Graham
27182afb67
get the timers in correctly.
...
This commit was SVN r17832.
2008-03-16 03:25:16 +00:00
Rich Graham
afcd1016fd
move temp buffer allocation out of the iteration loop - i.e. always use the
...
same temp loop. The algorithm is rather synchronous already...
This commit was SVN r17831.
2008-03-16 03:20:46 +00:00
Rich Graham
a1766b29f6
fix some barrier addressing errors.
...
This commit was SVN r17830.
2008-03-15 22:46:19 +00:00
Rich Graham
0453e7d2f4
bug in management memory allocation - too much memory allocated.
...
This commit was SVN r17829.
2008-03-15 18:12:20 +00:00
Rich Graham
3c2f1eb8bf
reduce the number of temp buffers used.
...
This commit was SVN r17828.
2008-03-15 17:23:04 +00:00
Rich Graham
0f9d642d51
temp buffer pointers are computed when they are set up. A bit more
...
efficient, but more important, it is much easier to play around with
memory layout now.
This commit was SVN r17827.
2008-03-15 16:36:35 +00:00
Rich Graham
e3e336b5ab
check point
...
This commit was SVN r17826.
2008-03-15 13:31:21 +00:00
Rich Graham
ebcf928c24
add some diagnostics.
...
This commit was SVN r17789.
2008-03-07 22:27:41 +00:00
Rich Graham
9131461511
move some test code to another machine.
...
This commit was SVN r17785.
2008-03-07 19:18:02 +00:00
Rich Graham
c230b65543
fix a couple of bugs. Recursive doubling seems to be working.
...
This commit was SVN r17777.
2008-03-07 02:51:38 +00:00
Rich Graham
70157166f9
checkpoint - compiles, now neeed to debug.
...
This commit was SVN r17775.
2008-03-07 00:39:59 +00:00
Rich Graham
4eace9d020
starting to implement recursive doubling algorithm.
...
This commit was SVN r17765.
2008-03-06 18:38:58 +00:00
Rich Graham
67ad9b6d6b
increase max data segments size.
...
This commit was SVN r17677.
2008-03-02 19:11:09 +00:00
Rich Graham
53126fa7bd
add calls to opal_progress()
...
This commit was SVN r17673.
2008-02-29 23:25:09 +00:00
Rich Graham
d37db14901
get the shared memory collectives working again with the new
...
version of orte.
This commit was SVN r17672.
2008-02-29 22:28:57 +00:00
Rich Graham
c253a7bda1
simplify the code abit.
...
This commit was SVN r17664.
2008-02-29 03:55:12 +00:00
Rich Graham
1632d8b299
revert to an older (not previosly checked in) version to get around a
...
regression.
This commit was SVN r17663.
2008-02-29 03:12:12 +00:00
Rich Graham
827e8d877e
fix bug in node type, and some memory copy optimizations.
...
This commit was SVN r17661.
2008-02-29 01:20:11 +00:00
Rich Graham
940d6732c9
remove compiler warnings.
...
This commit was SVN r17656.
2008-02-28 22:01:19 +00:00
Rich Graham
2b5fab9d51
avoid 0 byte malloc.
...
This commit was SVN r17653.
2008-02-28 21:11:42 +00:00
Rich Graham
4b26adef00
remove some debug output.
...
This commit was SVN r17650.
2008-02-28 20:54:35 +00:00
Rich Graham
5df6c6d043
fix several race conditions.
...
This commit was SVN r17645.
2008-02-28 19:40:19 +00:00
Ralph Castain
d70e2e8c2b
Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately.
...
Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer
This commit was SVN r17632.
2008-02-28 01:57:57 +00:00
Rich Graham
68aa691171
checkpoint work.
...
This commit was SVN r17620.
2008-02-27 14:56:36 +00:00
Rich Graham
b4bbb70bb7
got it all, but for the mem copies. Also, need to make sure volatile declarations are all inplace, as well as memory barriers.
...
This commit was SVN r17572.
2008-02-25 00:16:21 +00:00
Rich Graham
2d8c2420e8
checkpoint.
...
This commit was SVN r17571.
2008-02-24 20:54:16 +00:00
Rich Graham
771584bff5
generate reduction tree.
...
This commit was SVN r17569.
2008-02-24 03:25:40 +00:00
Rich Graham
b9bb78484d
a bit of omptimization.
...
This commit was SVN r17528.
2008-02-20 16:19:49 +00:00
Rich Graham
09afc36f5f
correct addressing.
...
This commit was SVN r17519.
2008-02-20 01:12:43 +00:00
Rich Graham
b87b15580c
fix memory allocation error. Initialize pointer.
...
This commit was SVN r17514.
2008-02-19 20:01:42 +00:00
Rich Graham
1cd8a2e578
checkpoint - works for 2 procs, but not more.
...
This commit was SVN r17477.
2008-02-17 05:21:58 +00:00
Rich Graham
8006927ae8
free buffer, rather than ask for another one, when done with the memory.
...
This commit was SVN r17468.
2008-02-15 04:21:58 +00:00
Rich Graham
2277b47ab9
register mca_coll_sm2_allreduce_intra - function still does not do any
...
reduction operations.
This commit was SVN r17467.
2008-02-15 04:13:00 +00:00
Rich Graham
9b0687e6df
add buffer allocation and deallocation calls to the allreduce routine, so
...
I can start debugging the memory management code. The allreduce fucntion
does nothing at this stage.
This commit was SVN r17466.
2008-02-15 03:59:14 +00:00
Rich Graham
41943dbd76
adding missing files.
...
This commit was SVN r17462.
2008-02-15 00:59:28 +00:00
Rich Graham
41f4b06b39
buffer allocate/release code is fully written, and compiles. Now need to debug.
...
This commit was SVN r17461.
2008-02-15 00:57:44 +00:00
Rich Graham
7cc58768cd
checkpoint something that compiles
...
This commit was SVN r17460.
2008-02-15 00:33:14 +00:00
Rich Graham
292d930eea
check point.
...
This commit was SVN r17457.
2008-02-14 20:00:26 +00:00
Edgar Gabriel
77057a50a3
- adding the two-level hierarchy detection algorithm
...
- minor fix in the temporary collectives
- removing the symmetric parameter, since it didn't really make sense.
This commit was SVN r17359.
2008-02-01 17:11:36 +00:00
Rich Graham
fda485ff9c
backing file is allocated and deallocated.
...
This commit was SVN r17358.
2008-02-01 15:26:20 +00:00
Rich Graham
165fc3f8cc
memory allocation implemented and debugged. Still need to finish
...
file allocation/dealocation and control information initialization.
This commit was SVN r17291.
2008-01-29 03:09:12 +00:00
Rich Graham
e24c2ebbc0
have a working skeleton for the SM-V2 component. It does nothing at this stage.
...
This commit was SVN r17241.
2008-01-25 21:16:36 +00:00
Rich Graham
1d0334f4f2
skeleton for new shared memory collective component.
...
This commit was SVN r17235.
2008-01-25 19:35:26 +00:00
Rich Graham
432ba0cecd
add comments about the life-cycle of a collective module.
...
This commit was SVN r17223.
2008-01-25 03:46:31 +00:00
George Bosilca
31390c0074
We should take in account the extent of the datatype when we compute
...
the initial displacement in bytes. Thanks to Daniel G. Hyams for the fix.
This commit was SVN r17165.
2008-01-19 05:34:53 +00:00
George Bosilca
3fca3973d3
The PTLs are now long gone !!!
...
This commit was SVN r17104.
2008-01-10 00:18:45 +00:00