1
1
Rich Graham b9520e61dc get the sm optimized allreduce working for all but user defined
operations.  Added to the reduction operations a set of reduction
functions that take 2 input buffers and one output buffer to avoid
some extra memory copies.  These can't be used with user defined
operations.  The intel c collective suite passes both original, and
new (new, not the user defined operations).

This commit was SVN r17901.
2008-03-20 23:51:16 +00:00
..
2008-03-05 00:55:43 +00:00
2008-03-18 03:03:33 +00:00
2008-03-05 12:22:34 +00:00