627533fe4a
Algorithm allows user to specify the segment size to be used for computation/communication overlap. The additional memory requirement for the algorithm is 2 x segment size. It performed well for (really) large message sizes over MX and it passed intel Allreduce_c and Allreduce_loc_c tests. This commit was SVN r13832. |
||
---|---|---|
.. | ||
base | ||
basic | ||
demo | ||
hierarch | ||
libnbc | ||
self | ||
sm | ||
tuned | ||
coll.h | ||
Makefile.am |