d2921a9d42
- utilizing coll_tuned_util functions - setting line length to 80. This implementation uses standard send messages (instead of synchronous ones). The change improved our performance over MX multiple number of times, however, there exists a small potential that last message to be sent can be delayed (until next mpi call, which means potentially infinitely). If this shows to be a problem, I will modify the algorithms to use synchronous send as last operation (which will incur performance penalty again). This commit was SVN r13071. |
||
---|---|---|
.. | ||
base | ||
basic | ||
demo | ||
hierarch | ||
libnbc | ||
self | ||
sm | ||
tuned | ||
coll.h | ||
Makefile.am |