7bab4ed269
- finally added "in use" flags -- one flag protects a set of segments - these flags now used in bcast to protect (for example) when a message is so long that the root loops around the segments and has to re-use old segments -- now it knows that it has to wait until the non-root processes have finished with that set of segments before it can start using them - implement allreduce as a reduce followed by a bcast (per discussion with rich) - removed some redundant data on various data structures - implemented query MCA param ("coll_sm_shared_mem_used_data") that tells you how much shared memory will be used for a given set of MCA params (e.g., number of segments, etc.). For example: ompi_info --mca coll_sm_info_num_procs 4 --param coll sm | \ grep shared_mem_used_data tells you that for the default MCA param values (as of r7172), for 4 processes, sm will use 548864 bytes of shared memory for its data transfer section - remove a bunch of .c files from the Makefile.am that aren't implemented yet (i.e., all they do is return ERR_NOT_IMPLEMENTED) Now on to the big Altix to test that this stuff really works... This commit was SVN r7205. The following SVN revision numbers were found above: r7172 --> open-mpi/ompi@bc72a7722b |
||
---|---|---|
.. | ||
.ompi_ignore | ||
.ompi_unignore | ||
coll_sm_allgather.c | ||
coll_sm_allgatherv.c | ||
coll_sm_allreduce.c | ||
coll_sm_alltoall.c | ||
coll_sm_alltoallv.c | ||
coll_sm_alltoallw.c | ||
coll_sm_barrier.c | ||
coll_sm_bcast.c | ||
coll_sm_component.c | ||
coll_sm_exscan.c | ||
coll_sm_gather.c | ||
coll_sm_gatherv.c | ||
coll_sm_module.c | ||
coll_sm_reduce_scatter.c | ||
coll_sm_reduce.c | ||
coll_sm_scan.c | ||
coll_sm_scatter.c | ||
coll_sm_scatterv.c | ||
coll_sm.h | ||
configure.params | ||
help-coll-sm.txt | ||
Makefile.am | ||
memory-layout.ppt |