1
1
openmpi/ompi/mca/coll
Nathan Hjelm 9c788ff940 coll/basic: fix segmentation fault in neighborhood collectives if the degree
of the topology is higher than the communicator size

It is possible to have a topology degree higher than the size of the communicator.
For example, a periodic cartesian communicator on MPI_COMM_SELF. This will leave
the neighborhood collectives with a request buffer that is too small. This commit
adds a call that will dynamically increase the size of the request buffer if it
is too small.

A better fix would be to create the topology *before* calling the coll_select
routine on a communicator. This will take some discussion and the solution will
not likely be ready anytime soon.

Thanks to Lisandro Dalcin for reporting this.

Original thread: http://www.open-mpi.org/community/lists/devel/2014/08/15713.php

cmr=v1.8.3:reviewer=jsquyres

This commit was SVN r32796.
2014-09-25 17:43:29 +00:00
..
base check-help-strings cleanup 2014-08-11 03:27:45 +00:00
basic coll/basic: fix segmentation fault in neighborhood collectives if the degree 2014-09-25 17:43:29 +00:00
cuda Fix missing help file. 2014-08-18 21:52:31 +00:00
demo The bulk of the remaining renaming changes, in one final glorious "blob". Thanks to Jeff for some help chasing down a few spots. Per chat with Jeff, we decided to cleanup a few things that were historical in nature: 2014-05-07 21:48:53 +00:00
fca fca: adopt opal API refactoring, fix warning. 2014-08-09 15:50:51 +00:00
hcoll HCOLL: fix misplaced hcoll_init return value check. 2014-07-22 18:47:34 +00:00
hierarch Some more cleanups. Remove direct references to ORTE by changing OMPI_CAST_ORTE_NAME -> OMPI_CAST_RTE_NAME. Ensure that ORTE tools (mpirun, orted, tools) set the OPAL proc structure fields so OPAL knows what is going on and uses the correct print functions (still need to fix the problem for non-MPI apps). Properly return uint32_t from the opal utilities instead of int32_t as that is what the ORTE process name fields contain. 2014-08-01 14:44:11 +00:00
inter This commit fixes some of the Coverity reported warnings. I addressed 2014-04-06 18:23:49 +00:00
libnbc Fix typo. 2014-09-18 13:58:54 +00:00
ml coll/ml: better error handling 2014-09-01 10:00:49 +00:00
portals4 Portals fixes after latest move 2014-07-28 19:25:03 +00:00
self Fix longstanding issue with our multi-project support. Rather than using 2014-01-07 22:11:15 +00:00
sm Some more cleanups. Remove direct references to ORTE by changing OMPI_CAST_ORTE_NAME -> OMPI_CAST_RTE_NAME. Ensure that ORTE tools (mpirun, orted, tools) set the OPAL proc structure fields so OPAL knows what is going on and uses the correct print functions (still need to fix the problem for non-MPI apps). Properly return uint32_t from the opal utilities instead of int32_t as that is what the ORTE process name fields contain. 2014-08-01 14:44:11 +00:00
tuned COLL/TUNED: alltoall - return previous default values of algorithm choosing decision thresholds (were changed by r32735) 2014-09-18 08:07:51 +00:00
coll.h MPI-3: Add support for neighborhood collectives 2013-09-26 21:55:08 +00:00
Makefile.am Fix longstanding issue with our multi-project support. Rather than using 2014-01-07 22:11:15 +00:00