Nathan Hjelm
f68c315188
pml/ob1: add missing ompi_request_wait_completion for buffered sends
...
This commit adds a call to ompi_request_wait_completion for buffered
sends. Without this line it is possible to get into a state where the
data is never sent.
Fixes open-mpi/ompi#1185
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-12-07 22:28:07 -07:00
Gilles Gouaillardet
bfe8e03d9d
fcoll/two_phase: use ompi_mpi_abort instead of PMPI_Abort
...
Thanks Jeff for the review
2015-12-07 11:34:36 +09:00
Gilles Gouaillardet
ef03bc726c
ompi: fix comment in ompi/mpi/c/Makefile.am
...
Thanks Jeff for the review
2015-12-07 11:34:01 +09:00
Gilles Gouaillardet
37c978f5e9
coll/libnbc: correctly handle changed types.
...
this fixes open-mpi/ompi@d816d1c194
thanks Jeff for the review
2015-12-07 10:13:43 +09:00
George Bosilca
3a9664ac9d
Fix Coverity CIDs 1341584-1341589.
2015-12-06 14:06:36 -05:00
Jeff Squyres
ad35a363fa
Merge pull request #1179 from jsquyres/pr/mpi-testsome-man-page-update
...
Pr/mpi testsome man page update
2015-12-04 05:55:33 -05:00
bosilca
8fee96c086
Merge pull request #1091 from bosilca/topic/datatype_span
...
Cleanup the temporary memory allocation in collectives
2015-12-03 19:25:04 -05:00
Jeff Squyres
0adcb5b5cd
MPI_Testsome.3in: wrap some long lines
...
Wrap some long lines; no other text or semantics changes.
2015-12-03 17:06:43 -05:00
Jeff Squyres
11c571b568
MPI_Testsome.3in: add explicit verbiage about return values
...
Instead of solely relying on the out value definitions in
MPI_Waitsome.3, explicitly copy this text here.
Note that the original text in this man page was copied verbatim from
the MPI spec; we've now added a bit more text (copied from
MPI_Waitsome.3in) that explains the out values so that users don't
have to cross-reference to another man page.
Thanks to Eric Schnetter for the suggestion.
Fixes open-mpi/ompi#1153
2015-12-03 17:06:22 -05:00
Gilles Gouaillardet
a5440ade5f
topo/treematch: do not invoke hwloc_topology_{init,load}
...
* this is not necessary
* this overwrites existing topology, that could be different if hwloc_base_topo_file is used
2015-12-03 11:24:32 +09:00
George Bosilca
688108cf7f
Patch submitted by @ggouaillardet on ticket #1091 .
2015-12-02 20:42:18 -05:00
George Bosilca
4d00c59b2e
Cleanup the memory handling for temporary buffers in
...
some of the collective modules. Added a new function
opan_datatype_span, to compute the memory span of
count number of datatype, excluding the gaps in the
beginning and at the end. If a memory allocation is
made using the returned value, the gap (also returned)
should be removed from the allocated pointer.
2015-12-02 20:42:18 -05:00
Gilles Gouaillardet
351bd03249
ompi_proc_sentinel_to_name: clear the top left bit
2015-12-02 17:18:56 +09:00
Jeff Squyres
15325c8094
op/x86: change the owner to Ralph
...
Cisco no longer cares about this component, but Intel might.
Transferring ownership to Ralph.
2015-12-01 15:08:07 -08:00
igor-ivanov
d8c85738ab
Merge pull request #1151 from igor-ivanov/pr/opal-abort-vars
...
Add new mca variables opal_abort_delay and opal_abort_print_stack
2015-12-01 16:27:11 +04:00
Ryan Grant
324534b191
Merge pull request #1161 from tkordenbrock/topic/add.triggered.scatter
...
coll-portals4: add scatter and iscatter implementations that use Portals4 triggered operations
2015-11-30 16:53:47 -07:00
Todd Kordenbrock
4721b70dd5
coll-portals4: add scatter and iscatter implementations that use Portals4 triggered operations
...
This commit adds implementations of scatter and iscatter using
Portals4 triggered operations. Currently, the only algorithm
is linear.
2015-11-30 15:07:18 -06:00
Todd Kordenbrock
f6f525e0d8
coll-portals4: remove unneeded code from gather
...
This commit removes two pieces of unneeded code from gather. First
it removes destroy_tree() calls from linear_top(), because the
linear algorithm does not create a tree, so there is no need to
destroy it. Second it removes unpack_bytes from the gather request
because it was calculated but never used.
2015-11-30 10:38:51 -06:00
Gilles Gouaillardet
80f02518ff
topo/base: correctly free the topo object in mca_topo_base_dist_graph_create_adjacent
2015-11-30 15:33:59 +09:00
Gilles Gouaillardet
8227bc6320
ompi_proc_find_and_add: use ompi_proc_allocate in order to update *both* ompi_proc_list and ompi_proc_hash
2015-11-30 14:00:59 +09:00
igor.ivanov@itseez.com
c15bf147bf
opal: Add opal_abort_print_stack mca variable with aliases for ompi/oshmem
...
This commit allows to control output during abnormal oshmem/ompi application
termination.
Fixed issue in backtrace output. HAVE_BACKTRACE was never set so user was limited
in control of this variable.
Two related mca variables are moved to opal layer. Corresponding aliases are
added for ompi and oshmem.
2015-11-25 18:18:33 +02:00
Ryan Grant
81d482dca6
Merge pull request #1137 from francois-wellenreiter/trig_mtl_rdv
...
MTL portals4 : improve the rendez-vous protocol using PtlTriggeredGet…
2015-11-24 17:31:31 -07:00
Ryan Grant
219581e87e
Merge pull request #1090 from tkordenbrock/topic/check.for.invalid.handles.in.finalize
...
mtl-portals4: test for valid handle before releasing resources
2015-11-20 07:54:44 -06:00
Mike Dubman
c544620a7c
Merge pull request #1138 from igor-ivanov/pr/yalla-valgrind
...
yalla: fix valgrind error due to uninitialized status field.
2015-11-20 07:19:11 -05:00
Gilles Gouaillardet
002c7b8b3a
fcoll/two_phase: use PMPI_* insted of MPI_*
2015-11-20 13:46:19 +09:00
Gilles Gouaillardet
561e7f6647
vprotocol/pessimist: use internal ompi_* insted of MPI_*
2015-11-20 13:46:19 +09:00
Gilles Gouaillardet
025fd8a9fc
osc: use PMPI_* insted of MPI_*
2015-11-20 13:46:19 +09:00
Gilles Gouaillardet
d816d1c194
coll/libnbc: use PMPI_* and internal ompi_* insted of MPI_*
2015-11-20 13:46:19 +09:00
yosefe
3bb1270715
yalla: fix valgrind error due to uninitialized status field.
2015-11-19 10:59:31 +02:00
Francois WELLENREITER
9126ea5e82
MTL portals4 : improve the rendez-vous protocol using PtlTriggeredGet operation
2015-11-19 09:52:53 +01:00
Edgar Gabriel
9e5ade4e8b
argh, a debugging sleep statement got into the source code.
2015-11-16 13:26:57 -06:00
Edgar Gabriel
dbfbcdecd5
make adjustments for the default settings of grouping parameters and the default contiguous group size option.
...
minor bug fix in the simple grouping strategy.
2015-11-16 08:17:27 -06:00
Edgar Gabriel
27628774c7
add a new option for a simple aggregator selection which has zero communication costs.
2015-11-16 08:17:26 -06:00
Edgar Gabriel
66c1ea5fcb
change the default value of the grouping option. Also add new grouping option which skips the refinement step in the aggregator selection.
2015-11-16 08:17:23 -06:00
Edgar Gabriel
e8e117503d
reduce the communication volume during MPI_File_set_view
2015-11-16 08:17:22 -06:00
yohann
005400a937
mtl/ofi: Make sure the resources are managed by the provider.
2015-11-13 16:16:58 -08:00
Nathan Hjelm
9ef0821856
osc/rdma: fix some threading bugs
...
There were two bugs in osc/rdma when using threads:
- Deadlock is ompi_osc_rdma_start_atomic. This occurs because
ompi_osc_rdma_frag_alloc is called with the module lock. To fix the
issue the module lock is now recursive. In the future I will add a
new lock to protect just the current rdma fragment.
- Do not drop the lock in ompi_osc_rdma_frag_alloc when calling
ompi_osc_rdma_frag_complete. Not only is it not needed but dropping
the lock at this point can cause a competing thread to mess up the
state.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-11-12 20:25:57 -07:00
Yossi
b750b72a81
Merge pull request #1127 from yosefe/topic/pml-ucx-implement-cancel
...
pml_ucx: implement cancel, and add small optimizations.
2015-11-12 10:50:48 +02:00
yosefe
7becc54d67
pml_ucx: fix typo.
2015-11-12 09:57:41 +02:00
Todd Kordenbrock
b9630f802b
Merge pull request #1120 from francois-wellenreiter/mtl_min_mdbind
...
mtl-portals4 : remove useless PtlMDBind PtlMDRelease calls for rendez-vous messages
2015-11-10 14:34:19 -06:00
yosefe
d66b01d380
pml_ucx: implement cancel, and add small optimizations.
2015-11-10 17:40:06 +02:00
Gilles Gouaillardet
d6ff25b9a2
pml/monitoring: initialize common symbols
2015-11-10 13:58:54 +09:00
Jeff Squyres
e89ecac83c
bml r2: fix exclusivity comparison
...
Fixes open-mpi/ompi#1106
2015-11-06 13:26:32 -08:00
Francois WELLENREITER
b301b49a40
MTL portals4 : remove useless PtlMDBind PtlMDRelease calls for rendez-vous messages
2015-11-06 15:55:44 +01:00
Ralph Castain
bfdf08ae86
Fix intercomm_create by ensuring that both sides know how to translate jobid to/from nspace
...
Return something just to ensure that pack is happy
2015-11-06 02:19:45 -08:00
Nathan Hjelm
fda5daf453
Merge pull request #1096 from kawashima-fj/pr/fortran-var-type-fix
...
Fix Fortran variable types
2015-11-05 14:27:40 -07:00
Nathan Hjelm
acf3cb9b9b
Merge pull request #1095 from kawashima-fj/pr/trivial-fixes
...
Some trivial fixes
2015-11-04 09:45:59 -07:00
yosefe
45c3d04857
pml_ucx: fix request construct/destruct.
...
We should invoke OBJ_CONTRUCT/OBJ_DESTRUCT only on regular requests
(which are embedded inside UCX requests) and for the completed request.
Persistent requests are already constructed/destructed by the free list.
This fixes an assertion in ompi_request_destruct.
2015-11-04 11:03:46 +02:00
KAWASHIMA Takahiro
c09f9f05d3
mpi/tool: Fix an incorrect type cast.
...
This bug caused an invalid result value on `MPI_T_cvar_read`
on big-endian machines or for large (>=2Gi) cvar values.
2015-11-04 11:28:43 +09:00
KAWASHIMA Takahiro
384f4b51d1
fortran: Fix: missing dimension(*)
in (I)NEIGHBOR_ALLTOALLW
.
2015-11-04 10:38:25 +09:00