Ralph Castain
bfdf08ae86
Fix intercomm_create by ensuring that both sides know how to translate jobid to/from nspace
...
Return something just to ensure that pack is happy
2015-11-06 02:19:45 -08:00
Nathan Hjelm
fda5daf453
Merge pull request #1096 from kawashima-fj/pr/fortran-var-type-fix
...
Fix Fortran variable types
2015-11-05 14:27:40 -07:00
Nathan Hjelm
acf3cb9b9b
Merge pull request #1095 from kawashima-fj/pr/trivial-fixes
...
Some trivial fixes
2015-11-04 09:45:59 -07:00
yosefe
45c3d04857
pml_ucx: fix request construct/destruct.
...
We should invoke OBJ_CONTRUCT/OBJ_DESTRUCT only on regular requests
(which are embedded inside UCX requests) and for the completed request.
Persistent requests are already constructed/destructed by the free list.
This fixes an assertion in ompi_request_destruct.
2015-11-04 11:03:46 +02:00
KAWASHIMA Takahiro
c09f9f05d3
mpi/tool: Fix an incorrect type cast.
...
This bug caused an invalid result value on `MPI_T_cvar_read`
on big-endian machines or for large (>=2Gi) cvar values.
2015-11-04 11:28:43 +09:00
KAWASHIMA Takahiro
384f4b51d1
fortran: Fix: missing dimension(*)
in (I)NEIGHBOR_ALLTOALLW
.
2015-11-04 10:38:25 +09:00
KAWASHIMA Takahiro
1092eabfab
fortran: Update comment.
...
The structure was changed in commit 9c77c6b.
2015-11-04 10:38:25 +09:00
KAWASHIMA Takahiro
107c0073dd
fortran: Fix: MPI_UNWEIGHTED
and MPI_WEIGHTS_EMPTY
should be arrays.
...
Without this modification, gfortran throw the following error
if these variables are used for `MPI_DIST_GRAPH_CREATE_ADJACENT` or
`MPI_DIST_GRAPH_CREATE_ADJACENT`.
Error: There is no specific subroutine for the generic
'mpi_dist_graph_create_adjacent' at (1)
2015-11-04 10:38:25 +09:00
KAWASHIMA Takahiro
d5e1f40a1e
fortran: Fix: info
should be an integer parameter.
2015-11-04 10:38:24 +09:00
KAWASHIMA Takahiro
9bf93810d7
fortran: Fix: array dimension of MPI_ARGVS_NULL
.
...
`MPI_ARGVS_NULL` should be a two-dimensional array.
Without this modification, gfortran throw the following error
if `MPI_ARGVS_NULL` is used for `MPI_COMM_SPAWN_MULTIPLE`.
Error: There is no specific subroutine for the generic
'mpi_comm_spawn_multiple' at (1)
2015-11-04 10:38:24 +09:00
George Bosilca
b14212f142
Fix Coverity issue 1338059.
2015-11-02 22:51:52 -05:00
Todd Kordenbrock
cefe50cf54
mtl-portals4: test for valid handle before releasing resources
...
During component finalize, mtl-portals4 would blindly release
resources without testing if the handle was valid. This was OK,
but resource allocation is now delayed until add_procs(). If
mtl-portals4 is deselected, it will be finalized without
add_procs() ever being called. This commit ensures that invalid
handles are not released.
2015-11-02 21:01:14 -06:00
George Bosilca
5c60e76669
Fix Coverity CIDs 1338021, 1338020, 1338019, 1338018.
2015-11-02 17:38:51 -05:00
bosilca
f1a5362f94
Merge pull request #1072 from bosilca/topic/resized
...
Fix for the subarray and darray type creation issue.
2015-11-01 21:17:03 -05:00
George Bosilca
b77c203068
Add more comments and restore the progress, flags, max tag, and max
...
context_id from the original PML.
2015-10-31 17:13:35 -04:00
George Bosilca
3efd494972
Make sure the monitoring infrastructure works well with the
...
new dynamic add_procs.
2015-10-31 17:13:35 -04:00
Guillaume Papauré
86714ad91e
change pml_monitoring_messages_count and pml_monitoring_messages_size pvars to use the start/stop features
2015-10-31 17:13:35 -04:00
George Bosilca
a43c2ce529
Fully integrate the monitoring with the MPI_T PVAR.
...
Writing to the pml_monitoring_flush variable will set the filename of
the output file.
Stopping a session for the pml_monitoring_flush will force the
generation of the nobitoring output file (as long as the filename
is not NULL).
To reset the monitoring, une has to bind the pml_monitoring_flush to a
session.
2015-10-31 17:13:35 -04:00
George Bosilca
646a662721
Use the new group interface and add const to the PML send functions.
2015-10-31 17:13:35 -04:00
George Bosilca
5224a7ce4d
Allow the pvar to be written by invoking the associated callback.
...
Use a PVAR to generate the monitoring dump of the information into a
file.
Use the PVAR to instruct the PML monitoring when to do the dump.
2015-10-31 17:13:35 -04:00
George Bosilca
df167f4177
Rewrite the close logic to be more clean and cleaner.
2015-10-31 17:13:35 -04:00
George Bosilca
c801ffde86
Use MPI_T variables to handle the flush in a more MPI-blessed way.
...
Code cleanup.
Update the monitoring test to use MPI_T variables.
2015-10-31 17:13:35 -04:00
George Bosilca
4f88c82500
Fix a convertion problem and add a comment about the lack of component
...
retain in the new component infrastructure.
Clean Makefile.am to fix "make distcheck".
Update the gitignore rules.
2015-10-31 17:13:35 -04:00
George Bosilca
80343a0d39
add ability to querry pml monitorinting results with MPI Tools interface
...
using performance variables "pml_monitoring_messages_count" and
"pml_monitoring_messages_size"
Per Brice suggestion make all data count and message length be
uint64_t.
2015-10-31 17:13:35 -04:00
George Bosilca
a47d69202f
Add a monitoring PML. This PML track all data exchanges by the processes
...
counting or not the collective traffic as a separate entity. The need
for such a PML is simply because the PMPI interface doesn't allow us to
identify the collective generated traffic.
2015-10-31 17:13:35 -04:00
Rolf vandeVaart
578385ca78
Merge pull request #1079 from rolfv/pr/cuda-require-41
...
Make CUDA 4.1 a requirement for CUDA-aware support
2015-10-29 12:56:22 -04:00
Nathan Hjelm
b1e3936261
Merge pull request #1078 from rolfv/pr/disable-osc-rdma-for-cuda
...
Disable the use of osc rdma when we detect a GPU buffer
2015-10-29 10:03:28 -06:00
Rolf vandeVaart
f2ff6e03ab
Make CUDA 4.1 a requirement for CUDA-aware support.
...
Remove all related preprocessor conditionals.
2015-10-29 11:24:02 -04:00
Matias Cabral
8ebcac1b2c
Merge pull request #1075 from matcabral/psm2_symbol_rename
...
Updated psm2 mtl with new externally exposed symbols of psm2.so
Fixes open-mpi/ompi#1018
Fixes open-mpi/ompi#1021
2015-10-28 13:55:45 -07:00
Rolf vandeVaart
87a4cc6118
Disable the use of osc rdma when we detect a GPU buffer as it is not supported in that component.
...
This forces a failover to the osc pt2pt component. Fixes #1012
2015-10-28 14:47:45 -04:00
yosefe
ae738d0434
pml_ucx: add pmi fence in del_procs
2015-10-28 18:34:36 +02:00
Matias A Cabral
ed16d8e1cc
Updated psm2 mtl with new externally exposed symbols of psm2.so
...
Fixes open-mpi/ompi#1018
Fixes open-mpi/ompi#1021
2015-10-28 09:12:33 -07:00
yosefe
41b6230be3
pml_ucx: fix debug macros, and initialize mpi request properly.
2015-10-28 10:59:25 +02:00
Ralph Castain
267ca8fcd3
Cleanup the PMIx direct modex support. Add an MCA parameter pmix_base_async_modex that will cause the async modex to be used when set to 1. Default it to 0 for now
...
to continue current default behavior.
Also add an MCA param pmix_base_collect_data to direct that the blocking fence shall return all data to each process. Obviously, this param has no effect if async_
modex is used.
2015-10-27 17:31:56 -07:00
yohann
8bf1c95cdc
mtl/ofi: Remove unused help messages.
2015-10-27 09:38:04 -07:00
Nathan Hjelm
69d403d42b
Merge pull request #1054 from hjelmn/add_procs_threading
...
add_procs: add threading protection for dynamic add_procs
2015-10-27 09:27:13 -06:00
George Bosilca
679dc9b437
Fix the subarray and darray type creation. Include a
...
small patch provided by Gilles.
2015-10-26 23:44:26 -04:00
yohann
a111d66f0f
mtl/ofi: Change hints to FI_PROGRESS_MANUAL.
2015-10-26 15:32:30 -07:00
yohann
fde8b89ceb
mtl/ofi: Use OFI's representation of ANY_SRC instead of NULL.
2015-10-26 14:38:41 -07:00
yohann
4246de4508
mtl/ofi: Treat error correctly.
2015-10-26 14:38:33 -07:00
George Bosilca
2622b9d3a1
Fix minor issues in the treematch topo
...
based on a patch provided by Guillaume.
2015-10-25 21:38:59 -04:00
Gilles Gouaillardet
1105634ca1
mpi_f08: fix MPI_WIN_{ATTACH,DETACH} bindings
...
fixes INTENT from open-mpi/ompi@9600e2bc63
Thanks Jeff for pointing this !
2015-10-26 10:02:45 +09:00
George Bosilca
4ac247b1da
Minor updated on the validity checks for the alltoall collectives.
2015-10-24 15:25:28 -04:00
Jeff Squyres
140cf90e3e
osc_rdma: minor compiler warning stomp
2015-10-23 06:21:56 -07:00
Ralph Castain
4c12022a50
Silence a couple of warnings from valgrind and compilers. Since some pmix components may return success with a NULL value from a "get", check for that situation before attempting to unload the data. Preset the hostname before calling modex_recv to get it so unload properly checks for NULL. Cast a returned value to the correct ompi_proc_t pointer
2015-10-22 20:56:02 -07:00
Nathan Hjelm
9dad35b467
Merge pull request #1061 from hjelmn/osc_fixes
...
one-sided fixes
2015-10-22 18:23:19 -06:00
Nathan Hjelm
63e744ffc6
osc/rdma: use only a single btl registration for local state
...
This commit fixes a bug that can occur on Cray Gemini networks. If
multiple registrations are used for the local state then we looks the
atomicity guarantees. To avoid issues like this use only a single
registration handle for all local state on a node.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-10-22 15:51:19 -06:00
Nathan Hjelm
f690fc8fd5
osc/pt2pt: fix warnings
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-10-22 15:50:40 -06:00
Nathan Hjelm
e716866e0c
Merge pull request #1057 from hjelmn/binding_fix
...
win: fix erroneous argument check
2015-10-22 15:15:43 -06:00
Jeff Squyres
86270e7613
MPI_File_open: add note about allowable chars in filenames
...
Thanks to @nasailja for the original text suggestion.
2015-10-22 11:56:53 -07:00