Pascal Deveze
9e58b4842f
mtl-portals4: Correct how the request_status._ucount is set
2016-07-26 08:42:48 +02:00
Pascal Deveze
3ca194f10a
mtl-portals4: Store ptl_process_id (from PtlGetPhysId) and display it.
2016-07-26 08:42:08 +02:00
Pascal Deveze
bd3b1cf7be
mtl-portals4: Control that flowctl_idx is egal to REQ_FLOWCTL_TABLE_ID
...
and use OPAL_ATOMIC_CMPSET_32 to test and set flowctl_active flag to true
2016-07-26 08:41:31 +02:00
Ralph Castain
9ab20cafe3
Pass the nodeid for each proc in the job. Fix a mistaken error output message
2016-07-25 15:41:15 -07:00
Gilles Gouaillardet
bbc6d4b3d4
ompi/communicator: remove an other debug print statement in ompi_comm_allreduce_intra_pmix_nb()
2016-07-22 15:42:56 +09:00
Edgar Gabriel
b0fa1fd2a1
move the internal file_open/close functions to common/ompio
2016-07-21 13:08:32 -05:00
Edgar Gabriel
ccf76b7791
moving the internal read/write functions to common/ompio
...
and update all fs/fcoll/sharedfp components to use these functions.
2016-07-21 13:08:32 -05:00
Edgar Gabriel
688710d408
make common/ompio compile
2016-07-21 13:08:32 -05:00
Edgar Gabriel
39ae93b87b
modify the fcoll components to use the common/ompio print queues
2016-07-21 13:08:32 -05:00
Edgar Gabriel
fe17410943
next step in making the print_queue functionality move to common/ompio
2016-07-21 13:08:32 -05:00
Edgar Gabriel
af67c8f239
first cut on moving some ompio functionality to common/ompio
2016-07-21 13:08:32 -05:00
Edgar Gabriel
a899c0fb38
fcoll/static: fix coverty warnings
...
fix coverty warnings CID 72144, CID 710677, CID 1364164
2016-07-21 13:08:15 -05:00
Pascal Deveze
a7e3de6c4f
coll-portals4: No more messages passed to Portals4 bigger than the limit given by PtlNIInit
2016-07-21 15:58:20 +02:00
Pascal Deveze
175e6aa385
coll-portals4: Before calling PtlCTWait, call PtlTriggeredInc twice so be sure all pending PtlTriggredPut are triggered
2016-07-21 15:58:20 +02:00
Pascal Deveze
df59d6cdd4
coll-portals4: Correct and simplify how the data are cut in segment_nb segments (bcast)
2016-07-21 15:58:09 +02:00
Pascal Deveze
274f8d608c
coll-portals4: Change output format and change variable names (minor changes).
2016-07-21 11:06:45 +02:00
Todd Kordenbrock
37ad6aa711
Merge pull request #1853 from PDeveze/Patchs-on-osc-portals4
...
Patchs on osc portals4
2016-07-20 09:22:19 -05:00
Todd Kordenbrock
210534adb3
Merge pull request #1850 from PDeveze/Patchs-on-mtl-portals4
...
Patchs on mtl portals4
2016-07-20 08:21:03 -05:00
rhc54
4bc5048608
Merge pull request #1888 from rhc54/topic/pmixup
...
Update pmix2 component
2016-07-20 06:14:05 -07:00
Ralph Castain
01a653d50a
Remove a debug print in comm_cid.c. Update PMIx2 to include the revised PMIx_Get logic for higher performance by reducing the number of hash table lookups. Fix a bug where requests for data from a proc in another nspace could hang, or result in "not found".
...
Remove stale file reference
Restore autogen pass thru pmix
Remove generated file
2016-07-20 00:58:19 -07:00
Gilles Gouaillardet
252fadf099
ompi: fix #if vs #ifdef HAVE___MALLOC_INITIALIZE_HOOK usage
2016-07-20 13:18:11 +09:00
Ralph Castain
36a9063466
Silence warnings
2016-07-19 17:36:13 -07:00
Nathan Hjelm
40f71f2d7a
Merge pull request #1873 from hjelmn/comm_split_update
...
Improve MPI_Comm_split_type scalability
2016-07-19 14:36:44 -06:00
Nathan Hjelm
5edab9cb22
Merge pull request #1855 from hjelmn/comm_rework
...
ompi/comm: refactor communicator cid code
2016-07-19 10:04:17 -06:00
Pascal Deveze
9cac32ba6a
mtl/portals4: Modifications concerning the short message management
2016-07-19 11:21:50 +02:00
Pascal Deveze
49e9936914
mtl/portals4: Some little patches
2016-07-19 11:18:55 +02:00
Nathan Hjelm
ced853476f
Merge pull request #1878 from hjelmn/f_rops
...
ompi/fortran: fix typos in request RMA bindings
2016-07-18 13:48:41 -06:00
Nathan Hjelm
8bdcb40dc4
ompi/fortran: fix typos in request RMA bindings
...
This commit fixes typos on the C side of the request-based RMA binding. We
were not returning the request on success but on failure. Thanks to
@alazzaro for reporting and @ggouaillardet, and @vondele for tracking
this down.
Fixes part of open-mpi/ompi#1869
Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2016-07-18 13:46:28 -06:00
Nathan Hjelm
4c49c42dd0
ompi/comm: improve comm_split_type scalability
...
This commit introduces a new algorithm for MPI_Comm_split_type. The
old algorithm performed an allgather on the communicator to decide
which processes were part of the new communicators. This does not
scale well in either time or memory.
The new algorithm performs a couple of all reductions to determine the
global parameters of the MPI_Comm_split_type call. If any rank gives
an inconsistent split_type (as defined by the standard) an error is
returned without proceeding further. The algorithm then creates a
communicator with all the ranks that match the split_type (no
communication required) in the same order as the original
communicator. It then does an allgather on the new communicator (which
should be much smaller) to determine 1) if the new communicator is in
the correct order, and 2) if any ranks in the new communicator
supplied MPI_UNDEFINED as the split_type. If either of these
conditions are detected the new communicator is split using
ompi_comm_split and the intermediate communicator is freed.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-18 12:47:05 -06:00
Nathan Hjelm
035c2e2e2a
ompi/comm: refactor communicator cid code
...
This commit simplifies the communicator context ID generation by
removing the blocking code. The high level calls: ompi_comm_nextcid
and ompi_comm_activate remain but now call the non-blocking variants
and wait on the resulting request. This was done to remove the
parallel paths for context ID generation in preperation for further
improvements of the CID generation code.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-18 12:47:05 -06:00
Pascal Deveze
f19a2b961c
osc/portals4: Correct an error in an if statement
2016-07-18 13:16:12 +02:00
Pascal Deveze
81823d7a63
osc/portals4: Store the no_locks parameter in osc_portals4_component.no_locks
2016-07-18 11:51:52 +02:00
Pascal Deveze
76b38651da
osc/portals4: For the contiguous datatype, take into account the lower bound before calling portals4
2016-07-18 11:20:50 +02:00
Pascal Deveze
7aaf16e7fe
osc/portals4: Put/Get splitting because Portals4 may restrict sizes
2016-07-18 10:49:28 +02:00
Pascal Deveze
025201b459
osc/portals4: set the initial value of req_status.MPI_ERROR to MPI_SUCCESS
2016-07-18 09:52:56 +02:00
Pascal Deveze
aa0d687a0a
osc/portals4: Display an ouput message if ompi_osc_portals4_get_dt() or ompi_osc_portals4_get_op() returns an error
2016-07-18 09:52:56 +02:00
Pascal Deveze
c4181909a4
osc/portals4: Be sure that the ME are operationnal (wait for the PTL_EVENT_LINK)
2016-07-18 09:52:56 +02:00
Pascal Deveze
e99e7d08ed
osc/portals4: For the ME, use the uid from PtlGetUid instead of PTL_UID_ANY
2016-07-18 09:52:56 +02:00
Pascal Deveze
56b36eeb7e
osc/portals4: Format of "target_disp" is OPAL_PTRDIFF_TYPE and %lu is the appropriate format to display it.
2016-07-18 09:52:55 +02:00
Pascal Deveze
a76566c754
osc/portals4: To allocate a PT, use REQ_OSC_TABLE_ID and test that the right ID is allocated
2016-07-18 09:52:55 +02:00
Edgar Gabriel
195ec89732
fcoll/base: mv coll_array functionis to fcoll base
...
the coll_array functions are truly only used by the fcoll modules, so move
them to fcoll/base. There is currently one exception to that rule (number of aggreagtors
logic), but that function will be moved in a long term also to fcoll/base.
2016-07-14 08:41:14 -05:00
Edgar Gabriel
1f1504ebbb
remove some unused code
2016-07-14 08:41:14 -05:00
Joshua Ladd
06930a0423
Merge pull request #1840 from artpol84/yalla_perf_fix
...
pml/yalla: fix yalla performance regression
2016-07-14 10:55:30 +03:00
Gilles Gouaillardet
c3c262b3a8
ompi/group: get rid of malloc(0) in ompi_group_intersection(...)
...
Thanks Lisandro Dalcin for the report
Fixes open-mpi/ompi#1866
2016-07-14 11:19:46 +09:00
Jeff Squyres
1bea2b2575
mpi.h: fix types of MPI_UNWEIGHTED and MPI_WEIGHTS_EMPTY
...
Thanks to Lisandro Dalcin for reporting.
Fixes open-mpi/ompi#1865 .
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-13 09:36:24 -04:00
Pascal Deveze
b87ed1ad4a
mtl/portals4: Display actual limits given by the portals4 PtlNIInit function
2016-07-12 15:07:31 +02:00
Pascal Deveze
f666b0d9aa
mtl/portals4: Allocate a PT with the PTL_PT_FLOWCTRL flag only if OMPI_MTL_PORTALS4_FLOW_CONTROL is set
2016-07-12 15:07:31 +02:00
Pascal Deveze
bed572cd6c
mtl/portals4: Unlink the ME first, then free the CT and at the end free the PT
2016-07-12 15:07:30 +02:00
Ralph Castain
0e433eaa78
Silence warning
2016-07-11 19:43:02 -07:00
Nathan Hjelm
b47208e909
osc/rdma: fix bug in CAS
...
This commit fixes a bug in the RDMA compare-and-swap implementation
that caused the origin value to always be written even if the compare
should have failed.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-11 09:54:23 -06:00