Nathan Hjelm
03bce91de8
pmix/pmix2x: add missing increment in loop
...
This commit fixes a bug in the pmix2x client code where a loop
variable is not correctly incremented. This was leading to hangs and
crashes when creating intercommunicators. Also fixed two double
increments in other loops.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-18 10:35:05 -06:00
Pascal Deveze
f19a2b961c
osc/portals4: Correct an error in an if statement
2016-07-18 13:16:12 +02:00
Pascal Deveze
81823d7a63
osc/portals4: Store the no_locks parameter in osc_portals4_component.no_locks
2016-07-18 11:51:52 +02:00
Pascal Deveze
76b38651da
osc/portals4: For the contiguous datatype, take into account the lower bound before calling portals4
2016-07-18 11:20:50 +02:00
Pascal Deveze
7aaf16e7fe
osc/portals4: Put/Get splitting because Portals4 may restrict sizes
2016-07-18 10:49:28 +02:00
Pascal Deveze
025201b459
osc/portals4: set the initial value of req_status.MPI_ERROR to MPI_SUCCESS
2016-07-18 09:52:56 +02:00
Pascal Deveze
aa0d687a0a
osc/portals4: Display an ouput message if ompi_osc_portals4_get_dt() or ompi_osc_portals4_get_op() returns an error
2016-07-18 09:52:56 +02:00
Pascal Deveze
c4181909a4
osc/portals4: Be sure that the ME are operationnal (wait for the PTL_EVENT_LINK)
2016-07-18 09:52:56 +02:00
Pascal Deveze
e99e7d08ed
osc/portals4: For the ME, use the uid from PtlGetUid instead of PTL_UID_ANY
2016-07-18 09:52:56 +02:00
Pascal Deveze
56b36eeb7e
osc/portals4: Format of "target_disp" is OPAL_PTRDIFF_TYPE and %lu is the appropriate format to display it.
2016-07-18 09:52:55 +02:00
Pascal Deveze
a76566c754
osc/portals4: To allocate a PT, use REQ_OSC_TABLE_ID and test that the right ID is allocated
2016-07-18 09:52:55 +02:00
rhc54
739c5803f3
Merge pull request #1880 from jsquyres/pr/pmix-remove-all-tabs
...
pmix: replace all tabs with spaces
2016-07-17 20:24:54 -07:00
Jeff Squyres
72f41d4490
pmix: replace all tabs with spaces
...
No code or logic changes
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-17 15:08:33 -04:00
Jeff Squyres
1c32742c66
pmix_ext20: fix syntax error
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-17 15:04:12 -04:00
Ralph Castain
99f7096031
Fix permissions
2016-07-16 21:03:55 -07:00
rhc54
339235bee0
Merge pull request #1879 from rhc54/topic/fixdyn
...
Fix dynamic operations by ensuring that we only fire the debugger rel…
2016-07-16 14:28:12 -07:00
Ralph Castain
d4071fbd1c
Fix dynamic operations by ensuring that we only fire the debugger release if the debugger is attached, and that the OPAL pmix key for directing events to non-default handlers matches the PMIx spelling
2016-07-16 13:20:41 -07:00
Edgar Gabriel
195ec89732
fcoll/base: mv coll_array functionis to fcoll base
...
the coll_array functions are truly only used by the fcoll modules, so move
them to fcoll/base. There is currently one exception to that rule (number of aggreagtors
logic), but that function will be moved in a long term also to fcoll/base.
2016-07-14 08:41:14 -05:00
Edgar Gabriel
1f1504ebbb
remove some unused code
2016-07-14 08:41:14 -05:00
Joshua Ladd
06930a0423
Merge pull request #1840 from artpol84/yalla_perf_fix
...
pml/yalla: fix yalla performance regression
2016-07-14 10:55:30 +03:00
Gilles Gouaillardet
2a98f9fcc3
oshmem: replace header files in include/mpp with symlinks
...
This is a work around to avoit what looks like a CMake bug
Thanks Paul Kapinos for the report
Fixes open-mpi/ompi#1868
2016-07-14 14:32:25 +09:00
Gilles Gouaillardet
c3c262b3a8
ompi/group: get rid of malloc(0) in ompi_group_intersection(...)
...
Thanks Lisandro Dalcin for the report
Fixes open-mpi/ompi#1866
2016-07-14 11:19:46 +09:00
Ralph Castain
1ceb35ba5c
Fix singletons - do not include the PMIx tool URI in the environment provided to child processes
2016-07-13 17:33:34 -07:00
rhc54
2414244171
Merge pull request #1872 from rhc54/topic/continuous
...
Add support for continuously operating applications
2016-07-13 15:29:31 -07:00
Ralph Castain
20a91c2baf
Add a new --continuous flag to mpirun that directs ORTE to let a job continue running as app procs terminate. Don't attempt to restart them. Add event notification of abnormally terminating procs, and demonstrate that in the mpi_spin test program.
...
Cleanup debug message
2016-07-13 15:28:33 -07:00
Jeff Squyres
0d1afba640
Merge pull request #1867 from hpcraink/pr/shmem_fixes
...
Fixes to shmem
2016-07-13 08:52:25 -04:00
Rainer Keller
3ec1b868d1
Fix missing include and missing MCA_SPML_CALL.
2016-07-13 11:23:47 +02:00
Rainer Keller
997a00c06f
Correct "configure --help" output and amend the default setting if user
...
provides a wrong input value, like "runtime"
(which works for MPI, but not for OSHMEM)
2016-07-13 11:23:07 +02:00
Jeff Squyres
e28951e738
Merge pull request #1863 from jsquyres/pr/specfile-fortran-flags-fix
...
openmpi.spec: don't export FFLAGS
2016-07-12 17:23:42 -04:00
Jeff Squyres
48938f542c
openmpi.spec: don't export FFLAGS
...
The Open MPI configure script has long-since only paid attention to
FCFLAGS. Indeed, it will warn if you set FFLAGS or F77FLAGS. So
remove them from the spec file.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-12 14:22:20 -07:00
rhc54
cc2a648124
Merge pull request #1862 from rhc54/topic/mapping
...
Fix a bug in the handling of nper<foo> when -host or -hostfile was gi…
2016-07-12 10:40:28 -07:00
Ralph Castain
aa78f902f2
Add some missing info to the job map so remote procs get their app_rank
2016-07-12 09:50:12 -07:00
Ralph Castain
ddd0d05de3
Fix a bug in the handling of nper<foo> when -host or -hostfile was given. Correctly mark slots as "given" when we auto-assign them. Ensure we don't set the number of procs when using nper<foo> so the PPR mapper can correctly assing them.
2016-07-12 09:27:02 -07:00
Pascal Deveze
b87ed1ad4a
mtl/portals4: Display actual limits given by the portals4 PtlNIInit function
2016-07-12 15:07:31 +02:00
Pascal Deveze
f666b0d9aa
mtl/portals4: Allocate a PT with the PTL_PT_FLOWCTRL flag only if OMPI_MTL_PORTALS4_FLOW_CONTROL is set
2016-07-12 15:07:31 +02:00
Pascal Deveze
bed572cd6c
mtl/portals4: Unlink the ME first, then free the CT and at the end free the PT
2016-07-12 15:07:30 +02:00
Ralph Castain
0e433eaa78
Silence warning
2016-07-11 19:43:02 -07:00
Gilles Gouaillardet
a27ced0966
configury: fix typos in {ompi,opal}_setup_cxx
2016-07-12 10:09:32 +09:00
Nathan Hjelm
34b0a4fc78
Merge pull request #1859 from hjelmn/rdma_fixes
...
osc/rdma: fix bug in CAS
2016-07-11 11:43:39 -06:00
Nathan Hjelm
b47208e909
osc/rdma: fix bug in CAS
...
This commit fixes a bug in the RDMA compare-and-swap implementation
that caused the origin value to always be written even if the compare
should have failed.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-11 09:54:23 -06:00
Artem Polyakov
44f01b437d
Merge pull request #1858 from artpol84/fix_pmix_slurm
...
opal/pmix: add blocking fence to SLURM components
2016-07-11 21:28:25 +06:00
Edgar Gabriel
c8b1c6cae1
Merge pull request #1856 from edgargabriel/pr/zero-size-iread-iwrite
...
io/ompio: fix the request in case of a zero size write/read operation
2016-07-11 08:19:02 -05:00
Gilles Gouaillardet
14624506df
coll/libnbc: do not exchange data between roots in ompi_coll_libnbc_ireduce_scatter_inter()
...
this is now useless since the scatter is done via the local communicator
2016-07-11 17:18:30 +09:00
Artem Polyakov
72585a905f
opal/pmix: add blocking Fence to SLURM components.
...
Blocking fence is used in yalla del proc. Native pmix exposes this functionality.
We need to expose it for SLURM's s1/s2 components as well.
Also this commit fixes uninitialized `rc` in fencenb's of both
components.
2016-07-11 09:43:15 +03:00
Edgar Gabriel
3dd81e9e09
io/ompio: fix the request in case of a zero size write/read operation
2016-07-08 14:11:22 -05:00
Gilles Gouaillardet
a55d57406b
coll/base: fix non zero lower bound datatype handling in mca_coll_base_alltoallv_intra_basic_inplace()
2016-07-08 16:55:26 +09:00
Gilles Gouaillardet
7b8094aac1
coll/base: silence misc warning
...
as reported by Coverity with CIDs 1363349-1363362
Offset temporary buffer when a non zero lower bound datatype is used.
Thanks Hristo Iliev for the report
(cherry picked from commit 0e393195d9
)
2016-07-08 13:06:26 +09:00
Gilles Gouaillardet
678d08647b
coll/libnbc: various fixes
...
- correctly handle non commutative operators
- correctly handle non zero lower bound ddt
- correctly handle ddt with size > extent
- revamp NBC_Sched_op so it takes two buffers and matches ompi_op_reduce semantic
- various fix for inter communicators
Thanks Yuki Matsumoto for the report
2016-07-07 15:55:49 +09:00
Gilles Gouaillardet
3e559a14a9
coll/inter: fix non standard ddt handling
...
- correctly handle non zero lower bound ddt
- correctly handle ddt with size > extent
Thanks Yuki Matsumoto for the report
2016-07-07 15:49:59 +09:00
Gilles Gouaillardet
488d037d51
coll/basic: fix non standard ddt handling
...
- correctly handle non zero lower bound ddt
- correctly handle ddt with size > extent
Thanks Yuki Matsumoto for the report
2016-07-07 15:49:53 +09:00