Ralph Castain
d4071fbd1c
Fix dynamic operations by ensuring that we only fire the debugger release if the debugger is attached, and that the OPAL pmix key for directing events to non-default handlers matches the PMIx spelling
2016-07-16 13:20:41 -07:00
Edgar Gabriel
195ec89732
fcoll/base: mv coll_array functionis to fcoll base
...
the coll_array functions are truly only used by the fcoll modules, so move
them to fcoll/base. There is currently one exception to that rule (number of aggreagtors
logic), but that function will be moved in a long term also to fcoll/base.
2016-07-14 08:41:14 -05:00
Edgar Gabriel
1f1504ebbb
remove some unused code
2016-07-14 08:41:14 -05:00
Joshua Ladd
06930a0423
Merge pull request #1840 from artpol84/yalla_perf_fix
...
pml/yalla: fix yalla performance regression
2016-07-14 10:55:30 +03:00
Gilles Gouaillardet
2a98f9fcc3
oshmem: replace header files in include/mpp with symlinks
...
This is a work around to avoit what looks like a CMake bug
Thanks Paul Kapinos for the report
Fixes open-mpi/ompi#1868
2016-07-14 14:32:25 +09:00
Gilles Gouaillardet
c3c262b3a8
ompi/group: get rid of malloc(0) in ompi_group_intersection(...)
...
Thanks Lisandro Dalcin for the report
Fixes open-mpi/ompi#1866
2016-07-14 11:19:46 +09:00
Ralph Castain
1ceb35ba5c
Fix singletons - do not include the PMIx tool URI in the environment provided to child processes
2016-07-13 17:33:34 -07:00
rhc54
2414244171
Merge pull request #1872 from rhc54/topic/continuous
...
Add support for continuously operating applications
2016-07-13 15:29:31 -07:00
Ralph Castain
20a91c2baf
Add a new --continuous flag to mpirun that directs ORTE to let a job continue running as app procs terminate. Don't attempt to restart them. Add event notification of abnormally terminating procs, and demonstrate that in the mpi_spin test program.
...
Cleanup debug message
2016-07-13 15:28:33 -07:00
Jeff Squyres
0d1afba640
Merge pull request #1867 from hpcraink/pr/shmem_fixes
...
Fixes to shmem
2016-07-13 08:52:25 -04:00
Rainer Keller
3ec1b868d1
Fix missing include and missing MCA_SPML_CALL.
2016-07-13 11:23:47 +02:00
Rainer Keller
997a00c06f
Correct "configure --help" output and amend the default setting if user
...
provides a wrong input value, like "runtime"
(which works for MPI, but not for OSHMEM)
2016-07-13 11:23:07 +02:00
Jeff Squyres
e28951e738
Merge pull request #1863 from jsquyres/pr/specfile-fortran-flags-fix
...
openmpi.spec: don't export FFLAGS
2016-07-12 17:23:42 -04:00
Jeff Squyres
48938f542c
openmpi.spec: don't export FFLAGS
...
The Open MPI configure script has long-since only paid attention to
FCFLAGS. Indeed, it will warn if you set FFLAGS or F77FLAGS. So
remove them from the spec file.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-07-12 14:22:20 -07:00
rhc54
cc2a648124
Merge pull request #1862 from rhc54/topic/mapping
...
Fix a bug in the handling of nper<foo> when -host or -hostfile was gi…
2016-07-12 10:40:28 -07:00
Ralph Castain
aa78f902f2
Add some missing info to the job map so remote procs get their app_rank
2016-07-12 09:50:12 -07:00
Ralph Castain
ddd0d05de3
Fix a bug in the handling of nper<foo> when -host or -hostfile was given. Correctly mark slots as "given" when we auto-assign them. Ensure we don't set the number of procs when using nper<foo> so the PPR mapper can correctly assing them.
2016-07-12 09:27:02 -07:00
Pascal Deveze
b87ed1ad4a
mtl/portals4: Display actual limits given by the portals4 PtlNIInit function
2016-07-12 15:07:31 +02:00
Pascal Deveze
f666b0d9aa
mtl/portals4: Allocate a PT with the PTL_PT_FLOWCTRL flag only if OMPI_MTL_PORTALS4_FLOW_CONTROL is set
2016-07-12 15:07:31 +02:00
Pascal Deveze
bed572cd6c
mtl/portals4: Unlink the ME first, then free the CT and at the end free the PT
2016-07-12 15:07:30 +02:00
Ralph Castain
0e433eaa78
Silence warning
2016-07-11 19:43:02 -07:00
Gilles Gouaillardet
a27ced0966
configury: fix typos in {ompi,opal}_setup_cxx
2016-07-12 10:09:32 +09:00
Nathan Hjelm
34b0a4fc78
Merge pull request #1859 from hjelmn/rdma_fixes
...
osc/rdma: fix bug in CAS
2016-07-11 11:43:39 -06:00
Nathan Hjelm
b47208e909
osc/rdma: fix bug in CAS
...
This commit fixes a bug in the RDMA compare-and-swap implementation
that caused the origin value to always be written even if the compare
should have failed.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-07-11 09:54:23 -06:00
Artem Polyakov
44f01b437d
Merge pull request #1858 from artpol84/fix_pmix_slurm
...
opal/pmix: add blocking fence to SLURM components
2016-07-11 21:28:25 +06:00
Edgar Gabriel
c8b1c6cae1
Merge pull request #1856 from edgargabriel/pr/zero-size-iread-iwrite
...
io/ompio: fix the request in case of a zero size write/read operation
2016-07-11 08:19:02 -05:00
Gilles Gouaillardet
14624506df
coll/libnbc: do not exchange data between roots in ompi_coll_libnbc_ireduce_scatter_inter()
...
this is now useless since the scatter is done via the local communicator
2016-07-11 17:18:30 +09:00
Artem Polyakov
72585a905f
opal/pmix: add blocking Fence to SLURM components.
...
Blocking fence is used in yalla del proc. Native pmix exposes this functionality.
We need to expose it for SLURM's s1/s2 components as well.
Also this commit fixes uninitialized `rc` in fencenb's of both
components.
2016-07-11 09:43:15 +03:00
Edgar Gabriel
3dd81e9e09
io/ompio: fix the request in case of a zero size write/read operation
2016-07-08 14:11:22 -05:00
Gilles Gouaillardet
a55d57406b
coll/base: fix non zero lower bound datatype handling in mca_coll_base_alltoallv_intra_basic_inplace()
2016-07-08 16:55:26 +09:00
Gilles Gouaillardet
7b8094aac1
coll/base: silence misc warning
...
as reported by Coverity with CIDs 1363349-1363362
Offset temporary buffer when a non zero lower bound datatype is used.
Thanks Hristo Iliev for the report
(cherry picked from commit 0e393195d9
)
2016-07-08 13:06:26 +09:00
Gilles Gouaillardet
678d08647b
coll/libnbc: various fixes
...
- correctly handle non commutative operators
- correctly handle non zero lower bound ddt
- correctly handle ddt with size > extent
- revamp NBC_Sched_op so it takes two buffers and matches ompi_op_reduce semantic
- various fix for inter communicators
Thanks Yuki Matsumoto for the report
2016-07-07 15:55:49 +09:00
Gilles Gouaillardet
3e559a14a9
coll/inter: fix non standard ddt handling
...
- correctly handle non zero lower bound ddt
- correctly handle ddt with size > extent
Thanks Yuki Matsumoto for the report
2016-07-07 15:49:59 +09:00
Gilles Gouaillardet
488d037d51
coll/basic: fix non standard ddt handling
...
- correctly handle non zero lower bound ddt
- correctly handle ddt with size > extent
Thanks Yuki Matsumoto for the report
2016-07-07 15:49:53 +09:00
Gilles Gouaillardet
c06fb04a9a
coll/base: fix non zero lower bound ddt handling in ompi_coll_base_reduce_intra_basic_linear()
...
Thanks Yuki Matsumoto for the report
2016-07-07 15:49:48 +09:00
Artem Polyakov
8e16f47492
Merge pull request #1688 from artpol84/fix_base64
...
Fix base64 implementation in pmix framework.
2016-07-07 10:47:50 +06:00
Jeff Squyres
b8e93e9738
Merge pull request #1687 from ggouaillardet/topic/distclean_external
...
configury: correctly perform make distclean when {libevent,hwloc,pmix…
2016-07-06 12:57:02 -04:00
Ralph Castain
ae8444682f
Remove stale variable
2016-07-05 20:07:16 -07:00
Gilles Gouaillardet
acda07472a
configury: revamp and re-ident sub configure.m4 after open-mpi/ompi@846360fd4c
2016-07-06 11:59:51 +09:00
Gilles Gouaillardet
846360fd4c
configury: correctly perform make distclean when {libevent,hwloc,pmix} are external components
...
Thanks Jeff for the guidance
Fixes open-mpi/ompi#1683
note:
in order to keep this commit easy to review, some AS_IF([...]) were replaced with
AS_IF([false], ...) or AS_IF_([true], ...)
these will be removed and re-idented in a subsequent commit
2016-07-06 11:57:24 +09:00
rhc54
4bc08d64e9
Merge pull request #1849 from rhc54/topic/session
...
Shorten the session directory name as some OS's are now providing unu…
2016-07-05 15:38:24 -07:00
Ralph Castain
ee56d9dc1a
Shorten the session directory name as some OS's are now providing unusually long temp directory names, causing us to overflow the sockaddr field
2016-07-05 14:59:50 -07:00
rhc54
b7122963e1
Merge pull request #1847 from rhc54/topic/updatepmix
...
Update pmix2x to track upstream changes
2016-07-05 12:37:36 -07:00
Ralph Castain
7e0af3f4f0
Update pmix2x to track upstream changes
2016-07-05 11:54:22 -07:00
George Bosilca
eac5b3c668
Various cleanups in the monitoring PML.
2016-07-05 18:31:25 +02:00
George Bosilca
73972768f8
Remove an apparently useless function.
2016-07-05 18:30:11 +02:00
Artem Polyakov
a4ff9bef6d
fix #2
2016-07-05 14:38:35 +03:00
Artem Polyakov
bc973cad30
fix
2016-07-05 14:33:31 +03:00
Artem Polyakov
7d96f12fec
pml/yalla: fix yalla performance regression
...
It was introduced in PR https://github.com/open-mpi/ompi/pull/1228
in particular in commit 041a6a9f53
.
Original solution was using "flexible array member" called "mxm_base"
to "fall-through" to the "mxm" send/recv member that located in the
outer structure.
After changing number of elements in "mxm_base" from 0 to 1 we actually
allocating 2 mxm_req_base_t elements which leads to increased overal
size and harms cache performance.
It also brakes "mca_pml_yalla_check_request_state" function.
2016-07-05 10:52:48 +03:00
Gilles Gouaillardet
267821f0dd
pmix2x/pmix: fix a typo in PMIx_tool_init()
...
and remove now useless local variable i
2016-07-05 13:47:50 +09:00