Alex Mikheev
5c2f807ef8
OSHMEM: fixes verbosity log level cal
...
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:42 +02:00
Alex Mikheev
7caa736533
OSHMEM: fixes potential deadlock in shmem_lock()
...
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:42 +02:00
Alex Mikheev
defcc3ddc1
OSHMEM: spml ikrit: get/put request cleanup
...
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:41 +02:00
Alex Mikheev
61bd59a369
OSHMEM: fixes addr_acessible()
...
check every possible transport
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:41 +02:00
Alex Mikheev
23c3dc8345
OSHMEM: mxm: optimize mxm_peer layout.
...
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:40 +02:00
Alex Mikheev
df74d549dc
OSHMEM: spml ikrit: changes mxm_peers layout
...
use single array instead of array of pointers
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:39 +02:00
Alex Mikheev
b5c7c7de78
OSHMEM: memheap: disable oob if allgather mkey exchange is used
...
In this case there is no point to add another progress callback
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:39 +02:00
Alex Mikheev
0826e63363
OSHMEM: spml_ikrit: makes quiet wait for get_nbi requests
...
shmem_quit() shall complete all outstanding get_nbi() requests
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:38 +02:00
Alex Mikheev
2f91ce7281
OSHMEM: mxm versions less than 2.0 are no longer supported
...
Signed-off-by: Alex Mikheev <alexm@mellanox.com>
2016-11-06 11:56:38 +02:00
Jeff Squyres
f11b0c7edf
Merge pull request #2330 from jjhursey/topic/ibcast-non-uniform-dt-wa
...
coll/libnbc: Work around for non-uniform data types in ibcast
2016-11-05 10:26:04 -04:00
Mike Dubman
8c07a7fcdc
Merge pull request #2357 from shamisp/topic/shmem_nbi_bugfix
...
SPML/UCX: Fixing incorrect handling of UCX return codes
2016-11-04 20:41:05 +02:00
rhc54
d221a3567b
Merge pull request #2364 from rhc54/topic/static
...
Since static ports are only used by ORTE if the runtime option is given,
2016-11-04 07:57:18 -07:00
Ralph Castain
188880be3f
Since static ports are only used by ORTE if the runtime option is given,
...
there is no need for a configure option as well - so remove the
--enable-orte-static-ports configure option. When decoding the daemon
nidmap, mark new daemons as ALIVE by default - we will discover dead
ones as we go.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2016-11-04 05:01:42 -07:00
Gilles Gouaillardet
da0c873e14
oob/tcp: enhance debugging output
...
display the hop node used to send a message
(if the message is sent directly, then the hop is the destination)
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2016-11-04 14:16:06 +09:00
Gilles Gouaillardet
6f7ed1f552
event/libevent2022: add missing dependencies to the embedded libevent lib
...
force the libevent2022 component rebuild if the embedded libevent is updated
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2016-11-04 11:13:44 +09:00
Pavel Shamis (Pasha)
92b0ebd7c3
For UCX it is legal to return UCS_INPROGRESS (1) code for non-blocking function
...
calls, which means that the operation was successfully started but not
immediately completed. This is a "good" return code that should not be handled
as an error.
Signed-off-by: Pavel Shamis (Pasha) <pasharesearch@gmail.com>
2016-11-03 15:36:13 -05:00
Jeff Squyres
06e75d65c3
nightly-tarball: update Coverity configure params
...
* Point to local libfabric v1.4 install
* Add MPI C++ bindings
* Remove PSM support (if someone can install PSM/PSM2 libraries on the
build server, let's re-enable this)
Also change from -j8 to -j4 (the new AWS build instance only has 1
core / 2 hyperthreads).
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-11-03 12:27:34 -04:00
Jeff Squyres
7ccf253063
Remove old/now-useless SVN integration scripts
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-11-03 12:18:14 -04:00
Jeff Squyres
c54dc87f71
README: add --enable-mca-no-build explanation
...
Also update the configure.ac help message for --enable-mca-no-build to
avoid using a framework name that does not exist any more.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-11-03 06:11:51 -07:00
Jeff Squyres
a47ad865d3
create_tarball.sh: make sure to just get the git hash
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-11-02 08:57:32 -07:00
Josh Hursey
b18598f6c7
Merge pull request #2329 from jjhursey/topic/short-hostname-lsf-fix
...
ras/*: Fix !orte_keep_fqdn_hostnames for RAS components
2016-11-02 10:49:08 -05:00
Jeff Squyres
a4ffa590c8
Merge pull request #2308 from hjelmn/vader_mem
...
btl/vader: reduce memory footprint when using xpmem
2016-11-02 10:28:26 -04:00
Jeff Squyres
27fcd2d6ba
Merge pull request #2336 from larrystevenwise/master
...
openib btl: remove BTL_OPENIB_FAILOVER_ENABLED code
2016-11-02 10:27:26 -04:00
Jeff Squyres
479fe59d9e
Merge pull request #2343 from karasevb/fix_spml_yoda
...
oshmem/spml/yoda: fixed the btl operations
2016-11-02 10:25:47 -04:00
Boris Karasev
68b5acd9f4
oshmem/spml/yoda: fixed the btl operations
...
Fixed the shmem OOM error which is referenced on #2028
Signed-off-by: Boris Karasev <karasev.b@gmail.com>
2016-11-02 13:38:35 +02:00
rhc54
4c9f7af4cb
Merge pull request #2340 from rhc54/topic/dvm
...
Fix the radix routed component to correctly handle connected tools - …
2016-11-01 20:28:02 -07:00
Ralph Castain
435d771e76
Fix the radix routed component to correctly handle connected tools - in such cases, the route must be direct to the tool.
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2016-11-01 19:03:26 -07:00
Jeff Squyres
5e7d8636f6
Merge pull request #2338 from jsquyres/pr/rename-nightly-tarball
...
create_tarball.sh: update snapshot filename
2016-11-01 20:11:32 -04:00
Jeff Squyres
78d1e4ebff
create_tarball.sh: update snapshot filename
...
Nightly snapshots will now be named:
openmpi-${BRANCHNAME}-${YYYYMMDDHHMM}-${SHORTHASH}.tar.${COMPRESSION}.
Fixes #2337
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-11-01 17:09:17 -07:00
Steve Wise
7050969d47
openib btl: remove BTL_OPENIB_FAILOVER_ENABLED code
...
Remove BTL_OPENIB_FAILOVER_ENABLED code in the openib btl source.
Remove the failover-specific files from the openib btl.
Update the openib/Makefile.am accordingly.
Remove the -enable-openib-failover config logic.
Signed-off-by: Steve Wise <swise@opengridcomputing.com>
2016-11-01 14:45:36 -07:00
rhc54
0cd200da80
Merge pull request #2332 from rhc54/topic/routed2
...
Remove the max_connections parameter from the radix component as it i…
2016-11-01 13:49:51 -07:00
Ralph Castain
64873487b4
Remove the max_connections parameter from the radix component as it is confusing. Modify PMIx client init so that it simply returns the nspace/rank if called by a server - this allows the server to retrieve its assigned ID. Register the server's nspace so client-side operations can succeed
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2016-11-01 12:17:11 -07:00
Joshua Hursey
350ef67fe0
coll/libnbc: Work around for non-uniform data types in ibcast
...
* If (legal) non-uniform data type signatures are used in ibcast
then the chosen algorithm may fail on the request, and worst case
it could produce wrong answers.
* Add an MCA parameter that, by default, protects the user from this
scenario. If the user really wants to use it then they have to
'opt-in' by setting the following parameter to false:
- `-mca coll_libnbc_ibcast_skip_dt_decision f`
* Once the following Issues are resolved then this parameter can
be removed.
- https://github.com/open-mpi/ompi/issues/2256
- https://github.com/open-mpi/ompi/issues/1763
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2016-11-01 13:33:23 -05:00
Joshua Hursey
ed5268a96a
ras/slurm: Fix !orte_keep_fqdn_hostnames for Slurm
...
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2016-11-01 13:21:30 -05:00
Joshua Hursey
5a4c52d9cb
ras/loadleveler: Fix !orte_keep_fqdn_hostnames for Loadleveler
...
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2016-11-01 13:21:30 -05:00
Joshua Hursey
8230201ad1
ras/gridengine: Fix !orte_keep_fqdn_hostnames for GridEngine
...
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2016-11-01 13:21:30 -05:00
Joshua Hursey
9643175e40
ras/tm: Fix !orte_keep_fqdn_hostnames for TORQUE
...
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2016-11-01 13:21:24 -05:00
Mike Dubman
0b14928b45
Merge pull request #2327 from yosefe/topic/pml-ucx-status-cancelled
...
pml_ucx: fix uninitialized field req_status->_cancelled.
2016-11-01 20:06:25 +02:00
Joshua Hursey
8d02a33639
ras/lsf: Fix !orte_keep_fqdn_hostnames for LSF
...
* By default, make sure that we are using the short hostnames and not
the fully qualified hostnames when running under LSF.
* Related to commit open-mpi/ompi@d26dd2c20e
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2016-11-01 13:04:52 -05:00
Yossi Itigin
17c8f76411
pml_ucx: fix uninitialized field req_status->_cancelled.
...
Signed-off-by: Yossi Itigin <yosefe@mellanox.com>
2016-11-01 17:02:22 +02:00
rhc54
6074c2a2a9
Merge pull request #2322 from rhc54/topic/routed
...
Update the routed components as we no longer need to init_routes.
2016-10-31 13:37:07 -07:00
Ralph Castain
b8c5d1ad88
Update the routed components as we no longer need to init_routes. Fixes case of direct launch via srun
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2016-10-31 12:38:13 -07:00
Jeff Squyres
773d6039e7
Merge pull request #2306 from hjelmn/alps_cores
...
ras/alps: use cpuCnt if using hwthreads as cores
2016-10-31 15:22:13 -04:00
Gilles Gouaillardet
30298cc83c
oob/tcp: remove debug that should have never been commited
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2016-10-31 16:41:14 +09:00
Gilles Gouaillardet
75e96004a4
oob/tcp: fix a typo in mca_oob_tcp_component_no_route()
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2016-10-31 16:30:24 +09:00
Jeff Squyres
25c3edb6d8
Merge pull request #2312 from jsquyres/pr/usnic-warning-fix
...
btl/usnic: fix compiler warning
2016-10-28 12:08:13 -04:00
Joshua Ladd
d27b680de2
Merge pull request #2305 from vspetrov/hcoll_fortran_pair_types
...
coll/hcoll fortran pair types
2016-10-28 12:05:00 -04:00
Jeff Squyres
149b660666
btl/usnic: fix compiler warning
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-10-28 07:36:20 -07:00
Gilles Gouaillardet
52a1f96726
fortran/use-mpi-tkr: update .gitignore
...
ignore automatically generated mpi-tkr-sizeof.*
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2016-10-28 10:41:59 +09:00
Gilles Gouaillardet
efac15e9a1
ompi: use opal_setenv instead of putenv
...
this fixes a memory leak at finalize
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2016-10-28 09:32:30 +09:00