Mohan
e3dfe11da9
Btl tcp: Improving verbose around tcp
...
As part of improvement towards tcp btl we
are improving verbose in general
Signed-off-by: Mohan Gandhi <mohgan@amazon.com>
2017-08-17 17:22:16 -07:00
Mohan
4bc7b214dc
Btl tcp: Improving verbose around IPV6
...
As part of improvement around tcp btl debugging
& verbose. we are improving verbose around IPV6
Signed-off-by: Mohan Gandhi <mohgan@amazon.com>
2017-08-17 16:45:14 -07:00
Mohan
0741fad479
Btl tcp: BTL_ERROR to show_help & update func behaviour
...
As part of improvement towards tcp debugging
we are moving few BTL_ERROR to show_help and also
update the function behaviour of
mca_btl_tcp_endpoint_complete_connect to return
SUCCESS and ERROR cases.
Signed-off-by: Mohan Gandhi <mohgan@amazon.com>
2017-08-17 16:45:14 -07:00
Mohan
368f9f0dfc
Btl tcp: Using magic string to verify mpi connection
...
As part of improvement towards handling failure case
in btl tcp we are using magic string to verify mpi
connection. In case if there is mismatch or missing
magic string we can identify that we are trying to
connect with someother process.
Signed-off-by: Mohan Gandhi <mohgan@amazon.com>
2017-08-17 16:45:13 -07:00
Mohan
c30a42917c
Btl tcp: Refactoring non-blocking send/receive function
...
Moving non-blocking send/receive function to btl_tcp
will help reusing these function where ever needed.
In this case we plan to reuse receive function to
retrive magic string to validate established connection
is from mpi process.
Signed-off-by: Mohan Gandhi <mohgan@amazon.com>
2017-08-17 16:45:13 -07:00
Ralph Castain
af85e48dd7
Silence Coverity warning, silence pmix_error_log of success
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-21 15:33:16 -07:00
Ralph Castain
b8d3999da2
Merge pull request #3950 from rhc54/topic/pmixup
...
Update to latest PMIx v2.1.0a.
2017-07-21 16:00:06 -05:00
Ralph Castain
492f98f8a5
Update to latest PMIx v2.1.0a
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-21 12:58:09 -07:00
Ralph Castain
325778e667
Merge pull request #3836 from rhc54/topic/ofi
...
Bring the ofi/rml component online
2017-07-21 07:46:52 -05:00
Ralph Castain
f7e8780a42
Remove fortran support from platform file
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 21:02:30 -07:00
Ralph Castain
b225366012
Bring the ofi/rml component online by completing the wireup protocol for the daemons. Cleanup the current confusion over how connection info gets created and
...
passed to make it all flow thru the opal/pmix "put/get" operations. Update the PMIx code to latest master to pickup some required behaviors.
Remove the no-longer-required get_contact_info and set_contact_info from the RML layer.
Add an MCA param to allow the ofi/rml component to route messages if desired. This is mainly for experimentation at this point as we aren't sure if routing wi
ll be beneficial at large scales. Leave it "off" by default.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 21:01:57 -07:00
Ralph Castain
855b430632
Merge pull request #3948 from rhc54/topic/crud
...
Remove problem installation of hwloc 2.0
2017-07-20 22:54:59 -05:00
Ralph Castain
0e4e3af1db
Remove problem installation of hwloc 2.0
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 18:18:08 -07:00
Ralph Castain
4d4dec60b4
Merge pull request #3787 from rhc54/topic/pmix210
...
Update to cross-version for PMIx
2017-07-20 14:22:49 -05:00
Ralph Castain
7d8d877837
Remove build product and update .gitignore to avoid picking it up again
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 11:49:48 -07:00
Howard Pritchard
566c3263cd
Merge pull request #3862 from hppritcha/readme_update_no_aarch64
...
README: update ARM processor support (v3.0.x and higher)
2017-07-20 12:31:40 -06:00
Ralph Castain
8c30958879
Update to PMIx v2.1.0alpha
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 11:12:06 -07:00
Ralph Castain
d34dbf184a
Merge pull request #3942 from artpol84/dmdx_err_resp/master
...
orte/pmix/server: Fix direct modex response with error status
2017-07-20 12:59:22 -05:00
Artem Polyakov
79c10c884d
orte/pmix/server: Fix direct modex response with error status
...
`send_error()` is only packing status and peer info in the reply.
While remote counterpart in `pmix_server_dmdx_resp()` expects
the "hotel room number" to proceed correctly.
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-20 23:50:57 +07:00
Howard Pritchard
0386c26177
Merge pull request #3860 from hppritcha/readme_update_pgi-os-x
...
README: say we don't support PGI on OS-X
2017-07-20 08:10:21 -06:00
Ralph Castain
7c6bd7d48d
Merge pull request #3302 from ggouaillardet/poc/hwloc2
...
POC: add support for hwloc v2
2017-07-20 07:57:13 -05:00
Gilles Gouaillardet
593e4ce63f
hwloc: add hwloc2x
...
internal hwloc 2x is used with --with-hwloc=future
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:51 +09:00
Gilles Gouaillardet
60aa9cfcb6
hwloc: add support for hwloc v2 API
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:44 +09:00
Gilles Gouaillardet
9f29f3bff4
hwloc: since WHOLE_SYSTEM is no more used, remove useless
...
checks related to offline and disallowed elements
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:21 +09:00
Gilles Gouaillardet
1a34224948
hwloc: do not set the HWLOC_TOPOLOGY_FLAG_WHOLE_SYSTEM flag
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-20 17:39:16 +09:00
Ralph Castain
fca68b070b
Merge pull request #3934 from rhc54/topic/singleton
...
Fix the isolated pmix component. Cleanup the ess/singleton component …
2017-07-19 16:02:37 -05:00
Ralph Castain
543c16b28d
Fix the isolated pmix component. Cleanup the ess/singleton component - we shouldn't be automatically discovering the local topology as that is now done on-demand.
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-19 12:14:29 -07:00
Artem Polyakov
6cbea90209
Merge pull request #3932 from artpol84/rm_yoda_news/master
...
NEWS: fix removed legacy support section for v3.0.0
2017-07-19 08:48:49 -07:00
Howard Pritchard
503de6ca7d
Merge pull request #3933 from hppritcha/topic/fix_s1_bug
...
pmix/s1: fix problems with ref counting in s1
2017-07-19 07:03:38 -06:00
Geoffrey Paulsen
71333a4b14
Transitioning ownership of rmaps/seq and rmaps/rank_file from Intel to IBM.
2017-07-18 21:31:01 -04:00
Gilles Gouaillardet
da34e2f109
ess/base: silence a warning
...
by fixing a static initializer
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-19 09:30:53 +09:00
Howard Pritchard
2fa0c4c6ec
pmix/s1: fix problems with ref counting in s1
...
s1 pmix component wasn't doing proper ref counting
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-07-18 15:59:28 -06:00
Artem Polyakov
e34362de0a
NEWS: fix removed legacy support section for v3.0.0
...
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-19 02:14:15 +07:00
Ralph Castain
83d21e3130
Merge pull request #3930 from rhc54/topic/signal
...
Fix signal forwarding on ORTE daemons
2017-07-18 13:44:37 -05:00
Ralph Castain
8a98aab6cc
Fix signal forwarding on ORTE daemons so that _all_ daemons do it, regardless of environment. Add missing support for SIGTSTP and a few others.
...
Thanks to Eugene Dedits for reporting the problem.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-18 09:58:55 -07:00
Josh Hursey
8688219091
Merge pull request #3775 from jjhursey/fix/mca_base_verbose-file
...
opal/mca: Fix mca_base_verbose file suffix processing
2017-07-18 10:14:42 -05:00
Edgar Gabriel
b363a0f4db
Merge pull request #3908 from edgargabriel/pr/lustre-header-update
...
fs/lustre: update lustre header file used in the component
2017-07-18 09:57:11 -05:00
Gilles Gouaillardet
6d6f5fdfb8
Merge pull request #3894 from ggouaillardet/topic/pml_ob1_progress
...
pml/ob1: fix mca_pml_ob1_progress_needed usage
2017-07-18 10:43:44 +09:00
Artem Polyakov
bdb698a9b6
Merge pull request #3923 from artpol84/rm_yoda_news/master
...
Yoda removal (update docs)
2017-07-17 17:51:36 -07:00
Gilles Gouaillardet
1b46fe2d9a
pml/ob1: fix mca_pml_ob1_progress_needed usage
...
correctly use OPAL_ATOMIC_ADD32() that returns the *new* value
and *not* the previous one.
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-07-18 09:30:57 +09:00
Howard Pritchard
771f51af12
Merge pull request #3917 from hppritcha/topic/remove_cr_config_master
...
configure: remove CR/FT related options
2017-07-17 16:12:07 -06:00
Artem Polyakov
cc156a332b
Sync NEWS with v3.0.x
...
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-18 05:05:40 +07:00
Artem Polyakov
a70d28cbb0
oshmem: Update README file to reflect available SPML options.
...
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-18 04:39:03 +07:00
Nathan Hjelm
5a5edfce88
Merge pull request #3876 from hppritcha/topic/add_psm2_get_stats
...
mtl/psm2: add pvar support for PSM2 MQ stats
2017-07-17 16:02:51 -05:00
Nathan Hjelm
2060fcf8bb
mca/base: use the project name when registering pvars
...
References #3918 . Close when applied to v2.0.x, v2.x, and v3.0.x.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-07-17 15:57:50 -05:00
Howard Pritchard
45e2771162
configure: remove CR/FT related options
...
As part of the process for addressing removal of CR/FT related
code from master (and hence from the 3.0.0 release), it was agreed
at the OMPI devel F2F on 7/13/17 that we'd break this in to two
pieces:
1) remove the configure arguments (fewer changes)
2) remove all the CR/FT code, etc. in a subsequent bigger commit
that may not make it in to 3.0.0 in time.
By doing 1), the available configure options would not change
in a subsequent 3.0.x release if we end up not being able to do 2)
before 3.0.0 is released.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-07-17 13:48:59 -06:00
Nathan Hjelm
e5343c16c0
btl/vader: remove debug code that should not be in a release
...
References #3902 . Close when in master, v3.0.x, and v2.x.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-07-17 11:58:47 -05:00
Edgar Gabriel
13b14f5efe
Merge pull request #3906 from edgargabriel/pr/lazy_open_fix
...
common/ompio: fix the lazy_open flag
2017-07-17 10:58:13 -05:00
Edgar Gabriel
931c2021c7
Merge pull request #3905 from edgargabriel/pr/fix-grouping-options
...
io/ompio: fix grouping option
2017-07-17 10:55:57 -05:00
Edgar Gabriel
bc8f642211
fs/lustre: update lustre header file used in the component
...
liblustreapi.h is at this point deprecated. Switch to lustreapi.h instead
fixes issue #3223
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-07-17 10:33:52 -05:00