Ralph Castain
cde80bbf47
Merge pull request #3638 from anandhis/ofi-coll-conduit-fail-dbg
...
Minor clean up to rml-ofi send message
2017-06-01 20:51:22 -07:00
KAWASHIMA Takahiro
c8d38d31c6
Merge pull request #3618 from kawashima-fj/pr/java-doc-man
...
java: Detect `javadoc` path and improve `mpijavac` man page
2017-06-02 10:24:05 +09:00
anandhi
6ddb487744
Cleaned up the send_msg(), moved checking for send to self into the send_nb()
...
and send_buffer_nb()
modified: orte/mca/rml/ofi/rml_ofi_send.c
Signed-off-by: Anandhi Jayakumar <anandhi.s.jayakumar@intel.com>
2017-06-01 17:50:54 -07:00
George Bosilca
037a85a782
Fix the OSHMEM request padding.
...
This patch fixes a missed case by 5b670a2 (PR #3634 ).
Signed-off-by: George Bosilca <bosilca@icl.utk.edu>
2017-06-01 18:30:02 -04:00
Josh Hursey
1665d771a6
Merge pull request #3635 from wlepera/fix/ibm/155305
...
MPI_Sendreceive_replace data error with > 2k msg
2017-06-01 14:38:01 -05:00
Jeff Squyres
d520c24f3a
predefined MPI object padding: set to fixed number of bytes ( #3634 )
...
Convert the predefined MPI object padding to a fixed number of bytes
(vs. a multiple of sizeof(void*)) so that the padding is the same size
between 32 and 64 bit builds. I.e., we won't have a situation where
we've run out of padding in 32 bit builds but still have more space
available in 64 bit builds.
Fixes #3610
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-06-01 15:28:23 -04:00
William LePera
a7c9c4aef3
MPI_Sendreceive_replace data error with > 2k msg (RTC 155305)
...
Signed-off-by: William LePera <lepera@us.ibm.com>
2017-06-01 13:08:58 -04:00
Gilles Gouaillardet
5e9be7667b
Merge pull request #3600 from ggouaillardet/topic/osc_rdma_get_segment
...
osc/rdma: fix osc_rdma_get_remote_segment() length parameter
2017-06-01 13:09:14 +09:00
Ralph Castain
4c00e2ca2f
Merge pull request #3620 from rhc54/topic/grr
...
Fix uninitialized variable. Set exit codes for failed launch so we get pretty error messages
2017-05-31 13:08:42 -07:00
Nathan Hjelm
e1a997c0cb
Merge pull request #3593 from hjelmn/bug_3575
...
osc/rdma: fix typo in ompi_osc_rdma_lock_acquire_exclusive
2017-05-31 08:54:40 -06:00
Ralph Castain
9d6b929894
Fix uninitialized variable. Set exit codes for failed launch so we get pretty error messages
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-31 07:38:37 -07:00
KAWASHIMA Takahiro
76b1f80664
java: Use correct date/version in mpijava
man page
...
`mpijavac.1` should be generated at `make`-time...
Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2017-05-31 17:24:49 +09:00
Ralph Castain
9c1df3f7f0
Merge pull request #3615 from rhc54/topic/sigh
...
Roll in latest PMIx updates
2017-05-30 23:13:15 -07:00
KAWASHIMA Takahiro
63f0945dcc
java: Detect the path of javadoc
in configure
...
Without this change, the directory of `javadoc` command must be
included in the `PATH` environment variable at `make`-time.
Paths of `javac`, `javah`, and `jar` commands are detected in
`configure`. So the path of `javadoc` also should be detected.
Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>
2017-05-31 14:26:14 +09:00
Ralph Castain
26d96061aa
Roll in latest PMIx updates
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-30 21:35:35 -07:00
Ralph Castain
6ff2780426
Merge pull request #3614 from rhc54/topic/sync
...
Don't sweat the "sync" settings on file descriptors as those flags aren't apparently fully portable
2017-05-30 21:34:29 -07:00
Ralph Castain
26e7515a5e
Don't sweat the "sync" settings on file descriptors as those flags aren't apparently fully portable
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-30 20:37:26 -07:00
Howard Pritchard
de92d1b9ce
Merge pull request #3506 from hppritcha/topic/fix_issue_1089
...
configury: add option to disable enable-new-dtags
2017-05-30 19:56:14 -06:00
Ralph Castain
1fe452d371
Merge pull request #3607 from rhc54/topic/server
...
Ensure that data from a job that was stored in ompi-server is purged once that job completes. Cleanup a few typos. Silence a Coverity warning
2017-05-30 17:20:37 -07:00
Ralph Castain
5d990b557c
Reorg ordering so that bare executable names also are found
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-30 15:58:55 -07:00
Ralph Castain
321abfc8c6
Fix cwd and preload-binary options
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-30 14:07:22 -07:00
Ralph Castain
ad108ba44d
Fix the DVM
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-30 11:42:42 -07:00
Ralph Castain
9a8811a246
Ensure that data from a job that was stored in ompi-server is purged once that job completes. Cleanup a few typos. Silence a Coverity warning
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-30 09:43:01 -07:00
Ralph Castain
22631832ce
Merge pull request #3604 from rhc54/topic/test
...
Add minor test to ORTE test suite
2017-05-29 16:03:13 -07:00
Ralph Castain
e8759ca66b
Add minor test to ORTE test suite
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-29 15:43:52 -07:00
Ralph Castain
a8b34d975a
Merge pull request #3603 from rhc54/topic/fds
...
Add some debug code for detecting leaking file descriptors.
2017-05-29 12:20:57 -07:00
Ralph Castain
f3ab326b4a
Add some debug code for detecting leaking file descriptors. At the end of each job (and if MCA param is set), have each daemon compute the number of open fds and their characteristics and print a summary
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-29 11:25:20 -07:00
Ralph Castain
8e583dcf76
Merge pull request #3601 from rhc54/topic/fix
...
Protect against the condition where the port string is actually NULL
2017-05-28 21:36:21 -07:00
Ralph Castain
ed4078e2dd
Protect against the condition where the port string is actually NULL
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-28 20:51:09 -07:00
Ralph Castain
89bbb5f585
Merge pull request #3597 from rhc54/topic/pmix2.0
...
Update to PMIx v2.0.0rc1
2017-05-28 20:27:11 -07:00
Gilles Gouaillardet
e622ca8c1c
osc/rdma: fix osc_rdma_get_remote_segment() length parameter
...
a buffer defined by (buf, count, dt)
will have data starting at buf+offset and ending len bytes later with
len = opal_datatype_span(&dt.super, count, &offset);
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-05-29 11:08:03 +09:00
Ralph Castain
9f1f9d6606
Update to PMIx v2.0.0rc1
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-28 10:30:58 -07:00
Ralph Castain
36457cbce6
Merge pull request #3596 from rhc54/topic/cov
...
Silence coverity warnings
2017-05-27 12:57:13 -07:00
Ralph Castain
87201a80ff
Silence coverity warnings
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-27 11:45:53 -07:00
Ralph Castain
c99978a767
Merge pull request #3595 from rhc54/topic/dyn
...
Update the connect/accept support
2017-05-27 11:32:08 -07:00
Ralph Castain
9f60cd0fe7
Update the connect/accept support so we check to see if we have the proper infrastructure and RTE support, including whether we have ompi-server available if the connect/accept spans multiple applications. Print pretty help messages in all cases where we do not have support
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-27 10:47:08 -07:00
Nathan Hjelm
b83c5dbee5
osc/rdma: fix typo in ompi_osc_rdma_lock_acquire_exclusive
...
Fixes #3575
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-05-26 14:21:08 -06:00
Ralph Castain
a6f61132d7
Merge pull request #3588 from rhc54/topic/server
...
Fix ompi-server operations
2017-05-26 11:19:59 -07:00
Nathan Hjelm
c7e6294f31
Merge pull request #3589 from hjelmn/cxx_glue
...
mpi/cxx: remove nonexistent function from cxx glue
2017-05-26 11:24:05 -06:00
Nathan Hjelm
33d59886e1
Merge pull request #3587 from hjelmn/event_abstraction
...
pmix/pmix2x: fix errors in event abstration
2017-05-26 10:44:18 -06:00
Nathan Hjelm
ee9093c373
mpi/cxx: remove nonexistent function from cxx glue
...
This commit removes a nonexistent function that was causing build
problems under certain environments.
Reference #3442
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-05-26 10:40:19 -06:00
Ralph Castain
8c2a06477c
Fix ompi-server operations
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-26 08:57:55 -07:00
Nathan Hjelm
a512b8962d
pmix/pmix2x: fix errors in event abstration
...
Parts of the pmix2x component called the event_* functions directly
instead of the opal_event_* wrappers. This is fine as long as we are
using libevent but becomes a problem with other event libraries.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-05-26 09:49:11 -06:00
Josh Hursey
4bfb0fcddd
Merge pull request #3577 from markalle/pr/osc_rdma_rangecheck
...
fix for buffer length check (rdma osc w/ odd datatypes)
2017-05-26 10:44:33 -05:00
Nathan Hjelm
7d5cc8ebca
Merge pull request #3572 from ggouaillardet/topic/ompi_osc_rdma_rget_accumulate_internal
...
osc/rdma: fix datatype extent usage in ompi_osc_rdma_rget_accumulate_…
2017-05-26 09:37:51 -06:00
Ralph Castain
2f721a3366
Merge pull request #3585 from rhc54/topic/pmix20
...
Update to pmix v2.0beta
2017-05-26 06:05:44 -07:00
Gilles Gouaillardet
47ebfaa60d
Merge pull request #3451 from mkurnosov/reduce-allreduce-rebenseifner
...
coll: Add Rabenseifner's algorithm for Reduce and Allreduce
2017-05-26 21:00:30 +09:00
Ralph Castain
e1e264711a
Update to pmix v2.0beta
...
Fix atomics - again
Fix initialization of notification ring buffer
Fix wait_sync definitions
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-05-26 03:33:18 -07:00
Mikhail Kurnosov
f6e2d4ab04
coll: Add Rabenseifner's algorithm for Reduce and Allreduce
...
A component with implementation of R. Rabenseifner's algorithm for Reduce and Allreduce.
This algorithm is a combination of a reduce-scatter implemented with recursive vector halving
and recursive distance doubling, followed either by a gather or an allgather.
Current limitations:
-- count >= 2^{\floor{\log_2 p}}
-- commutative operations only
-- intra-communicators onl
Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>
coll/spacc: Modify implementation to use `ompi_coll_base_sendrecv()`
Replace irecv() + isend() + ompi_request_wait() to ompi_coll_base_sendrecv().
Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>
2017-05-26 14:33:35 +07:00
Gilles Gouaillardet
0f79259b94
osc/rdma: use extent of the appropriate datatype in ompi_osc_rdma_rget_accumulate_internal()
...
origin_datatype and target_datatype might be different and hence have different extent,
so use either origin_extent or target_extent when appropriate.
Refs open-mpi/ompi#3569
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-05-26 13:59:38 +09:00