1
1

24492 Коммитов

Автор SHA1 Сообщение Дата
Gilles Gouaillardet
4c43fb2a50 orte_rmaps_base_map_job: set OPAL_BIND_ALLOW_OVERLOAD when needed 2016-01-13 17:13:36 +09:00
Tomislav Janjusic
3858bc8e62 Adding support for dynamic endpoint creation
Signed-off-by: Tomislav Janjusic <tomislavj@mngx-apl-01.mtl.labs.mlnx>
Signed-off-by: Tomislavj Janjusic <tomislavj@mellanox.com>
Signed-off-by: Joshua Ladd <jladd.mlnx@gmail.com>
2016-01-12 22:17:03 +02:00
Nathan Hjelm
dd4d49cbbb Merge pull request #1278 from ggouaillardet/poc/osc_pt2pt
osc/pt2pt: use two distinct "namespaces" for tags
2016-01-12 09:49:31 -07:00
Tim Mattox
f2d4a8d266 Replace a bit counting loop with a call to an efficient population count routine 2016-01-12 10:48:56 -05:00
Gilles Gouaillardet
955fe85cb6 pmix/pmix120: add missing include file 2016-01-12 11:35:32 +09:00
Nathan Hjelm
b6366e52a8 Merge pull request #1294 from hjelmn/group_fix
ompi/group: do not decrement parent group proc pointers in destruct
2016-01-11 13:49:23 -07:00
Nathan Hjelm
d26cc3fece ompi/group: do no decrement parent group proc pointers in destruct
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-01-11 12:56:11 -07:00
Ralph Castain
332019b43a Silence warning 2016-01-10 09:59:36 -08:00
Nathan Hjelm
aefc2ed3e8 Merge pull request #1283 from artpol84/udcm_race_fix
Fix race condition in UDCM
2016-01-09 08:19:20 -07:00
Gilles Gouaillardet
0fb7b07a71 opal/progress: fix non debug builds
this bug was introduced in open-mpi/ompi@64b695669a

Thanks Pavel (Pasha) Shamis for reporting this issue
2016-01-09 15:47:40 +09:00
Artem Polyakov
84e4fb308b Fix race condition in UDCM where service thread sees that
`cm_message_event_active == 1` but main thread has already stopped
processing messages and thus we will have the situation where one
message was left unhandled leading to a hang.
2016-01-08 23:56:21 +06:00
Gilles Gouaillardet
73daf58ee5 pmix: do not include automatically generated include/private/autogen/config.h into dist tarball
Thanks Siegmar Gross for the initial report of this issue
2016-01-08 13:18:15 +09:00
Edgar Gabriel
ac34c0ec51 Merge pull request #1287 from edgargabriel/posix-fbtl-update
use the actual preadv and pwritev functions if available. That's what…
2016-01-07 19:48:51 -06:00
Nathan Hjelm
faeca5663c Merge pull request #1289 from hjelmn/hwloc_fix
Update hwloc to 1.11.2 + Fix /proc/mounts issue.
2016-01-07 16:14:55 -07:00
Nathan Hjelm
15007b4e2b linux: use mntent.h instead of manually parsing /proc/mounts
setmntent() doesn't support root_fd, but manual parsing of
/proc/mounts is fragile, and actually buggy for very long mount lines
(see open-mpi/hwloc#142 (comment)).

Since we only openat("/proc/mounts") there, just manually concatenate
the fsroot_path and use setmntent().

Thanks to Nathan Hjelm for the report.

(Cherry-picked from open-mpi/hwloc@d2d07b9a22)

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-01-07 12:55:03 -07:00
Nathan Hjelm
1384559fcd Update hwloc to v1.11.2
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-01-07 12:33:12 -07:00
rhc54
73ca19581e Merge pull request #1288 from hjelmn/grpcomm_fixes
grpcomm: fix bugs in grpcomm algorithms
2016-01-07 10:24:03 -08:00
Nathan Hjelm
fab1eca536 grpcomm: fix bugs in grpcomm algorithms
This commit fixes multiple issues in the bruck's and recursive
doubling grpcomm algorithms. The following changes are included:

 - Use the existing bitmap implementation instead of implementing a
   new one. There were bugs in the implementation that caused an
   overrun of the bitmap array.

 - Clean up the algorithms to eliminate errors.

 - Send as little extra data as possible in the bruck's
   algorithm.

The changes were testest with various numbers of ortes varying from 1
to 4096.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-01-07 10:12:08 -07:00
Edgar Gabriel
0a1b735eed use the actual preadv and pwritev functions if available. That's what the fbtl interfaces have been designed for. 2016-01-07 08:29:17 -06:00
Nysal Jan K.A
13f9bb9202 Use PMI2 constants for consistency 2016-01-07 11:46:22 +05:30
Gilles Gouaillardet
713e3ea2e5 configury: fix pthread_join() call in OPAL_INTL_PTHREAD_TRY_LINK_FORTRAN 2016-01-07 10:20:20 +09:00
Gilles Gouaillardet
4c1ea4a171 dpm: correctly handle procs_cutoff in ompi_dpm_connect_accept()
this commit includes missing bits from open-mpi/ompi@213b2abde4
2016-01-07 09:11:03 +09:00
Gilles Gouaillardet
213b2abde4 dpm: correctly handle procs_cutoff in ompi_dpm_connect_accept() 2016-01-06 16:21:13 +09:00
Jeff Squyres
e4bdad09c1 pmix: remove extra wrapper LIBS
These extra libs are now no longer necessary.

Fixes open-ompi/ompi#1281.
2016-01-05 12:09:53 -08:00
Jeff Squyres
335e8bd858 Merge pull request #1280 from jsquyres/pr/travis
PR for travis
2016-01-05 15:09:25 -05:00
Edgar Gabriel
d827da5736 Merge pull request #1279 from edgargabriel/pr/plfs-fixes-2
fixes for the PLFS file system
2016-01-05 12:57:24 -06:00
Jeff Squyres
f8722a96a6 travis: add a bunch of comments explaining things 2016-01-05 10:54:39 -08:00
Edgar Gabriel
1b0b849994 remove the MCA parameter setting the number of hosts in PLFS, since the plfs_setxattr function used is causing linking problems with PLFS 2.5
remove unused variables.
2016-01-05 11:13:23 -06:00
Edgar Gabriel
7861a8c357 revise the logic in the fbtl plfs avoiding the memcpy operation 2016-01-05 10:04:46 -06:00
Edgar Gabriel
da309ac962 - use a unique pid for each process as requested by the API
- sync the file before closing it
- use plfs_access() instead of access() before closing the file
2016-01-05 10:04:12 -06:00
Gilles Gouaillardet
06ecdb6aa7 osc/pt2pt: use two distinct "namespaces" for tags 2016-01-05 16:57:37 +09:00
Jeff Squyres
6d073a8da4 btl_sm: add a comment explaining why we rename(2)
Per open-mpi/ompi#1230, add a comment explaining why we write to a
temporary file and then rename(2) the file, just so that future code
maintainers don't wonder why we do this seemingly-useless step.
2016-01-04 14:51:52 -05:00
Gilles Gouaillardet
14fdf75944 fs/pvfs2: fix typo
Thanks Dave Love for reporting this issue.

Fixes #1272
2016-01-03 23:28:35 +09:00
Artem Polyakov
d80b98be98 Merge pull request #1268 from artpol84/fix_mlnx_copy
Fix Mellanox copyrights with respect to the following PRs
2016-01-02 09:57:12 +05:00
Ralph Castain
f53d3c7a18 Silence warning 2015-12-30 10:16:58 -08:00
rhc54
d387725395 Merge pull request #1270 from rhc54/topic/cleanup
Cleanup connection termination
2015-12-30 08:37:25 -08:00
Ralph Castain
0a6b8d2c14 Correctly handle connection terminations during finalize so mpirun doesn't hang. Cleanup some corner cases in the error notification system 2015-12-30 07:16:43 -08:00
Ralph Castain
1cdc1c121c Revert "Standardize the handling of shutdown in the OOB TCP component"
This reverts commit open-mpi/ompi@12dccaa911.
2015-12-30 07:05:40 -08:00
Ralph Castain
a04f1cd643 Silence some Coverity warnings 2015-12-29 20:37:25 -08:00
Artem Polyakov
2abb2972ac Fix Mellanox copyrights with respect to the following PRs:
* https://github.com/open-mpi/ompi/pull/1184
* https://github.com/open-mpi/ompi/pull/1188
* https://github.com/open-mpi/ompi/pull/1197
* https://github.com/open-mpi/ompi/pull/1202
* https://github.com/open-mpi/ompi/pull/1210
* https://github.com/open-mpi/ompi/pull/1216
* https://github.com/open-mpi/ompi/pull/1236
* https://github.com/open-mpi/ompi/pull/1237
* https://github.com/open-mpi/ompi/pull/1248
* https://github.com/open-mpi/ompi/pull/1260
* https://github.com/open-mpi/ompi/pull/1264
2015-12-30 00:12:19 +06:00
rhc54
dc857a9b24 Merge pull request #1267 from rhc54/topic/oob
Standardize the handling of shutdown in the OOB TCP component
2015-12-29 09:10:42 -08:00
Ralph Castain
12dccaa911 Standardize the handling of shutdown in the OOB TCP component 2015-12-29 07:57:22 -08:00
Ralph Castain
ddf0f272e1 Fix typo 2015-12-29 07:04:28 -08:00
rhc54
5dfb7ac396 Merge pull request #1266 from ggouaillardet/topic/misc_pmix_fixes
Topic/misc pmix fixes
2015-12-29 07:02:44 -08:00
Gilles Gouaillardet
b20a219ad0 hwloc/external: abort if hwloc v2 is detected since it is not yet supported 2015-12-29 09:23:27 +09:00
rhc54
b7b4231fbb Merge pull request #1261 from ggouaillardet/topic/rmaps_rr_oversubsription
rmaps/round_robin: warn if oversubscribing when manually setting the …
2015-12-28 07:03:08 -08:00
Gilles Gouaillardet
0ca1ee5156 configury: misc pmix120 fixes 2015-12-28 23:17:41 +09:00
Gilles Gouaillardet
3300d7cc00 pmix: rename pmix_munge_module 2015-12-28 23:16:27 +09:00
Ralph Castain
a5b95a0939 Continue work on error notification system 2015-12-28 23:15:59 +09:00
Ralph Castain
810f2446b7 Add pmix120 component, update the error handling functions in the PMIx API.
Update the configure logic for the new pmix120 component

ckpt

Get the pmix120 component to work - still not really registering or handling notifications, but infrastructure now operates

Cleanup some of the symbol scopes, and provide a more comprehensive rename.h file. Will pretty it up later - let's see how this works

Cleanup the rename files to use the pretty macros
2015-12-28 23:15:44 +09:00