Ralph Castain
700ad84243
Send the pmix build results to me
2016-08-20 07:32:06 -07:00
Howard Pritchard
61d62b6821
mtl/ofi: fix a botched assignment of av_type
...
Well now the av_type is being assigned correctly
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2016-08-19 17:01:02 -05:00
Nathan Hjelm
f3e9a72f1a
Merge pull request #1987 from hjelmn/cid
...
comm/cid: fix threaded CID allocation
2016-08-19 14:26:39 -06:00
Nathan Hjelm
fbbf743c36
comm/cid: fix threaded CID allocation
...
This commit should restore the pre-non-blocking behavior of the CID
allocator when threads are used. There are two primary changes: 1)
do not hold the cid allocator lock past the end of a request callback,
and 2) if a lower id communicator is detected during CID allocation
back off and let the lower id communicator finish before continuing.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-08-19 11:47:19 -06:00
Jeff Squyres
bb6b87f086
Merge pull request #1972 from jsquyres/pr/rsh-robustify-default-check
...
rsh: robustify the check for plm_rsh_agent default value
2016-08-19 13:33:38 -04:00
Ralph Castain
c9dc286f25
Update the hwloc coverity submission script
2016-08-19 09:20:48 -07:00
Valentin Petrov
9790373fc6
coll/hcoll: Fixes predifined types mapping
2016-08-19 11:19:12 +03:00
Nathan Hjelm
e5c7512692
Merge pull request #1983 from hjelmn/request_cb
...
ompi/request: change semantics of ompi request callbacks
2016-08-18 08:31:56 -06:00
Jeff Squyres
917559f1c3
.mailmap / AUTHORS: auto-generate AUTHORS
...
In conjunction with a de-duplicating .mailmap file, use the
make-authors.pl script to auto-generate AUTHORS.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-18 08:47:15 -05:00
Jeff Squyres
1ba1e9e0b7
make-authors.pl: Auto-generate the entire AUTHORS file
...
Update the script to auto-generate the entire AUTHORS file from two
sources:
1. The existing AUTHORS file
2. The output from "git log --format=tformat:=tformat:'%aN <%aE>'"
Merge these two together (which will preserve organization
affiliations) and warn in two cases:
1. If a person has no organization affiliation
1. If the same email address appears for more than one person
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-18 07:29:18 -05:00
Jeff Squyres
7a25233f3a
AUTHORS: Preprocess in preparation for being auto-generated
...
- Add separator line for parser to find
- Remove all @users.noreply.github.com addresses
- Use proper character for Aurélien Bouteiller
- Fix spelling of Edgar Gabriel
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-18 07:15:26 -05:00
Jeff Squyres
5ac7b3c6d2
.mailmap: Remove stale SVN references
...
Also explain the true purpose of this file.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-18 07:10:27 -05:00
Artem Polyakov
6ea8cccdab
Merge pull request #1969 from artpol84/pmix_jobid_fix
...
Pmix jobid fix
2016-08-18 17:24:58 +07:00
Nathan Hjelm
6aa658ae33
ompi/request: change semantics of ompi request callbacks
...
This commit changes the sematics of ompi request callbacks. If a
request's callback has freed or re-posted (using start) a request
the callback must return 1 instead of OMPI_SUCCESS. This indicates
to ompi_request_complete that the request should not be modified
further. This fixes a race condition in osc/pt2pt that could lead
to the req_state being inconsistent if a request is freed between
the callback and setting the request as complete.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-08-17 20:14:01 -06:00
Sylvain Jeaugey
61e900eea5
Fix typo calling allreduce with the allgather module.
...
That was causing CUDA collective to crash.
2016-08-17 17:05:13 -07:00
rhc54
394e23d179
Merge pull request #1981 from rhc54/topic/timeout
...
Support the PMIX_TIMEOUT key at the PMIx server when timeout=0 - this…
2016-08-17 17:40:43 -05:00
Edgar Gabriel
e14c23ba79
Merge pull request #1980 from edgargabriel/topic/coverty-cleanup
...
io/ompio: Topic/coverty cleanup
2016-08-17 17:27:51 -05:00
Ralph Castain
7da9793fef
Support the PMIX_TIMEOUT key at the PMIx server when timeout=0 - this indicates that the user doesn't want a lookup of any data from the host RM.
2016-08-17 16:26:58 -05:00
Edgar Gabriel
2c8437ce62
fs/pvfs2: fix a common symbol
2016-08-17 13:10:32 -05:00
Edgar Gabriel
eba5293586
fix coverty warning CID 1369021
2016-08-17 13:02:45 -05:00
Nathan Hjelm
cdbc94e34e
Merge pull request #1977 from hjelmn/osc_pt2pt_fix
...
osc/pt2pt: make receive count an unsigned int
2016-08-17 09:38:33 -06:00
Nathan Hjelm
40b70889e5
osc/pt2pt: make receive count an unsigned int
...
This receive_count MCA variable should never be negative. Change it
to an unsigned int.
Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2016-08-17 08:14:24 -06:00
Jeff Squyres
ce0124603d
.gitignore: add test executable to ignore
...
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-17 04:27:51 -07:00
Gilles Gouaillardet
8faa1edafa
osc/pt2pt: silence misc warnings
2016-08-17 14:24:14 +09:00
Gilles Gouaillardet
6b7bc64101
spml/yoda: MCA_PML(add_procs) all procs from oshmem_comm_world
...
and fix oshmem_group_proc_{init,create} so they use the number of procs in oshmem_comm_world
Thanks Debendra Das for the report and Josh Ladd for the guidance
Fixes open-mpi/ompi#1966
2016-08-17 14:24:02 +09:00
LANL OMPI Bot
96c7762050
Merge pull request #1942 from hppritcha/topic/minor_ofi_fix
...
mtl/ofi: use mca param to set av type
2016-08-16 14:14:12 -06:00
Nathan Hjelm
2e1378596f
Merge pull request #1953 from hjelmn/pt2pt_fixes
...
osc/pt2pt updates
2016-08-16 08:00:24 -06:00
Jeff Squyres
71ec5cfb43
rsh: robustify the check for plm_rsh_agent default value
...
Don't strcmp against the default value -- the default value may change
over time. Instead, check to see if the MCA var source is not
DEFAULT.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2016-08-16 06:58:20 -05:00
rhc54
d7cd802426
Merge pull request #1971 from rhc54/topic/sesdir
...
Update the session dir structure. Restore the creation of a top-level…
2016-08-16 03:14:08 -05:00
Ralph Castain
ae2af61ee3
Update the session dir structure. Restore the creation of a top-level dir based on userid so that everything is contained under the user's top-level dir. Make the next level down (the "job family" level) be either the pid (indicated by a name of "pid.N") or the job family if not launched by mpirun. This allows for proper rendezvous by direct-launched procs.
2016-08-15 22:46:46 -05:00
rhc54
dd05f085e9
Merge pull request #1968 from rhc54/topic/rsh
...
Further cleanup getpwuid usage - try it first (unless completely disa…
2016-08-15 22:11:21 -05:00
Gilles Gouaillardet
3126ff77e2
pmix2x: common syms: whitelist bison-generated common symbols
...
Bison generates some common symbols that we can't do anything about,
so whitelist them.
2016-08-16 11:29:06 +09:00
Artem Polyakov
c5a91c5c9d
opal/pmix: fix pmix jobid calculation if external PMIx server is used.
2016-08-15 21:13:51 +03:00
Ralph Castain
9f43db7303
Further cleanup getpwuid usage - try it first (unless completely disabled), and then silently failover to try other methods.
2016-08-15 07:51:36 -07:00
Ralph Castain
ecbedee8bb
Fix typo
2016-08-15 07:32:00 -07:00
Artem Polyakov
f3c816b52e
opal/pmix: fix indentation in some files.
2016-08-15 18:21:50 +07:00
Gilles Gouaillardet
483685eb6a
update .gitignore
...
remove autogenerated opal/mca/pmix/pmix2x/pmix/src/include/pmix_config.h.in
2016-08-15 17:00:20 +09:00
rhc54
2228d2efc2
Merge pull request #1965 from rhc54/topic/pmixfix
...
Provide backward compatible keys so that the non-PMIx components in t…
2016-08-13 13:48:12 -07:00
Ralph Castain
be8424b691
Provide backward compatible keys so that the non-PMIx components in the opal/pmix framework don't have to adjust as we continue to work on finalizing the PMIx reference scheme. Activate and utilize the new PMIx show_help capability to provide more meaningful error output when the server cannot start.
...
Add a contrib script to cleanup permissions incorrectly modified due to things like smb mounts
dd
2016-08-13 12:13:04 -07:00
rhc54
d12e50b2d6
Merge pull request #1963 from rhc54/topic/pmixfix
...
Fix shared memory rendezvous
2016-08-13 09:59:14 -07:00
Ralph Castain
08a0644df5
Fix shared memory rendezvous
2016-08-13 08:14:50 -07:00
rhc54
ddde154d28
Merge pull request #1962 from rhc54/topic/notify
...
Ensure we properly convert pmix status to ORTE state before activatin…
2016-08-13 06:59:50 -07:00
Ralph Castain
48d35a9627
Ensure we properly convert pmix status to ORTE state before activating an error state upon notification. Cleanup some conversion issues on notification info. Add a new orte_notify.c test program
2016-08-12 21:14:29 -07:00
rhc54
9868093bef
Merge pull request #1961 from rhc54/topic/static
...
Setup the job list in the PMIx integration so that static ports can run
2016-08-12 15:17:31 -07:00
rhc54
9eed451916
Merge pull request #1960 from rhc54/topic/rsh
...
Restore the rsh template creation code
2016-08-12 13:38:43 -07:00
rhc54
8d67f753ca
Merge pull request #1959 from rhc54/topic/nodeid
...
The node index isn't normally passed with the packed node object, so …
2016-08-12 13:30:10 -07:00
Ralph Castain
4a4c9703a9
Setup the job list in the PMIx integration so that static ports can run
2016-08-12 13:27:10 -07:00
rhc54
1ef3c86d44
Merge pull request #1931 from hjelmn/ess_fix
...
ess/base: set up nidmap after pmix
2016-08-12 13:10:30 -07:00
Ralph Castain
5717b75b45
Restore the rsh template creation code
2016-08-12 12:43:40 -07:00
rhc54
ee1ee2086c
Merge pull request #1958 from rhc54/topic/path
...
Fix a bug where we were requiring that all paths in $PATH be absolute
2016-08-12 12:31:43 -07:00