Josh Hursey
c6595c2289
Merge pull request #2792 from jjhursey/topic/libevent-conf2
...
libevent2022: Fix broken configure AC_LANG_PROGRAM
2017-01-24 08:31:46 -06:00
Ralph Castain
4e9364b9a4
Merge pull request #2794 from rhc54/topic/regs
...
Next step in reducing launch time
2017-01-24 03:19:57 -08:00
Gilles Gouaillardet
682f5116aa
Merge pull request #2781 from ggouaillardet/topic/misc_fixes_and_plugs
...
fix misc bugs and plug misc memory leaks
2017-01-24 14:41:45 +09:00
Ralph Castain
86ab751c5e
Next step in reducing launch time: begin reducing the size of the launch message itself. Start by expressing the daemon map as a set of three regular expression strings. On an 8k cluster, this reduces the nidmap contribution from over 200kBytes to 21 bytes in size.
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-23 19:54:47 -08:00
Joshua Hursey
72ac812039
libevent2022: Fix broken configure AC_LANG_PROGRAM
...
* Similar to commit 029964a748
This removes an extra `int main` during configure.
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2017-01-23 21:47:59 -06:00
Josh Hursey
b9b96f13ca
Merge pull request #2788 from jjhursey/topic/libevent-conf
...
libevent2022: Fix broken configure AC_LANG_PROGRAM
2017-01-23 21:39:05 -06:00
Gilles Gouaillardet
d54e54538a
orted/orted_submit: plug a memory leak
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:30 +09:00
Gilles Gouaillardet
189da7fdab
pmix2x: plug a memory leak in _event_hdlr()
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:30 +09:00
Gilles Gouaillardet
acbc32d3b2
pmix2x: plug a memory leak in opal_lkupcbfunc()
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:29 +09:00
Gilles Gouaillardet
b5b21043c4
pmix2x: plug a memory leak in _reg_nspace()
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:29 +09:00
Gilles Gouaillardet
0f47310a75
pmix2x/pmix2x_client: plug misc memory leaks
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:29 +09:00
Gilles Gouaillardet
f4dc7e4134
orted/orted_submit: plug misc memory leaks
...
- always invoke init_globals() before opal_cmd_line_parse(orte_cmd_line, ...)
- plug more leaks in init_globals()
- remove unused env_val and personalities fields from orte_cmd_options_t
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:29 +09:00
Gilles Gouaillardet
d5aa310884
mpiext/affinity: initialize all output variables of OMPI_Affinity_str()
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:29 +09:00
Gilles Gouaillardet
501eb8dc7e
ompio: plug misc memory leaks
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:13:19 +09:00
Gilles Gouaillardet
1a6c17ec7d
opal/util: plug a memory leak
...
by using opal_setenv() instead of putenv()
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:12:47 +09:00
Gilles Gouaillardet
d0629f18c2
coll/libnbc: optimize size one communicators
...
simply "return" with ompi_request_empty if the communicator size is 1
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:12:47 +09:00
Gilles Gouaillardet
9d6e0482a6
orte/data_server: plug a memory leak in orte_data_server()
...
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:12:47 +09:00
Gilles Gouaillardet
0bdc594b2e
rml/base: plug a memory leak in orte_rml_API_recv_cancel()
...
simply return when the orte event thread has gone
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:12:47 +09:00
Gilles Gouaillardet
6f2ca5809b
man: fix a typo in MPI_Win_get_name()
...
Thanks Nicolas Joly for the report
Fixes open-mpi/ompi#2782
Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>
2017-01-24 09:08:13 +09:00
Jeff Squyres
e7588f0509
Merge pull request #2785 from edgargabriel/pr/sharedfp-append-fix
...
common/ompio: update comment based on the previous commit.
2017-01-23 14:06:13 -08:00
Joshua Hursey
029964a748
libevent2022: Fix broken configure AC_LANG_PROGRAM
...
* The AC_LANG_PROGRAM macro adds the `main()` so it is erroneous
to add it to the test program.
* This was detected with the XL compilers which will fail to
build the program in this situation. The GNU compiler does not
error out or warn, but successfully compiles the program.
Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2017-01-23 13:44:12 -06:00
Edgar Gabriel
4dc09de3b8
common/ompio: update comment based on the previsou commit.
...
No source code changed.
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-01-23 13:38:05 -06:00
Ralph Castain
f3920828ed
Merge pull request #2784 from rhc54/topic/pmixup
...
Update to latest PMIx master
2017-01-23 11:01:19 -08:00
Edgar Gabriel
2215f29849
Merge pull request #2783 from edgargabriel/pr/sharedfp-append-fix
...
Pr/sharedfp append fix
2017-01-23 10:38:27 -06:00
Ralph Castain
8c960bae8d
Update to latest PMIx master
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-23 07:07:40 -08:00
Edgar Gabriel
3eae0eecd0
io/ompio: change default for sharedfp_lazy_open parameter
...
Revert the logic of io_ompio_sharedfp_lazy_open. The user now has to explicitely
disable shared fp in order for the structures not to be allocated.
Otherwise, resetting the shared fp e.g. in case the file was opened
in append mode will not work correctly, the code could deadlock.
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-01-23 08:59:22 -06:00
Edgar Gabriel
d3a8d38cc6
common/ompio: correctly position shared fp in append mode
...
Fixes a bug reported on the mailing list. ompio did only reposition the individual
file pointer when the file was opened in append mode. Set the shared file
pointer also to point to the end of the file, similarly to the individual
file pointer.
Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>
2017-01-23 08:59:05 -06:00
Ralph Castain
a61f7bdb26
Merge pull request #2780 from rhc54/topic/conn
...
Ensure we properly set the "shutting down" flag so connection drops by downstream peers are properly handled.
2017-01-23 06:40:28 -08:00
Ralph Castain
e7b12913b4
Ensure we properly set the "shutting down" flag so connection drops by downstream peers are properly handled.
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-23 04:00:24 -08:00
Ralph Castain
0b4648b3a7
Merge pull request #2779 from hjelmn/oob_param
...
oob/base: fix num_threads registration type
2017-01-22 14:09:06 -08:00
Nathan Hjelm
954a4b7be3
oob/base: fix num_threads registration type
...
This commit fixes a bug in the registration of the num_threads MCA
variable. The variable is of type int and was being registered as
a boolean.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2017-01-22 14:02:34 -07:00
Ralph Castain
c549f82cdc
Merge pull request #2778 from rhc54/topic/threads
...
Ensure that oob/base level data is always accessed in the oob/base event thread. Make debruijn the default routed component
2017-01-22 11:21:34 -08:00
Ralph Castain
ac4fcd3f97
Ensure that oob/base level data is always accessed in the oob/base event thread. Make debruijn the default routed component
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-22 10:33:32 -08:00
Ralph Castain
adbcefebf8
Merge pull request #2777 from rhc54/topic/spawn
...
Fix comm_spawn and orte-dvm by resetting all used "node mapped" flags after building the child list
2017-01-22 08:07:08 -08:00
Ralph Castain
6560617c04
Fix comm_spawn and orte-dvm by resetting all used "node mapped" flags after building the child list
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-22 05:55:53 -08:00
Ralph Castain
59eafebf66
Merge pull request #2776 from rhc54/topic/fix
...
Add missing flag set to ensure nodes do not get double-added to job map.
2017-01-21 20:54:37 -08:00
Ralph Castain
639cdd4f9d
Add missing flag set to ensure nodes do not get double-added to job map.
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-21 20:06:50 -08:00
Ralph Castain
164fc6436d
Merge pull request #2775 from rhc54/topic/oob3
...
More scaling efficiencies
2017-01-21 15:45:57 -08:00
Ralph Castain
e8e5f81abd
Something not quite right about the revised allocation algos, so revert them while retaining the larger initial and threshold sizes
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-21 14:37:45 -08:00
Ralph Castain
be3ef77739
Improve packing efficiency by raising the initial buffer size and modifying the extension code. Flag if a job map has had its nodes added so we don't have to loop repeatedly to check it.
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-21 14:03:19 -08:00
Ralph Castain
466cbd4d29
Rework the threading in oob/tcp so that daemons (including mpirun) use multiple progress threads to get messages out to their children, and so that the oob/base uses a separate one to setup sends. This allows the daemon cmd processor to execute in parallel with relay of messages, which significantly reduces launch times at scale
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-21 13:26:19 -08:00
Ralph Castain
917b88a2d5
Merge pull request #2771 from rhc54/topic/zlib
...
Check for zlib.h
2017-01-20 13:44:43 -08:00
Ralph Castain
08b5fe46db
Check for zlib.h
...
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-20 11:55:11 -08:00
Open MPI Team
dba106ee10
pmix nightly tarball: only save 7 days
...
We don't have infinite disk space: only save 7 days of builds, not 28.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-01-20 19:16:43 +00:00
Open MPI Team
96a90ffab3
remove-old.pl: update / fix minor bugs
...
- Ensure that $to_delete is always defined
- Re-indent to 4 spaces for readability
- Don't only delete files -- it's ok to delete directories, too
- Print the directory from which we are deleting
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-01-20 19:16:43 +00:00
Open MPI Team
e642d1d91c
nightly tarball: put the SSH target in a variable
...
Just to make the scripts a little less error-prone. Also split up the
ssh/scp lines just for readability.
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-01-20 19:16:43 +00:00
Jeff Squyres
328b654626
snapshot: fix hash comparison
...
- Don't use "-i" CLI option to perl; it's unnecessary here and causes
a warning
- Branch names may not be entirely letters (e.g., "v1.11"), so take
any character in the regexp to match the branch name
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-01-20 13:26:42 -05:00
Ralph Castain
cfce565ce9
Merge pull request #2763 from naughtont3/tjn-ortedvm-daemonize
...
dvm: add daemonize and set-sid options
2017-01-20 08:08:21 -08:00
Thomas Naughton
39d335a277
dvm: add daemonize and set-sid options
...
Signed-off-by: Thomas Naughton <naughtont@ornl.gov>
2017-01-20 09:28:26 -05:00
Ralph Castain
33d97b22bc
Merge pull request #2766 from rhc54/topic/zlib
...
Compress the xcast message if bigger than a defined size to further improve launch performance at scale
2017-01-19 23:14:04 -08:00