1
1
Граф коммитов

1142 Коммитов

Автор SHA1 Сообщение Дата
Andrey Maslennikov
547b7da664 rpmbuild: fix rpmtopdir redefinition
Erasing this variable by default makes outside definition useless.

Signed-off-by: Andrey Maslennikov <andreyma@mellanox.com>
(cherry picked from commit c7d51a3a83)
2018-09-19 10:43:35 +03:00
Ralph Castain
98b4ed9a3a Fix the no-disconnect test
A race condition exists based on whether or not the userdata object attached to a hwloc_obj_t has been initialized. These objects are setup whenever we scan for resources under that location. You therefore must not set a variable to the pointer to the userdata object and then call a function that will initialize the data in it - you need to set the variable after the function call, and protect against a NULL pointer

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-06-19 13:52:34 -07:00
Ralph Castain
081a0d98eb Ignore the ud/oob component
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-06-18 13:25:16 -07:00
Ralph Castain
014bb3c8de Fix external hwloc builds
Remove spurious comma in header file definition. Remove unused variables

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-06-03 11:24:21 -07:00
Nathan Hjelm
85d1965a0f
Merge pull request #4828 from hppritcha/topic/update_lanl_toss_platform
lanl/platform: add new toss2/3 platform files
2018-05-01 09:52:14 -06:00
Brian Barrett
fe2b6cf1d6 dist: Don't build case-conflicting html man pages
The html man page builder creates a web page for every
man page generated by our releases.  We then check all
those pages into the ompi-www repo.  Force the build
to skip case-conflicting pages (ie, mpiCC because it
conflicts with mpicc), even if building on Linux, so that
ompi-www continues to be sane when cloned onto MacOS.

Signed-off-by: Brian Barrett <bbarrett@amazon.com>
2018-03-29 21:44:58 +00:00
Ralph Castain
538fd18fad Update default MCA params in platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-03-25 17:14:01 -07:00
Aurélien Bouteiller
9e23d24bb4
Scaling.pl: Fix Srun options and wait for DVM launch
Flush out the DVM ready notice on stdout

Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>
2018-03-15 00:00:49 -04:00
Howard Pritchard
8eb738a9c8 lanl/platform: add new toss2/3 platform files
remove old platform files
add new platform files for toss2/toss3
OPA/MLX-IB variants.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2018-02-17 10:32:07 -07:00
Ralph Castain
4381b2c60f Add ability to multiply number of nodes when running scaling tests
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-11-12 16:38:37 -08:00
Ralph Castain
e7c6718d29 Add timeout option to scaling script
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-24 12:33:22 -07:00
Ralph Castain
3b71be4db4 Update the scaling script to avoid use of "system" command, thus ensuring that each command sees the same environment. Fix prun to pickup and propagate OMPI MCA params
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-23 16:27:41 -07:00
Ralph Castain
6ffb0d0507 Ensure that the pmix server system-level rendezvous file is only output by the HNP as (at least for slurm on cray) a daemon could be colocated with the HNP and overwrite the file. Update the scaling.pl script to only use the system-level rendezvous so it doesn't get rejected by a colocated daemon
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-14 10:16:49 -07:00
Ralph Castain
31bce4ba9c Update the scaling.pl script
* check that the command succeeds when pre-positioning the file to ensure there isn't an error somewhere in the execution

* properly define srun cmd line options

* terminate the orte-dvm only when it is actually in operation so prun doesn't generate spurious error messages

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-10-13 18:23:18 -07:00
Howard Pritchard
406c4cc126 Merge pull request #4299 from hppritcha/topic/update_lanl_toss_platform_file
LANL/platform: disable use of XRC recv bufs
2017-10-06 09:31:17 -06:00
Howard Pritchard
1a639ec477 LANL/platform: disable use of XRC recv bufs
Forgot as part of #3970 to disable use of XRC
recv bufs by default in LANL platform config
file.

related to #4300

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-10-04 11:40:20 -06:00
Brian Barrett
2ac8f75e0c contrib: remove nightly tarball scripts
The nightly tarball scripts have been moved to the ompi-scripts
repo (and rewritten), so are no longer needed in the ompi
repo.  Clean up some bits to make life a bit less confusing.

Signed-off-by: Brian Barrett <bbarrett@amazon.com>
2017-10-02 12:35:42 -07:00
Ralph Castain
1dd45e0f30 Get the scaling test to properly run a scan across the #nodes
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-09-22 21:04:36 -07:00
Ralph Castain
4f932819aa Update platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-09-22 16:05:57 -07:00
Ralph Castain
3493c43468 Sync to PMIx master
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-09-22 10:48:00 -07:00
Joshua Hursey
7a3f1ff75e contrib: Script to automate LIBADD changes for components
* This script will search for all of the `Makefile.am` files in each
   of the project-level components. Then it adds the project-level
   library to `mca_FRAMEWORK_COMPONENT_la_LIBADD`.
   - If the library is already in the LIBADD list then it's skipped.
     So it is safe to run multiple times on the same codebase.

Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>
2017-08-24 11:56:16 -04:00
Ralph Castain
98f36711e3 Update hwloc to latest shmem branch. Correct typos in update-my-copyright.pl.
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-08-15 13:32:12 -07:00
Ralph Castain
d1b7c3d8d5 Silence some compile-time warnings. Update scripts now that AUTHORS is gone
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-08-04 20:08:31 -07:00
Brian Barrett
fe8e4a0402 dist: Autogenerate AUTHORS file
Per discussion at the Summer 2017 developers meeting, generate
the AUTHORS list at make dist time, rather than trying to
keep it up to date and merge on the branches by hand.  While
most of the data is generated from git, the organization list
was maintained by hand.  The general feeling at the meeting was
that the organization list was not adding value and there were
concrete cases where it involved much chasing by the RMs, so
it has been removed.

Signed-off-by: Brian Barrett <bbarrett@amazon.com>
2017-07-27 13:04:17 -07:00
Ralph Castain
f7e8780a42 Remove fortran support from platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 21:02:30 -07:00
Artem Polyakov
35f15a0ba5 contrib: Fix mellanox platform defaults (btl/sm -> btl/vader)
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-15 14:32:26 +07:00
Mark Allen
c5cb07c12e adding --manual-list="file" to update-my-copyright.pl
Along with using git status and related commands to find a list of
modified files to update the copyright on, this adds the option of
using a manually created list from a file (one filename per line).

Signed-off-by: Mark Allen <markalle@us.ibm.com>
2017-07-11 02:13:22 -04:00
Ralph Castain
6ec2ad5288 Fix the pmix_query API when it asks for something that returns an array of pmix_info_t. Protect the PMIX_INFO_FREE macro from NULL arrays. Update the mpi_memprobe scaling test
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-06-22 20:11:36 -07:00
Ralph Castain
243076dd8c Update gadget platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-04-19 21:45:13 -06:00
Howard Pritchard
3918b7a796 Merge pull request #3213 from hppritcha/topic/remove_loadleveer
orte/ras: remove loadleveler support
2017-04-18 09:18:54 -06:00
Jeff Squyres
7e57075f0d Merge pull request #3248 from jsquyres/pr/remove-macosx-pkg-support
dist: remove OS X package script
2017-03-29 18:46:14 -04:00
Jeff Squyres
f0a8a0af51 dist: remove OS X package script
We stopped supporting this long ago.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-03-29 10:13:01 -04:00
Kevin Buckley
9e23c5e3f6 openmpi.spec: also put the modulefile in /opt if install_in_opt==1
Thanks to Kevin Buckley for noticing the issue and supplying the
patch.

[skip ci]
bot:notest

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-03-28 20:45:09 -04:00
Howard Pritchard
9350aa5d71 orte/ras: remove loadleveler support
Remove loadleveler as it is obsolescent and is no longer supported.

Fixes #3167

We'll wait for final check of whether or not loadleveler even
compiles/functions before merging this.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-03-21 10:32:28 -06:00
Howard Pritchard
8e4689c2b8 v3.x:updates for branching v3.x
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-03-14 14:03:47 -06:00
Ralph Castain
24e8639826 Platform file update
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-14 11:11:48 -06:00
Ralph Castain
6d6bc9bd07 Update alps module to new APIs
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-12 09:43:07 -07:00
Ralph Castain
48fc339718 Create an alternative mapping method that pushes responsibility
onto the backend daemons. By default, let mpirun only pack the app_context
info and send that to the backend daemons where the mapping will
be done. This significantly reduces the computational time on mpirun as it isn't
running up/down the topology tree computing thousands of binding
locations, and it reduces the launch message to a very small number of
bytes.

When running -novm, fall back to the old way of doing things
where mpirun computes the entire map and binding, and then sends
the full info to the backend daemon.

Add a new cmd line option/mca param --fwd-mpirun-port that allows
mpirun to dynamically select a port, but then passes that back to
all the other daemons so they will use that port as a static port
for their own wireup. In this mode, we no longer "phone home" directly
to mpirun, but instead use the static port to wireup at daemon
start. We then use the routing tree to rollup the initial
launch report, and limit the number of open sockets on mpirun's node.

Update ras simulator to track the new nidmap code

Cleanup some bugs in the nidmap regex code, and enhance the error message for not enough slots to include the host on which the problem is found.

Update gadget platform file

Initialize the range count when starting a new range

Fix the no-np case in managed allocation

Ensure DVM node usage gets cleaned up after each job

Update scaling.pl script to use --fwd-mpirun-port. Pre-connect the daemon to its parent during launch while we are otherwise waiting for the daemon's children to send their "phone home" rollup messages

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-07 20:43:12 -08:00
Ralph Castain
a774ea73e4 Skip empty files to avoid infinite loop
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-02-27 06:02:54 -08:00
Ralph Castain
9f8f7f3189 Add CPPFLAGS to build of rml/ofi component.
Fix finalize to ensure we only destruct the msg queue list once.
Update platform file

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-02-25 09:17:41 -08:00
Ralph Castain
8ae55429bc Be a little less OMPI-centric on checking for the top-level directory
Look for .git directory

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-02-22 11:29:51 -08:00
Ralph Castain
665850ed69 Use regex to define the protected files
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-02-18 06:45:13 -08:00
Ralph Castain
2f0aec709a Protect the embedded libraries when updating copyrights - we shouldn't be overwriting their copyrights with our own
bot:notest

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-02-18 06:45:13 -08:00
Jeff Squyres
81e57bb7db nightly-tarball scripts: more quoting fixes
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-02-12 04:19:35 +00:00
Jeff Squyres
2d4fc45429 nightly-tarball scripts: fix quoting
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-02-11 22:40:10 +00:00
Jeff Squyres
b385ac4f09 nightly-tarball scripts: more debugging and robustness
Check the exit status of major commands, as well as (optionally)
output the pwd and command being executed (when debugging).  Also,
read the $debug variable from the environment; if it's set, go into
debugging mode (vs. requiring a modification to the script to enable
debugging mode).

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-02-11 21:50:10 +00:00
Jeff Squyres
0178307d36 openmpi-nightly-tarball: remove spurrious echo statement
Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-02-11 21:50:10 +00:00
Jeff Squyres
704d6a0309 create_tarball: read $debug from environment
If $debug is set in the environment, use that.  This allows enabling
debug mode without requiring an edit to the script.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-02-11 21:50:10 +00:00
Jeff Squyres
51def91003 nightly tarballs: compare the hashes to know if they're new
The filenames contain date/timestamps; if you compare those, the
tarball generated every night will *always* be new.  Instead, separate
out the git hash from the old and new tarballs, and compare those.

Signed-off-by: Jeff Squyres <jsquyres@cisco.com>
2017-02-09 16:56:00 +00:00
Ralph Castain
28abe78f8c Add new platform files. Modify scaling.pl to support ppn option
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-29 15:55:49 -08:00