1
1
Граф коммитов

505 Коммитов

Автор SHA1 Сообщение Дата
Ralph Castain
081a0d98eb Ignore the ud/oob component
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-06-18 13:25:16 -07:00
Ralph Castain
014bb3c8de Fix external hwloc builds
Remove spurious comma in header file definition. Remove unused variables

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-06-03 11:24:21 -07:00
Nathan Hjelm
85d1965a0f
Merge pull request #4828 from hppritcha/topic/update_lanl_toss_platform
lanl/platform: add new toss2/3 platform files
2018-05-01 09:52:14 -06:00
Ralph Castain
538fd18fad Update default MCA params in platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2018-03-25 17:14:01 -07:00
Howard Pritchard
8eb738a9c8 lanl/platform: add new toss2/3 platform files
remove old platform files
add new platform files for toss2/toss3
OPA/MLX-IB variants.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2018-02-17 10:32:07 -07:00
Howard Pritchard
406c4cc126 Merge pull request #4299 from hppritcha/topic/update_lanl_toss_platform_file
LANL/platform: disable use of XRC recv bufs
2017-10-06 09:31:17 -06:00
Howard Pritchard
1a639ec477 LANL/platform: disable use of XRC recv bufs
Forgot as part of #3970 to disable use of XRC
recv bufs by default in LANL platform config
file.

related to #4300

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-10-04 11:40:20 -06:00
Ralph Castain
4f932819aa Update platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-09-22 16:05:57 -07:00
Ralph Castain
f7e8780a42 Remove fortran support from platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-07-20 21:02:30 -07:00
Artem Polyakov
35f15a0ba5 contrib: Fix mellanox platform defaults (btl/sm -> btl/vader)
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
2017-07-15 14:32:26 +07:00
Ralph Castain
243076dd8c Update gadget platform file
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-04-19 21:45:13 -06:00
Howard Pritchard
9350aa5d71 orte/ras: remove loadleveler support
Remove loadleveler as it is obsolescent and is no longer supported.

Fixes #3167

We'll wait for final check of whether or not loadleveler even
compiles/functions before merging this.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-03-21 10:32:28 -06:00
Ralph Castain
24e8639826 Platform file update
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-14 11:11:48 -06:00
Ralph Castain
6d6bc9bd07 Update alps module to new APIs
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-12 09:43:07 -07:00
Ralph Castain
48fc339718 Create an alternative mapping method that pushes responsibility
onto the backend daemons. By default, let mpirun only pack the app_context
info and send that to the backend daemons where the mapping will
be done. This significantly reduces the computational time on mpirun as it isn't
running up/down the topology tree computing thousands of binding
locations, and it reduces the launch message to a very small number of
bytes.

When running -novm, fall back to the old way of doing things
where mpirun computes the entire map and binding, and then sends
the full info to the backend daemon.

Add a new cmd line option/mca param --fwd-mpirun-port that allows
mpirun to dynamically select a port, but then passes that back to
all the other daemons so they will use that port as a static port
for their own wireup. In this mode, we no longer "phone home" directly
to mpirun, but instead use the static port to wireup at daemon
start. We then use the routing tree to rollup the initial
launch report, and limit the number of open sockets on mpirun's node.

Update ras simulator to track the new nidmap code

Cleanup some bugs in the nidmap regex code, and enhance the error message for not enough slots to include the host on which the problem is found.

Update gadget platform file

Initialize the range count when starting a new range

Fix the no-np case in managed allocation

Ensure DVM node usage gets cleaned up after each job

Update scaling.pl script to use --fwd-mpirun-port. Pre-connect the daemon to its parent during launch while we are otherwise waiting for the daemon's children to send their "phone home" rollup messages

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-03-07 20:43:12 -08:00
Ralph Castain
9f8f7f3189 Add CPPFLAGS to build of rml/ofi component.
Fix finalize to ensure we only destruct the msg queue list once.
Update platform file

Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-02-25 09:17:41 -08:00
Ralph Castain
28abe78f8c Add new platform files. Modify scaling.pl to support ppn option
Signed-off-by: Ralph Castain <rhc@open-mpi.org>
2017-01-29 15:55:49 -08:00
Ralph Castain
649301a3a2 Revise the routed framework to be multi-select so it can support the new conduit system. Update all calls to rml.send* to the new syntax. Define an orte_mgmt_conduit for admin and IOF messages, and an orte_coll_conduit for all collective operations (e.g., xcast, modex, and barrier).
Still not completely done as we need a better way of tracking the routed module being used down in the OOB - e.g., when a peer drops connection, we want to remove that route from all conduits that (a) use the OOB and (b) are routed, but we don't want to remove it from an OFI conduit.
2016-10-23 21:52:39 -07:00
Ralph Castain
2f966bf3bf Cleanup external PMIx v3 component for copy/paste errors - component and module require unique names 2016-10-20 09:11:46 -07:00
Alina Sklarevich
a2be17ec14 Revert "mellanox/optimized: set enable_openib_rdmacm_ibaddr=yes in the mellanox/optimized file."
This reverts commit 6cd7282631.
2016-06-06 11:26:11 +03:00
Nathan Hjelm
1e6b4f2f55 Merge pull request #1495 from hjelmn/new_hooks
Add new patcher memory hooks
2016-04-13 18:19:23 -06:00
Nathan Hjelm
27f8a4e806 opal: add code patcher framework
This commit adds a framework to abstract runtime code patching.
Components in the new framework can provide functions for either
patching a named function or a function pointer. The later
functionality is not being used but may provide a way to allow memory
hooks when dlopen functionality is disabled.

This commit adds two different flavors of code patching. The first is
provided by the overwrite component. This component overwrites the
first several instructions of the target function with code to jump to
the provided hook function. The hook is expected to provide the full
functionality of the hooked function.

The linux patcher component is based on the memory hooks in ucx. It
only works on linux and operates by overwriting function pointers in
the symbol table. In this case the hook is free to call the original
function using the function pointer returned by dlsym.

Both components restore the original functions when the patcher
framework closes.

Changes had to be made to support Power/PowerPC with the Linux
dynamic loader patcher. Some of the changes:

 - Move code necessary for powerpc/power support to the patcher
   base. The code is needed by both the overwrite and linux
   components.

 - Move patch structure down to base and move the patch list to
   mca_patcher_base_module_t. The structure has been modified to
   include a function pointer to the function that will unapply the
   patch. This allows the mixing of multiple different types of
   patches in the patch_list.

 - Update linux patching code to keep track of the matching between
   got entry and original (unpatched) address. This allows us to
   completely clean up the patch on finalize.

All patchers keep track of the changes they made so that they can be
reversed when the patcher framework is closed.

At this time there are bugs in the Linux dynamic loader patcher so
its priority is lower than the overwrite patcher.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-04-13 17:16:13 -06:00
Nathan Hjelm
b1670f844d contrib/platform: don't disable dlopen
The --enable-static gives us what we want: statically linked components.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-04-13 17:16:12 -06:00
Alina Sklarevich
6cd7282631 mellanox/optimized: set enable_openib_rdmacm_ibaddr=yes in the mellanox/optimized file. 2016-04-11 18:01:16 +03:00
Nathan Hjelm
147e780fa5 contrib/lanl: update platform files for TOSS2
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-03-17 14:30:50 -06:00
Mike Dubman
cdffe4f92d BUILD: update mellanox platform file
add support for UCX
2015-10-21 11:39:30 +03:00
Howard Pritchard
89b9be3732 lanl/platform: fixes to pick up lustre
Fixes to lanl platform files to pick up lustre header
files, etc. for romio and ompi i/o.

Fixes #1033

Thanks to Jerome Vienne for spotting this.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-10-15 14:32:21 -05:00
Ralph Castain
c1bbbb5e2f Remove the last involvement of the OOB system from the MPI layer, remove the no-longer-needed usock/oob component, and have procs no longer open the RML, OOB, ROUTED, and GRPCOMM frameworks as PMIx now provides all required app-mpirun cmds 2015-09-15 13:08:35 -07:00
Howard Pritchard
5eccba17af lanl: help out lanl admins
LANL admins want platform files and *.conf
files so oblige them.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-07-24 08:03:52 -07:00
Ralph Castain
75ceec663a Now that it has been officially released, update the embedded HWLOC to 1.11.0 2015-06-28 14:07:45 -07:00
Ralph Castain
869041f770 Purge whitespace from the repo 2015-06-23 20:59:57 -07:00
Ralph Castain
9a70765f27 Silence malloc(0) warnings reported by Lisandro 2015-05-12 12:38:58 -07:00
Mike Dubman
dede6fa1fb build: new options
- enable/disable know for threads support
- disable rpath by default
2015-04-30 14:46:15 +03:00
Bert Wesarg
d01c5160df Remove any reference to VampirTrace in the platform files. 2015-01-22 08:08:08 +01:00
Howard Pritchard
328a408dd0 comment out alps select in cray_xe6 platform
This alps selection stuff in the platform file is no longer required.
2014-12-05 13:22:59 -07:00
Aurelien Bouteiller
08d93bd4b9 Adding a platform file for Darter NICS CrayXC30 2014-10-16 14:22:02 -04:00
Howard Pritchard
bae3837121 Title: Comment out all mpi_abort_print_stack in lanl platform files
Description:
setting mpi_abort_print_stack in mca params file now makes openmpi
unhappy. Comment these out in all the LANL platform files.
Requested by TOSS OpenMPI support person.

cmr=v1.8.3

This commit was SVN r32782.
2014-09-24 18:25:58 +00:00
Ralph Castain
a90f12ad1d Save the clang settings for detecting alignment issues - I don't want to have to remember the cmd line jango
This commit was SVN r32705.
2014-09-10 18:56:19 +00:00
Howard Pritchard
51c73f116b switch check for ugni to use pkg-config
deprecate with-ugni in lanl/cray_xe6 platform file

This commit was SVN r32625.
2014-08-28 22:03:41 +00:00
Howard Pritchard
4d6d4f46b0 switch udreg config macro to use pkg-config
This commit was SVN r32516.
2014-08-12 21:30:06 +00:00
Ralph Castain
c5b2f9c8a5 Ignore Java on Mac for me
This commit was SVN r32464.
2014-08-08 16:14:03 +00:00
Mike Dubman
142f7290bc BUILD: update platform file with debug caps
cmr=v1.8.2:reviewer=ompi-rm1.8

This commit was SVN r32102.
2014-06-28 18:55:31 +00:00
Ralph Castain
1949f485ac Update platform file
cmr=v1.8.2:reviewer=ompi-gk1.8

This commit was SVN r32069.
2014-06-24 13:53:05 +00:00
Ralph Castain
a286977eff Don't build slurm if not present - platform file change only
This commit was SVN r31571.
2014-04-30 19:27:43 +00:00
Ralph Castain
751b4325b3 Provide more general code coverage
This commit was SVN r31568.
2014-04-30 15:42:16 +00:00
Ralph Castain
0bfce321cd Remove orcm-related platform files and ignore them from now on
This commit was SVN r31405.
2014-04-15 21:58:57 +00:00
Nathan Hjelm
9db9f9724a Update LANL platform files to always use vader over sm.
cmr=v1.8:reviewer=ompi-gk1.8

This commit was SVN r31233.
2014-03-26 21:54:43 +00:00
Mike Dubman
cf9f5f9c4c enable oshmem in mlnx platform file
cmr=v1.7.5:reviewer=ompi-rm1.7

This commit was SVN r31065.
2014-03-14 08:18:55 +00:00
Mike Dubman
e630b0f47a update ignore list
disable coll-ml

cmr=v1.7.5:reviewer=ompi-rm1.7

This commit was SVN r30924.
2014-03-04 14:23:12 +00:00
Nathan Hjelm
5c35b5ba19 Fix wrapper ldflags.
cmr=v1.7.4:reviewer=jsquyres

This commit was SVN r30555.
2014-02-04 19:44:08 +00:00