1
1
Граф коммитов

116 Коммитов

Автор SHA1 Сообщение Дата
Howard Pritchard
01d4d52588 SCIF: remove it
KNC is effectively dead.  Remove corresponding SCIF
support in Open MPI.

cherry pick of PR #5737

+

news update

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
(cherry picked from commit b9ac3d8931)
2018-09-19 11:48:17 -06:00
Howard Pritchard
8eb738a9c8 lanl/platform: add new toss2/3 platform files
remove old platform files
add new platform files for toss2/toss3
OPA/MLX-IB variants.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2018-02-17 10:32:07 -07:00
Howard Pritchard
1a639ec477 LANL/platform: disable use of XRC recv bufs
Forgot as part of #3970 to disable use of XRC
recv bufs by default in LANL platform config
file.

related to #4300

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2017-10-04 11:40:20 -06:00
Nathan Hjelm
27f8a4e806 opal: add code patcher framework
This commit adds a framework to abstract runtime code patching.
Components in the new framework can provide functions for either
patching a named function or a function pointer. The later
functionality is not being used but may provide a way to allow memory
hooks when dlopen functionality is disabled.

This commit adds two different flavors of code patching. The first is
provided by the overwrite component. This component overwrites the
first several instructions of the target function with code to jump to
the provided hook function. The hook is expected to provide the full
functionality of the hooked function.

The linux patcher component is based on the memory hooks in ucx. It
only works on linux and operates by overwriting function pointers in
the symbol table. In this case the hook is free to call the original
function using the function pointer returned by dlsym.

Both components restore the original functions when the patcher
framework closes.

Changes had to be made to support Power/PowerPC with the Linux
dynamic loader patcher. Some of the changes:

 - Move code necessary for powerpc/power support to the patcher
   base. The code is needed by both the overwrite and linux
   components.

 - Move patch structure down to base and move the patch list to
   mca_patcher_base_module_t. The structure has been modified to
   include a function pointer to the function that will unapply the
   patch. This allows the mixing of multiple different types of
   patches in the patch_list.

 - Update linux patching code to keep track of the matching between
   got entry and original (unpatched) address. This allows us to
   completely clean up the patch on finalize.

All patchers keep track of the changes they made so that they can be
reversed when the patcher framework is closed.

At this time there are bugs in the Linux dynamic loader patcher so
its priority is lower than the overwrite patcher.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-04-13 17:16:13 -06:00
Nathan Hjelm
b1670f844d contrib/platform: don't disable dlopen
The --enable-static gives us what we want: statically linked components.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-04-13 17:16:12 -06:00
Nathan Hjelm
147e780fa5 contrib/lanl: update platform files for TOSS2
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2016-03-17 14:30:50 -06:00
Howard Pritchard
89b9be3732 lanl/platform: fixes to pick up lustre
Fixes to lanl platform files to pick up lustre header
files, etc. for romio and ompi i/o.

Fixes #1033

Thanks to Jerome Vienne for spotting this.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-10-15 14:32:21 -05:00
Howard Pritchard
5eccba17af lanl: help out lanl admins
LANL admins want platform files and *.conf
files so oblige them.

Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-07-24 08:03:52 -07:00
Ralph Castain
869041f770 Purge whitespace from the repo 2015-06-23 20:59:57 -07:00
Bert Wesarg
d01c5160df Remove any reference to VampirTrace in the platform files. 2015-01-22 08:08:08 +01:00
Howard Pritchard
328a408dd0 comment out alps select in cray_xe6 platform
This alps selection stuff in the platform file is no longer required.
2014-12-05 13:22:59 -07:00
Howard Pritchard
bae3837121 Title: Comment out all mpi_abort_print_stack in lanl platform files
Description:
setting mpi_abort_print_stack in mca params file now makes openmpi
unhappy. Comment these out in all the LANL platform files.
Requested by TOSS OpenMPI support person.

cmr=v1.8.3

This commit was SVN r32782.
2014-09-24 18:25:58 +00:00
Howard Pritchard
51c73f116b switch check for ugni to use pkg-config
deprecate with-ugni in lanl/cray_xe6 platform file

This commit was SVN r32625.
2014-08-28 22:03:41 +00:00
Howard Pritchard
4d6d4f46b0 switch udreg config macro to use pkg-config
This commit was SVN r32516.
2014-08-12 21:30:06 +00:00
Nathan Hjelm
9db9f9724a Update LANL platform files to always use vader over sm.
cmr=v1.8:reviewer=ompi-gk1.8

This commit was SVN r31233.
2014-03-26 21:54:43 +00:00
Nathan Hjelm
5c35b5ba19 Fix wrapper ldflags.
cmr=v1.7.4:reviewer=jsquyres

This commit was SVN r30555.
2014-02-04 19:44:08 +00:00
Nathan Hjelm
aa3fea55b2 btl/ugni: re-add a sendi function to exploit the new optimization in
ob1.

Also update LANL platform files to use the latest version of ugni.

cmr=v1.7.5:reviewer=manjugv

This commit was SVN r30353.
2014-01-21 20:53:35 +00:00
Nathan Hjelm
91db0cd512 Remove non-existent MCA variables from LANL platform file.
No review required.

cmr=v1.7.4:reviewer=ompi-gk1.7

This commit was SVN r29810.
2013-12-05 22:43:08 +00:00
Nathan Hjelm
24a7e7aa34 Add support for the udreg registration cache and dynamics on XE/XK/XC.
To support the new mpool two changes were made to the mpool infrastructure:

 1) Added an mpool flag to indicate that an mpool does not need the memory
    hooks to use the leave pinned protocols. This flag is checked in the
    mpool lookup.

 2) Add a mpool context to the base registration. This new member is used
    by the udreg mpool to store the udreg context associated with the
    particular registration. The new member will not break the ABI
    compatibility as the new member is only currently used by the udreg
    mpool.

Dynamics support for Cray systems makes use of the global rank provided by
orte to give the ugni library a unique rank for each process. Dynamics
support is not available under direct-launch (srun.)

cmr=v1.7.4

This commit was SVN r29719.
2013-11-18 04:58:37 +00:00
Nathan Hjelm
0df678f1ea Update LANL platform files.
cmr=v1.7.4:reviewer=ompi-gk1.7

This commit was SVN r29625.
2013-11-06 22:43:21 +00:00
Nathan Hjelm
68dac45a37 Add platform files for LANL MIC nodes.
No review needed.

cmr=v1.7.4:reviewer=ompi-rm1.7

This commit was SVN r29540.
2013-10-28 16:46:04 +00:00
Nathan Hjelm
ba8bfeded0 lanl: clean up tlcc plaform files
No review necessary.

cmr=v1.7.3:reviewer=ompi-gk1.7

This commit was SVN r28976.
2013-08-01 19:54:29 +00:00
Ralph Castain
37db1727a2 Refs trac:3710
Simplify the whole stripping of prefix method by consolidating it into a single MCA param. Allow for multiple prefixes to be stripped, each separated in the param by a comma. If no prefix is given, or the specified prefix isn't in the nodename, then just use the hostname itself.

This commit was SVN r28974.

The following Trac tickets were found above:
  Ticket 3710 --> https://svn.open-mpi.org/trac/ompi/ticket/3710
2013-08-01 00:32:10 +00:00
Nathan Hjelm
83a3fc2fd2 Add an option to control which hostnames orte_strip_prefix_from_node_names works
on.

This corrects a problem with Cray systems where the login node's hostname
was being stripped causing the login node to be used as a compute node by
mpirun.

cmr=v1.7.3:reviewer=rhc

This commit was SVN r28970.
2013-07-31 18:42:02 +00:00
Nathan Hjelm
278522d8e8 Update LANL platform files for changes in linux memory hook configuration.
No review necessary

cmr=v1.7.3:reviewer=ompi-gk1.7

This commit was SVN r28969.
2013-07-31 17:56:22 +00:00
Joshua Ladd
0b5c1f2ea8 Add 'generic' support for PMI2 (previously, we checked for PMI2 only on Cray systems.) If your resource manager (e.g. SLURM) has support for PMI2, then the --with-pmi configure flag will enable its usage. If you don't have PMI2, then you will fallback to regular old PMI1. This patch was submitted by Ralph Castain and reviewed and pushed by Josh Ladd. This should be added to cmr:v1.7:reviewer=jladd
This commit was SVN r28666.
2013-06-21 15:28:14 +00:00
Nathan Hjelm
e61a1aa865 Update LANL XE-6 platform files
This commit was SVN r28574.
2013-05-30 18:33:27 +00:00
Nathan Hjelm
bdd6d35eeb update LANL platform files
This commit was SVN r28375.
2013-04-24 15:46:44 +00:00
Nathan Hjelm
cf377db823 MCA/base: Add new MCA variable system
Features:
 - Support for an override parameter file (openmpi-mca-param-override.conf).
   Variable values in this file can not be overridden by any file or environment
   value.
 - Support for boolean, unsigned, and unsigned long long variables.
 - Support for true/false values.
 - Support for enumerations on integer variables.
 - Support for MPIT scope, verbosity, and binding.
 - Support for command line source.
 - Support for setting variable source via the environment using
   OMPI_MCA_SOURCE_<var name>=source (either command or file:filename)
 - Cleaner API.
 - Support for variable groups (equivalent to MPIT categories).

Notes:
 - Variables must be created with a backing store (char **, int *, or bool *)
   that must live at least as long as the variable.
 - Creating a variable with the MCA_BASE_VAR_FLAG_SETTABLE enables the use of
   mca_base_var_set_value() to change the value.
 - String values are duplicated when the variable is registered. It is up to
   the caller to free the original value if necessary. The new value will be
   freed by the mca_base_var system and must not be freed by the user.
 - Variables with constant scope may not be settable.
 - Variable groups (and all associated variables) are deregistered when the
   component is closed or the component repository item is freed. This
   prevents a segmentation fault from accessing a variable after its component
   is unloaded.
 - After some discussion we decided we should remove the automatic registration
   of component priority variables. Few component actually made use of this
   feature.
 - The enumerator interface was updated to be general enough to handle
   future uses of the interface.
 - The code to generate ompi_info output has been moved into the MCA variable
   system. See mca_base_var_dump().

opal: update core and components to mca_base_var system
orte: update core and components to mca_base_var system
ompi: update core and components to mca_base_var system

This commit also modifies the rmaps framework. The following variables were
moved from ppr and lama: rmaps_base_pernode, rmaps_base_n_pernode,
rmaps_base_n_persocket. Both lama and ppr create synonyms for these variables.

This commit was SVN r28236.
2013-03-27 21:09:41 +00:00
Nathan Hjelm
0a8189aa4e update lanl platform files for XE6
This commit was SVN r27964.
2013-01-29 16:06:08 +00:00
Ralph Castain
c96cc2d5a0 In order to properly connect to debuggers like STAT, we need to get the hostname in its unstripped version for the MPIR_proctab. Unfortunately, we need a stripped version for Cray's alps launcher. So when we are stripping the hostname prefix, retain alias hostnames and add the ability to specify an alias to use in the proctab.
This commit was SVN r27863.
2013-01-18 05:00:05 +00:00
Samuel Gutierrez
ba470dcec9 Add eth0 to oob_tcp_if_include
This commit was SVN r27374.
2012-09-26 14:47:00 +00:00
Ralph Castain
254f34133c Add and update platform files
This commit was SVN r27092.
2012-08-17 21:39:22 +00:00
Ralph Castain
589acf550c Improve the new MPI_INFO_ENV to better handle Java applications and to correctly report the info for singletons.
This commit was SVN r27025.
2012-08-13 22:13:49 +00:00
Ralph Castain
b9b41d8662 For cases where the alpha+non-zero prefix must be removed from a node name, be sure to do it everywhere we access node names - otherwise, modex methods such as pmi will fail to correctly identify procs on the same node
This commit was SVN r27022.
2012-08-13 20:44:56 +00:00
Nathan Hjelm
2706353a5d update lanl platform files for new verbs configury
This commit was SVN r26659.
2012-06-25 22:54:03 +00:00
Ralph Castain
f653e2e81e Update platform file
This commit was SVN r26617.
2012-06-19 13:36:16 +00:00
Ralph Castain
d1ca08041b Update LANL Cray platform files to disable static ports
This commit was SVN r26607.
2012-06-15 10:16:56 +00:00
Nathan Hjelm
b94e7af5c8 LANL: update XE-6 platform files
This commit was SVN r26394.
2012-05-07 17:22:14 +00:00
Nathan Hjelm
2f98ef894a update lanl tlcc platform files
This commit was SVN r26345.
2012-04-26 22:08:27 +00:00
Nathan Hjelm
e7ae4c3626 update LANL XE6 platform files
This commit was SVN r26305.
2012-04-19 21:51:37 +00:00
Jeff Squyres
320b7bd5df Update platform files to use enable_mpi_fortran, not enable_mpi_f77
and enable_mpi_f90

This commit was SVN r26284.
2012-04-18 16:16:33 +00:00
Nathan Hjelm
05a9727aca lanl: add -march=amdfam10 to xe6 platform files
This commit was SVN r25896.
2012-02-10 00:47:14 +00:00
Samuel Gutierrez
b88508e2b9 Platform file updates. Build with PMI support on our TLCC2 clusters.
This commit was SVN r25888.
2012-02-09 20:11:35 +00:00
Nathan Hjelm
ad6f0ac59d set default binding in lanl platform files to by core
This commit was SVN r25620.
2011-12-12 20:05:48 +00:00
Ralph Castain
07655e2945 Handle the case where the allocator "fibs" to us about the node names. In some cases (ahem...you know who you are!), the allocator will tell us a node number (e.g., "16"). However, the daemon will return a node name (e.g., "nid0016") - leaving us not recognizing its location.
So provide a new parameter (can't have too many!) that handles this situation by stripping the prefix from the returned node name. Also do a little cleanup to ensure we cleanly exit from errors, without generating too many annoying messages.

This commit was SVN r25562.
2011-12-02 14:10:08 +00:00
Samuel Gutierrez
1eb97a903e update plat files to include ugni btl.
This commit was SVN r25451.
2011-11-07 05:00:46 +00:00
Samuel Gutierrez
c646c93eec remove unneeded flags from cray xe6 platform file.
This commit was SVN r25363.
2011-10-24 18:42:43 +00:00
Samuel Gutierrez
949364d2d6 update LANL Cray XE6 platform files to include PMI support.
This commit was SVN r25356.
2011-10-21 21:05:23 +00:00
Nathan Hjelm
ce29170968 update lanl xe6 platform files for vader
This commit was SVN r25344.
2011-10-20 21:50:53 +00:00