openmpi

Автор	SHA1	Сообщение	Дата
igor.ivanov@itseez.com	38c253c74c	ompi/mtl: Fix warnings in mxm component	2015-12-16 16:22:29 +02:00
Nathan Hjelm	2c89c7f47d	ompi/proc: add function to get all allocated procs This commit adds two new functions: - ompi_proc_get_allocated - Returns all procs in the current job that have already been allocated. This is used in init/finalize to determine which procs to pass to add_procs/del_procs. - ompi_proc_world_size - returns the number of processes in MPI_COMM_WORLD. This may be removed in favor of callers just looking at ompi_process_info. The behavior of ompi_proc_world has been restored to return ompi_proc_t's for all processes in the current job. The use of this function is discouraged. Code that was using ompi_proc_world() has been updated to make use of the new functions to avoid the memory overhead of ompi_comm_world (). Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-09-23 16:22:05 -06:00
Ralph Castain	cf6137b530	Integrate PMIx 1.0 with OMPI. Bring Slurm PMI-1 component online Bring the s2 component online Little cleanup - let the various PMIx modules set the process name during init, and then just raise it up to the ORTE level. Required as the different PMI environments all pass the jobid in different ways. Bring the OMPI pubsub/pmi component online Get comm_spawn working again Ensure we always provide a cpuset, even if it is NULL pmix/cray: adjust cray pmix component for pmix Make changes so cray pmix can work within the integrated ompi/pmix framework. Bring singletons back online. Implement the comm_spawn operation using pmix - not tested yet Cleanup comm_spawn - procs now starting, error in connect_accept Complete integration	2015-08-29 16:04:10 -07:00
Alina Sklarevich	28586caecf	MTL_MXM/PML_YALLA: fix coverity issues.	2015-03-12 11:49:22 +02:00
Alex Mikheev	168c83ed95	OMPI/MXM: add out of band barrier at the end of del_procs mxm shutdown requires out of band barrier	2015-03-02 12:56:02 +02:00
igor-ivanov	0f44cdd779	Merge pull request #421 from igor-ivanov/pr/fix-oshmem-coverity oshmem: Fix set of coverity issues	2015-02-24 21:40:06 +04:00
Nathan Hjelm	5f1254d710	Update code base to use the new opal_free_list_t Use of the old ompi_free_list_t and ompi_free_list_item_t is deprecated. These classes will be removed in a future commit. This commit updates the entire code base to use opal_free_list_t and opal_free_list_item_t. Notes: OMPI_FREE_LIST__MT -> opal_free_list_ (uses opal_using_threads ()) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-02-24 10:05:45 -07:00
Igor Ivanov	3e2dd782ea	oshmem: Fix set of coverity issues Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-02-24 19:03:10 +02:00
Mike Dubman	5b3b04b26e	mxm: revert coverity fixes mxm fails on this commit: `780c93ee57`	2015-02-23 07:52:28 +02:00
Igor Ivanov	010dce307a	Fix set of coverity issues List of CIDs (scan.coverity.com): oshmem: 1269787, 1269907, 1270161, 1270162, 1270977, 1270978 ompi: 1270170, 1270172, 1270173 Signed-off-by: Igor Ivanov <Igor.Ivanov@itseez.com>	2015-02-20 17:45:46 +04:00
Ralph Castain	780c93ee57	Per the PR and discussion on today's telecon, extend the process name definition as a two-field struct of uint32_t's down to the OPAL layer. This resolves issues created by prior commits that impacted both heterogeneous and SPARC support. This also simplifies the OMPI code base by removing the need for frequent memcpy's when transitioning between the OMPI/ORTE layers and OPAL. We recognize that this means other users of OPAL will need to "wrap" the opal_process_name_t if they desire to abstract it in some fashion. This is regrettable, and we are looking at possible alternatives that might mitigate that requirement. Meantime, however, we have to put the needs of the OMPI community first, and are taking this step to restore hetero and SPARC support.	2014-11-11 17:00:42 -08:00
Nadezhda Kogteva	2bce929330	MTL MXM cleanup: unnecessary OMPI_MTL_MXM_CONNECT_ON_FIRST_COMM variable removed	2014-10-20 10:29:47 +03:00
Ralph Castain	41c6058153	Bring over changes to MXM from pmix branch: MTL MXM: establish endpoint connection on the first communication when direct_modex used This commit was SVN r32668.	2014-09-03 18:22:11 +00:00
Ralph Castain	aec5cd08bd	Per the PMIx RFC: WHAT: Merge the PMIx branch into the devel repo, creating a new OPAL “lmix” framework to abstract PMI support for all RTEs. Replace the ORTE daemon-level collectives with a new PMIx server and update the ORTE grpcomm framework to support server-to-server collectives WHY: We’ve had problems dealing with variations in PMI implementations, and need to extend the existing PMI definitions to meet exascale requirements. WHEN: Mon, Aug 25 WHERE: https://github.com/rhc54/ompi-svn-mirror.git Several community members have been working on a refactoring of the current PMI support within OMPI. Although the APIs are common, Slurm and Cray implement a different range of capabilities, and package them differently. For example, Cray provides an integrated PMI-1/2 library, while Slurm separates the two and requires the user to specify the one to be used at runtime. In addition, several bugs in the Slurm implementations have caused problems requiring extra coding. All this has led to a slew of #if’s in the PMI code and bugs when the corner-case logic for one implementation accidentally traps the other. Extending this support to other implementations would have increased this complexity to an unacceptable level. Accordingly, we have: * created a new OPAL “pmix” framework to abstract the PMI support, with separate components for Cray, Slurm PMI-1, and Slurm PMI-2 implementations. * Replaced the current ORTE grpcomm daemon-based collective operation with an integrated PMIx server, and updated the grpcomm APIs to provide more flexible, multi-algorithm support for collective operations. At this time, only the xcast and allgather operations are supported. * Replaced the current global collective id with a signature based on the names of the participating procs. The allows an unlimited number of collectives to be executed by any group of processes, subject to the requirement that only one collective can be active at a time for a unique combination of procs. Note that a proc can be involved in any number of simultaneous collectives - it is the specific combination of procs that is subject to the constraint * removed the prior OMPI/OPAL modex code * added new macros for executing modex send/recv to simplify use of the new APIs. The send macros allow the caller to specify whether or not the BTL supports async modex operations - if so, then the non-blocking “fence” operation is used, if the active PMIx component supports it. Otherwise, the default is a full blocking modex exchange as we currently perform. * retained the current flag that directs us to use a blocking fence operation, but only to retrieve data upon demand This commit was SVN r32570.	2014-08-21 18:56:47 +00:00
Vasily Filipov	5ca2fffa44	MTL/MXM: call for ompi_proc_world instead of ompi_comm_size during del_procs. This commit was SVN r32504.	2014-08-11 11:52:23 +00:00
Mike Dubman	3c8a4d7d2d	mxm: opal refactoring voices http://www.open-mpi.org/community/lists/devel/2014/08/15590.php This commit was SVN r32486.	2014-08-10 04:35:56 +00:00
Ralph Castain	552c9ca5a0	George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-) WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic. This commit was SVN r32317.	2014-07-26 00:47:28 +00:00
Mike Dubman	da8df859b3	MXM: use builk connection establishment API fixed by Vasily, reviewed by Yossi/Miked cmr=v1.8.2:reviwer=ompi-rm1.8 This commit was SVN r32256.	2014-07-17 08:35:55 +00:00
Alina Sklarevich	f8a664f5ec	MXM: generate the jobid only for MXM versions under v2.0. reviewed by miked cmr=v1.8.2:reviewer=ompi-rm1.8 This commit was SVN r31910.	2014-06-01 13:29:24 +00:00
Yossi Etigin	6aa5680059	Revert r30966. cmr=v1.8.1:reviewer=ompi-gk1.8 This commit was SVN r31593. The following SVN revision numbers were found above: r30966 --> open-mpi/ompi@280e96c99a	2014-05-01 22:17:09 +00:00
Alina Sklarevich	5cbf085dc2	mtl mxm: silent a warning. in ompi_mtl_mxm_add_procs, define the ep_index variable only for an older version of mxm. submitted by Alina, reviewed by Mike. cmr=v1.8:reviewer=ompi-rm1.8 This commit was SVN r31245.	2014-03-27 08:39:51 +00:00
Ralph Castain	e4efd5675f	Per telecon, add comment indicating this needs to be fixed Refs trac:4354 This commit was SVN r30991. The following Trac tickets were found above: Ticket 4354 --> https://svn.open-mpi.org/trac/ompi/ticket/4354	2014-03-11 15:57:11 +00:00
Yossi Etigin	280e96c99a	In mtl_mxm, don't disconnect from a proc with refcount > 1. This will keep the connection until mxm endpoint is destroyed. cmr=v1.7.5:reviewer=ompi-rm1.7 This commit was SVN r30966.	2014-03-09 08:35:44 +00:00
Mike Dubman	05ee929832	OMPI-MXM: handle multiple calls to add_procs() in MXM - now add_procs can be called more than once (during MPI_INIT and Inter_Comm_Create) - adjust MXM to this reality fixed by Alina, reviewed by Yossi/Mike cmr=v1.7.5:reviewer=ompi-rm1.7 This commit was SVN r30907.	2014-03-03 13:50:37 +00:00
Alina Sklarevich	2869ff1782	mxm: fixes for compilation warnings. removed set but not used variables and a variable that is unused. reviewed by miked cmr=v1.7.4:reviewer=ompi-rm1.7 This commit was SVN r30176.	2014-01-09 15:15:14 +00:00
Yossi Etigin	6ab4aba9e6	Fix missing include of show_help.h in mtl mxm. cmr=v1.7.4:reviewer=jsquyres This commit was SVN r29987.	2013-12-19 19:37:21 +00:00
Yossi Etigin	a913b00f89	mtl mxm: update configuration parsing api to mxm 2.1, drop older version support (1.0 and 1.1), and cleanup the code. reviewed by miked. cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r29797.	2013-12-04 09:11:55 +00:00
Brian Barrett	16a1166884	Remove the proc_pml and proc_bml fields from ompi_proc_t and replace with a configure-time dynamic allocation of flags. The net result for platforms which only support BTL-based communication is a reduction of 8*nprocs bytes per process. Platforms which support both MTLs and BTLs will not see a space reduction, but will now be able to safely run both the MTL and BTL side-by-side, which will prove useful. This commit was SVN r29100.	2013-08-30 16:54:55 +00:00
Ralph Castain	45e695928f	As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: * add a new MCA param orte_hostname_cutoff to specify the number of nodes at which we stop including hostnames. This defaults to INT_MAX => always include hostnames. If a value is given, then we will include hostnames for any allocation smaller than the given limit. * remove ompi_proc_get_hostname. Replace all occurrences with a direct link to ompi_proc_t's proc_hostname, protected by appropriate "if NULL" * modify the OMPI-ORTE integration component so that any call to modex_recv automatically loads the ompi_proc_t->proc_hostname field as well as returning the requested info. Thus, any process whose modex info you retrieve will automatically receive the hostname. Note that on-demand retrieval is still enabled - i.e., if we are running under direct launch with PMI, the hostname will be fetched upon first call to modex_recv, and then the ompi_proc_t->proc_hostname field will be loaded * removed a stale MCA param "mpi_keep_peer_hostnames" that was no longer used anywhere in the code base * added an envar lookup in ess/pmi for the number of nodes in the allocation. Sadly, PMI itself doesn't provide that info, so we have to get it a different way. Currently, we support PBS-based systems and SLURM - for any other, rank0 will emit a warning and we assume max number of daemons so we will always retain hostnames This commit was SVN r29052.	2013-08-20 18:59:36 +00:00
Ralph Castain	611d7f9f6b	When we direct launch an application, we rely on PMI for wireup support. In doing so, we lose the de facto data compression we get from the ORTE modex since we no longer get all the wireup info from every proc in a single blob. Instead, we have to iterate over all the procs, calling PMI_KVS_get for every value we require. This creates a really bad scaling behavior. Users have found a nearly 20% launch time differential between mpirun and PMI, with PMI being the slower method. Some of the problem is attributable to poor exchange algorithms in RM's like Slurm and Alps, but we make things worse by calling "get" so many times. Nathan (with a tad advice from me) has attempted to alleviate this problem by reducing the number of "get" calls. This required the following changes: * upon first request for data, have the OPAL db pmi component fetch and decode all the info from a given remote proc. It turned out we weren't caching the info, so we would continually request it and only decode the piece we needed for the immediate request. We now decode all the info and push it into the db hash component for local storage - and then all subsequent retrievals are fulfilled locally * reduced the amount of data by eliminating the exchange of the OMPI_ARCH value if heterogeneity is not enabled. This was used solely as a check so we would error out if the system wasn't actually homogeneous, which was fine when we thought there was no cost in doing the check. Unfortunately, at large scale and with direct launch, there is a non-zero cost of making this test. We are open to finding a compromise (perhaps turning the test off if requested?), if people feel strongly about performing the test * reduced the amount of RTE data being automatically fetched, and fetched the rest only upon request. In particular, we no longer immediately fetch the hostname (which is only used for error reporting), but instead get it when needed. Likewise for the RML uri as that info is only required for some (not all) environments. In addition, we no longer fetch the locality unless required, relying instead on the PMI clique info to tell us who is on our local node (if additional info is required, the fetch is performed when a modex_recv is issued). Again, all this only impacts direct launch - all the info is provided when launched via mpirun as there is no added cost to getting it Barring objections, we may move this (plus any required other pieces) to the 1.7 branch once it soaks for an appropriate time. This commit was SVN r29040.	2013-08-17 00:49:18 +00:00
Yossi Etigin	64d98e0438	Fix data corruption in MXM by registering to OPAL memory release hooks and removing any mappings created by mxm This commit was SVN r28489.	2013-05-14 12:27:44 +00:00
Alex Margolin	0ab7675019	Fix MXM connection establishment flow This commit was SVN r28329.	2013-04-12 16:37:42 +00:00
Brian Barrett	312f37706e	In talking about this with Jeff and Ralph, we don't actually need ompi_show_help, because opal_show_help is replaced with an aggregating version when using ORTE, so there's no reason to directly call orte_show_help. This commit was SVN r28051.	2013-02-12 21:10:11 +00:00
Vasily Filipov	21b170b43b	MTL MXM: push commit r27987 back, now with right user. r27987 - MTL MXM: ver. 2.0 interface changes. This commit was SVN r28026. The following SVN revision numbers were found above: r27987 --> open-mpi/ompi@2735658d81	2013-02-04 06:59:24 +00:00
Vasily Filipov	aa5e436479	Revert revesion -r27986, the reason is - it was submitted with wrong user name. This commit was SVN r28025. The following SVN revision numbers were found above: r27986 --> open-mpi/ompi@729caaf0cd	2013-02-04 06:54:24 +00:00
Pavel Shamis	2735658d81	MTL MXM: ver. 2.0 interface changes. This commit was SVN r27987.	2013-01-31 08:38:08 +00:00
Brian Barrett	f42783ae1a	Move the RTE framework change into the trunk. With this change, all non-CR runtime code goes through one of the rte, dpm, or pubsub frameworks. This commit was SVN r27934.	2013-01-27 23:25:10 +00:00
Mike Dubman	a454341e2b	add support for mxm 2.0 This commit was SVN r27661.	2012-12-09 22:58:37 +00:00
Aleksey Senin	33ae1fe6c7	Fix untitialized return code in ompi_mtl_mxm_add_procs function. This commit was SVN r27216.	2012-09-02 13:17:49 +00:00
Yael Dayan	79e6b9c91d	Adapt OMPI to use newer version of MXM. This commit was SVN r26974.	2012-08-08 15:29:38 +00:00
Yael Dayan	954bcdc0a5	adapt the way to find amount of local processes to OMPI trunk. This commit was SVN r26973.	2012-08-08 15:26:28 +00:00
Vasily Filipov	fc712182db	MTL MXM: make MXM use MXM_VERSION macro for MXM version checking. This commit was SVN r26952.	2012-08-06 06:35:57 +00:00
Vasily Filipov	c386847d9a	MTL MXM: Adding MXM version protect for Mprobe, Mrecv resources. This commit was SVN r26922.	2012-07-31 07:57:25 +00:00
Vasily Filipov	ef9bd8e4cb	MTL MXM: MPI_Mprobe, MPI_Mrecv implementation for MXM adding. This commit was SVN r26866.	2012-07-25 13:26:40 +00:00
Yevgeny Kliteynik	0e28fa984b	Remove dead code that was related to ticket #2971 This commit was SVN r26701.	2012-07-02 11:19:09 +00:00
Ralph Castain	0dfe29b1a6	Roll in the rest of the modex change. Eliminate all non-modex API access of RTE info from the MPI layer - in some cases, the info was already present (either in the ompi_proc_t or in the orte_process_info struct) and no call was necessary. This removes all calls to orte_ess from the MPI layer. Calls to orte_grpcomm remain required. Update all the orte ess components to remove their associated APIs for retrieving proc data. Update the grpcomm API to reflect transfer of set/get modex info to the db framework. Note that this doesn't recreate the old GPR. This is strictly a local db storage that may (at some point) obtain any missing data from the local daemon as part of an async methodology. The framework allows us to experiment with such methods without perturbing the default one. This commit was SVN r26678.	2012-06-27 14:53:55 +00:00
Josh Hursey	28681deffa	Backout the ORCA commit. :( There is a linking issue on Mac OSX that needs to be addressed before this is able to come back into the trunk. This commit was SVN r26676.	2012-06-27 01:28:28 +00:00
Josh Hursey	542330e3a7	Commit of ORCA: Open MPI Runtime Collaborative Abstraction This is a runtime interposition project that sits between the OMPI and ORTE layers in Open MPI. The project is described on the wiki: https://svn.open-mpi.org/trac/ompi/wiki/Runtime_Interposition And on this email thread: http://www.open-mpi.org/community/lists/devel/2012/06/11109.php This commit was SVN r26670.	2012-06-26 21:42:16 +00:00
Mike Dubman	10831e111a	detect num of local procs This commit was SVN r26555.	2012-06-05 09:13:16 +00:00
Yevgeny Kliteynik	1cbce83ece	Fixed wording of MXM parameters as suggested By Jeff. This commit was SVN r26545.	2012-06-03 21:48:42 +00:00

1 2

67 Коммитов