openmpi

Автор	SHA1	Сообщение	Дата
Rolf vandeVaart	18962d296b	This has bothered me for a while. Change MCA_BTL_TAG_BTL to MCA_BTL_TAG_IB. They are the same value so this does not change anything. (MCA_BTL_TAG_IB = MCA_BTL_TAG_BTL + 0). This just makes it more correct. This commit was SVN r29099.	2013-08-30 14:53:59 +00:00
Rolf vandeVaart	96fdb060ea	Fix compile errors and warnings from changeset 29052. This commit was SVN r29054.	2013-08-21 19:01:54 +00:00
Ralph Castain	45e695928f	As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: * add a new MCA param orte_hostname_cutoff to specify the number of nodes at which we stop including hostnames. This defaults to INT_MAX => always include hostnames. If a value is given, then we will include hostnames for any allocation smaller than the given limit. * remove ompi_proc_get_hostname. Replace all occurrences with a direct link to ompi_proc_t's proc_hostname, protected by appropriate "if NULL" * modify the OMPI-ORTE integration component so that any call to modex_recv automatically loads the ompi_proc_t->proc_hostname field as well as returning the requested info. Thus, any process whose modex info you retrieve will automatically receive the hostname. Note that on-demand retrieval is still enabled - i.e., if we are running under direct launch with PMI, the hostname will be fetched upon first call to modex_recv, and then the ompi_proc_t->proc_hostname field will be loaded * removed a stale MCA param "mpi_keep_peer_hostnames" that was no longer used anywhere in the code base * added an envar lookup in ess/pmi for the number of nodes in the allocation. Sadly, PMI itself doesn't provide that info, so we have to get it a different way. Currently, we support PBS-based systems and SLURM - for any other, rank0 will emit a warning and we assume max number of daemons so we will always retain hostnames This commit was SVN r29052.	2013-08-20 18:59:36 +00:00
Ralph Castain	611d7f9f6b	When we direct launch an application, we rely on PMI for wireup support. In doing so, we lose the de facto data compression we get from the ORTE modex since we no longer get all the wireup info from every proc in a single blob. Instead, we have to iterate over all the procs, calling PMI_KVS_get for every value we require. This creates a really bad scaling behavior. Users have found a nearly 20% launch time differential between mpirun and PMI, with PMI being the slower method. Some of the problem is attributable to poor exchange algorithms in RM's like Slurm and Alps, but we make things worse by calling "get" so many times. Nathan (with a tad advice from me) has attempted to alleviate this problem by reducing the number of "get" calls. This required the following changes: * upon first request for data, have the OPAL db pmi component fetch and decode all the info from a given remote proc. It turned out we weren't caching the info, so we would continually request it and only decode the piece we needed for the immediate request. We now decode all the info and push it into the db hash component for local storage - and then all subsequent retrievals are fulfilled locally * reduced the amount of data by eliminating the exchange of the OMPI_ARCH value if heterogeneity is not enabled. This was used solely as a check so we would error out if the system wasn't actually homogeneous, which was fine when we thought there was no cost in doing the check. Unfortunately, at large scale and with direct launch, there is a non-zero cost of making this test. We are open to finding a compromise (perhaps turning the test off if requested?), if people feel strongly about performing the test * reduced the amount of RTE data being automatically fetched, and fetched the rest only upon request. In particular, we no longer immediately fetch the hostname (which is only used for error reporting), but instead get it when needed. Likewise for the RML uri as that info is only required for some (not all) environments. In addition, we no longer fetch the locality unless required, relying instead on the PMI clique info to tell us who is on our local node (if additional info is required, the fetch is performed when a modex_recv is issued). Again, all this only impacts direct launch - all the info is provided when launched via mpirun as there is no added cost to getting it Barring objections, we may move this (plus any required other pieces) to the 1.7 branch once it soaks for an appropriate time. This commit was SVN r29040.	2013-08-17 00:49:18 +00:00
Jeff Squyres	bbddd6ea03	Add header file for opal_show_help(). This commit was SVN r28056.	2013-02-13 16:31:59 +00:00
Brian Barrett	312f37706e	In talking about this with Jeff and Ralph, we don't actually need ompi_show_help, because opal_show_help is replaced with an aggregating version when using ORTE, so there's no reason to directly call orte_show_help. This commit was SVN r28051.	2013-02-12 21:10:11 +00:00
Joshua Ladd	70ad711337	Backing out the Open SHMEM project This commit was SVN r28050.	2013-02-12 17:45:27 +00:00
Mike Dubman	ff384daab4	Added new project: oshmem. This commit was SVN r28048.	2013-02-12 15:33:21 +00:00
Brian Barrett	f42783ae1a	Move the RTE framework change into the trunk. With this change, all non-CR runtime code goes through one of the rte, dpm, or pubsub frameworks. This commit was SVN r27934.	2013-01-27 23:25:10 +00:00
Mike Dubman	b6d50a5733	Performance optimizations by alexm: * btl sendi(): if message can be send inline try to avoid signal * signal is requested one per 64 or when there are no send wqes when message can not be send inline any other btl method then sendi() This commit was SVN r27724.	2012-12-26 10:19:12 +00:00
Yael Dayan	b3b8a2a23a	function mca_btl_openib_endpoint_post_send can return 3 statuses: - OMPI_SUCCESS - OMPI_ERROR - OMPI_ERR_RESOURCE_BUSY If an "OMPI_ERR_OUT_OF_RESOURCE" occurs, the request is added to the pending list, and will be handled later. An error message should not be printed to the user in this case. This is not an error, but rather a notification of a possible valid condition. Only in the case of "OMPI_ERROR" should it be printed to the user. This commit was SVN r27065.	2012-08-16 07:04:40 +00:00
Josh Hursey	28681deffa	Backout the ORCA commit. :( There is a linking issue on Mac OSX that needs to be addressed before this is able to come back into the trunk. This commit was SVN r26676.	2012-06-27 01:28:28 +00:00
Josh Hursey	542330e3a7	Commit of ORCA: Open MPI Runtime Collaborative Abstraction This is a runtime interposition project that sits between the OMPI and ORTE layers in Open MPI. The project is described on the wiki: https://svn.open-mpi.org/trac/ompi/wiki/Runtime_Interposition And on this email thread: http://www.open-mpi.org/community/lists/devel/2012/06/11109.php This commit was SVN r26670.	2012-06-26 21:42:16 +00:00
Nathan Hjelm	249066e06d	Timeout! Per RFC update the BTL interface to hide segment keys. All BTLs (with the exception of wv), all relevant PMLs, and osc/rdma have been updated for the new interface. This commit was SVN r26626.	2012-06-21 17:09:12 +00:00
Ralph Castain	bd8b4f7f1e	Sorry for mid-day commit, but I had promised on the call to do this upon my return. Roll in the ORTE state machine. Remove last traces of opal_sos. Remove UTK epoch code. Please see the various emails about the state machine change for details. I'll send something out later with more info on the new arch. This commit was SVN r26242.	2012-04-06 14:23:13 +00:00
Brad Benton	0712b911a5	Updated IBM copyright This commit was SVN r25464.	2011-11-09 19:38:53 +00:00
Christopher Yeoh	fb57a74a40	Removes pointless memmove which because of a previous memcpy will always have identical source and destination pointers. See #2871 Plugs a couple of minor memory leaks related to remote qp info This commit was SVN r25431.	2011-11-04 00:15:08 +00:00
Yevgeny Kliteynik	4fbe68dd86	Removing trailing white spaces in all the openib btl code. This commit was SVN r24855.	2011-07-04 14:00:41 +00:00
Donald Kerr	995d46344c	simplify the way IBV_ACCESS_SO is discovered This commit was SVN r24409.	2011-02-17 04:28:56 +00:00
Donald Kerr	2b60b165aa	on Solaris, when IBV_ACCESS_SO is available, use strong ordered memory region for eager rdma connection This commit was SVN r24395.	2011-02-16 05:37:22 +00:00
Shiqing Fan	cdc7e0bec9	Mainly type casts. Get rid of pthread and other unnecessary stuffs for Windows. This commit was SVN r23376.	2010-07-12 16:17:56 +00:00
Rolf vandeVaart	03b3e75f86	Add two arguments to the PML error callback function. This allows the BTL to specify a specific ompi_proc_t that had an error. Also add an optional descriptive string. Currently, arguments are not used but will be by future failover PML. Changes based on RFC. Reviewed by George Bosilca. This commit was SVN r23174.	2010-05-19 11:55:45 +00:00
Abhishek Kulkarni	afbe3e99c6	* Wrap all the direct error-code checks of the form (OMPI_ERR_* == ret) with (OMPI_ERR_* = OPAL_SOS_GET_ERR_CODE(ret)), since the return value could be a SOS-encoded error. The OPAL_SOS_GET_ERR_CODE() takes in a SOS error and returns back the native error code. * Since OPAL_SUCCESS is preserved by SOS, also change all calls of the form (OPAL_ERROR == ret) to (OPAL_SUCCESS != ret). We thus avoid having to decode 'ret' to get the native error code. This commit was SVN r23162.	2010-05-17 23:08:56 +00:00
Christopher Yeoh	0b93c87c2c	Correct year for copyright notices This commit was SVN r22877.	2010-03-25 03:14:21 +00:00
Christopher Yeoh	a14a5dc3c6	This fixes a bug where sometimes the rcache lock would be dropped when it wasn't actually held. Also includes some minor copytight header additions that were missed in previous checkins fixes trac:2101 added cmr:v1.4 This commit was SVN r22676. The following Trac tickets were found above: Ticket 2101 --> https://svn.open-mpi.org/trac/ompi/ticket/2101	2010-02-22 07:40:42 +00:00
Vasily Filipov	354bfe527f	Improving support for non homogeneous OpenFabrics network configurations This commit was SVN r22312.	2009-12-15 14:25:07 +00:00
Christopher Yeoh	d5253aa0f1	Fixes multithread race which causes corruption of no_credits_pending_frags list in the ib btl. See #2128 for details This commit was SVN r22298.	2009-12-14 01:41:45 +00:00
Pavel Shamis	31a88b149a	Fixing thread deadlock flow in openib btl (mpi-thread enabled mode) This commit was SVN r21793.	2009-08-11 10:43:52 +00:00
Rainer Keller	6050020c54	- Use OMPI_SUCCESS. Fails to compile in environments with --disable-mpi This commit was SVN r21785.	2009-08-10 17:46:25 +00:00
Rolf vandeVaart	c82e468ede	Undo revision r21767 - sorry folks This commit was SVN r21769. The following SVN revision numbers were found above: r21767 --> open-mpi/ompi@41f38110ff	2009-08-05 22:23:26 +00:00
Rolf vandeVaart	41f38110ff	HCA failover support in openib BTL This commit was SVN r21767.	2009-08-05 21:53:02 +00:00
Rainer Keller	8243831d76	- Get OpenIB BTL to work with old libibverbs installation Tested on smoky. This commit was SVN r21685.	2009-07-15 16:12:47 +00:00
Jeff Squyres	efd229b56b	Clean up bunches of compiler warnings This commit was SVN r21242.	2009-05-14 15:39:53 +00:00
Greg Koenig	60485ff95f	This is a very large change to rename several #define values from OMPI_* to OPAL_*. This allows opal layer to be used more independent from the whole of ompi. NOTE: 9 "svn mv" operations immediately follow this commit. This commit was SVN r21180.	2009-05-06 20:11:28 +00:00
Pavel Shamis	d25b7203a2	Adding send_immediate (sendi) implementation to openib btl. This commit was SVN r20881.	2009-03-25 16:53:26 +00:00
Rainer Keller	ec0ed48718	- Revert r20739 This commit was SVN r20742. The following SVN revision numbers were found above: r20739 --> open-mpi/ompi@781caee0b6	2009-03-05 21:56:03 +00:00
Rainer Keller	781caee0b6	- First of two or three patches, in orte/util/proc_info.h: Adapt orte_process_info to orte_proc_info, and change orte_proc_info() to orte_proc_info_init(). - Compiled on linux-x86-64 - Discussed with Ralph This commit was SVN r20739.	2009-03-05 20:36:44 +00:00
Rainer Keller	9dea63d63a	- Last of intrusive commits (promised)... err for now. Anyway, this is blocking the move: do not include pml.h if not really needed, aka none of the following used: mca_pml MCA_PML_CALL OMPI_ANY_TAG OMPI_ANY_SOURCE OMPI_PROC_NULL - Notable exceptions (deleting in one header->adding): - ompi/mca/mtl/psm/ - ompi/mca/osc/rdma/ - ompi/mca/btl/openib/btl_openib_endpoint.c depended on pml_base_sendreq.h - Tested on Linux/x86-64, this time including make check (thanks Jeff and Ralph) This commit was SVN r20725.	2009-03-04 17:06:51 +00:00
Rainer Keller	4c0e8e1e69	- Header orte/mca/oob/base/base.h is probably the wrong one to include anyhow -- if oob functionality is neededm then orte/mca/oob/oob.h Nevertheless compiles fine with -Wimplicit-function-declaration This commit was SVN r20641.	2009-02-26 04:20:03 +00:00
Rainer Keller	04567d3af0	- Header orte/mca/errmgr/errmgr.h is not needed. Once again compiles fine with -Wimplicit-function-declaration This commit was SVN r20640.	2009-02-26 04:05:30 +00:00
Jeff Squyres	265ac096e8	Restore a few #include's This commit was SVN r20559.	2009-02-14 15:21:28 +00:00
Jeff Squyres	84a3f84fdf	Possible fix for random openib segv. This commit was SVN r20282.	2009-01-15 17:10:18 +00:00
Pavel Shamis	391b101439	Renaming pending_frags to no_credits_pending_frags. (this commit is part of bug fix for ticket #1693) This commit was SVN r20217.	2009-01-07 14:41:20 +00:00
Pavel Shamis	2f7b66160b	Adding real fix for ticket #1693 - XRC + coalescing segfault. This commit was SVN r20214.	2009-01-07 14:10:58 +00:00
Nysal Jan	6a5454b76a	Fixes crash in openib BTL on a heterogeneous cluster Refs trac:1700 This commit was SVN r20113. The following Trac tickets were found above: Ticket 1700 --> https://svn.open-mpi.org/trac/ompi/ticket/1700	2008-12-10 22:07:48 +00:00
Jeff Squyres	ae86c0eeca	Ensure that the btl variable is always initialized. This commit was SVN r19873.	2008-11-01 11:24:11 +00:00
Jeff Squyres	942b59d572	The endpoint is actually a list_item_t -- not a base object. But only XOOB uses it, so we didn't notice util recently. This commit was SVN r19689.	2008-10-06 17:38:25 +00:00
Jeff Squyres	c42ab8ea37	Fixes trac:1210, #1319 Commit from a long-standing Mercurial tree that ended up incorporating a lot of things: * A few fixes for CPC interface changes in all the CPCs * Attempts (but not yet finished) to fix shutdown problems in the IB CM CPC * #1319: add CTS support (i.e., initiator guarantees to send first message; automatically activated for iWARP over the RDMA CM CPC) * Some variable and function renamings to make this be generic (e.g., alloc_credit_frag became alloc_control_frag) * CPCs no longer post receive buffers; they only post a single receive buffer for the CTS if they use CTS. Instead, the main BTL now posts the main sets of receive buffers. * CPCs allocate a CTS buffer only if they're about to make a connection * RDMA CM improvements: * Use threaded mode openib fd monitoring to wait for for RDMA CM events * Synchronize endpoint finalization and disconnection between main thread and service thread to avoid/fix some race conditions * Converted several structs to be OBJs so that we can use reference counting to know when to invoke destructors * Make some new OBJ's have opal_list_item_t's as their base, thereby eliminating the need for the local list_item_t type * Renamed many variables to be internally consistent * Centralize the decision in an inline function as to whether this process or the remote process is supposed to be the initiator * Add oodles of OPAL_OUTPUT statements for debugging (hard-wired to output stream -1; to be activated by developers if they want/need them) * Use rdma_create_qp() instead of ibv_create_qp() * openib fd monitoring improvements: * Renamed a bunch of functions and variables to be a little more obvious as to their true function * Use pipes to communicate between main thread and service thread * Add ability for main thread to invoke a function back on the service thread * Ensure to set initiator_depth and responder_resources properly, but putting max_qp_rd_ataom and ma_qp_init_rd_atom in the modex (see rdma_connect(3)) * Ensure to set the source IP address in rdma_resolve() to ensure that we select the correct OpenFabrics source port * Make new MCA param: openib_btl_connect_rdmacm_resolve_timeout * Other improvements: * btl_openib_device_type MCA param: can be "iw" or "ib" or "all" (or "infiniband" or "iwarp") * Somewhat improved error handling * Bunches of spelling fixes in comments, VERBOSE, and OUTPUT statements * Oodles of little coding style fixes * Changed shutdown ordering of btl; the device is now an OBJ with ref counting for destruction * Added some more show_help error messages * Change configury to only build IBCM / RDMACM if we have threads (because we need a progress thread) This commit was SVN r19686. The following Trac tickets were found above: Ticket 1210 --> https://svn.open-mpi.org/trac/ompi/ticket/1210	2008-10-06 00:46:02 +00:00
Jeff Squyres	8cb6ad5cb7	Adjustment to match the ob1 change from r19658. This commit was SVN r19671. The following SVN revision numbers were found above: r19658 --> open-mpi/ompi@b32e4e7f34	2008-10-01 23:57:26 +00:00
Jeff Squyres	7b05a14d9a	Back up r19489, which was the result of a "svn ci -m ..." instead of an "hg ci -m ...". Oops. This commit was SVN r19490. The following SVN revision numbers were found above: r19489 --> open-mpi/ompi@ea866f9e26	2008-09-03 08:45:33 +00:00

1 2 3 4

197 Коммитов