openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	8ace07efed	This commit brings in two major things: 1. Galen's fine-grain control of queue pair resources in the openib BTL. 1. Pasha's new implementation of asychronous HCA event handling. Pasha's new implementation doesn't take much explanation, but the new "multifrag" stuff does. Note that "svn merge" was not used to bring this new code from the /tmp/ib_multifrag branch -- something Bad happened in the periodic trunk pulls on that branch making an actual merge back to the trunk effectively impossible (i.e., lots and lots of arbitrary conflicts and artifical changes). :-( == Fine-grain control of queue pair resources == Galen's fine-grain control of queue pair resources to the OpenIB BTL (thanks to Gleb for fixing broken code and providing additional functionality, Pasha for finding broken code, and Jeff for doing all the svn work and regression testing). Prior to this commit, the OpenIB BTL created two queue pairs: one for eager size fragments and one for max send size fragments. When the use of the shared receive queue (SRQ) was specified (via "-mca btl_openib_use_srq 1"), these QPs would use a shared receive queue for receive buffers instead of the default per-peer (PP) receive queues and buffers. One consequence of this design is that receive buffer utilization (the size of the data received as a percentage of the receive buffer used for the data) was quite poor for a number of applications. The new design allows multiple QPs to be specified at runtime. Each QP can be setup to use PP or SRQ receive buffers as well as giving fine-grained control over receive buffer size, number of receive buffers to post, when to replenish the receive queue (low water mark) and for SRQ QPs, the number of outstanding sends can also be specified. The following is an example of the syntax to describe QPs to the OpenIB BTL using the new MCA parameter btl_openib_receive_queues: {{{ -mca btl_openib_receive_queues \ "P,128,16,4;S,1024,256,128,32;S,4096,256,128,32;S,65536,256,128,32" }}} Each QP description is delimited by ";" (semicolon) with individual fields of the QP description delimited by "," (comma). The above example therefore describes 4 QPs. The first QP is: P,128,16,4 Meaning: per-peer receive buffer QPs are indicated by a starting field of "P"; the first QP (shown above) is therefore a per-peer based QP. The second field indicates the size of the receive buffer in bytes (128 bytes). The third field indicates the number of receive buffers to allocate to the QP (16). The fourth field indicates the low watermark for receive buffers at which time the BTL will repost receive buffers to the QP (4). The second QP is: S,1024,256,128,32 Shared receive queue based QPs are indicated by a starting field of "S"; the second QP (shown above) is therefore a shared receive queue based QP. The second, third and fourth fields are the same as in the per-peer based QP. The fifth field is the number of outstanding sends that are allowed at a given time on the QP (32). This provides a "good enough" mechanism of flow control for some regular communication patterns. QPs MUST be specified in ascending receive buffer size order. This requirement may be removed prior to 1.3 release. This commit was SVN r15474.	2007-07-18 01:15:59 +00:00
George Bosilca	c839694fb8	Dont print anything when the user requested a specific MX interface. This commit was SVN r15426.	2007-07-14 00:04:50 +00:00
Galen Shipman	06b97cb267	fix template btl This commit was SVN r15413.	2007-07-13 20:06:22 +00:00
Josh Hursey	d4d5a351c1	Silence a compiler warning when not using IPV6. Also convert a few statements to conform to coding standard for Open MPI. This commit was SVN r15407.	2007-07-13 16:38:36 +00:00
Josh Hursey	021249fa65	Use the new MCA metadata flag instead of 'false' for the newly added components This commit was SVN r15400.	2007-07-13 14:39:17 +00:00
George Bosilca	8643f38adf	Don't allow the BTL to be closed before the end of the process. Count the number of times the BTLs are opened, and then don't remove them until close was called the same number of times. This commit was SVN r15376.	2007-07-11 22:21:04 +00:00
Brian Barrett	1f2942cf2a	* Provide flag if the BTL can do RDMA, but requires a prepare_{src,dst} that exactly describes the buffer to be used as the target of the operation * Use the above flag to disable components setting the flag from being used for real RDMA operations for the one-sided component (the BTLs will still be used for RDMA transfers for the PML and for send/receive communication for the OSC component) This commit was SVN r15375.	2007-07-11 21:21:40 +00:00
Jeff Squyres	8aa8a667da	Use the OMPI version number for the component number, like all other btl components. This commit was SVN r15363.	2007-07-11 15:45:25 +00:00
Donald Kerr	88c9dfdf9f	improve message to user when dat_ia_open fails This commit was SVN r15362.	2007-07-11 15:20:35 +00:00
Andrew Friedley	87dd4bbd47	No idea how I did this.. thanks again to Jeff. This commit was SVN r15345.	2007-07-10 20:37:42 +00:00
Brian Barrett	1d02b9e7b5	Fix a bunch of issues exposed by Ken Cain in getting Open MPI to work with VxWorks. Still some issues remaining, I'm sure. Refs trac:1010 This commit was SVN r15320. The following Trac tickets were found above: Ticket 1010 --> https://svn.open-mpi.org/trac/ompi/ticket/1010	2007-07-10 03:46:57 +00:00
George Bosilca	1200fa4ac5	The first version of the Elan BTL. This commit was SVN r15319.	2007-07-09 21:03:13 +00:00
Jeff Squyres	cee9c214c7	Update the vendor ID list to include HP (0x1708). Thanks to Peter Kjellstrom for pointing this out. This commit was SVN r15316.	2007-07-09 20:09:31 +00:00
Brian Barrett	8b9e8054fd	Move modex from pml base to general ompi runtime, sicne it's used by more than just the PML/BTLs these days. Also clean up the code so that it handles the situation where not all nodes register information for a given node (rather than just spinning until that node sends information, like we do today). Includes r15234 and r15265 from the /tmp/bwb-modex branch. This commit was SVN r15310. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r15234 r15265	2007-07-09 17:16:34 +00:00
Andrew Friedley	b212cf4dae	Fix a signedness warning reported by Jeff/MTT. This commit was SVN r15309.	2007-07-09 15:30:29 +00:00
Andrew Friedley	77038b65a8	Bring the UD BTL over to the trunk, named 'ofud'. This commit was SVN r15298.	2007-07-05 23:42:54 +00:00
Sven Stork	21f12f29f8	- fix a sm bug that causes segfaults in the case of threaded builds. The problem is that in the case of threaded builds for every fifo a head and tail lock will be allocated inside the shared memory segment and the ptr is stored inside the fifo. In the case that the sm backend file will be mapped in all processes at the same address (mostly the case for non-thread builds) this is fine, but in the cases when the processes map the file at different addresses this addresses cause big trouble in other processes than the one that allocted the locks. Therefore the send lock addresses have to be recalculated to match the local mapping of the processes that use them. This commit was SVN r15291.	2007-07-05 14:26:32 +00:00
Brian Barrett	41afd4ebee	Clean up the MX configure test a bit. Use AC macros instead of hand writing them. Better tests, less code, and caching. Update the code to match changes in configure defines. This commit was SVN r15287.	2007-07-04 22:07:30 +00:00
George Bosilca	dfa5ae34e1	Per a discussion with Kees Verstoep and Reese Faucette add one more argument to the query for the line speed. This function is still not documented, and it really look strange that we have to respecify the nic_id (it's already attached to the endpoint). This commit was SVN r15241.	2007-06-28 20:58:00 +00:00
Brian Barrett	f8fb1e9720	Fix some compile failures on Solaris 9 because it doesn't have V6ONLY. This commit was SVN r15237.	2007-06-28 18:52:15 +00:00
George Bosilca	aec0b00f29	Get some hints about the network and propagate them to the upper level. This commit was SVN r15236.	2007-06-28 18:51:48 +00:00
Sven Stork	428f697542	- addition to r15198. Update also the prepare destintation functions. This commit was SVN r15199. The following SVN revision numbers were found above: r15198 --> open-mpi/ompi@f63dd902cb	2007-06-26 12:07:30 +00:00
Sven Stork	f63dd902cb	- bring the order changes of r14768 also to the mvapi btl This commit was SVN r15198. The following SVN revision numbers were found above: r14768 --> open-mpi/ompi@3401bd2b07	2007-06-26 09:34:44 +00:00
Jeff Squyres	022bd30558	Back out r15158 because it apparently breaks with recent versions of flex (which, incidentally, emit ''more'' warnings than earlier versions). Grumble. This commit was SVN r15166. The following SVN revision numbers were found above: r15158 --> open-mpi/ompi@57d09c10f7	2007-06-21 21:14:10 +00:00
Jeff Squyres	57d09c10f7	Avoid some compiler warnings that come up ''every day'' in MTT (and have been for eons): make a symbol be used in a dumb but harmless way. This commit was SVN r15158.	2007-06-21 15:42:06 +00:00
Gleb Natapov	b88b7dedfe	Rename btl_rdma_offset to btl_pipeline_send_length. This commit was SVN r15153.	2007-06-21 07:12:40 +00:00
Jeff Squyres	84487f5c4b	Update and correct the help messages for the generic BTL MCA parameters. Hopefully, they now make more sense to the mostly naieve user... This commit was SVN r15147.	2007-06-20 16:37:50 +00:00
Jeff Squyres	930a9b7682	Make the help messages for if_include/if_exclude a little better. This commit was SVN r15134.	2007-06-19 13:38:58 +00:00
Gleb Natapov	643037907f	Convert all #ifdef OMPI_ENABLE_DEBUG to #if. This commit was SVN r15117.	2007-06-17 07:14:47 +00:00
George Bosilca	ceb8abe9c1	OMPI_ENABLE_DEBUG require an #if not an #ifdef This commit was SVN r15107.	2007-06-15 19:22:19 +00:00
Josh Hursey	6cdfefad87	Fix portals BTL and cnos RML. Both were failing due to interface changes that were never applied to them properly. This commit was SVN r15082.	2007-06-14 18:49:41 +00:00
Jeff Squyres	2399b9a535	Ensure to initialize the variable so that we don't segv. This commit was SVN r15078.	2007-06-14 13:59:28 +00:00
Gleb Natapov	7b9ae49fe1	This time correctly calculate local BTL rank among all BTLs in a subnet. This commit was SVN r15073.	2007-06-14 10:27:11 +00:00
Jeff Squyres	1e18265c16	Bring over the functionality from the /tmp/jnysal-openib-wireup branch: * Support btl_openib_if_include and btl_openib_if_exclude MCA parameters, similar to those supported by other BTLs. Each take a comma-delimited lists of identifiers. Identifiers can be HCA interface names (e.g., ipath0, mthca1, etc.) or an HCA interface name and port numbers (e.g., ipath0:1, mthca1:2, etc.). It is an error to specify both _include and _exclude. If you specify a non-existant (or non-ACTIVE) HCA and/or port, you'll get a warning unless you disable the warning by setting the MCA parameter btl_openib_warn_nonexistent_if to 0. * Start updating to use BEGIN_C_DECLS and END_C_DECLS * A few other minor fixes that were picked up along the way. This commit was SVN r15063.	2007-06-14 01:59:25 +00:00
Gleb Natapov	8164723014	Allow to configure bandwidth and latency with finer granularity. Set bandwidth for all ports of mthca0: --mca btl_openib_bandwidth_mthca0 1000 Set bandwidth for port 1 of mthca1: --mca btl_openib_bandwidth_mthca1:1 1000 Set latency for port 2 lid 123 on mthca0: --mca btl_openib_latency_mthca0:2:123 20 This commit was SVN r15041.	2007-06-13 12:47:38 +00:00
Gleb Natapov	5c3f511451	Properly determine btl's rank among all btls withing the same subnet. This commit was SVN r15038.	2007-06-13 11:15:58 +00:00
Brian Barrett	27ad954265	Fix a couple of problems with the way we were using orte_process_name_t structures in the system. Instead of using memcmp, use the ns function. This won't cause a problem as long as all three elements of the name are ints, but if they have different sizes, alignment and padding rules can cause memcmp() to compare padding space, which rarely holds a sane value. This commit was SVN r14998.	2007-06-11 19:12:11 +00:00
George Bosilca	e2dd0a50fc	A better version alowing for multi-rails or clusters of clusters. A lot of cleanups. This commit was SVN r14963.	2007-06-08 20:37:20 +00:00
George Bosilca	c66cf32ee2	Cleaning up. Removing all unused variables and fields in the MX BTL and component structures. This commit was SVN r14957.	2007-06-07 21:02:18 +00:00
Tim Prins	06bf4c3f3b	fix some printf warnings This commit was SVN r14934.	2007-06-06 22:37:26 +00:00
George Bosilca	6a5e039466	Allow smart connection to be setup. Each peer now has attached to it thea unique id based on the last half of the mapper MAC. This allow us to figure out how to connect peers. This allow the MX BTL to be used in a cluster of cluster configuration where each cluster have MX internally as well as on a multi rail MX system. This commit was SVN r14932.	2007-06-06 21:42:11 +00:00
Galen Shipman	5340f5e320	Try to cleanup the flow control logic a bit Renamed a few variables Inialize the reserve receive buffers to 1, prior to this they were initialized to zero. This commit was SVN r14919.	2007-06-06 18:51:09 +00:00
Gleb Natapov	de58336c45	Let rdma_pipeline_offset to be set to zero. This commit was SVN r14900.	2007-06-06 11:54:25 +00:00
Donald Kerr	8ecbc71ed2	add support for connection private data, off by default This commit was SVN r14878.	2007-06-05 19:29:50 +00:00
Gleb Natapov	ac1e8f81af	Lets be real. TCP latency is slightly worse then mx/openib. This commit was SVN r14865.	2007-06-05 12:22:57 +00:00
Gleb Natapov	fbd033b162	Cut&Paste error in r14795. Fix. This commit was SVN r14862. The following SVN revision numbers were found above: r14795 --> open-mpi/ompi@6b0d8c0858	2007-06-05 10:07:06 +00:00
Brian Barrett	508da4e959	OS X apparently really doesn't like shared libraries with unresolvable symbols in them and environ is defined only in the final application (probably in crt1.o). Apple provides a function for getting at the environment, so use that instead if it's available. This commit was SVN r14857.	2007-06-05 03:03:59 +00:00
Brian Barrett	a446af5b6b	* Remove unneeded SRQ test -- we no longer support OFED builds that don't have the SRQ interface. * Instead of setting AC_DEFINEs per MCA component, set per test. THe answers can never be difference, and this will speed sed just a teeny bit This commit was SVN r14856.	2007-06-05 01:49:26 +00:00
Gleb Natapov	6b0d8c0858	TCP BTL ignores btl_tcp_bandwidth parameter. Fix it. This commit was SVN r14795.	2007-05-30 14:12:05 +00:00
Donald Kerr	91c9b7b6f9	don't call dat_evd_resize if new value is less than or equal to current because ofed stack does not return DAT_INVALID_STATE This commit was SVN r14792.	2007-05-29 20:08:16 +00:00

... 4 5 6 7 8 ...

1005 Коммитов