openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	3e29949cc8	* Fix shutdown code in utcp portals code * make all sends long sends for now in Portals MTL * More optimized match check This commit was SVN r10667.	2006-07-05 21:46:45 +00:00
Galen Shipman	fe480cd003	change mask bits and don't call convertor if we received directly into the user buffer.. This commit was SVN r10665.	2006-07-05 21:10:09 +00:00
Jeff Squyres	429c25095e	Fix for bug #176 . * Fix for two problems introduced by r10661: 1. ensure to use the key ''after'' it is initialized (sigh). 1. handle the case where we free the attrkey before it is fully initialized (i.e., some other error causes us to free it). In this case, don't try to remove the key from the hash map, because it won't exist. * More accurate zeroing in the keyval constructor (ompi_attrkey_item_constructor) * Widen the scope of the alock such that the attrkey destructor does not need to acquire it. Instead, assume that the caller already has it. * Add a comment about why the keyval may get destroyed as the result of deleting an attribute (so that I don't have to figure it out again the next time I read this code :-) ) This commit was SVN r10664. The following SVN revision numbers were found above: r10661 --> open-mpi/ompi@fdba2c9df0	2006-07-05 20:23:08 +00:00
George Bosilca	6265625983	Generate the XFER_CONTINUE PERUSE event (or the receive) before unpacking the data. This commit was SVN r10663.	2006-07-05 19:45:00 +00:00
Jeff Squyres	fdba2c9df0	Per the analysis in bug #184 , move some assignments around to effect thread safety. This is likely to be only the first of multiple steps for complete thread safety in the MPI attribute code. All tests [continue to] pass the intel and ibm attribute tests. Also renamed a variable from "attr" to "attrkey" to reflect that it's a keyval, not an attribute. This commit was SVN r10661.	2006-07-05 17:37:17 +00:00
Brian Barrett	4ee4acb6a6	* ignore some Cray-only code when not on the Cray machine This commit was SVN r10660.	2006-07-05 17:16:27 +00:00
Brian Barrett	043153dad3	* fix opal_list_item_t -> ompi_free_list_item_t type change This commit was SVN r10659.	2006-07-05 17:02:16 +00:00
Rainer Keller	23d3628691	- Declare and initialize the peruse_handle_list_lock This commit was SVN r10656.	2006-07-05 13:48:25 +00:00
George Bosilca	d2bf3844e9	Include the header file which define opal_output. This commit was SVN r10648.	2006-07-04 06:23:01 +00:00
George Bosilca	2bdb06b549	Force the request to NULL in order to avoid complaints from the compiler. This commit was SVN r10647.	2006-07-04 06:20:13 +00:00
George Bosilca	402a03d229	Add a .h dependency in order to remove a warning when we compile without --enable-debug. This commit was SVN r10646.	2006-07-04 04:53:38 +00:00
George Bosilca	9ac1a6cdb3	Remove the warnings. Now they are ompi_free_list_item not opal_list_item_t. This commit was SVN r10645.	2006-07-04 04:21:16 +00:00
Brian Barrett	7d12f9119a	* make sure to include post_configure.sh in the dist tarball, so that direct calling the ob1 pml works properly. This commit was SVN r10644.	2006-07-04 04:03:58 +00:00
Brian Barrett	47725c9b02	* Add new PML (CM) and network drivers (MTL) for high speed interconnects that provide matching logic in the library. Currently includes support for MX and some support for Portals * Fix overuse of proc_pml pointer on the ompi_proc structuer, splitting into proc_pml for pml data and proc_bml for the BML endpoint data * bug fixes in bsend init code, which wasn't being used by the OB1 or DR PMLs... This commit was SVN r10642.	2006-07-04 01:20:20 +00:00
Graham Fagg	f10c21b746	corrected mca param description and algorithm count (now to find out why I have disallowed direct calling fo the bm tree) This commit was SVN r10603.	2006-06-30 23:22:49 +00:00
Josh Hursey	2edf1511fd	Closes ticket #173 : Split name linking up for orte/ompi shared tools. This moves the logic to create the symbolic links for: - mpirun - mpiexec - ompi-ps - ompi-clean and their respective man pages to the ompi level from the orte layer. This is a bit pedantic, but orte shouldn't be doing the work of ompi since that is a bit of an abstraction break. Note: need to autogen.sh to get this. Sorry :( This commit was SVN r10602.	2006-06-30 22:01:56 +00:00
Graham Fagg	f64cbbe8f2	ops. some decisions used extent rather than size for decision making yes this means it WAS possible for two nodes to choice two different algorithms (discovered by Doug Gregor and figured out by George) Also changed some names like size to comsize so we know which sizes we are using where This should be updated in al versions This commit was SVN r10601.	2006-06-30 21:49:04 +00:00
Brian Barrett	df9273587f	* romio_cb_write should also be forced to enable when optimizations are requested This commit was SVN r10584.	2006-06-30 15:06:10 +00:00
Galen Shipman	7e079d20ab	fix for stupid casting.. addresses issue on PPC64 where sizes get set improperly and badness ensues.. This commit was SVN r10574.	2006-06-29 21:58:50 +00:00
George Bosilca	7d59a6885b	Remove all references to the MRU list. Add back the repost list checks. For some reasons it decrease the latency by around 0.3 micro-seconds ... This commit was SVN r10571.	2006-06-29 19:25:44 +00:00
George Bosilca	78f0de127d	Typo. This commit was SVN r10567.	2006-06-29 15:16:25 +00:00
George Bosilca	4df58b5579	Latency is LATENCY as everybody understand it not some percentage of something. Now, we really order the BTL depending on the real latency for the eager protocol. Starting from now, the latency one can specify for the devices will be in micro-second, while the bandwidth is in Mbs (as it was before). This commit was SVN r10566.	2006-06-29 15:13:58 +00:00
George Bosilca	238147f576	Help the compiler to optimize the code. Now the order in the enum reflect the order we use them in the switch. This commit was SVN r10565.	2006-06-29 15:10:58 +00:00
George Bosilca	9bf281bca2	Remove the gm_mru_reg list as it is never used. Cleanup the repost logic. Now we repost a receive fragment only when we're done with the message from inside and we try to add it to the list. This commit was SVN r10564.	2006-06-29 15:10:11 +00:00
George Bosilca	43b7b17033	Release the memory registration when the descriptors get freed. This commit was SVN r10540.	2006-06-28 15:24:16 +00:00
George Bosilca	d9daa34a6c	Set the registration field to NULL when we create a new fragment. This commit was SVN r10539.	2006-06-28 15:23:36 +00:00
Gleb Natapov	c8f75c472a	remove modulo op from fast path. Improvement 0.02-0.04ms. This commit was SVN r10538.	2006-06-28 12:00:47 +00:00
Gleb Natapov	e58a89ef3e	OMPI_ENABLE_DEBUG is always defined (to 0 or 1). Use #if and nto #ifdef. This commit was SVN r10537.	2006-06-28 11:25:09 +00:00
Gleb Natapov	704a5eb645	Support for LMC (lid mask count) and multiple QPs per port. This commit was SVN r10536.	2006-06-28 07:23:08 +00:00
Galen Shipman	e6cd8db0e5	DR will now checksum on a per btl basis (see MCA_BTL_FLAGS_NEED_CSUM). We still always send ACK's, teasing apart completion for ACK/no ACK looks like a pain in the .. This commit was SVN r10530.	2006-06-27 20:23:47 +00:00
Brian Barrett	0031e39d72	* fix for dumb memory bug introduced in romio performance fixup code This commit was SVN r10528.	2006-06-27 19:58:18 +00:00
Brian Barrett	9a65a7ca97	* re-add -Is necessary for VPATH builds. This commit was SVN r10524.	2006-06-27 14:10:34 +00:00
Jeff Squyres	df45221a3e	Until a real fix for #142 is found, this workaround prohibits using mpi_leave_pinned when multiple OpenIB HCA ports are found. Specifically, if mpi_leave_pinned == 1 and ultiple HCA ports are found, the MCA parameter btl_openib_max_btls is set to 1. If the MCA parameter btl_openib_warn_leave_pinned_multi_port is true, emit a warning that this happened (having an MCA parameter to control the warning allows users/sysadmins to turn it off instead of being nagged for every run). This commit was SVN r10521.	2006-06-27 10:43:03 +00:00
Gleb Natapov	012d95d195	If ompi_free_list_grow fails wait until resources are available instead of spinning without progress. This commit was SVN r10520.	2006-06-27 09:23:51 +00:00
Gleb Natapov	52208d7bf9	Whe don't need to register zero sized frags. This commit was SVN r10519.	2006-06-27 08:50:12 +00:00
Galen Shipman	8855e5b73a	Fixes for DR as well as better diagnostic.. Successfully passing the intel test suite with/without induced errors/drops. This commit was SVN r10518.	2006-06-26 22:29:29 +00:00
Brian Barrett	970d858f30	* Add performance code requested by LANL, per ticket #128 . Must be explicitly enabled at run-time with the mca parameter io_romio_enable_parallel_optimizations set to something non-zero. This will enable some magic flags in Panasas if the user didn't set them (either on or off) and do some slightly better things with strided collective writes. This commit was SVN r10516.	2006-06-26 22:26:36 +00:00
George Bosilca	940dbff0fa	Add a new PERUSE macro. This is for the CONTINUE event (the one we added to the standard). This macro allow us to specify the length of the fragment. Now we are able to know how the message is fragmented between the network devices or inside the communication protocol. This commit was SVN r10508.	2006-06-26 20:08:33 +00:00
George Bosilca	41c886399b	Don't let the user to specify flags which does not make sense. If the PUT flag is specified check that the put function is available for the BTL. Same safe check for the GET function. At the end make sure that at least on communication protocol is specified, otherwise force the send flag. This commit was SVN r10507.	2006-06-26 20:00:18 +00:00
George Bosilca	c43b9821e7	Generate the PERUSE XFER_CONTINUE event. This commit was SVN r10501.	2006-06-26 19:01:22 +00:00
George Bosilca	53a5d3df0f	Remove useless lines. This commit was SVN r10500.	2006-06-26 19:00:37 +00:00
George Bosilca	a514cdc068	Always limit the size of the RDMA transfer to the maximum amount supported by the BTL (btl_max_rdma_size). Now the PUT protocol is pipelined even if there is just one network between the 2 peers. Unfortunately, this problem is present the 1.1 (no pipeline for the PUT protocol). This commit was SVN r10499.	2006-06-26 19:00:07 +00:00
George Bosilca	8cd4718198	Generate the PERUSE PERUSE_COMM_REQ_XFER_BEGIN event only when there is some data to transfer. This commit was SVN r10498.	2006-06-26 18:57:55 +00:00
Gleb Natapov	b7715395cb	Return descriptor before sending credits one more time. We may need it. This commit was SVN r10495.	2006-06-26 07:05:58 +00:00
Andrew Friedley	7bfac82ce7	Change over from lazy connection setup to setting up at initialization time. UD is connectionless, and as long as peers are statically assigned to QPs, there is no reason to set up the adressing information lazily. Lots of code was axed, as endpoints no longer have state. Removed a number of other elements in the endpoint struct to make it as lightweight as possible. I was able to remove an entire function call/branch in the send path, which I believe is the main contributor to a 2us drop in NetPIPE latency. Some whitespace cleanups as well. Passes IBM test suite, and all but certain intel tests that were failing before the change, over ob1 PML. This commit was SVN r10494.	2006-06-23 16:50:50 +00:00
Andrew Friedley	046f4cd4ae	Enough cleanup for now. Moved a lot of the module-specific init from the component init to the module init. Try keeping a pointer to reduce indexing, didn't seem to help - leaving in place for now. This commit was SVN r10485.	2006-06-22 22:12:13 +00:00
Brian Barrett	7dd1112d07	* implement missing MPI::Is_finalized() function This commit was SVN r10482.	2006-06-22 19:40:54 +00:00
Andrew Friedley	8392ed4cac	A checkpoint before I really do some cleanup.. nothing pretty here. Playing around with OPAL_LIKELY/UNLIKELY, no real gains yet. Reworked progress() to process many WC's at a time, as well as immediately repost groups of receive buffers. This commit was SVN r10481.	2006-06-22 18:06:55 +00:00
George Bosilca	9eb023a5c2	OK my last commit was ... kind of wrong. It only worked if the element_size was smaller than the CACHE_LINE_SIZE. Here is the version that works. In fact this works on 2 steps. First we set the element size to something multiple of the desired alignment. Then when we allocate memory, we compute the total size, and we will align each of the elements (we allocate multiple of them every time) to the CACHE_LINE_SIZE. This commit was SVN r10479.	2006-06-22 14:47:07 +00:00
George Bosilca	c71f6c9765	All elements will be aligned to the CACHE_LINE_SIZE define (currently 128 bytes). The simplest way to make sure they are aligned is to update the size of the basic element to a multiple of the desired alignment. It will use a little bit more memory, but the improvements on the SM BTL seems quite interesting. This commit was SVN r10478.	2006-06-22 14:07:14 +00:00

... 2 3 4 5 6 ...

1796 Коммитов