openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	ba3c247f2a	Big collective commit. I lightly test it, but I think it should be quite stable. Anyway, the default decision functions (for broadcast, reduce and barrier) are based on a high performance network (not TCP). It should give good performance (really good) for any network having the following caracteristics: small latency (5 microseconds) and good bandwidth (more than 1Gb/s). + Cleanup of the reduce algorithms, plus 2 new algorithms (binary and binomial). Now most of the reduce algorithms use a generic tree based function for completing the reduce. + Added macros for computing the trees (they are used for bcast and reduce right now). + Allow the usage of all 5 topologies. + Jelena's implementation of a binary tree that can be used for non commutative operations. Right now only the tree building function is there, it will get activated soon. + Some others minor cleanups. This commit was SVN r12326.	2006-10-26 22:53:05 +00:00
George Bosilca	99631ccf66	Cleanups. This commit was SVN r12272.	2006-10-23 22:29:17 +00:00
George Bosilca	d7d3f9e486	Tuned collectives works only for at least 2 processes. We have the self module for the other cases. This commit was SVN r12271.	2006-10-23 22:28:56 +00:00
George Bosilca	b848a5ad06	Remove all ompi_coll_chain_t references. This commit was SVN r12269.	2006-10-23 21:47:50 +00:00
George Bosilca	39cd8d3d17	One to rule them all. We only need one topology information: a tree. How we build it it's hat make the difference. This commit was SVN r12268.	2006-10-23 21:46:30 +00:00
George Bosilca	9cf3040e5f	Allocate enough memory for the reduce operation when MPI_IN_PLACE is specified. This commit was SVN r12260.	2006-10-23 17:51:36 +00:00
George Bosilca	6b697ad3dd	If the operation is not commutative then force the basic reducve algorithm. The others cannot be used for non commutative operations ... yet ... This commit was SVN r12241.	2006-10-20 22:11:44 +00:00
George Bosilca	a7b6078b73	No more segfault. Still some wrong data around ... This commit was SVN r12238.	2006-10-20 20:17:34 +00:00
George Bosilca	02759cf515	Update the reduce chain collective. This commit was SVN r12237.	2006-10-20 19:47:52 +00:00
George Bosilca	06563b5dec	Last set of explicit conversions. We are now close to the zero warnings on all platforms. The only exceptions (and I will not deal with them anytime soon) are on Windows: - the write functions which require the length to be an int when it's a size_t on all UNIX variants. - all iovec manipulation functions where the iov_len is again an int when it's a size_t on most of the UNIXes. As these only happens on Windows, so I think we're set for now :) This commit was SVN r12215.	2006-10-20 03:57:44 +00:00
George Bosilca	527bb7a197	Remove a double ; This commit was SVN r12213.	2006-10-20 03:28:51 +00:00
George Bosilca	caefd6d0ee	Do not leak memory. Allocate the intermediary buffer only when we really need it (not leafs) and release on the same way. This commit was SVN r12200.	2006-10-19 22:20:33 +00:00
George Bosilca	26b33ec2d7	If there is just one node, we don't need a decision function, just do the copy and return. This commit was SVN r12199.	2006-10-19 22:19:36 +00:00
George Bosilca	3eb2f90ceb	For the recurvise doubling correctly compute the closest power of 2 number of nodes. This commit was SVN r12191.	2006-10-19 17:14:57 +00:00
George Bosilca	041fcb8d18	Update the barrier decision function. This commit was SVN r12190.	2006-10-19 17:14:01 +00:00
George Bosilca	c9da782804	Keep only one function to get the size of a datatype. This commit was SVN r12170.	2006-10-18 17:33:01 +00:00
George Bosilca	21ade43b96	Remove a non reacheable statement. This commit was SVN r12166.	2006-10-18 16:43:55 +00:00
George Bosilca	be27ee6fa0	Correct the bcast problem where we always did a bcast with segzise of 0. Activate the reduce decision function. Others small updates (mostly TAB to spaces). This commit was SVN r12161.	2006-10-18 02:00:46 +00:00
George Bosilca	8852c00c36	Look like a big commit but in fact it address only one issue. The way we're working with size and diplacement of data-type. After this patch all data can contain size_t bytes and the displacements are defined as ptrdiff_t. All of the files I was able to compile have been modified to match this requirement. This commit was SVN r12146.	2006-10-17 20:20:58 +00:00
Jeff Squyres	a8e9fa09da	Fix some compiler warnings introduced in r11619. I checked with George: ompi_ddt_type_size() returns a signed int only because of the MPI spec; it will never return a negative value. So casting the return value out of it to a (uint32_t) is safe, and makes the comparisons be between two unsigned values. This commit was SVN r11639. The following SVN revision numbers were found above: r11619 --> open-mpi/ompi@8667648a1b	2006-09-13 16:42:31 +00:00
Graham Fagg	8667648a1b	Simple fix (for ticket 363). We push segment size to type size. In other algorithms we switch of segementing altogether. But really the DDT can probably handle partial types so we could really keep the segsize constant (for all but reduce ops) and treat it just as byte arrays.. todos: macroize it as we do it 10 different ways, add mca params to control handling (push up size, no change, switch off segmenting) This commit was SVN r11619.	2006-09-12 00:01:27 +00:00
Jeff Squyres	fb4d7ab268	* Fix svn:ignore * Remove files that should not be in SVN This commit was SVN r11565.	2006-09-08 10:35:45 +00:00
George Bosilca	3b39df8ae1	More protection around what we really want to get exported. This commit was SVN r11437.	2006-08-27 04:49:02 +00:00
Sami Ayyorgun	aa8cd63418	changed some barrier variables for shared-memory to volatile This commit was SVN r11403.	2006-08-24 16:53:10 +00:00
Torsten Hoefler	6b22641669	added LibNBC (http://www.unixer.de/NBC ) as collv1 (blocking) component. I know it does not make much sense but one can play around with the performance. Numbers are available at http://www.unixer.de/research/nbcoll/perf/. This is the first step towards collv2. Next step includes the addition of non-blocking functions to the MPI-Layer and the collv1 interface. It implements all MPI-1 collective algorithms in a non-blocking manner. However, the collv1 interface does not allow non-blocking collectives so that all collectives are used blocking by the ompi-glue layer. I wanted to add LibNBC as a separate subdirectory, but I could not convince the buildsystem (and had not the time). So the component looks pretty messy. It would be great if somebody could explain me how to move all nbc{c,h}, and {hb,dict}{c,h} to a seperate subdirectory. It's .ompi_ignored because I did not test it exhaustively yet. This commit was SVN r11401.	2006-08-24 16:47:18 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
George Bosilca	6afa4c6c64	Windows friendly version. We have to split the OMPI_DECLSPEC in at least 3 different macros, one for each project. Therefore, now we have OPAL_DECLSPEC, ORTE_DECLSPEC and OMPI_DECLSPEC. Please use them based on the sub-project. This commit was SVN r11270.	2006-08-20 15:54:04 +00:00
Brian Barrett	4c101c6394	* rename the collectives sm bootstrap area to be consistent with other shared memory segments * make sure to properly unlink the collectives sm bootstrap area at shutdown * Add missing / in the path for the mpool shared memory segment * make sure to release the common_mmap structure in the SM btl after unlinking the file during shutdown This commit was SVN r10886.	2006-07-19 20:55:29 +00:00
George Bosilca	ee6fab783d	SwitchToThread is not defined by any library. Not even by the kernel32.lib as noted in the MSDN documentation. At least not on my WinXP Pro box. This commit was SVN r10719.	2006-07-11 05:36:04 +00:00
Graham Fagg	f10c21b746	corrected mca param description and algorithm count (now to find out why I have disallowed direct calling fo the bm tree) This commit was SVN r10603.	2006-06-30 23:22:49 +00:00
Graham Fagg	f64cbbe8f2	ops. some decisions used extent rather than size for decision making yes this means it WAS possible for two nodes to choice two different algorithms (discovered by Doug Gregor and figured out by George) Also changed some names like size to comsize so we know which sizes we are using where This should be updated in al versions This commit was SVN r10601.	2006-06-30 21:49:04 +00:00
George Bosilca	29219ee57d	Thanks to Gleb now we are able to call the schduler on Windows. Instead of using sched_yield, we use our friend SwitchToThread. This commit was SVN r9671.	2006-04-20 19:56:50 +00:00
Graham Fagg	c31a5ad4b3	A few small changes that just expanded in the name of neatness... (1) As pointed out by Torsten after Jeff comment that there are 15 collectives yesterday.. nope.. I have 16 but miss counted them in my ifdefs (I had two #11s). Replaces with enum... (2) Added a readonly MCA param for how many backend algorithms are available per collective (used by benchmarker/STS) This allowed me to remove the tuned query internal functions and replace them with ompi_coll_tuned_forced_max_algorithms[COLL]. (3) I was reading the user forced MCA params for the collectives on each comm create (module init) but I then put the values into a global set of variables (like ompi_coll_tuned_reduce_forced_algorithm). To fix this and make the code neater: (a) The component looks up the MCA param indices on Open if dynamic_rules is set via the ompi_coll_tuned_COLLECTIVE_intra_check_forced_init () call. (b) Got rid of the ompi_coll_ompi_coll_tuned_COLLECTIVE_forced_algorithm/segmentsize/etc globals with a struct that is now cached on the module data hung off the communicator. i.e. done right. (c) On module init if dynamic rules enabled we call a general getvalues routine (in coll_tuned_forced.c) to get the CURRENT values using the MCA param indices and then put them on the modules data segment. A shorter version of getvalues exists for barrier which only needs the algorithm choice This commit was SVN r9663.	2006-04-19 23:42:06 +00:00
Tim Woodall	bd870519fd	- modified convertor copy_and_prepare routines to accept an addition flag, new flags to be included when convertor is initialized - modified pml/btl module defs and added stub functions for diagnostic output routines to dump state of queues / endpoints - updates to data reliability pml This commit was SVN r9329.	2006-03-17 18:46:48 +00:00
Jeff Squyres	8a9e76dfa3	Thanks to Sven for noticing that the increment in scatter should be per the send datatype, not the receive datatype (MPI-1:105). This commit was SVN r9312.	2006-03-16 18:18:28 +00:00
Graham Fagg	95b060c741	output the right name and stop confusing george This commit was SVN r9215.	2006-03-08 00:40:14 +00:00
George Bosilca	39252b764f	Correctly compute the size of the datatype. This commit was SVN r9127.	2006-02-23 04:30:52 +00:00
George Bosilca	805c45de29	Don't let a division by zero happens ... This commit was SVN r9109.	2006-02-22 06:34:05 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
Graham Fagg	232bb9534a	Start moving stuff out of modules that should be in the component. This commit was SVN r8874.	2006-02-01 20:50:14 +00:00
Graham Fagg	5f2d82347f	a couple of changes to make barrier synchronous.. means last communication to any possible peer must be locally completing. for now using synchronous calls until the new functionality is available. then will change the code to use the new PML send flags. This commit was SVN r8867.	2006-01-31 23:21:46 +00:00
Graham Fagg	25375759c3	arrgh. reduce could for very small message sizes and proc counts call a linear function this was implemented using a chain (tree followed with pipeline) by setting the chain fanout to a factor of size etc but the chain datastructure was fixed in length and if exceeded the topo create returned a null which isn't helpfull in cid next function of comdup... Anyway two fixes, first we do have a real linear function so changed the decision function and second altered the topo chain create to force chain fanouts of less than 1 to 1 and fanouts bigger than max to max. next check in will change chain to dynamically allocd array (reallocable) but we shouldn't ever use a chain fanout for a linear tree anyway. (lession must rerun all tests for all data sizes when changing decision functions) This commit was SVN r8662.	2006-01-08 02:41:09 +00:00
George Bosilca	479d510eaf	Use the common SM component to unmap the shared memory file. This commit was SVN r8623.	2005-12-31 15:07:48 +00:00
Jeff Squyres	54c4bd3ce2	Update to have public symbols be consistent; use new prefix rule (apparently we've been doing this in opal and orte, but not in ompi yet). All public symbols begin with "ompi_coll_tuned_" (not mca_coll_tuned_) except the component struct. Now this component passes the illegal symbol report with no hits. This commit was SVN r8589.	2005-12-22 13:49:33 +00:00
Jeff Squyres	2435970cb8	Enable the new "tuned" coll component in an attempt to get wider testing. Note that this effectively replaces the "basic" component as the baseline collective component. Please report any problems with this component. If you run into problems with this component, you can disable it with: --mca coll_tuned_priority 0 This commit was SVN r8575.	2005-12-21 12:43:03 +00:00
Brian Barrett	a5af07cd6b	fixes suggested by Ralf for supporting both Libtool 1 and 2 in Open MPI... This commit was SVN r8538.	2005-12-19 03:10:23 +00:00
Graham Fagg	8651658816	minor compile warnings fix This commit was SVN r8497.	2005-12-14 19:09:46 +00:00
George Bosilca	6f45b6175a	Header protection. This commit was SVN r8441.	2005-12-10 22:11:10 +00:00
George Bosilca	79486e5922	Protect the min function on Windows as it's defined by default in windows.h This commit was SVN r8437.	2005-12-10 22:02:14 +00:00
George Bosilca	b7353c707d	Remove unprotected header files. This commit was SVN r8432.	2005-12-10 17:04:46 +00:00

1 2 3 4

189 Коммитов