openmpi

Автор	SHA1	Сообщение	Дата
Shiqing Fan	a0660f4deb	- Just some type casts. This commit was SVN r16100.	2007-09-12 15:29:58 +00:00
Jeff Squyres	c4a38f47f6	Resolve Coverity CID 467: remove unused variable / dead code. This commit was SVN r15997.	2007-08-29 01:23:18 +00:00
Edgar Gabriel	a2f5cada1a	convert the hiearch component to the new structure. More testing required before we remove the .ompi_ignore flag again. This commit was SVN r15954.	2007-08-23 20:41:29 +00:00
Shiqing Fan	a497a3fcad	- Fix some small bugs, copy-paste mistakes. This commit was SVN r15941.	2007-08-21 19:57:28 +00:00
Sven Stork	3985a35c35	- export required symbol This commit was SVN r15939.	2007-08-21 18:46:11 +00:00
Brian Barrett	af4e86c25f	Update collectives selection logic to allow for multiple components to be used at nce (up to one unique collective module per collective function). Matches r15795:15921 of the tmp/bwb-coll-select branch This commit was SVN r15924. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r15795 r15921	2007-08-19 03:37:49 +00:00
Jelena Pjesivac-Grbovic	9bd9c92dbd	Making sure that the decision function for scatter and gather correctly computes everything for MPI_IN_PLACE case. This commit was SVN r15841.	2007-08-13 17:35:50 +00:00
Jelena Pjesivac-Grbovic	b558e820cb	removing compiler wraning This commit was SVN r15803.	2007-08-08 15:22:01 +00:00
Jelena Pjesivac-Grbovic	daa10b277e	modifying scatter decision function to use binomial algorithm for small message sizes. This commit was SVN r15798.	2007-08-07 22:16:13 +00:00
Mohamad Chaarawi	59a7bf8a9f	Merging in the Sparse Groups.. This commit includes config changes.. This commit was SVN r15764.	2007-08-04 00:41:26 +00:00
Sven Stork	855434de59	- fixes several coverty issues - add missing initialisation for variables - use strncpy instead of strcpy This commit was SVN r15683.	2007-07-30 14:44:37 +00:00
Jelena Pjesivac-Grbovic	1b66a52c50	Modifying type of binomial tree used for binomial reduce: switching: 0 0 / \ \ / \ \ 1 \ \ --> 4 \ \ / \ \ / \ \ 3 2 \ 3 2 \ 4 1 (duh). The first form is the bmtree suitable for bcast, but the latter is better for reduce. Updating default decision function accordingly. This commit was SVN r15422.	2007-07-13 21:07:51 +00:00
Jelena Pjesivac-Grbovic	d677db9b5f	cleaning up alltoall implementation: - removing MPI_* calls from bruck implementation - simplifying 2 process case - identation, etc. This commit was SVN r15301.	2007-07-07 01:06:19 +00:00
Jelena Pjesivac-Grbovic	483222085e	Fixing compiler warnings. In gather, the ptmp += incr is irrelevant, since ptmp is set within the loop. This commit was SVN r15293.	2007-07-05 20:40:50 +00:00
Jelena Pjesivac-Grbovic	3b0a52a104	adding tuned allgatherv implementation using bruck, ring, and neighbor-exchange algorithms. The implementations passed intel and imb tests up to 40 processes. This commit was SVN r15280.	2007-07-03 23:33:12 +00:00
Jelena Pjesivac-Grbovic	d55b415bb0	fixing typo This commit was SVN r15240.	2007-06-28 20:56:55 +00:00
Jelena Pjesivac-Grbovic	8fc8b44d11	Modifying reduce decision function for large, single element reduces (again). Binary algorithm without segmentation tends to outperform binomial algorithm in this case. This commit was SVN r15226.	2007-06-27 22:01:56 +00:00
Jelena Pjesivac-Grbovic	0ecef1750d	Modifying the default reduce decision function to use binomial algorithm for single-element reduce (segmented algorithms make no sense in this case and can cause performance degradation). This commit was SVN r15209.	2007-06-26 20:14:03 +00:00
Jelena Pjesivac-Grbovic	567b40b9a9	Modifying the default broadcast decision function to use binomial algorithm for single-element broadcasts (segmented algorithms make no sense in this case and can cause performance degradation). This commit was SVN r15208.	2007-06-26 20:08:31 +00:00
Jelena Pjesivac-Grbovic	3740640711	Modifying MPI_Gather in tuned module: - adding linear algorithm with synchronization for gather. This algorithm prevents congestion at root process, but introduces synchronization (serializes non-root processes, but allows messages to arrive from two processes at the same time). It performed better than binomial and linear algorithms for large message, and intermediate and large communicator sizes. - Updating MPI_Gather decision function to reflect performance results from MX. I will perform more measurements though - so this one can change. This commit was SVN r15165.	2007-06-21 20:00:36 +00:00
Sven Stork	22af6d38e6	- UNexport symbols that shouldn't be needed outside the libraries - replace #if/#endif with BEGIN/END_C_DECLS - reformating This commit was SVN r14669.	2007-05-16 15:46:52 +00:00
Brian Barrett	21e00f6f0c	Clean up a couple of configure things: * Require Autoconf 2.60 or higher and remove some cruft required for AC 2.59 or the AC 2.59 / AC 2.60 mix * Remove a bunch of now unnecessary AC_SUBST calls * Use the libtool-provided variables for the -I and library to use when compiling against ltdl Fixes trac:1000 This commit was SVN r14652. The following Trac tickets were found above: Ticket 1000 --> https://svn.open-mpi.org/trac/ompi/ticket/1000	2007-05-15 04:23:48 +00:00
Jelena Pjesivac-Grbovic	625c6739ab	Removing warning about unsed variable This commit was SVN r14579.	2007-05-03 20:26:41 +00:00
Jelena Pjesivac-Grbovic	9eff74ad4d	Modifying generalized reduce "synchronized" behavior: - Removing "small" message size limit because it really does not relate to the eager size accross the board. Now, the leaf nodes in generalized reduce will use blocking send (DEFAULT/ORIGINAL BEHAVIOR) either when the maximum number of outstanding requests is 0 or when the total number of segments is less than the maximum number of outstanding requests. Otherwise, it will send messages using non-blocking synchronized send operation. This commit was SVN r14572.	2007-05-02 21:42:45 +00:00
George Bosilca	69642a9cd4	Remove 2 warnings about ptrdiff_t to unsigned long implicit conversion. This commit was SVN r14565.	2007-05-01 19:47:33 +00:00
Jelena Pjesivac-Grbovic	3eac49aa59	Adding flow control for leaf nodes in generalized reduce structure. This "feature" is disabled by default and it should not affect the current performance. In case when the message size is large and segment size is smaller than eager size for particular interface, the leaf nodes in generalized reduce function can overflood parent nodes by sending all segments without any synchronization. This can cause the parent to have HIGH number of unexpected messages (think 16MB message with 1KB segments for example). In case of binomial algorithm root node always has at least one child which is leaf, so this can potentially affect the root's performance significantly [Especially in large communicators where root may have quite a few children (binomial tree for example)]. When the segment size is bigger than the eager size, rendezvous protocol ensures that this does not happen so it is not necessary. Originally, the problem was exposed in "infinite" bucket allocator clean up time for "small" segment sizes (which may explain some "deadlocks" on Thunderbird tests). To prevent this, we allow user to specify mca parameter "--mca coll_tuned_reduce_algorithm_max_requests NUM" this limits number of outstanding messages from a leaf node in generalized reduce to the parent to NUM. Messages are sent as non-blocking synchrnous messages, so syncronization happens at "wait" time. The synchronization actually improved performance of pipeline and binomial algorithm for large message sizes with 1KB segments over MX, but I need to test it some more to make sure it is consistent. Since there is no easy way to find out what is "the eager" size for particular btl, I set the limit to 4000B. If message/individual segment size is greater than 4000B - we will not use this feature. This variable may or may not be exposed as mca parameter later... I did not have any problems running it and both "default" and "synchronous" tests passed Intel Reduce* tests up to 80 processes (over MX). This commit was SVN r14518.	2007-04-25 20:39:53 +00:00
Jelena Pjesivac-Grbovic	53cbec7a09	Make coll/tuned dynamic rules more verbose (when promted with --mca coll_base_verbose 1) This commit was SVN r14469.	2007-04-23 16:34:52 +00:00
Jeff Squyres	51f286d737	Just like r14289 on the ORTE trunk: Per discussions with Brian and Ralph, make a slight correction in where components are installed. Use $pkglibdir, not $libdir/openmpi, so that when compiled in the orte trunk, components are installed to the right directory (because the component search patch is checking $pkglibdir). This commit was SVN r14345. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r14289	2007-04-12 11:19:42 +00:00
George Bosilca	120cf76ad8	Remove some warnings. This commit was SVN r14196.	2007-04-02 19:11:06 +00:00
George Bosilca	cc65814969	And set the message size before the first use too. This commit was SVN r14159.	2007-03-28 18:01:13 +00:00
George Bosilca	b540545fa7	Set the communicator size before using it. This commit was SVN r14158.	2007-03-28 17:59:21 +00:00
Mohamad Chaarawi	bfaf9d4a12	Added new module for intercomm collectives. This will require an autogen. This commit was SVN r14149.	2007-03-27 02:06:42 +00:00
Jelena Pjesivac-Grbovic	d6402b6898	Adding in-order binary tree algorithm for non-commutative reduce operations. I tested algorithm with intel and ibm tests and it passed again - so it should work. This commit was SVN r14068.	2007-03-19 21:03:57 +00:00
Josh Hursey	dadca7da88	Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD). This merge adds Checkpoint/Restart support to Open MPI. The initial frameworks and components support a LAM/MPI-like implementation. This commit follows the risk assessment presented to the Open MPI core development group on Feb. 22, 2007. This commit closes trac:158 More details to follow. This commit was SVN r14051. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r13912 The following Trac tickets were found above: Ticket 158 --> https://svn.open-mpi.org/trac/ompi/ticket/158	2007-03-16 23:11:45 +00:00
Rolf vandeVaart	42168575fd	Fix for the special case where np=2 and the sendbuf is set to MPI_IN_PLACE. In that case, sendcount and sendtype are not valid and we need to use recvcount and recvtype. This commit fixes trac:943. Reviewed by Jelena Pjesivac-Grbovic. This commit was SVN r14022. The following Trac tickets were found above: Ticket 943 --> https://svn.open-mpi.org/trac/ompi/ticket/943	2007-03-13 19:01:20 +00:00
Jelena Pjesivac-Grbovic	9780a000ba	Cleanup of generic reduce function and possible (low probability) bug fix. - fixing line lengths and some of the comments - possible bug fix (but I do not think we exposed it in any tests so far) temporary buffers were allocated as multiples of extent instead of true_extent + (count -1) * extent. Everything is still passing Intel tests over tcp and btl mx up to 64 nodes. This commit was SVN r13956.	2007-03-08 00:54:52 +00:00
Jelena Pjesivac-Grbovic	57cbafafd5	Clean up of generic broadcast function: removing unecessary statements and improving comments. This commit was SVN r13955.	2007-03-07 21:59:53 +00:00
Jelena Pjesivac-Grbovic	0c07654c30	Updating reduce_scatter decision function based on MX results up to 64 nodes and both 1ppn and 2ppn configurations. This commit was SVN r13945.	2007-03-07 00:38:33 +00:00
Jelena Pjesivac-Grbovic	e5ed167a6e	Adding tuned version of reduce_scatter implementation. Currently 3 algorithms are available: - non-overlapping, reduce + scatterv, (works for non-commutative operations) - recursive halving algorithm (copied from basic module) - ring algorithm (similar to allreduce ring, for large messages) This commit was SVN r13929.	2007-03-05 20:40:39 +00:00
Li-Ta Lo	196e2a86bb	addes binomial tree based scatter, passed IBM and intel tests This commit was SVN r13906.	2007-03-02 23:19:02 +00:00
Li-Ta Lo	11c94cbe76	eliminated the use of MPI_Get_count This commit was SVN r13904.	2007-03-02 22:57:50 +00:00
Li-Ta Lo	3765e19d15	added ASCII graph for the topologies This commit was SVN r13892.	2007-03-02 17:17:14 +00:00
Li-Ta Lo	bd75f2f162	change ALLGATHER to GATHER This commit was SVN r13891.	2007-03-02 17:02:29 +00:00
Li-Ta Lo	c5d8c221b0	added binomial tree based Gather alogrithm, passed IBM and Intel tests This commit was SVN r13835.	2007-02-28 01:11:01 +00:00
Jelena Pjesivac-Grbovic	627533fe4a	Adding segmented ring algorithm for Allreduce for commutative operations. Algorithm allows user to specify the segment size to be used for computation/communication overlap. The additional memory requirement for the algorithm is 2 x segment size. It performed well for (really) large message sizes over MX and it passed intel Allreduce_c and Allreduce_loc_c tests. This commit was SVN r13832.	2007-02-27 20:32:30 +00:00
George Bosilca	bec20422ee	Remove the warnings about printf data-type mismatch. This commit was SVN r13804.	2007-02-26 22:20:35 +00:00
Li-Ta Lo	c860bd1be5	fixed a typo in the comment This commit was SVN r13802.	2007-02-26 19:20:46 +00:00
Li-Ta Lo	73a73b1c78	added ASCII graph on reduce_log_intra This commit was SVN r13801.	2007-02-26 19:15:37 +00:00
Bill D'Amico	db1c2a58c4	Removed cruft - unused variables causing warnings during OMPI build. This commit was SVN r13772.	2007-02-23 18:55:41 +00:00
Tim Prins	f35f67ed1c	(very) minor correction to helpfile This commit was SVN r13758.	2007-02-22 16:02:12 +00:00

1 2 3 4 5 ...

283 Коммитов