openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	21e00f6f0c	Clean up a couple of configure things: * Require Autoconf 2.60 or higher and remove some cruft required for AC 2.59 or the AC 2.59 / AC 2.60 mix * Remove a bunch of now unnecessary AC_SUBST calls * Use the libtool-provided variables for the -I and library to use when compiling against ltdl Fixes trac:1000 This commit was SVN r14652. The following Trac tickets were found above: Ticket 1000 --> https://svn.open-mpi.org/trac/ompi/ticket/1000	2007-05-15 04:23:48 +00:00
Jeff Squyres	92090967b1	Add definitions for Hemon/ConnectX Mellanox HCA This commit was SVN r14639.	2007-05-10 12:27:51 +00:00
Donald Kerr	436d370d51	latency improvements: use ompi_free_list_init_ex, create optimal alignment parameter, remove rdma guarantee path, replace dat_lmt_sync_rdma with use of volatile This commit was SVN r14634.	2007-05-09 19:41:25 +00:00
Gleb Natapov	2562253678	Do more work at RDMA frag preparation time and less work at RDMA frag sending time. This commit was SVN r14627.	2007-05-09 12:11:51 +00:00
Gleb Natapov	78fda79630	Use size_t instead of uint64_t in call to convertor cloning. This commit was SVN r14626.	2007-05-09 10:02:06 +00:00
Pavel Shamis	e2d0e27111	Adding: * openib_finalize flow for openib btl * async event handler for openib btl This commit was SVN r14623.	2007-05-08 21:47:21 +00:00
Terry Dontje	f864348f97	Put an ifdef to conditionalize the use of memcpy for sparcv9 platforms to avoid alignmment issues. This commit fixes trac:1009. This commit was SVN r14608. The following Trac tickets were found above: Ticket 1009 --> https://svn.open-mpi.org/trac/ompi/ticket/1009	2007-05-08 17:17:34 +00:00
Jeff Squyres	ecf5a3b8dd	Fix compiler warning This commit was SVN r14604.	2007-05-08 13:12:50 +00:00
Sven Stork	d0c936ca85	- export required symbols or library is useless This commit was SVN r14595.	2007-05-07 13:59:37 +00:00
Sven Stork	32564d73da	- the define is always defined independent, therefore we need to check if it's 1 or not This commit was SVN r14594.	2007-05-07 13:18:09 +00:00
Jeff Squyres	86f48bf0d7	Fix ordering of dimensions for status arrays. Thanks to Randy Bramley for noticing the problem. This commit was SVN r14591.	2007-05-05 05:02:02 +00:00
Sven Stork	a04c8eb39a	- Bring over the visibility feature, for a finer symbol export control via the visibility feature that is provided by some compilers. Per default this feature is disabled, to enable it you need to configure with --enable-visibility and obviously you need a compiler with visibility support. Please refer to the wiki for more information. https://svn.open-mpi.org/trac/ompi/wiki/Visibility This commit was SVN r14582.	2007-05-04 09:03:37 +00:00
Jelena Pjesivac-Grbovic	625c6739ab	Removing warning about unsed variable This commit was SVN r14579.	2007-05-03 20:26:41 +00:00
Gleb Natapov	8029893489	In multithreaded application sending of initial portion of a request may overlap with RDMAing the rest of it. Also more than one RDMA writes can be performed simultaneously by different threads. To make this code thread safe this patch clones original request convertor for each RDMA fragment. This commit was SVN r14574.	2007-05-03 09:13:17 +00:00
Jelena Pjesivac-Grbovic	9eff74ad4d	Modifying generalized reduce "synchronized" behavior: - Removing "small" message size limit because it really does not relate to the eager size accross the board. Now, the leaf nodes in generalized reduce will use blocking send (DEFAULT/ORIGINAL BEHAVIOR) either when the maximum number of outstanding requests is 0 or when the total number of segments is less than the maximum number of outstanding requests. Otherwise, it will send messages using non-blocking synchronized send operation. This commit was SVN r14572.	2007-05-02 21:42:45 +00:00
George Bosilca	69642a9cd4	Remove 2 warnings about ptrdiff_t to unsigned long implicit conversion. This commit was SVN r14565.	2007-05-01 19:47:33 +00:00
Brian Barrett	a25ce44dc1	Clean up the preconnect code: * Don't need the 2 process case -- we'll send an extra message, but at very little cost and less code is better. * Use COMPLETE sends instead of STANDARD sends so that the connection is fully established before we move on to the next connection. The previous code was still causing minor connection flooding for huge numbers of processes. * mpi_preconnect_all now connects both OOB and MPI layers. There's also mpi_preconnect_mpi and mpi_preconnect_oob should you want to be more specific. * Since we're only using the MCA parameters once at the beginning of time, no need for global constants. Just do the quick param lookup right before the parameter is needed. Save some of that global variable space for the next guy. Fixes trac:963 This commit was SVN r14553. The following Trac tickets were found above: Ticket 963 --> https://svn.open-mpi.org/trac/ompi/ticket/963	2007-05-01 04:49:36 +00:00
Adrian Knoth	d63d125a88	I guess we only need this when IPv6 is enabled. This commit was SVN r14551.	2007-04-29 16:38:34 +00:00
Adrian Knoth	5765ecc22e	This patch reverts r14549 while retaining IPv6 support. Re #1008 This commit was SVN r14550. The following SVN revision numbers were found above: r14549 --> open-mpi/ompi@386baed55b	2007-04-29 16:23:11 +00:00
Adrian Knoth	386baed55b	Hotfix for IPv6 support. Closes trac:1008 This commit was SVN r14549. The following Trac tickets were found above: Ticket 1008 --> https://svn.open-mpi.org/trac/ompi/ticket/1008	2007-04-29 13:46:45 +00:00
George Bosilca	bb481273a6	Typos. This commit was SVN r14546.	2007-04-28 19:15:53 +00:00
George Bosilca	46265db0a9	Update the TCP BTL in order to bring back some of the functionalities lost during the IPv6 patch. The most important is the multi BTL support. There was a quite interesting bug. Instead of setting up the multiple connections over different physical devices, based on the time when these connections were created most of the time they were all using the same physical network. Which, of course, was not the intended goal, as we top at the maximum bandwidth available over one device instead of gathering all available bandwidth from all devices. Second, the IPv6 RFC suggest to use sockaddr_storage as a holder for the IP information, but use a sockaddr* when we pass it to functions. This is only partially corrected by this patch. Some other minor cleanups. This commit was SVN r14544.	2007-04-28 19:13:47 +00:00
Josh Hursey	4c453caab6	Make the check a bit better This commit was SVN r14542.	2007-04-27 17:38:36 +00:00
Josh Hursey	486f29eb6b	Make sure to use the new metadata flags This commit was SVN r14541.	2007-04-27 17:18:26 +00:00
Sven Stork	8d92773067	- export required symbol This commit was SVN r14536.	2007-04-27 11:38:45 +00:00
Rainer Keller	63b904ed1d	- Don't segfault, when calling PERUSE_Init before MPI_Init... This commit was SVN r14535.	2007-04-26 21:06:08 +00:00
Rainer Keller	1aceece03f	- Add a few comments for elements for structs, a few spelling fixes. No functional change. This commit was SVN r14534.	2007-04-26 21:03:38 +00:00
Rainer Keller	ce32b918da	- Fixes for for unlocking the mutex in case of error in functions mca_btl_openib_post_srr and btl_openib_endpoint_post_rr This commit was SVN r14530.	2007-04-26 13:33:02 +00:00
Sven Stork	fe3b08004e	- export symbols that are needed by the fortran libs This commit was SVN r14527.	2007-04-26 09:34:41 +00:00
Rainer Keller	6f9251ed39	- Small fixes by PGI -Minform=inform This commit was SVN r14524.	2007-04-26 08:16:07 +00:00
Josh Hursey	af38efd27c	Use more of the datatype engine supplied functions This commit was SVN r14519.	2007-04-26 00:06:22 +00:00
Jelena Pjesivac-Grbovic	3eac49aa59	Adding flow control for leaf nodes in generalized reduce structure. This "feature" is disabled by default and it should not affect the current performance. In case when the message size is large and segment size is smaller than eager size for particular interface, the leaf nodes in generalized reduce function can overflood parent nodes by sending all segments without any synchronization. This can cause the parent to have HIGH number of unexpected messages (think 16MB message with 1KB segments for example). In case of binomial algorithm root node always has at least one child which is leaf, so this can potentially affect the root's performance significantly [Especially in large communicators where root may have quite a few children (binomial tree for example)]. When the segment size is bigger than the eager size, rendezvous protocol ensures that this does not happen so it is not necessary. Originally, the problem was exposed in "infinite" bucket allocator clean up time for "small" segment sizes (which may explain some "deadlocks" on Thunderbird tests). To prevent this, we allow user to specify mca parameter "--mca coll_tuned_reduce_algorithm_max_requests NUM" this limits number of outstanding messages from a leaf node in generalized reduce to the parent to NUM. Messages are sent as non-blocking synchrnous messages, so syncronization happens at "wait" time. The synchronization actually improved performance of pipeline and binomial algorithm for large message sizes with 1KB segments over MX, but I need to test it some more to make sure it is consistent. Since there is no easy way to find out what is "the eager" size for particular btl, I set the limit to 4000B. If message/individual segment size is greater than 4000B - we will not use this feature. This variable may or may not be exposed as mca parameter later... I did not have any problems running it and both "default" and "synchronous" tests passed Intel Reduce* tests up to 80 processes (over MX). This commit was SVN r14518.	2007-04-25 20:39:53 +00:00
Adrian Knoth	e3d35258b4	Cosmetics. Brian fixes my crappy code and I fix the curly braces. That's teamwork, right? ;) This commit was SVN r14517.	2007-04-25 20:17:19 +00:00
Brian Barrett	4b8bb70afb	A couple cleanups for the IPv6 support: - make opal_sockaddr2str() take a sockaddr_storage instead of a sockaddr_in6 so that it works for IPv4 and IPv6 addresses, and remove a whole bunch of #ifs in the OOOB code. - Fix a compiler warning in the TCP BTL due to run-time determined array size by making it a dynamicly allocated array. - Fix the unpacking code of IPv4 addresses when using IPv6 support, so that the address is in the correct location (instead of in an IPv6 structure, use an IPv4 structure). Refs trac:1005. This commit was SVN r14514. The following Trac tickets were found above: Ticket 1005 --> https://svn.open-mpi.org/trac/ompi/ticket/1005	2007-04-25 19:08:07 +00:00
Adrian Knoth	d1ce39de4f	Move mca_btl_tcp_addr_isipv4public to opal_addr_isipv4public This commit was SVN r14512.	2007-04-25 18:06:06 +00:00
Donald Kerr	80d984441f	change so that we only check connection queue when expecting a connection; create a mca parameter that controls frequency at which the async queue is checked This commit was SVN r14511.	2007-04-25 17:46:25 +00:00
Ralph Castain	7d0f51e6b9	Begin setting up for a change to the OOB information passing functionality - this is totally transparent at the moment (need to change computers). This commit was SVN r14510.	2007-04-25 17:36:26 +00:00
Jeff Squyres	c4c68e666a	Merge in the ipv6 work from /tmp/ipv6-merge. This commit was SVN r14503.	2007-04-25 01:55:40 +00:00
Donald Kerr	cae24fcde1	move mca parameter registration into own .c and .h files This commit was SVN r14493.	2007-04-24 18:34:16 +00:00
Josh Hursey	8c2385416f	Per a developer request - Make sure that the wrapper selection is compiled out if not enabling FT. Before the logic would skip over it since the conditional if statements would not be satisfied, now there are no additional if statements when compiled out. With this modification the selection logic looks nearly identical to pre-r14051 with the exception of the non-FT related improvements. This commit was SVN r14491. The following SVN revision numbers were found above: r14051 --> open-mpi/ompi@dadca7da88	2007-04-24 17:08:48 +00:00
Ralph Castain	18b2dca51c	Bring in the code for routing xcast stage gate messages via the local orteds. This code is inactive unless you specifically request it via an mca param oob_xcast_mode (can be set to "linear" or "direct"). Direct mode is the old standard method where we send messages directly to each MPI process. Linear mode sends the xcast message via the orteds, with the HNP sending the message to each orted directly. There is a binomial algorithm in the code (i.e., the HNP would send to a subset of the orteds, which then relay it on according to the typical log-2 algo), but that has a bug in it so the code won't let you select it even if you tried (and the mca param doesn't show, so you'd really have to try). This also involved a slight change to the oob.xcast API, so propagated that as required. Note: this has only been tested on rsh, SLURM, and Bproc environments (now that it has been transferred to the OMPI trunk, I'll need to re-test it [only done rsh so far]). It should work fine on any environment that uses the ORTE daemons - anywhere else, you are on your own... :-) Also, correct a mistake where the orte_debug_flag was declared an int, but the mca param was set as a bool. Move the storage for that flag to the orte/runtime/params.c and orte/runtime/params.h files appropriately. This commit was SVN r14475.	2007-04-23 18:41:04 +00:00
Donald Kerr	3f428af7b8	couple of minor changes to fix #973 and seperated eager rdma fragments into structure only and data only area This commit was SVN r14470.	2007-04-23 17:41:34 +00:00
Jelena Pjesivac-Grbovic	53cbec7a09	Make coll/tuned dynamic rules more verbose (when promted with --mca coll_base_verbose 1) This commit was SVN r14469.	2007-04-23 16:34:52 +00:00
Rich Graham	ce35761683	make sure not to go out of bounds. element i+1 of bml_btls is referenced, which for i-arr_size-1 is beyond the array dimentions. This commit was SVN r14464.	2007-04-22 21:43:34 +00:00
Sharon Melamed	cf3f41288b	Add pkey value MCA parameter. if this param is used, only ports with the actual pkey value will be initiate. This commit was SVN r14463.	2007-04-22 10:22:12 +00:00
Adrian Knoth	339dbf6cd5	Cosmetics. Enforcing style guide. This commit was SVN r14459.	2007-04-21 21:47:25 +00:00
Josh Hursey	4159b72a60	Some minor updates to go along with commit r14457 This commit was SVN r14458. The following SVN revision numbers were found above: r14457 --> open-mpi/ompi@2af38229c1	2007-04-21 21:24:44 +00:00
Josh Hursey	2af38229c1	Re-worked the implementation of the LAM-like coord component. It's a bit longer, but much more clear in it's implementation I believe. Fundamentally it is the same, but is much more solid in the implementation. I created quite a few directed tests that this version of the implementation now passes. This commit was SVN r14457.	2007-04-21 20:35:01 +00:00
Jeff Squyres	401a072888	Revert r14435 -- it breaks compiling on Linux with at least the PGI compiler suite. The rule is that ompi_info.h is supposed to be the ''first'' file included so that it can affect system header files if necessary (and it is sometimes necessary, such as with the PGI compiler suite). If this breaks VC on Windows, we'll have to find another fix. More on the mailing list... This commit was SVN r14453. The following SVN revision numbers were found above: r14435 --> open-mpi/ompi@a20b43ace9	2007-04-21 00:51:31 +00:00
Jeff Squyres	5bebd24250	Bring over Brian's installdirs fixes from this afternoon (r14445). This commit was SVN r14450. The following SVN revision numbers were found above: r14445 --> open-mpi/ompi@13d366b827	2007-04-21 00:16:31 +00:00

1 2 3 4 5 ...

2590 Коммитов