openmpi

Автор	SHA1	Сообщение	Дата
Gleb Natapov	30b2183314	Remove debug output from a hot path. This commit was SVN r15478.	2007-07-18 12:48:34 +00:00
Sven Stork	2ab401dc3c	- export required symbols used by OSC This commit was SVN r15476.	2007-07-18 11:51:52 +00:00
Jeff Squyres	3bc940ac27	Fix three things from r15474 (thanks to Brian for noticing): * bml.h had a change that introduced a variable named "_order" to avoid a conflict with a local variable. The namespace starting with _ belongs to the os/compiler/kernel/not us. So we can't start symbols with _. So I replaced it with arg_order, and also updated the threaded equivalent of the macro that was modified. * in btl_openib_proc.c, one opal_output accidentally had its string reverted from "ompi_modex_recv..." to "mca_pml_base_modex_recv....". This was fixed. * The change to ompi/runtime/ompi_preconnect.c was entirely reverted; it was an artifact of debugging. This commit was SVN r15475. The following SVN revision numbers were found above: r15474 --> open-mpi/ompi@8ace07efed	2007-07-18 11:38:06 +00:00
Jeff Squyres	8ace07efed	This commit brings in two major things: 1. Galen's fine-grain control of queue pair resources in the openib BTL. 1. Pasha's new implementation of asychronous HCA event handling. Pasha's new implementation doesn't take much explanation, but the new "multifrag" stuff does. Note that "svn merge" was not used to bring this new code from the /tmp/ib_multifrag branch -- something Bad happened in the periodic trunk pulls on that branch making an actual merge back to the trunk effectively impossible (i.e., lots and lots of arbitrary conflicts and artifical changes). :-( == Fine-grain control of queue pair resources == Galen's fine-grain control of queue pair resources to the OpenIB BTL (thanks to Gleb for fixing broken code and providing additional functionality, Pasha for finding broken code, and Jeff for doing all the svn work and regression testing). Prior to this commit, the OpenIB BTL created two queue pairs: one for eager size fragments and one for max send size fragments. When the use of the shared receive queue (SRQ) was specified (via "-mca btl_openib_use_srq 1"), these QPs would use a shared receive queue for receive buffers instead of the default per-peer (PP) receive queues and buffers. One consequence of this design is that receive buffer utilization (the size of the data received as a percentage of the receive buffer used for the data) was quite poor for a number of applications. The new design allows multiple QPs to be specified at runtime. Each QP can be setup to use PP or SRQ receive buffers as well as giving fine-grained control over receive buffer size, number of receive buffers to post, when to replenish the receive queue (low water mark) and for SRQ QPs, the number of outstanding sends can also be specified. The following is an example of the syntax to describe QPs to the OpenIB BTL using the new MCA parameter btl_openib_receive_queues: {{{ -mca btl_openib_receive_queues \ "P,128,16,4;S,1024,256,128,32;S,4096,256,128,32;S,65536,256,128,32" }}} Each QP description is delimited by ";" (semicolon) with individual fields of the QP description delimited by "," (comma). The above example therefore describes 4 QPs. The first QP is: P,128,16,4 Meaning: per-peer receive buffer QPs are indicated by a starting field of "P"; the first QP (shown above) is therefore a per-peer based QP. The second field indicates the size of the receive buffer in bytes (128 bytes). The third field indicates the number of receive buffers to allocate to the QP (16). The fourth field indicates the low watermark for receive buffers at which time the BTL will repost receive buffers to the QP (4). The second QP is: S,1024,256,128,32 Shared receive queue based QPs are indicated by a starting field of "S"; the second QP (shown above) is therefore a shared receive queue based QP. The second, third and fourth fields are the same as in the per-peer based QP. The fifth field is the number of outstanding sends that are allowed at a given time on the QP (32). This provides a "good enough" mechanism of flow control for some regular communication patterns. QPs MUST be specified in ascending receive buffer size order. This requirement may be removed prior to 1.3 release. This commit was SVN r15474.	2007-07-18 01:15:59 +00:00
George Bosilca	59ee366728	Remove a compilation warning. This commit was SVN r15472.	2007-07-17 22:32:59 +00:00
Rich Graham	f2a30cde5d	add table of send completion callback functions, on a per send-type basis. This commit was SVN r15471.	2007-07-17 21:26:56 +00:00
Rich Graham	0991c3d5f5	move buffered send component clean up out of the pml to ompi_mpi_finalize. This commit was SVN r15463.	2007-07-17 14:50:52 +00:00
George Bosilca	e782da00e0	Don't allow the same communicator to be used in a multi-threaded build by several threads to create new communicators. There is nothing in the standard about threading and communicaotr functions, but as they include collective communications I expect the same rules have to be applied. As such, on an incorrect MPI program we deadlock (!). This commit was SVN r15456.	2007-07-17 00:33:27 +00:00
Rich Graham	de5670cd79	add missing header file - Thanks Brian. This commit was SVN r15455.	2007-07-17 00:06:35 +00:00
Rich Graham	1a4ce2a961	move setting of the component used to managed buffer sends out of the pmls, and into ompi_mpi_init. This is the first of several steps to pull buffered send management out of the pmls. This commit was SVN r15451.	2007-07-16 21:52:25 +00:00
Brian Barrett	6a1f876e98	Don't inline this function so that we can access the predefined datatype array even when visibility is turned on This commit was SVN r15444.	2007-07-16 16:29:51 +00:00
Sven Stork	804f3bee41	- export symbols that are required for the fortran bindings This commit was SVN r15439.	2007-07-16 13:23:57 +00:00
George Bosilca	c839694fb8	Dont print anything when the user requested a specific MX interface. This commit was SVN r15426.	2007-07-14 00:04:50 +00:00
George Bosilca	1e825888a5	Fix the problem reported on #1087 . The global send and receive requests queues are now release in the base close, so there is no need for the cm PML to destroy them. This commit was SVN r15425.	2007-07-13 23:56:09 +00:00
Brian Barrett	c9ad5d1f24	ooops, need to handle case where extents are not same as type sizes This commit was SVN r15423.	2007-07-13 21:26:12 +00:00
Jelena Pjesivac-Grbovic	1b66a52c50	Modifying type of binomial tree used for binomial reduce: switching: 0 0 / \ \ / \ \ 1 \ \ --> 4 \ \ / \ \ / \ \ 3 2 \ 3 2 \ 4 1 (duh). The first form is the bmtree suitable for bcast, but the latter is better for reduce. Updating default decision function accordingly. This commit was SVN r15422.	2007-07-13 21:07:51 +00:00
Pak Lui	685dd6f47b	Fixed the mpool sm size specification problem at large -np due to variable has overflown Added a verbose MCA param for showing the actual size of the mpool sm allocation See trac #1083 for details This commit was SVN r15419.	2007-07-13 20:49:30 +00:00
Brian Barrett	7a9a8c7e17	Support reduction operations other than MPI_REPLACE for user-defined datatypes with MPI_ACCUMULATE This commit was SVN r15418.	2007-07-13 20:46:12 +00:00
Galen Shipman	06b97cb267	fix template btl This commit was SVN r15413.	2007-07-13 20:06:22 +00:00
Josh Hursey	d4d5a351c1	Silence a compiler warning when not using IPV6. Also convert a few statements to conform to coding standard for Open MPI. This commit was SVN r15407.	2007-07-13 16:38:36 +00:00
Josh Hursey	021249fa65	Use the new MCA metadata flag instead of 'false' for the newly added components This commit was SVN r15400.	2007-07-13 14:39:17 +00:00
George Bosilca	b9db0a4c2d	Remove a warning: ompi-trunk/ompi/runtime/ompi_mpi_init.c:221: warning: `cmd_buffer' might be used uninitialized in this function This commit was SVN r15397.	2007-07-13 06:20:44 +00:00
George Bosilca	725f776bb2	This patch was originally proposed by Brian, I just did some small optimizations. It solve the problem with the MPI_Aint alignment that showed up on Solaris Sparc and on heterogeneous environments when dealing with the data-type description. The solution is to move the displacement array from the packed array if we detect that the local architecture required MPI_Aint to be aligned to an MPI_Aint boundary (which is not the case for x86 architectures if MPI_Aint is a 64 bits type). This commit was SVN r15395.	2007-07-13 05:45:02 +00:00
Brian Barrett	d4950c6aa1	Allow an arbitrary list of procs to be passed to the resolve function, instead of just the procs for MCW (in MCW order). Should make resolving ptl_process_id_t structures for arbitrary communicators easier for applications that need it. This commit was SVN r15393.	2007-07-12 20:55:44 +00:00
Ralph Castain	bd65f8ba88	Bring in an updated launch system for the orteds. This commit restores the ability to execute singletons and singleton comm_spawn, both in single node and multi-node environments. Short description: major changes include - 1. singletons now fork/exec a local daemon to manage their operations. 2. the orte daemon code now resides in libopen-rte 3. daemons no longer use the orte triggering system during startup. Instead, they directly call back to their parent pls component to report ready to operate. A base function to count the callbacks has been provided. I have modified all the pls components except xcpu and poe (don't understand either well enough to do it). Full functionality has been verified for rsh, SLURM, and TM systems. Compile has been verified for xgrid and gridengine. This commit was SVN r15390.	2007-07-12 19:53:18 +00:00
Jeff Squyres	cdb56d65e2	Due to the recent changes to reduce memory footprint, we need to set an override here in ompi_info to force the loading of all components. This is ok because we only call opal_init_util() (not orte_init() or ompi_mpi_init()). This commit was SVN r15386.	2007-07-12 15:23:49 +00:00
George Bosilca	752909c628	These are supposed to have a high probability of success. This commit was SVN r15377.	2007-07-11 23:02:47 +00:00
George Bosilca	8643f38adf	Don't allow the BTL to be closed before the end of the process. Count the number of times the BTLs are opened, and then don't remove them until close was called the same number of times. This commit was SVN r15376.	2007-07-11 22:21:04 +00:00
Brian Barrett	1f2942cf2a	* Provide flag if the BTL can do RDMA, but requires a prepare_{src,dst} that exactly describes the buffer to be used as the target of the operation * Use the above flag to disable components setting the flag from being used for real RDMA operations for the one-sided component (the BTLs will still be used for RDMA transfers for the PML and for send/receive communication for the OSC component) This commit was SVN r15375.	2007-07-11 21:21:40 +00:00
Brian Barrett	739fed9dc9	Don't poke at internal structure fiealds of communicators or groups, but instead use accessor functions This commit was SVN r15366.	2007-07-11 17:16:06 +00:00
Brian Barrett	82c8d224d6	add interface for getting an ompi_proc_t from a group, similar to the ompi_comm_peer_lookup function for communicators This commit was SVN r15365.	2007-07-11 17:15:28 +00:00
Brian Barrett	cb2bc19f07	add accessor function for getting ompi_communicator_t* -> cid mapping, since we already have a function for getting cid -> ompi_communicator_t* mapping This commit was SVN r15364.	2007-07-11 17:14:57 +00:00
Jeff Squyres	8aa8a667da	Use the OMPI version number for the component number, like all other btl components. This commit was SVN r15363.	2007-07-11 15:45:25 +00:00
Donald Kerr	88c9dfdf9f	improve message to user when dat_ia_open fails This commit was SVN r15362.	2007-07-11 15:20:35 +00:00
George Bosilca	9ed3ede73e	Correct the thin and heavy requests management for the CM PML. This commit was SVN r15361.	2007-07-11 15:10:01 +00:00
George Bosilca	ef7d17d814	Fix a copy&paste typo. This commit was SVN r15360.	2007-07-11 15:09:06 +00:00
George Bosilca	9b501eb66d	Looks like MAX is not a standard macro. Anyway, that the heavy requests is larger than the thin seems to be a "correct" assumption. This commit was SVN r15348.	2007-07-11 00:04:33 +00:00
George Bosilca	e19777e910	A more consistent version. As we now share the send and receive queue, we have to construct/destruct only once. Therefore, the construction will happens before digging for a PML, while the destruction just before finalizing the component. Add some OPAL_LIKELY/OPAL_UNLIKELY. This commit was SVN r15347.	2007-07-10 23:45:23 +00:00
George Bosilca	433f8a7694	This patch bring full support for message queues in Open MPI. Now the send and receive queues are shared among all PMLs, they are declared in the base PML, and the selected PML is in charge of initializing and releasing them. The CM PML is slightly different compared with OB1 or DR. Internally it use 2 different types of requests: light and heavy. However, now with this patch both types of requests are stored in the same queue, and cast appropriately on the allocation macro. This means we might use less memory than we allocate, but in exchange we got full support for most of the parallel debuggers. Another thing with this patch, is that now for all PML (CM included) the basic PML requests start with the same fields, and they are declared in the same order in the request structure. Moreover, the fields have been moved in such a way that only one volatile/atomic will exist per line of cache (hopefully). This commit was SVN r15346.	2007-07-10 22:16:38 +00:00
Andrew Friedley	87dd4bbd47	No idea how I did this.. thanks again to Jeff. This commit was SVN r15345.	2007-07-10 20:37:42 +00:00
Christian Bell	5ae68f82b2	fix gcc 3.x compilation warnings This commit was SVN r15327.	2007-07-10 13:54:34 +00:00
Tim Prins	5b815ec94b	fix deadlock in new modex code This commit was SVN r15326.	2007-07-10 13:28:44 +00:00
Brian Barrett	1d02b9e7b5	Fix a bunch of issues exposed by Ken Cain in getting Open MPI to work with VxWorks. Still some issues remaining, I'm sure. Refs trac:1010 This commit was SVN r15320. The following Trac tickets were found above: Ticket 1010 --> https://svn.open-mpi.org/trac/ompi/ticket/1010	2007-07-10 03:46:57 +00:00
George Bosilca	1200fa4ac5	The first version of the Elan BTL. This commit was SVN r15319.	2007-07-09 21:03:13 +00:00
Jeff Squyres	cee9c214c7	Update the vendor ID list to include HP (0x1708). Thanks to Peter Kjellstrom for pointing this out. This commit was SVN r15316.	2007-07-09 20:09:31 +00:00
Brian Barrett	8b9e8054fd	Move modex from pml base to general ompi runtime, sicne it's used by more than just the PML/BTLs these days. Also clean up the code so that it handles the situation where not all nodes register information for a given node (rather than just spinning until that node sends information, like we do today). Includes r15234 and r15265 from the /tmp/bwb-modex branch. This commit was SVN r15310. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r15234 r15265	2007-07-09 17:16:34 +00:00
Andrew Friedley	b212cf4dae	Fix a signedness warning reported by Jeff/MTT. This commit was SVN r15309.	2007-07-09 15:30:29 +00:00
Tim Prins	f3ac4ac20e	Fix order of function arguments This commit was SVN r15304.	2007-07-08 16:37:51 +00:00
Gleb Natapov	88f4018543	Don't fail MPI_Alloc_mem() when no more memory can be registered. This commit was SVN r15303.	2007-07-08 11:44:58 +00:00
George Bosilca	11ff1b2c20	Add few OPAL_LIKELY/OPAL_UNLIKELY to the datatype engine. This commit was SVN r15302.	2007-07-07 04:31:06 +00:00

... 3 4 5 6 7 ...

3017 Коммитов