openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	915d748d72	Initialize the convertor on _START not on _INIT. This allow us to set it up before the match when we know the peer, saving some time on the critical path. If the receive is ANY_SOURCE then we initialize the convertor on _MATCHED. Anyway, we will set it up only once per receive. This commit was SVN r12484.	2006-11-08 05:42:29 +00:00
George Bosilca	eb45a5e402	Move things around a little bit. Mainly fields from the send and receive request in the base request. Rearrange the fields to keep the data together. Remove some useless tests. This commit was SVN r12482.	2006-11-08 04:58:23 +00:00
George Bosilca	63462331c9	Reduce the number of branches. Keep the fast path as short as possible. Remove some useless error checking. Add OPAL_UNLIKELY directives. This commit was SVN r12477.	2006-11-07 23:59:32 +00:00
George Bosilca	f3de2e1a82	Keep the fast path as short as possible. This commit was SVN r12476.	2006-11-07 23:56:32 +00:00
Jeff Squyres	427c20af0d	Use a new algorithm for allgatherv. The old algorithm essentially did N gatherv's: for (i = 0 ... size) MPI_Gatherv(..., root = i, ...) The new algorithm simply does (effectively): MPI_Gatherv(..., root = 0, ...) MPI_Bcast(..., root = 0, ...) This commit was SVN r12469.	2006-11-07 18:07:55 +00:00
Galen Shipman	55db17b37c	don't try to use a dead btl.. This commit was SVN r12456.	2006-11-06 23:25:24 +00:00
George Bosilca	108ea4dbe9	When the MX MTL complete a request, force a return from the progress function. Decrease the latency by about 0.3 microseconds. This commit was SVN r12454.	2006-11-06 23:13:07 +00:00
George Bosilca	3d0df2cf29	Allow the MX BTL to finish the small sends quicker. Once the mx_isend is posted if the message size is less than 4K do a check for the message completion and if any call the callback. This commit was SVN r12453.	2006-11-06 23:12:01 +00:00
Galen Shipman	eef37430a7	failing already failed for ACK timeout.. This commit was SVN r12452.	2006-11-06 22:09:39 +00:00
Galen Shipman	813e7faea8	more fixes for failover.. and yet still more to come.. This commit was SVN r12450.	2006-11-06 21:27:17 +00:00
Gleb Natapov	b4fd2d7d50	Fix warnings from progress thread patch. This commit was SVN r12434.	2006-11-06 12:34:56 +00:00
Gleb Natapov	82f7c0dd69	Fix regression from v1.1. 1) make the code do what comment says 2) if memory is prepinned don't send multiple PUT messages. This commit was SVN r12433.	2006-11-06 12:00:17 +00:00
Galen Shipman	f7c554df65	Try to failover when we get an async error from the lower layer (BTL).. This commit was SVN r12420.	2006-11-03 15:40:26 +00:00
George Bosilca	8529238d93	Add 2 more algorithms to the dynamic list. This commit was SVN r12415.	2006-11-02 19:19:08 +00:00
George Bosilca	110d07b7d3	Small optimization or zero length messages. This commit was SVN r12414.	2006-11-02 19:10:28 +00:00
Pavel Shamis	566667ac61	Adding progress thread support to OpenIB BTL. Reviewed by Gleb. This commit was SVN r12411.	2006-11-02 16:15:21 +00:00
George Bosilca	dbec514b0f	Optimize the generation of the match_bits and the mask. This commit was SVN r12396.	2006-11-01 23:19:20 +00:00
Gleb Natapov	4c784b6403	As Andrew Friedley pointed, my previous patch may cause deadlock if mca_btl_openib_endpoint_connect_eager_rdma() is called recursively. He also noticed that orte_pointer_array_add() can't fail because we allocate max number of elements at init time. So just remove error handling and locking. No locking - no deadlocks. This commit was SVN r12388.	2006-11-01 15:53:33 +00:00
Gleb Natapov	3bf31fe4a3	Correctly determine the first element on the list. opal_list_get_prev() never returns NULL, it should be compared with opal_list_get_end() instead. This commit was SVN r12387.	2006-11-01 13:44:47 +00:00
Gleb Natapov	b5714d698a	Fix compilation with GM version smaller than 2.0. Fix compilation warnings. This commit was SVN r12386.	2006-11-01 10:26:15 +00:00
Gleb Natapov	aac695a51f	eager_rdma_buffers update is not atomic. A buffer is added to the array and if something is going wrong down in the code it is removed from the array. So add mutex to prevent concurrent access to the array from different threads. This commit was SVN r12385.	2006-11-01 07:27:32 +00:00
Andrew Friedley	48c5117476	Fix some signedness warnings on threaded builds introduced by r12369 This commit was SVN r12376. The following SVN revision numbers were found above: r12369 --> open-mpi/ompi@d7375ec102	2006-10-31 17:29:25 +00:00
Gleb Natapov	d7375ec102	Fix deadlock reported by Andrew Friedley: What's happening is that we're holding openib_btl->eager_rdma_lock when we call mca_btl_openib_endpoint_send_eager_rdma() on btl_openib_endpoint.c:1227. This in turn calls mca_btl_openib_endpoint_send() on line 1179. Then, if the endpoint state isn't MCA_BTL_IB_CONNECTED or MCA_BTL_IB_FAILED, we call opal_progress(), where we eventually try to lock openib_btl->eager_rdma_lock at btl_openib_component.c:997. The fix removes this lock altogether. Instead we atomically set local RDMA pointer to prevent other threads to create rdma buffer for the same endpoint. And we increment eager_rdma_buffers_count atomically thus polling thread doesn't need lock around it. This commit was SVN r12369.	2006-10-31 09:54:52 +00:00
Gleb Natapov	1b152dfe09	On 64 bit platform if high 32 bits of buf address is not zero they are trimmed by wrong bitwise and. Fix it by expanding mask to 64 bits. This commit was SVN r12368.	2006-10-31 07:33:35 +00:00
Gleb Natapov	7b39039cd6	Add comments to process_pending functions. This commit was SVN r12346.	2006-10-29 09:12:24 +00:00
Gleb Natapov	8ef5b6a589	Change tabs to spaces to be consistent with the rest of the file. This commit was SVN r12345.	2006-10-29 08:12:44 +00:00
George Bosilca	a9c6ae8f15	Minimize the number of branches, and orce the correct prediction for the most usual one. Most of the time we expect the functions which allocate requests to succeed. This commit was SVN r12344.	2006-10-27 23:16:13 +00:00
George Bosilca	44f3dd81b4	Update the comment to reflect what's inside the code. This commit was SVN r12343.	2006-10-27 23:09:37 +00:00
George Bosilca	3472d19d4d	Do not modify the convertor if there is no data to be send across the network. The req_bytes_packed field is initialized in the BASE_INIT macro, so it is set for all requests at this stage. This commit was SVN r12342.	2006-10-27 23:03:15 +00:00
Jeff Squyres	020efdf1f9	Refs trac:250 This commit essentially caches the invoking comm/win/file on the ompi_request_t. This, paired with the req_type field, allows us to retrieve the invoking MPI object and invoke the proper errhandler. The patch is missing most updates for the MPI-2 one-sided stuff (i.e., the patch mainly fixes comms and files); I didn't really understand that code and didn't want to hazard trying to figure it out when Brian can probably do it much more quickly. So #250 will still stay open, pending MPI-2 one-sided updates for this stuff. This commit was SVN r12339. The following Trac tickets were found above: Ticket 250 --> https://svn.open-mpi.org/trac/ompi/ticket/250	2006-10-27 12:35:27 +00:00
Jeff Squyres	e02114dcf3	Fixes trac:529. * Create a new request type: NOOP (described below) * For all MPI__INIT functions, OBJ_NEW an ompi_request_t and set its type to NOOP Ensure that the NOOP requests are OBJ_RELEASE'd when they are done * MPI_START looks at the request type; if NOOP, just return success. If not, call the PML start() function * MPI_STARTALL always pass the entire array of requests back to the PML (see next point) * Make the PMLs only process PML requests (i.e., ignore/skip anything that isn't of type PML -- such as the NOOP requests) * Add a little more param error checking in STARTALL This commit was SVN r12338. The following Trac tickets were found above: Ticket 529 --> https://svn.open-mpi.org/trac/ompi/ticket/529	2006-10-27 12:32:36 +00:00
George Bosilca	882b429f64	ompi_mtl_datatype_pack is not a data-type function (really) so it still need the free_after (which btw has a different meaning that the one removed from the data-type engine few minutes ago). This commit was SVN r12333.	2006-10-27 00:15:53 +00:00
George Bosilca	393657ee26	Initialize the sndbuf in all cases. Do not forget to initialize the tree used in each of the broadcast functions. This commit was SVN r12332.	2006-10-27 00:13:33 +00:00
George Bosilca	126a68dc9a	Big datatype commit. Remove all unused features of the datatype engine. As the memory allocation logic is completely done outside the data-type engine (in the PML) there is no need for any special case inside the data-type engine. There is less arguments for the ompi_convertor_pack and ompi_convertor_unpack as well (the last field free_after is not required anymore as there is no memory allocated in the engine itself). This change affect all components using datatypes. I test most of them, but it might happens that I miss some ... If it's the case please let me know (don't shoot the pianist!!). This commit was SVN r12331.	2006-10-26 23:11:26 +00:00
George Bosilca	a1a4f7c422	Reset the segment pointer once we release the self fragment. This commit was SVN r12330.	2006-10-26 23:07:14 +00:00
George Bosilca	be8516e0d7	Anothers indentations. This commit was SVN r12329.	2006-10-26 23:06:15 +00:00
George Bosilca	83dfd36c1f	Indentations. This commit was SVN r12328.	2006-10-26 23:05:41 +00:00
George Bosilca	91ab093e96	Cleanup. No extern required for the function prototypes. This commit was SVN r12327.	2006-10-26 23:03:12 +00:00
George Bosilca	ba3c247f2a	Big collective commit. I lightly test it, but I think it should be quite stable. Anyway, the default decision functions (for broadcast, reduce and barrier) are based on a high performance network (not TCP). It should give good performance (really good) for any network having the following caracteristics: small latency (5 microseconds) and good bandwidth (more than 1Gb/s). + Cleanup of the reduce algorithms, plus 2 new algorithms (binary and binomial). Now most of the reduce algorithms use a generic tree based function for completing the reduce. + Added macros for computing the trees (they are used for bcast and reduce right now). + Allow the usage of all 5 topologies. + Jelena's implementation of a binary tree that can be used for non commutative operations. Right now only the tree building function is there, it will get activated soon. + Some others minor cleanups. This commit was SVN r12326.	2006-10-26 22:53:05 +00:00
Andrew Friedley	c752502dee	Fix for a common race condition when running the Sandia mt_send_recv.cc test. A segfault would occur in mca_pml_ob1_recv_request_progress() when trying to prepare the convertor for unpacking, because the request's req_proc field was NULL. Turns out that we weren't setting the req_proc field in the MCA_PML_OB1_CHECK_SPECIFIC_AND_WILD_RECEIVES_FOR_MATCH macro. Instead of just setting it there I removed the other place req_proc was being set correctly, and instead took care of all the cases at once in mca_pml_ob1_recv_frag_match(). This commit was SVN r12323.	2006-10-26 19:09:39 +00:00
Gleb Natapov	90be664b9f	Some process_pending() functions get bml_btl on which resource was freed as a parameter. For optimisation purpose only this BTL is used to send packet through instead of trying to send packets through all BTLs. But actually the code was wrong. It simply used provided bml_btl and it may represent different endpoint from packet's destination. The fixed code checks if packet's destination is reachable through the BTL, finds appropriate bml_btl and only then tries to send it through correct bml_btl. This commit was SVN r12319.	2006-10-26 13:21:47 +00:00
Terry Dontje	7259d1b512	Adjust allocation size to be a quantity divisible by sizeof(size_t). This is done to assure alignment so strictly aligned CPUs (like SPARC) do not sigbus. This also may benefit other platforms too. This commit fixes trac:494. This commit was SVN r12312. The following Trac tickets were found above: Ticket 494 --> https://svn.open-mpi.org/trac/ompi/ticket/494	2006-10-25 18:22:38 +00:00
Sven Stork	f3f39e003e	- Increment the pipeline depth before we trigger the send function. As mentioned in the comment the completion/callback of the triggered send operation can happen before the call returns. If this happens and if the pipeline depth is 0 before we triggered the send operation and this is the last send operation of the request then the completion detection code will decrement the pipeline depth and check it for equality to 0. Because (0-1) != 0 the pml completion function for this request will not be called. This part 2 of the fix for ticket #246. This commit was SVN r12292.	2006-10-25 08:52:39 +00:00
Sven Stork	3563f15fde	- Fix a bug in descriptor handling code. The self BTL was mixing the different kinds of descriptors (e.g. put rdma descriptor in the eager free-list). This part 1 of the fix for ticket #246. This commit was SVN r12291.	2006-10-25 08:45:29 +00:00
George Bosilca	99631ccf66	Cleanups. This commit was SVN r12272.	2006-10-23 22:29:17 +00:00
George Bosilca	d7d3f9e486	Tuned collectives works only for at least 2 processes. We have the self module for the other cases. This commit was SVN r12271.	2006-10-23 22:28:56 +00:00
George Bosilca	b848a5ad06	Remove all ompi_coll_chain_t references. This commit was SVN r12269.	2006-10-23 21:47:50 +00:00
George Bosilca	39cd8d3d17	One to rule them all. We only need one topology information: a tree. How we build it it's hat make the difference. This commit was SVN r12268.	2006-10-23 21:46:30 +00:00
George Bosilca	9cf3040e5f	Allocate enough memory for the reduce operation when MPI_IN_PLACE is specified. This commit was SVN r12260.	2006-10-23 17:51:36 +00:00
George Bosilca	6b697ad3dd	If the operation is not commutative then force the basic reducve algorithm. The others cannot be used for non commutative operations ... yet ... This commit was SVN r12241.	2006-10-20 22:11:44 +00:00

... 3 4 5 6 7 ...

1534 Коммитов