openmpi

Автор	SHA1	Сообщение	Дата
Gleb Natapov	cf40674369	Decide if sends should be throttled at the receiver and pass this to the sender in an ACK message. The decision can't be done reliably at the sender. This commit was SVN r17987.	2008-03-27 08:56:43 +00:00
George Bosilca	6310ce955c	The first patch related to the Active Message stuff. So far, here is what we have: - the registration array is now global instead of one by BTL. - each framework have to declare the entries in the registration array reserved. Then it have to define the internal way of sharing (or not) these entries between all components. As an example, the PML will not share as there is only one active PML at any moment, while the BTLs will have to. The tag is 8 bits long, the first 3 are reserved for the framework while the remaining 5 are use internally by each framework. - The registration function is optional. If a BTL do not provide such function, nothing happens. However, in the case where such function is provided in the BTL structure, it will be called by the BML, when a tag is registered. Now, it's time for the second step... Converting OB1 from a switch based PML to an active message one. This commit was SVN r17140.	2008-01-15 05:32:53 +00:00
Gleb Natapov	35bf8c7c46	Rewrite OB1 matching logic. Get rid of macros, make the code shorter. This commit was SVN r16993.	2007-12-19 09:16:20 +00:00
Gleb Natapov	5cd38b8b06	Better encapsulate heterogeneous arch handling in ob1. This commit was SVN r16970.	2007-12-16 08:45:44 +00:00
George Bosilca	05ae27c68b	Don't segfault if we receive a fragment for a non existing communicator. Instead, drop it by now. This commit was SVN r16105.	2007-09-12 17:52:02 +00:00
Gleb Natapov	0b0f9d14aa	Mark send request complete on PML level only when absolutely sure there is no more work associated with this request. No more outstanding completions or packets and send scheduling isn't running in another thread. This commit was SVN r16013.	2007-08-30 12:08:33 +00:00
Brian Barrett	59b22533f2	Enable RDMA for heterogeneous situations. Currently done by overloading the ompi_convertor_need_buffers function to only return 0 if the convertor is homogeneous (which it never does on the trunk, but does to on v1.2, but that's a different issue). Only enable the heterogeneous rdma code for a btl if it supports it (via a flag), as some btls need some work for this to work properly. Currently only TCP and OpenIB extensively tested This commit was SVN r15990.	2007-08-28 21:23:44 +00:00
Rainer Keller	1b5fa48a29	- Add missing PERUSE_COMM_REQ_REMOVE_FROM_POSTED_Q when matching from the posted generic_recv-queue. - Move the PERUSE_COMM_MSG_MATCH_POSTED_REQ from MCA_PML_OB1_RECV_REQUEST_MATCHED to mca_pml_ob1_recv_frag_match() as suggested by Terry Dontje Only post, if this is not a probe/iprobe request. - Do not post PERUSE_COMM_REQ_MATCH_UNEX for probes / iprobes and do in correct order before PERUSE_COMM_MSG_REMOVE_FROM_UNEX_Q This commit was SVN r15947.	2007-08-23 07:09:43 +00:00
George Bosilca	e19777e910	A more consistent version. As we now share the send and receive queue, we have to construct/destruct only once. Therefore, the construction will happens before digging for a PML, while the destruction just before finalizing the component. Add some OPAL_LIKELY/OPAL_UNLIKELY. This commit was SVN r15347.	2007-07-10 23:45:23 +00:00
George Bosilca	433f8a7694	This patch bring full support for message queues in Open MPI. Now the send and receive queues are shared among all PMLs, they are declared in the base PML, and the selected PML is in charge of initializing and releasing them. The CM PML is slightly different compared with OB1 or DR. Internally it use 2 different types of requests: light and heavy. However, now with this patch both types of requests are stored in the same queue, and cast appropriately on the allocation macro. This means we might use less memory than we allocate, but in exchange we got full support for most of the parallel debuggers. Another thing with this patch, is that now for all PML (CM included) the basic PML requests start with the same fields, and they are declared in the same order in the request structure. Moreover, the fields have been moved in such a way that only one volatile/atomic will exist per line of cache (hopefully). This commit was SVN r15346.	2007-07-10 22:16:38 +00:00
George Bosilca	951e4929b9	Usually it's unlikely to have additional fragments. This commit was SVN r15253.	2007-07-01 16:19:53 +00:00
Gleb Natapov	10266fb467	Fix deadlock in OB1 protocol by by sending memory by copying if registration fails. This commit was SVN r14842.	2007-06-03 08:31:58 +00:00
Gleb Natapov	a25e1e7b15	Implement new function mca_pml_ob1_send_requst_copy_in_out(req, offset, len) that allows to send any range of a request by send/recv instaed of RDMA and use it to send data from the end of a request in pipeline protocol. This commit was SVN r14841.	2007-06-03 08:30:07 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Gleb Natapov	1ad6c41735	Sender can start scheduling send fragments immediately after receiving ACK. No need to wait for RNDV completion. This commit was SVN r12965.	2007-01-03 12:37:11 +00:00
George Bosilca	3edd850d2e	Some indentation and code arrangement. However, there is a bug fix. Force the PUT protocol to always obey to the btl_max_rdma_size. This commit was SVN r12721.	2006-12-01 22:26:14 +00:00
Andrew Friedley	a4bdcb4faa	Fix a segfault that turned up in more MPI_THREAD_MULTIPLE testing. Same sort of problem and fix as described in r12323 - mca_pml_ob1_recv_frag_progress() was segfaulting due to a NULL req_proc pointer. The path leading to this was through the mca_pml_ob1_check_cantmatch_for_match() function, where we can match a frag using the same macros as mca_pml_ob1_frag_match() and never initialize the req_proc pointer. This commit was SVN r12582. The following SVN revision numbers were found above: r12323 --> open-mpi/ompi@c752502dee	2006-11-13 20:12:51 +00:00
Andrew Friedley	c752502dee	Fix for a common race condition when running the Sandia mt_send_recv.cc test. A segfault would occur in mca_pml_ob1_recv_request_progress() when trying to prepare the convertor for unpacking, because the request's req_proc field was NULL. Turns out that we weren't setting the req_proc field in the MCA_PML_OB1_CHECK_SPECIFIC_AND_WILD_RECEIVES_FOR_MATCH macro. Instead of just setting it there I removed the other place req_proc was being set correctly, and instead took care of all the cases at once in mca_pml_ob1_recv_frag_match(). This commit was SVN r12323.	2006-10-26 19:09:39 +00:00
George Bosilca	e5ccc1aece	Keep the loop as short as possible. And specialize the search for ANY_TAG. This commit was SVN r11874.	2006-09-28 22:47:40 +00:00
George Bosilca	688a16ea78	A long time waiting patch. Get rid of the comm->c_pml_procs. It was (and that was long ago) supposed to be used as a cache for accessing the PML procs. But in all of the PMLs the PML proc contain only one field i.e. a pointer to the ompi_proc. This pointer can be accessed using the c_remote_group easily. Therefore, there is no meaning of keeping the PML procs around. Slim fast commit ... This commit was SVN r11730.	2006-09-20 22:14:46 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
George Bosilca	5617cb1a0a	Make some function static. Optimize the fast path. Still working on the latency ... This commit was SVN r10787.	2006-07-13 16:52:40 +00:00
Galen Shipman	18dda70fd0	make ompi_free_list_item_t a class.. This will go to the 1.1 branch but will probably require a few changes as ompi_free_list_t is different in the branch.. This commit was SVN r10306.	2006-06-12 16:44:00 +00:00
George Bosilca	58cd591d3b	PERUSE support for OB1. There we go, now the trunk has a partial peruse implementation. We support all the events in the PERUSE specifications, but right now only one event of each type can be attached to a communicator. This will be worked out in the future. The events were places in such a way, that we will be able to measure the overhead for our threading implementation (the cost of the synchronization objects). This commit was SVN r9500.	2006-03-31 17:09:09 +00:00
George Bosilca	612570134f	The request management framework has been redesigned. The main idea is to let the PML (or io, more generally the low level request manager) to have it's own release function (what was before the req_fini). This function will only be called from the low level while the req_free will be called from the upper level (MPI layer) in order to mark the request as not used by the user anymore. From the request point of view the requests will be marked as inactive everytime we read their status (true for persistent as well). As MPI_REQUEST_NULL is already marked as inactive, the test and wait functions are simpler. The drawback is that now we have to change in the ompi_request_{test\|wait} the req_status of the request once we get it's status. This commit was SVN r9290.	2006-03-15 22:53:41 +00:00
Brian Barrett	1479a90b39	* assert() that endianness doesn't need to change if we are sending RDMA headers around, since OB1 currently doesn't do the right thing there, but that should not happen in the near future because the R2 BML should not make any RDMA networks available between machines with different architectures * Clean up the #ifs a little bit so that we don't do unneeded work when on big endian machines and heterogeneous support is disabled... This commit was SVN r9184.	2006-02-28 19:54:46 +00:00
Brian Barrett	285581dff2	More endian-related cleanups: - moved hton64 and ntoh64 from the bunch of places it had been copied into one header file - properly set and use the btl_tcp's nbo option to put things in network byte order on the wire if both sides don't have the same endianness - Put the OB1 PML's headers (with a couple exceptions I need to discuss with Tim) in network byte order on the wire if both sides don't have the same endianness - since it was needed for the TCP BTL, move the orte_process_name_t HTON and NTOH macros from the TCP OOB to ns_types.h This commit was SVN r9145.	2006-02-26 00:45:54 +00:00
Brian Barrett	2eb76ff0cd	* finish the TEG/UNIQ/PTL removal This commit was SVN r9118.	2006-02-23 00:39:01 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
George Bosilca	0376dce258	Keep track of the ompi_proc in the comm_proc. This avoid a lookup for the processor and simplify the execution path. The peer proc (ompi_proc_t) is set at the matching stage. This commit was SVN r8962.	2006-02-10 18:55:43 +00:00
George Bosilca	269fc0c13a	Cleanup: 1. remove all useless macros from the proc header file 2. merge 2 of the match macros (they share the same logic except one list) This commit was SVN r8946.	2006-02-09 06:59:54 +00:00
George Bosilca	5c8c939713	Move the comment at the right place. This commit was SVN r8445.	2005-12-10 23:25:29 +00:00
Tim Woodall	e135f850af	backed out to much :-) This commit was SVN r8356.	2005-12-01 17:32:27 +00:00
Tim Woodall	394bf196bd	back out changes to match only one probe - consensus was we should allow this if multiple threads post multiple probes This commit was SVN r8353.	2005-12-01 17:17:06 +00:00
Tim Woodall	53a33f3bed	dont allow fragment to match more than one probe This commit was SVN r8352.	2005-12-01 17:06:40 +00:00
Tim Woodall	d7c1c23e3f	corrections for handling probe with out of order delivery - when processing out of order list - reset match to null on each iteration - check matched request type and if probe - complete probe and queue fragment on unexpected list This commit was SVN r8339.	2005-11-30 17:57:59 +00:00
Jeff Squyres	42ec26e640	Update the copyright notices for IU and UTK. This commit was SVN r7999.	2005-11-05 19:57:48 +00:00
Tim Woodall	ee58631c82	corrections for probe/iprobe This commit was SVN r7342.	2005-09-13 16:45:41 +00:00
Tim Woodall	f274f524ab	- added get based protocol (if supported by btl) for pre-registered memory - removed 8 bytes from the majority of the pml headers This commit was SVN r6916.	2005-08-17 18:23:38 +00:00
Jeff Squyres	cf16a521c8	Ensure to get ompi/include/constants.h This commit was SVN r6845.	2005-08-12 21:42:07 +00:00
Galen Shipman	b01ebf45c9	Fixed build error related to direct call (bml_direct_call.h). Misc bug fixes and compiler warning issues. Fixed threaded build issue. This commit was SVN r6819.	2005-08-12 14:08:40 +00:00
Galen Shipman	c3c83aa3e1	BML (BTL Managment Layer). Allows BTL's to be used outside of the PML. See bml.h and PML-OB1 for usage. This commit was SVN r6815.	2005-08-12 02:41:14 +00:00
Tim Woodall	e9ca560f16	corrections for probe/iprobe This commit was SVN r6770.	2005-08-08 21:07:12 +00:00
Brian Barrett	24116a3935	* fix up a bunch of threading issues when progress and/or mpi threads are enabled. Mostly just ADD32 -> ADD_SIZE_T issues and naming of variables in THREAD_{LOCK,UNLOCK} This commit was SVN r6706.	2005-08-02 17:36:01 +00:00
Tim Woodall	0423d414ef	- correction for sync send - now passing all of the intel p2p list This commit was SVN r6543.	2005-07-18 18:54:25 +00:00
Brian Barrett	39dbeeedfb	* rename locking code from ompi to opal This commit was SVN r6327.	2005-07-03 22:45:48 +00:00
Brian Barrett	761402f95f	* rename ompi_list to opal_list This commit was SVN r6322.	2005-07-03 16:22:16 +00:00
Jeff Squyres	4ab17f019b	Rename src -> ompi This commit was SVN r6269.	2005-07-02 13:43:57 +00:00

49 Коммитов