openmpi

Автор	SHA1	Сообщение	Дата
Sven Stork	f3f39e003e	- Increment the pipeline depth before we trigger the send function. As mentioned in the comment the completion/callback of the triggered send operation can happen before the call returns. If this happens and if the pipeline depth is 0 before we triggered the send operation and this is the last send operation of the request then the completion detection code will decrement the pipeline depth and check it for equality to 0. Because (0-1) != 0 the pml completion function for this request will not be called. This part 2 of the fix for ticket #246. This commit was SVN r12292.	2006-10-25 08:52:39 +00:00
George Bosilca	8852c00c36	Look like a big commit but in fact it address only one issue. The way we're working with size and diplacement of data-type. After this patch all data can contain size_t bytes and the displacements are defined as ptrdiff_t. All of the files I was able to compile have been modified to match this requirement. This commit was SVN r12146.	2006-10-17 20:20:58 +00:00
George Bosilca	8d2a8229bb	We don't use the send and receive request destructor. This commit was SVN r11880.	2006-09-28 23:57:49 +00:00
George Bosilca	7f2fd41ace	Make sure we trigger the PERUSE event before releasing the request. This commit was SVN r11879.	2006-09-28 23:54:38 +00:00
George Bosilca	688a16ea78	A long time waiting patch. Get rid of the comm->c_pml_procs. It was (and that was long ago) supposed to be used as a cache for accessing the PML procs. But in all of the PMLs the PML proc contain only one field i.e. a pointer to the ompi_proc. This pointer can be accessed using the c_remote_group easily. Therefore, there is no meaning of keeping the PML procs around. Slim fast commit ... This commit was SVN r11730.	2006-09-20 22:14:46 +00:00
Rainer Keller	40cb5d3e30	- Fix peruse compilation This commit was SVN r11685.	2006-09-18 07:41:09 +00:00
Gleb Natapov	fa17445384	fix compilation warning. This commit was SVN r11601.	2006-09-10 06:17:33 +00:00
Gleb Natapov	e7650ff48a	Bad things happen if min_rdma_size is smaller then data delivered in the RNDV packet. Fix this. This commit was SVN r11548.	2006-09-07 10:42:35 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Gleb Natapov	91f48f9a79	Merge with gleb-pml branch. Add out of resource handling support to PML layer. If resource is not available request is added to one of the pending list and retried later. This commit was SVN r10900.	2006-07-20 14:44:35 +00:00
George Bosilca	9f927dc7c1	Minor cleanups. On the OB1 PML the endpoint is not used => remove it from the build. There was some old code regarding the convertor which does not have to be there (the problem was corrected a while ago). In the PML we already know how the progress function is defined, so call the BML progress instead, which will save one function call. The macro MCA_PML_OB1_COMPUTE_SEGMENT_LENGTH is already defined in the pml_ob1.h so it should not be in the endpoint.h. Remove a double definition of the mca_pml_ob1_progress function in the pml_ob1.h. This commit was SVN r10775.	2006-07-13 00:07:13 +00:00
George Bosilca	476c9e64df	Don't keep multiples copies of the datatype and count. The only one we really need is the one provided by the user. For the buffered send the real datatype used for the communication is always MPI_BYTE and the count can be retrieved from the req_bytes_packed field. This will decrease the size of the request by one pointer and one size_t (8 bytes or 16 bytes depending on the architecture). This commit was SVN r10680.	2006-07-06 17:58:25 +00:00
George Bosilca	01a59d68da	Do not generate the XFER_BEGIN and XFER_END events if the length of the data is zero, for both the receives and the sends. This commit was SVN r10670.	2006-07-05 23:39:13 +00:00
George Bosilca	940dbff0fa	Add a new PERUSE macro. This is for the CONTINUE event (the one we added to the standard). This macro allow us to specify the length of the fragment. Now we are able to know how the message is fragmented between the network devices or inside the communication protocol. This commit was SVN r10508.	2006-06-26 20:08:33 +00:00
George Bosilca	c43b9821e7	Generate the PERUSE XFER_CONTINUE event. This commit was SVN r10501.	2006-06-26 19:01:22 +00:00
George Bosilca	dee2a7a08d	On this branch the rdma_offset should be set. The send_offset is anyway already set in the _START macro. This commit was SVN r10429.	2006-06-20 14:12:32 +00:00
Galen Shipman	5d71c149c2	Another fix for PML request completion when local network completion can occur out of order.. Reviewed by Brian.. needs to hit 1.1 This commit was SVN r10353.	2006-06-14 16:55:35 +00:00
Galen Shipman	0eddad6849	Handle out of order completion/receives when marking completion... this is a fix for #107... needs to go to the 1.1 branch.. This commit was SVN r10331.	2006-06-13 16:57:41 +00:00
Gleb Natapov	48d348b577	Don't complete send request before we've got completion on the first rndv packet. Sender can receive and complete PUT request before it gets completion on the first rndv packet. senreq struct may be reused for the next MPI_Send and unexpected completion mess up the things. I sometimes got SEGV and sometimes data corruption. This commit was SVN r10301.	2006-06-12 14:00:43 +00:00
George Bosilca	b8ef0cc749	Minor cleanups. This commit was SVN r10001.	2006-05-21 05:55:21 +00:00
Rainer Keller	0f9b10ff8e	- Update test dup MPI_COMM_WORLD -- so that we may have additional Barriers for output. This commit was SVN r9831.	2006-05-05 07:42:33 +00:00
George Bosilca	22572940c8	Remove some useless code. This commit was SVN r9513.	2006-04-01 07:42:43 +00:00
George Bosilca	58cd591d3b	PERUSE support for OB1. There we go, now the trunk has a partial peruse implementation. We support all the events in the PERUSE specifications, but right now only one event of each type can be attached to a communicator. This will be worked out in the future. The events were places in such a way, that we will be able to measure the overhead for our threading implementation (the cost of the synchronization objects). This commit was SVN r9500.	2006-03-31 17:09:09 +00:00
George Bosilca	dabe47ca3d	A function declared as static inline and who's not used directly, but only as a pointer reference completely confuse some compilers (gcc 4.1 included). Removing the inline (it was there before when the function was used in the same file) seems to solve the problem. However, the most strange thing is that the bug only appear when we compile directly in the trunk directory. It just don't happens when we're using the VPATH build. This commit was SVN r9408.	2006-03-24 04:21:30 +00:00
Tim Woodall	bd870519fd	- modified convertor copy_and_prepare routines to accept an addition flag, new flags to be included when convertor is initialized - modified pml/btl module defs and added stub functions for diagnostic output routines to dump state of queues / endpoints - updates to data reliability pml This commit was SVN r9329.	2006-03-17 18:46:48 +00:00
George Bosilca	612570134f	The request management framework has been redesigned. The main idea is to let the PML (or io, more generally the low level request manager) to have it's own release function (what was before the req_fini). This function will only be called from the low level while the req_free will be called from the upper level (MPI layer) in order to mark the request as not used by the user anymore. From the request point of view the requests will be marked as inactive everytime we read their status (true for persistent as well). As MPI_REQUEST_NULL is already marked as inactive, the test and wait functions are simpler. The drawback is that now we have to change in the ompi_request_{test\|wait} the req_status of the request once we get it's status. This commit was SVN r9290.	2006-03-15 22:53:41 +00:00
Tim Woodall	8bf6ed7a36	- corrected locking in gm btl - gm api is not thread safe - initial support for gm progress thread - corrected threading issue in pml - added polling progress for a configurable number of cycles to wait for threaded case This commit was SVN r9188.	2006-03-02 00:39:07 +00:00
Brian Barrett	1479a90b39	* assert() that endianness doesn't need to change if we are sending RDMA headers around, since OB1 currently doesn't do the right thing there, but that should not happen in the near future because the R2 BML should not make any RDMA networks available between machines with different architectures * Clean up the #ifs a little bit so that we don't do unneeded work when on big endian machines and heterogeneous support is disabled... This commit was SVN r9184.	2006-02-28 19:54:46 +00:00
Brian Barrett	285581dff2	More endian-related cleanups: - moved hton64 and ntoh64 from the bunch of places it had been copied into one header file - properly set and use the btl_tcp's nbo option to put things in network byte order on the wire if both sides don't have the same endianness - Put the OB1 PML's headers (with a couple exceptions I need to discuss with Tim) in network byte order on the wire if both sides don't have the same endianness - since it was needed for the TCP BTL, move the orte_process_name_t HTON and NTOH macros from the TCP OOB to ns_types.h This commit was SVN r9145.	2006-02-26 00:45:54 +00:00
Rainer Keller	45e3415bb5	- With the change from MCA_PML_OB1_FREE in r8945 to the send/recv counterparts, the reset to MPI_REQUEST_NULL of the upper struct ompi_request_t was broken. Nightly mpi_test_suite failed, e.g. mpirun -np 2 ./mpi_test_suite -t "Ring Isend" This commit was SVN r9028. The following SVN revision numbers were found above: r8945 --> open-mpi/ompi@83f83e5730	2006-02-14 09:09:05 +00:00
Galen Shipman	fe05d1f238	use size passed, This commit was SVN r9010.	2006-02-13 17:53:30 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
Galen Shipman	44fe6c3896	allow pml pipeline to cache memory registrations to enable this (off by default) use: -mca pml_ob1_leave_pinned_pipeline 1 !!AND!!! -mca mpool_use_mem_hooks 1 This commit was SVN r8949.	2006-02-09 15:49:51 +00:00
George Bosilca	83f83e5730	Specialize the MCA_PML_OB1_FREE macro. When we call this macro we already know what kind of request we are playing with (send or receive). Therefore, it's useless to have another switch inside this macro and make the code bigger. Now, we have 2 versions MCA_PML_OB1_SEND_REQUEST_FREE and MCA_PML_OB1_RECV_REQUEST_FREE. This commit was SVN r8945.	2006-02-08 22:42:00 +00:00
George Bosilca	600f664db2	No comment !!! But this time I really remove the second lock ... This commit was SVN r8943.	2006-02-08 21:43:46 +00:00
George Bosilca	89db0be4a8	Remove the second lock. This commit was SVN r8940.	2006-02-08 19:19:57 +00:00
Galen Shipman	18bbb049d1	fix bsend bug, need to check that data length before packing. This needs to go to the release branch This commit was SVN r8937.	2006-02-08 17:39:33 +00:00
George Bosilca	e9706e6db0	Remove the macro to compute the length of the segments from the send header and add a new macro that can be used for both sends and receives. Move to atomic operations to manage the length of the sended or received status. There is one instance where the atomic operation is not required as the code can cannot be executed in same time by 2 differents threads. This commit was SVN r8933.	2006-02-08 06:03:54 +00:00
George Bosilca	a2f31b2bf6	Use the MCA_PML_BASE_REQUEST_MPI_COMPLETE macro to set the completed state to true. Add back the mutex protection around the size computations as they are not yet atomic operations. This commit was SVN r8812.	2006-01-25 23:17:17 +00:00
Rainer Keller	7442a641ea	- Fix locking, not related to the current deadlocks. This commit was SVN r8777.	2006-01-21 13:43:36 +00:00
Tim Woodall	ebbac05f3c	remove questionable sched_yield - probably left over from debugging This commit was SVN r8457.	2005-12-12 16:00:27 +00:00
George Bosilca	c6eb429a9a	Wondows work: - remove windows socket initialization (it's already in the TCP component) - protect all used header files - remove the unused ones. This commit was SVN r8434.	2005-12-10 21:38:48 +00:00
Tim Woodall	607f62accd	- pass a flag to the peer indicating wether data is contiguous at the soure - only attempt to schedule rdma if contiguous at both src/dst - need to review this for next release This commit was SVN r8119.	2005-11-11 15:33:25 +00:00
Jeff Squyres	42ec26e640	Update the copyright notices for IU and UTK. This commit was SVN r7999.	2005-11-05 19:57:48 +00:00
Tim Woodall	e45f4744ee	do not return these descriptors to cache This commit was SVN r7986.	2005-11-03 23:20:38 +00:00
Tim Woodall	26003bc952	fix from release branch - don't use get protocol if more than one btl is available This commit was SVN r7984.	2005-11-03 20:52:56 +00:00
George Bosilca	d916e0c5b4	The (I hope) final solution for the convertor problem. As all the PML inherit the base send and receive request from the pml_base, we can solve our problem if we construct the convertor attached to any request in the pml_base_construct function. At the end of the life time for each request (here life time is related to one utilisation, without taking in account the cache) we release all information attached to the convertors in the _FINI macro by calling the ompi_convertor_cleanup. This commit was SVN r7910.	2005-10-28 03:26:36 +00:00
Brian Barrett	bf67c9387b	* initialize send request convertor with the correct type (convertor instead of request). This fixes at least the bug with NetPIPE in 64bit land that Troy was seeing. This commit was SVN r7904.	2005-10-27 23:08:27 +00:00
George Bosilca	1d75b7972f	Solve thee problem with the reference count on the datatype (RT bug 1492). The problem is that the convertor (when prepared) increase the reference count on the used datatype. This reference count will be released only when the OBJ_DESTRUCT is called on a convertor. However, having to call OBJ_CONSTRUCT and OBJ_DESTRUCT on each request every time we want to use it (even when it come from the cache) is an expensive operation. This can be avoided is the OBJ_DESTRUCT will leave the convertor in exactly the same state as OBJ_CONSTRUCT. With this approach we just have to call OBJ_CONSTRUCT for each convertor once when we initially create the request. This commit was SVN r7813.	2005-10-19 20:57:39 +00:00
Jeff Squyres	f9974f72e0	construct/destruct convertor when requests are constructed and allocated to free lists This commit was SVN r7791.	2005-10-18 12:19:43 +00:00

1 2

83 Коммитов