openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	8900d3ae43	Second take at fixing the issues with using ompi_ptr_t. Add helper functions for converting from .pval to .lval and vice-versa. Users of ompi_ptr_t types should only use one of the fields in the union unless using the helper conversion functions. For the BTLs, local pointers will always be stored in the .pval field and remote pointers always stored in the .lval field. George wrote the initial patch, I extended it slightly and am responsible for all bugs found. Refs trac:587 This commit was SVN r13023. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-07 01:48:57 +00:00
Jelena Pjesivac-Grbovic	eae3df4904	Updated broadcast decision function based on MX results up to 64 nodes. (The previous decision function did not consider binomial algorithm (since we did not have it at the time)). This commit was SVN r13007.	2007-01-06 00:37:40 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Galen Shipman	d207a6c988	endpoint should use a uint64_t for subnet, as everyone else does.. makes bad things happen when packing into a 64 bit buffer... Misc cleanup.. This commit was SVN r12993.	2007-01-04 20:25:28 +00:00
Brian Barrett	936fdd2ae1	remove some code that accidently came in with r12974. Refs trac:587 This commit was SVN r12991. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-04 20:17:07 +00:00
Galen Shipman	931a389c4f	fix deadlock on rendezvous protocol.. This commit was SVN r12982.	2007-01-04 03:46:11 +00:00
Galen Shipman	f12bbe0591	Handle different subnets correctly and multiple nic endpoint negotiation This is somewhat limited currently for expample, if you have 3 ports on Node A and 5 ports on Node B then the peers will use 3 ports to communicate with each other. This is on a subnet basis, so for any pair of nodes we take the intersection of the available ports within a subnet. We use subnets to determine reachability for lazy connection establishment. So if Node A and Node B each have two HCA's (on seperate networks) then the subnet's must be distinct, otherwise we will try to wire up HCA's on seperate networks. This commit was SVN r12978.	2007-01-03 22:35:41 +00:00
Brian Barrett	7cac26d240	* fix some typos that slipped in with r12974. Refs trac:587 This commit was SVN r12976. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 20:14:45 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Gleb Natapov	a6127fd8ce	Increase req_bytes_delivered atomically. This commit was SVN r12971.	2007-01-03 15:19:34 +00:00
Gleb Natapov	79202561f6	Don't check req_pipeline_depth on frag completion. Checking of req_bytes_delivered should be enough. This commit was SVN r12967.	2007-01-03 14:44:20 +00:00
Gleb Natapov	1ad6c41735	Sender can start scheduling send fragments immediately after receiving ACK. No need to wait for RNDV completion. This commit was SVN r12965.	2007-01-03 12:37:11 +00:00
Rich Graham	8a9da02063	change code to conform with coding standard. Handle error condition where shared memory file is not created. This commit was SVN r12964.	2007-01-03 00:06:02 +00:00
Donald Kerr	899297c8f4	udapl btl was not compiling after r12878 on 12/17/2006, some minor changes to allow btl to compile This commit was SVN r12963. The following SVN revision numbers were found above: r12878 --> open-mpi/ompi@190e7a27cd	2007-01-02 21:44:12 +00:00
George Bosilca	d8dee3a740	If the MX driver was unable to load correctly, or if the endpoint was not created then don't try to call the MX endpoint close function. This commit was SVN r12950.	2007-01-02 00:01:50 +00:00
Rich Graham	6cb2377015	Change the allocation of the shared memory backing file. The file is allocated on a per comm_world instance, with the lowest rank in comm_world on the given host creating and initializing the file, and then notifying the remaining files via the OOB. Reviewed: Ralph Castain, Brian Barrett Addressing ticket #674. This commit was SVN r12949.	2007-01-01 02:39:02 +00:00
George Bosilca	e223b27268	A fragment is marked completed by the PML when the peer signal the completion of the RDMA operation associated with the fragment. The PML will call the BML free which in turn will call the BTL free. The MX BTL will not release the fragment if it not tagged with 0xff. This commit was SVN r12947.	2006-12-31 03:17:47 +00:00
George Bosilca	47601e315e	Allow the MX BTL to select at runtime if the unexpected handler will be activated or not. This commit was SVN r12944.	2006-12-30 20:57:50 +00:00
Brian Barrett	99c0a29602	Disable CM and DR PMLs in heterogeneous situtations as neither are heterogeneous safe. Refs trac:587 This commit was SVN r12942. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2006-12-30 16:17:56 +00:00
George Bosilca	d401a65975	Minor cleanups. Don't set the fields that will never be used. This commit was SVN r12941.	2006-12-29 07:55:17 +00:00
George Bosilca	0b5d879a63	ompi_convertor_pack do not return errors (all checkings are done when the convertor is created). This commit was SVN r12940.	2006-12-29 07:40:02 +00:00
George Bosilca	d8db9e49f3	Set the bml_btl to NULL or segfault !!! This commit was SVN r12939.	2006-12-29 07:38:24 +00:00
Brian Barrett	c010119667	If a BTL isn't needed due to exclusivity ranking, need to call a matching inuse decrement for the increment that was at the start of the procs loop. Otherwise, the inuse count can end up higher than it actually is and a btl can end up in the progress loop when it isn't active to any peer. Refs trac:543 This commit was SVN r12938. The following Trac tickets were found above: Ticket 543 --> https://svn.open-mpi.org/trac/ompi/ticket/543	2006-12-29 02:22:40 +00:00
George Bosilca	416e5b5f6a	Enable the MX extensions if and only if the mx_extensions.h header is installed on the system. This commit was SVN r12937.	2006-12-29 00:31:32 +00:00
George Bosilca	d7bc180a90	The max allocated tag is not 16. Use the define instead. This commit was SVN r12936.	2006-12-28 22:48:58 +00:00
George Bosilca	3eeecc3838	Add support for faster small messages. While sending a message, we check if the data was buffered by the MX library. If it's the case then we declare the send as completed and disable the completion event for the mx request. This commit was SVN r12935.	2006-12-28 22:34:24 +00:00
George Bosilca	b996c00d1a	Set the limits for the MX fragments to 4K. Add code to dump the state of the MX hardware (not activated). This commit was SVN r12931.	2006-12-28 08:40:37 +00:00
George Bosilca	3903009b8b	Add a check for the unexpected handler. If enabled, allow the zero-copy protocol over the MX BTL. Now, we have only one matching, the one in Open MPI. The problem is that when the unexpected handler is triggered, not all the message is on the host memory. In the best case we get one MX fragment (internal MX fragment), in the worst we get NULL. The only way to fit this with the design of the PML is to force the eager protocol at the MX internal fragment size, and to limit the send/receive protocol at the same size. Tests show the outcome is not far from optimal (if the pipeline depth is increased a little bit). Set MX_PIPELINE_LOG in order to allow MX to use internal fragments of 4K. This commit was SVN r12930.	2006-12-28 03:35:41 +00:00
George Bosilca	ff2319dcb7	Complete the OUT protocol. Small latency improvements. Some minor cleanups. Create some macros, reorder some functions. Make sure all fragments are correctly released at the end. This commit was SVN r12926.	2006-12-26 18:15:24 +00:00
George Bosilca	75a35ed7ee	Implement the PUT protocol over MX. The send/receive approach give the best performance on a 2G Myrinet card, as it look like pipelining the messages by 1M is faster than a simple send/receive. However, when using a 10G card the send/receive will limit the maximum bandwidth to 2.5Gbs. The reason is the scarce bus resources that have to be shared between the Myrinet hardware and the memcpy operation. The PUT protocol remove the memcpy, we now have a true zero-copy mechanism. But, there is no pipelining yet as it look like the RDMA pipeline somehow disappeared from the OB1 PML ... This commit was SVN r12925.	2006-12-24 22:52:46 +00:00
George Bosilca	e8bd985870	Add more output when calls to the MX library fails. Move the connection status from theproc into the endpoint. This commit was SVN r12924.	2006-12-24 22:34:48 +00:00
George Bosilca	14dc72f595	Allow the user to change the MX flags. This commit was SVN r12923.	2006-12-24 22:21:00 +00:00
George Bosilca	dbe2798638	Allow MX to handle shared memory and self communications. By default these features are disabled (btl_mx_shared_mem respectively btl_mx_self have to be set in order to activate them). This commit was SVN r12922.	2006-12-24 22:18:41 +00:00
Jelena Pjesivac-Grbovic	3494e1bb05	- Updated decision function for Alltoall collective. Fixes "jump" for intermediate sizes message on 24+ number of nodes (at least on Grig cluster). This commit was SVN r12920.	2006-12-22 19:59:17 +00:00
George Bosilca	b1725e02d4	No more warnings plus some code reordering. This commit was SVN r12919.	2006-12-21 22:42:15 +00:00
Jelena Pjesivac-Grbovic	f1aec23507	Adding tuned allgather implementation. It contains four algorithms: Bruck (ciel(logP) steps), Recursive Doubling (log(P) for power-of-2 processes), Ring (P-1 steps), and Neighbor Exchange (P/2 steps for even number of processes). All algorithms passed occ, IMB-2.3, and intel verification tests from ompi-tests/ for up to 56 processes. The fixed decision function is based on results collected over MX on the Grig cluster at the University of Tennessee at Knoxville. I have also added (and commented out) copy of MPICH2 decision function for allgather (from their IJHPCA 2005 paper). This commit was SVN r12910.	2006-12-21 18:40:02 +00:00
Brian Barrett	7880353fcc	Need to close every endpoint we open, or the MX progress thread doesn't die, which can cause segfaults on shutdown. Calling mx_finalize() isn't enough to shutdown the thread, so must close endpoints as well. Refs trac:513 This commit was SVN r12908. The following Trac tickets were found above: Ticket 513 --> https://svn.open-mpi.org/trac/ompi/ticket/513	2006-12-21 18:13:22 +00:00
Gleb Natapov	484c6a2c1a	Use OPAL_ALIGN() macro to align length. Return address from mpool_alloc is now properly aligned so no need to align it once more. This commit was SVN r12899.	2006-12-19 08:34:48 +00:00
Brian Barrett	2ab65eb521	Remove some debugging output that was #if 0'ed out but shouldn't have been committed into the trunk anyway This commit was SVN r12897.	2006-12-19 02:34:41 +00:00
Gleb Natapov	b910d10a81	Add general functions for alignment and change rdma_mpool_align to always honor an alignment event if posix_memalign() is not available. This commit was SVN r12892.	2006-12-18 10:52:18 +00:00
Brian Barrett	c554638446	Support systems without malloc.h or posix_memalign (ie, pretty much every one we support that isn't Linux) This commit was SVN r12880.	2006-12-17 17:28:59 +00:00
Brian Barrett	b448b4e47e	More heterogeneous fixes. Don't set reachability bit on a remote proc if the remote architecture differs from the local architecture and the btl doesn't support heterogeneous transport. Refs trac:587 This commit was SVN r12879. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2006-12-17 17:27:08 +00:00
Gleb Natapov	190e7a27cd	Merge with gleb-mpool branch. All RDMA components use same mpool now (rdma). udapl/openib/vapi/gm mpools a deprecated. rdma mpool has parameter that allows to limit its size mpool_rdma_rcache_size_limit (default is 0 - unlimited). This commit was SVN r12878.	2006-12-17 12:26:41 +00:00
Brian Barrett	0653dc3f24	Pad headers to eliminate heterogeneous issues. Add conversion functions for switching endianness of headers. Galen is going to add the code to use the endian stuff... This commit was SVN r12876.	2006-12-17 00:50:59 +00:00
Brian Barrett	01e8fc5f91	Redo of r12871, without the preconnect code change: Move the req_mtl structure back to the end of each of the structures in the CM PML. The req_mtl structure is cast into a mtl__request_structure for each MTL, which is larger than the req_mtl itself. The cast will cause the _request to overwrite parts of the heavy requests if the req_mtl isn't the LAST thing on each structure (hence the comment). This was moved as an optimization at some point, which caused buffer sends to fail... Refs trac:669 This commit was SVN r12873. The following SVN revision numbers were found above: r12871 --> open-mpi/ompi@597598b712 The following Trac tickets were found above: Ticket 669 --> https://svn.open-mpi.org/trac/ompi/ticket/669	2006-12-15 17:54:14 +00:00
Brian Barrett	bdf0b231b2	Undo r12871, as it contained some code in ompi/runtime that shouldn't have been committed Refs trac:669 This commit was SVN r12872. The following SVN revision numbers were found above: r12871 --> open-mpi/ompi@597598b712 The following Trac tickets were found above: Ticket 669 --> https://svn.open-mpi.org/trac/ompi/ticket/669	2006-12-15 17:52:13 +00:00
Brian Barrett	597598b712	Move the req_mtl structure back to the end of each of the structures in the CM PML. The req_mtl structure is cast into a mtl__request_structure for each MTL, which is larger than the req_mtl itself. The cast will cause the _request to overwrite parts of the heavy requests if the req_mtl isn't the LAST thing on each structure (hence the comment). This was moved as an optimization at some point, which caused buffer sends to fail... Refs trac:669 This commit was SVN r12871. The following Trac tickets were found above: Ticket 669 --> https://svn.open-mpi.org/trac/ompi/ticket/669	2006-12-15 17:46:53 +00:00
Brad Benton	18da4c40d3	Set the QP's static rate from the associated MCA parameter, rather than just defaulting to 0. Fixes trac:675 This commit was SVN r12855. The following Trac tickets were found above: Ticket 675 --> https://svn.open-mpi.org/trac/ompi/ticket/675	2006-12-14 19:42:24 +00:00
Brian Barrett	38c2e43ac2	Print out error string rather than errno for TCP-related errors, making it easier for both the user and us to debug issues with BTL and OOB issues... This commit was SVN r12852.	2006-12-14 18:20:43 +00:00
Jeff Squyres	0ca8cb35b7	Fixes trac:366 Add ability for ini files to recognize "use_eager_rdma" flag. Set the default to "no" (because we should assume that HCAs cannot support the property necessary for using RDMA for eager messages -- that the last byte of the message is guaranteed to be written to memory last -- unless proven otherwise. For example, iWARP cards apparently do not provide this guarantee), and then set all Mellanox and IBM HCAs to override the default to enable this behavior on these cards. This commit was SVN r12851. The following Trac tickets were found above: Ticket 366 --> https://svn.open-mpi.org/trac/ompi/ticket/366	2006-12-14 15:52:13 +00:00

... 3 4 5 6 7 ...

1637 Коммитов