openmpi

Автор	SHA1	Сообщение	Дата
Jelena Pjesivac-Grbovic	d2921a9d42	Cleanup of Barrier implementation: - utilizing coll_tuned_util functions - setting line length to 80. This implementation uses standard send messages (instead of synchronous ones). The change improved our performance over MX multiple number of times, however, there exists a small potential that last message to be sent can be delayed (until next mpi call, which means potentially infinitely). If this shows to be a problem, I will modify the algorithms to use synchronous send as last operation (which will incur performance penalty again). This commit was SVN r13071.	2007-01-10 22:49:43 +00:00
Jelena Pjesivac-Grbovic	ccc3ee0b6b	Minor changes to allgather implementation with some clean-up of util code. - in allgather algorithms I replaces irecv-isend-waitall sequence with call to ompi_coll_tuned_sendrecv - most of the functions in util code and allgather decision function conform to 80 character line width. - This commit was SVN r13069.	2007-01-10 21:56:59 +00:00
Josh Hursey	93208445fd	Make sure we wireup the 'verbose' MCA parameter for the BTL's. This commit was SVN r13067.	2007-01-10 21:24:35 +00:00
Gleb Natapov	624f139bd8	This commit fixes trac:729. Initialize pointer to registration to NULL. Otherwise it may contain garbage and we will try to unregister it later in btl_free(). This commit was SVN r13054. The following Trac tickets were found above: Ticket 729 --> https://svn.open-mpi.org/trac/ompi/ticket/729	2007-01-09 10:29:20 +00:00
Gleb Natapov	d3ac56272a	Prevent access to openib_btl after free(). This commit was SVN r13052.	2007-01-09 09:07:32 +00:00
George Bosilca	87ff2b5ce8	Cast to the correct type. This commit was SVN r13046.	2007-01-08 22:04:01 +00:00
George Bosilca	f419960c7f	All files have to include ompi_config.h before anything else. This commit was SVN r13045.	2007-01-08 22:03:16 +00:00
George Bosilca	53ddbe8446	Nothing relevant. This commit was SVN r13044.	2007-01-08 22:02:17 +00:00
Brian Barrett	e130f18cc2	Fix some compiler warnings that have slipped in lately... This commit was SVN r13037.	2007-01-08 17:20:09 +00:00
Brian Barrett	a34e67d743	Remove unneeded PARAM_INIT_FILE variable in configure.params files used by components that use configure.m4 for configuration or are always built. The macro has not been needed since moving to configure types other than configure.stub Fixes trac:590 This commit was SVN r13031. The following Trac tickets were found above: Ticket 590 --> https://svn.open-mpi.org/trac/ompi/ticket/590	2007-01-08 03:44:22 +00:00
Brian Barrett	b8413fb1d5	Just cast the pointer to a uintptr_t then to the match bits, instead of abusing the ompi_ptr_t interface. Not critical for v1.2, as there are no portals platforms that are big endian, so the code in v1.2 will work well enough for now This commit was SVN r13024.	2007-01-07 03:11:27 +00:00
Brian Barrett	8900d3ae43	Second take at fixing the issues with using ompi_ptr_t. Add helper functions for converting from .pval to .lval and vice-versa. Users of ompi_ptr_t types should only use one of the fields in the union unless using the helper conversion functions. For the BTLs, local pointers will always be stored in the .pval field and remote pointers always stored in the .lval field. George wrote the initial patch, I extended it slightly and am responsible for all bugs found. Refs trac:587 This commit was SVN r13023. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-07 01:48:57 +00:00
Jelena Pjesivac-Grbovic	eae3df4904	Updated broadcast decision function based on MX results up to 64 nodes. (The previous decision function did not consider binomial algorithm (since we did not have it at the time)). This commit was SVN r13007.	2007-01-06 00:37:40 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Galen Shipman	d207a6c988	endpoint should use a uint64_t for subnet, as everyone else does.. makes bad things happen when packing into a 64 bit buffer... Misc cleanup.. This commit was SVN r12993.	2007-01-04 20:25:28 +00:00
Brian Barrett	936fdd2ae1	remove some code that accidently came in with r12974. Refs trac:587 This commit was SVN r12991. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-04 20:17:07 +00:00
Galen Shipman	931a389c4f	fix deadlock on rendezvous protocol.. This commit was SVN r12982.	2007-01-04 03:46:11 +00:00
Galen Shipman	f12bbe0591	Handle different subnets correctly and multiple nic endpoint negotiation This is somewhat limited currently for expample, if you have 3 ports on Node A and 5 ports on Node B then the peers will use 3 ports to communicate with each other. This is on a subnet basis, so for any pair of nodes we take the intersection of the available ports within a subnet. We use subnets to determine reachability for lazy connection establishment. So if Node A and Node B each have two HCA's (on seperate networks) then the subnet's must be distinct, otherwise we will try to wire up HCA's on seperate networks. This commit was SVN r12978.	2007-01-03 22:35:41 +00:00
Brian Barrett	7cac26d240	* fix some typos that slipped in with r12974. Refs trac:587 This commit was SVN r12976. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 20:14:45 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Gleb Natapov	a6127fd8ce	Increase req_bytes_delivered atomically. This commit was SVN r12971.	2007-01-03 15:19:34 +00:00
Gleb Natapov	79202561f6	Don't check req_pipeline_depth on frag completion. Checking of req_bytes_delivered should be enough. This commit was SVN r12967.	2007-01-03 14:44:20 +00:00
Gleb Natapov	1ad6c41735	Sender can start scheduling send fragments immediately after receiving ACK. No need to wait for RNDV completion. This commit was SVN r12965.	2007-01-03 12:37:11 +00:00
Rich Graham	8a9da02063	change code to conform with coding standard. Handle error condition where shared memory file is not created. This commit was SVN r12964.	2007-01-03 00:06:02 +00:00
Donald Kerr	899297c8f4	udapl btl was not compiling after r12878 on 12/17/2006, some minor changes to allow btl to compile This commit was SVN r12963. The following SVN revision numbers were found above: r12878 --> open-mpi/ompi@190e7a27cd	2007-01-02 21:44:12 +00:00
George Bosilca	d8dee3a740	If the MX driver was unable to load correctly, or if the endpoint was not created then don't try to call the MX endpoint close function. This commit was SVN r12950.	2007-01-02 00:01:50 +00:00
Rich Graham	6cb2377015	Change the allocation of the shared memory backing file. The file is allocated on a per comm_world instance, with the lowest rank in comm_world on the given host creating and initializing the file, and then notifying the remaining files via the OOB. Reviewed: Ralph Castain, Brian Barrett Addressing ticket #674. This commit was SVN r12949.	2007-01-01 02:39:02 +00:00
George Bosilca	e223b27268	A fragment is marked completed by the PML when the peer signal the completion of the RDMA operation associated with the fragment. The PML will call the BML free which in turn will call the BTL free. The MX BTL will not release the fragment if it not tagged with 0xff. This commit was SVN r12947.	2006-12-31 03:17:47 +00:00
George Bosilca	47601e315e	Allow the MX BTL to select at runtime if the unexpected handler will be activated or not. This commit was SVN r12944.	2006-12-30 20:57:50 +00:00
Brian Barrett	99c0a29602	Disable CM and DR PMLs in heterogeneous situtations as neither are heterogeneous safe. Refs trac:587 This commit was SVN r12942. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2006-12-30 16:17:56 +00:00
George Bosilca	d401a65975	Minor cleanups. Don't set the fields that will never be used. This commit was SVN r12941.	2006-12-29 07:55:17 +00:00
George Bosilca	0b5d879a63	ompi_convertor_pack do not return errors (all checkings are done when the convertor is created). This commit was SVN r12940.	2006-12-29 07:40:02 +00:00
George Bosilca	d8db9e49f3	Set the bml_btl to NULL or segfault !!! This commit was SVN r12939.	2006-12-29 07:38:24 +00:00
Brian Barrett	c010119667	If a BTL isn't needed due to exclusivity ranking, need to call a matching inuse decrement for the increment that was at the start of the procs loop. Otherwise, the inuse count can end up higher than it actually is and a btl can end up in the progress loop when it isn't active to any peer. Refs trac:543 This commit was SVN r12938. The following Trac tickets were found above: Ticket 543 --> https://svn.open-mpi.org/trac/ompi/ticket/543	2006-12-29 02:22:40 +00:00
George Bosilca	416e5b5f6a	Enable the MX extensions if and only if the mx_extensions.h header is installed on the system. This commit was SVN r12937.	2006-12-29 00:31:32 +00:00
George Bosilca	d7bc180a90	The max allocated tag is not 16. Use the define instead. This commit was SVN r12936.	2006-12-28 22:48:58 +00:00
George Bosilca	3eeecc3838	Add support for faster small messages. While sending a message, we check if the data was buffered by the MX library. If it's the case then we declare the send as completed and disable the completion event for the mx request. This commit was SVN r12935.	2006-12-28 22:34:24 +00:00
George Bosilca	b996c00d1a	Set the limits for the MX fragments to 4K. Add code to dump the state of the MX hardware (not activated). This commit was SVN r12931.	2006-12-28 08:40:37 +00:00
George Bosilca	3903009b8b	Add a check for the unexpected handler. If enabled, allow the zero-copy protocol over the MX BTL. Now, we have only one matching, the one in Open MPI. The problem is that when the unexpected handler is triggered, not all the message is on the host memory. In the best case we get one MX fragment (internal MX fragment), in the worst we get NULL. The only way to fit this with the design of the PML is to force the eager protocol at the MX internal fragment size, and to limit the send/receive protocol at the same size. Tests show the outcome is not far from optimal (if the pipeline depth is increased a little bit). Set MX_PIPELINE_LOG in order to allow MX to use internal fragments of 4K. This commit was SVN r12930.	2006-12-28 03:35:41 +00:00
George Bosilca	ff2319dcb7	Complete the OUT protocol. Small latency improvements. Some minor cleanups. Create some macros, reorder some functions. Make sure all fragments are correctly released at the end. This commit was SVN r12926.	2006-12-26 18:15:24 +00:00
George Bosilca	75a35ed7ee	Implement the PUT protocol over MX. The send/receive approach give the best performance on a 2G Myrinet card, as it look like pipelining the messages by 1M is faster than a simple send/receive. However, when using a 10G card the send/receive will limit the maximum bandwidth to 2.5Gbs. The reason is the scarce bus resources that have to be shared between the Myrinet hardware and the memcpy operation. The PUT protocol remove the memcpy, we now have a true zero-copy mechanism. But, there is no pipelining yet as it look like the RDMA pipeline somehow disappeared from the OB1 PML ... This commit was SVN r12925.	2006-12-24 22:52:46 +00:00
George Bosilca	e8bd985870	Add more output when calls to the MX library fails. Move the connection status from theproc into the endpoint. This commit was SVN r12924.	2006-12-24 22:34:48 +00:00
George Bosilca	14dc72f595	Allow the user to change the MX flags. This commit was SVN r12923.	2006-12-24 22:21:00 +00:00
George Bosilca	dbe2798638	Allow MX to handle shared memory and self communications. By default these features are disabled (btl_mx_shared_mem respectively btl_mx_self have to be set in order to activate them). This commit was SVN r12922.	2006-12-24 22:18:41 +00:00
Jelena Pjesivac-Grbovic	3494e1bb05	- Updated decision function for Alltoall collective. Fixes "jump" for intermediate sizes message on 24+ number of nodes (at least on Grig cluster). This commit was SVN r12920.	2006-12-22 19:59:17 +00:00
George Bosilca	b1725e02d4	No more warnings plus some code reordering. This commit was SVN r12919.	2006-12-21 22:42:15 +00:00
Jelena Pjesivac-Grbovic	f1aec23507	Adding tuned allgather implementation. It contains four algorithms: Bruck (ciel(logP) steps), Recursive Doubling (log(P) for power-of-2 processes), Ring (P-1 steps), and Neighbor Exchange (P/2 steps for even number of processes). All algorithms passed occ, IMB-2.3, and intel verification tests from ompi-tests/ for up to 56 processes. The fixed decision function is based on results collected over MX on the Grig cluster at the University of Tennessee at Knoxville. I have also added (and commented out) copy of MPICH2 decision function for allgather (from their IJHPCA 2005 paper). This commit was SVN r12910.	2006-12-21 18:40:02 +00:00
Brian Barrett	7880353fcc	Need to close every endpoint we open, or the MX progress thread doesn't die, which can cause segfaults on shutdown. Calling mx_finalize() isn't enough to shutdown the thread, so must close endpoints as well. Refs trac:513 This commit was SVN r12908. The following Trac tickets were found above: Ticket 513 --> https://svn.open-mpi.org/trac/ompi/ticket/513	2006-12-21 18:13:22 +00:00
Gleb Natapov	484c6a2c1a	Use OPAL_ALIGN() macro to align length. Return address from mpool_alloc is now properly aligned so no need to align it once more. This commit was SVN r12899.	2006-12-19 08:34:48 +00:00
Brian Barrett	2ab65eb521	Remove some debugging output that was #if 0'ed out but shouldn't have been committed into the trunk anyway This commit was SVN r12897.	2006-12-19 02:34:41 +00:00

1 2 3 4 5 ...

1448 Коммитов