openmpi

Автор	SHA1	Сообщение	Дата
Abhishek Kulkarni	afbe3e99c6	* Wrap all the direct error-code checks of the form (OMPI_ERR_* == ret) with (OMPI_ERR_* = OPAL_SOS_GET_ERR_CODE(ret)), since the return value could be a SOS-encoded error. The OPAL_SOS_GET_ERR_CODE() takes in a SOS error and returns back the native error code. * Since OPAL_SUCCESS is preserved by SOS, also change all calls of the form (OPAL_ERROR == ret) to (OPAL_SUCCESS != ret). We thus avoid having to decode 'ret' to get the native error code. This commit was SVN r23162.	2010-05-17 23:08:56 +00:00
Rainer Keller	6c5532072a	- Split the datatype engine into two parts: an MPI specific part in OMPI and a language agnostic part in OPAL. The convertor is completely moved into OPAL. This offers several benefits as described in RFC http://www.open-mpi.org/community/lists/devel/2009/07/6387.php namely: - Fewer basic types (int* and float* types, boolean and wchar - Fixing naming scheme to ompi-nomenclature. - Usability outside of the ompi-layer. - Due to the fixed nature of simple opal types, their information is completely known at compile time and therefore constified - With fewer datatypes (22), the actual sizes of bit-field types may be reduced from 64 to 32 bits, allowing reorganizing the opal_datatype structure, eliminating holes and keeping data required in convertor (upon send/recv) in one cacheline... This has implications to the convertor-datastructure and other parts of the code. - Several performance tests have been run, the netpipe latency does not change with this patch on Linux/x86-64 on the smoky cluster. - Extensive tests have been done to verify correctness (no new regressions) using: 1. mpi_test_suite on linux/x86-64 using clean ompi-trunk and ompi-ddt: a. running both trunk and ompi-ddt resulted in no differences (except for MPI_SHORT_INT and MPI_TYPE_MIX_LB_UB do now run correctly). b. with --enable-memchecker and running under valgrind (one buglet when run with static found in test-suite, commited) 2. ibm testsuite on linux/x86-64 using clean ompi-trunk and ompi-ddt: all passed (except for the dynamic/ tests failed!! as trunk/MTT) 3. compilation and usage of HDF5 tests on Jaguar using PGI and PathScale compilers. 4. compilation and usage on Scicortex. - Please note, that for the heterogeneous case, (-m32 compiled binaries/ompi), neither ompi-trunk, nor ompi-ddt branch would successfully launch. This commit was SVN r21641.	2009-07-13 04:56:31 +00:00
Rainer Keller	221fb9dbca	... Delayed due to notifier commits earlier this day ... - Delete unnecessary header files using contrib/check_unnecessary_headers.sh after applying patches, that include headers, being "lost" due to inclusion in one of the now deleted headers... In total 817 files are touched. In ompi/mpi/c/ header files are moved up into the actual c-file, where necessary (these are the only additional #include), otherwise it is only deletions of #include (apart from the above additions required due to notifier...) - To get different MCAs (OpenIB, TM, ALPS), an earlier version was successfully compiled (yesterday) on: Linux locally using intel-11, gcc-4.3.2 and gcc-SVN + warnings enabled Smoky cluster (x86-64 running Linux) using PGI-8.0.2 + warnings enabled Lens cluster (x86-64 running Linux) using Pathscale-3.2 + warnings enabled This commit was SVN r21096.	2009-04-29 01:32:14 +00:00
Ralph Castain	d70e2e8c2b	Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer This commit was SVN r17632.	2008-02-28 01:57:57 +00:00
Gleb Natapov	e2e211f23b	Add flags parameter to btl_alloc() and btl_prepare_src() functions. If BTL knows at the time of allocation priority of a descriptor it may do some optimizations. This commit was SVN r16901.	2007-12-09 14:08:01 +00:00
George Bosilca	433f8a7694	This patch bring full support for message queues in Open MPI. Now the send and receive queues are shared among all PMLs, they are declared in the base PML, and the selected PML is in charge of initializing and releasing them. The CM PML is slightly different compared with OB1 or DR. Internally it use 2 different types of requests: light and heavy. However, now with this patch both types of requests are stored in the same queue, and cast appropriately on the allocation macro. This means we might use less memory than we allocate, but in exchange we got full support for most of the parallel debuggers. Another thing with this patch, is that now for all PML (CM included) the basic PML requests start with the same fields, and they are declared in the same order in the request structure. Moreover, the fields have been moved in such a way that only one volatile/atomic will exist per line of cache (hopefully). This commit was SVN r15346.	2007-07-10 22:16:38 +00:00
Galen Shipman	3401bd2b07	Add optional ordering to the BTL interface. This is required to tighten up the BTL semantics. Ordering is not guaranteed, but, if the BTL returns a order tag in a descriptor (other than MCA_BTL_NO_ORDER) then we may request another descriptor that will obey ordering w.r.t. to the other descriptor. This will allow sane behavior for RDMA networks, where local completion of an RDMA operation on the active side does not imply remote completion on the passive side. If we send a FIN message after local completion and the FIN is not ordered w.r.t. the RDMA operation then badness may occur as the passive side may now try to deregister the memory and the RDMA operation may still be pending on the passive side. Note that this has no impact on networks that don't suffer from this limitation as the ORDER tag can simply always be specified as MCA_BTL_NO_ORDER. This commit was SVN r14768.	2007-05-24 19:51:26 +00:00
Pavel Shamis	2483cefc57	Additional check if descriptor is NULL. It prevents mca_pml_dr_sendreq_cleanup_active failure on segfault. This commit was SVN r13647.	2007-02-14 10:43:43 +00:00
George Bosilca	22eca30b45	One less compiler warning. This commit was SVN r13633.	2007-02-13 09:32:57 +00:00
George Bosilca	79ea6d471b	Even less warnings. This commit was SVN r13429.	2007-02-01 19:27:11 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Rainer Keller	6f8f28f40f	- Get rid of inline definition, otherwise static-compilation fails. This commit was SVN r12735.	2006-12-03 14:52:17 +00:00
George Bosilca	eab1776e9a	Explicit casts for our friendly Windows environment... This commit was SVN r12496.	2006-11-08 17:02:46 +00:00
Galen Shipman	f7c554df65	Try to failover when we get an async error from the lower layer (BTL).. This commit was SVN r12420.	2006-11-03 15:40:26 +00:00
George Bosilca	126a68dc9a	Big datatype commit. Remove all unused features of the datatype engine. As the memory allocation logic is completely done outside the data-type engine (in the PML) there is no need for any special case inside the data-type engine. There is less arguments for the ompi_convertor_pack and ompi_convertor_unpack as well (the last field free_after is not required anymore as there is no memory allocated in the engine itself). This change affect all components using datatypes. I test most of them, but it might happens that I miss some ... If it's the case please let me know (don't shoot the pianist!!). This commit was SVN r12331.	2006-10-26 23:11:26 +00:00
Andrew Friedley	836261b85a	Fixes ticket 186. First, move the OPAL_THREAD_LOCK out to the same level as its corresponding UNLOCK. It was possible to hit the UNLOCK without ever acquiring the lock. Since the OPAL_THREAD_ADD64() is now protected by this lock, we can just do the decrement non-atomically. This commit was SVN r11958.	2006-10-03 18:15:26 +00:00
George Bosilca	688a16ea78	A long time waiting patch. Get rid of the comm->c_pml_procs. It was (and that was long ago) supposed to be used as a cache for accessing the PML procs. But in all of the PMLs the PML proc contain only one field i.e. a pointer to the ompi_proc. This pointer can be accessed using the c_remote_group easily. Therefore, there is no meaning of keeping the PML procs around. Slim fast commit ... This commit was SVN r11730.	2006-09-20 22:14:46 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Galen Shipman	7473d04a9a	Simple failover is working.. ;-) This commit was SVN r11237.	2006-08-16 22:32:18 +00:00
Galen Shipman	3b49953ce2	Add error callback to the btl interface, this allows error to be delivered to the upperlayer assynchronously although there are some issues with this.. such as there are multiple consumers of the btl's.. who get's the This commit was SVN r11232.	2006-08-16 20:21:38 +00:00
George Bosilca	476c9e64df	Don't keep multiples copies of the datatype and count. The only one we really need is the one provided by the user. For the buffered send the real datatype used for the communication is always MPI_BYTE and the count can be retrieved from the req_bytes_packed field. This will decrease the size of the request by one pointer and one size_t (8 bytes or 16 bytes depending on the architecture). This commit was SVN r10680.	2006-07-06 17:58:25 +00:00
Brian Barrett	47725c9b02	* Add new PML (CM) and network drivers (MTL) for high speed interconnects that provide matching logic in the library. Currently includes support for MX and some support for Portals * Fix overuse of proc_pml pointer on the ompi_proc structuer, splitting into proc_pml for pml data and proc_bml for the BML endpoint data * bug fixes in bsend init code, which wasn't being used by the OB1 or DR PMLs... This commit was SVN r10642.	2006-07-04 01:20:20 +00:00
Galen Shipman	e6cd8db0e5	DR will now checksum on a per btl basis (see MCA_BTL_FLAGS_NEED_CSUM). We still always send ACK's, teasing apart completion for ACK/no ACK looks like a pain in the .. This commit was SVN r10530.	2006-06-27 20:23:47 +00:00
Galen Shipman	8855e5b73a	Fixes for DR as well as better diagnostic.. Successfully passing the intel test suite with/without induced errors/drops. This commit was SVN r10518.	2006-06-26 22:29:29 +00:00
George Bosilca	3727fa2ae6	Nothing relevant. I add some more output in the case we have a checksum error. Just to be able to know more information about the failure. This commit was SVN r10337.	2006-06-13 19:36:38 +00:00
Brian Barrett	c70fff6ed0	* Fix for bug #44 for the trunk -- remove a bunch of warnings from the DR PML when compiling on Solaris. Patch won't apply cleanly to the v1.1 branch, so a diff for that is coming up soon. This commit was SVN r10173.	2006-06-01 18:58:38 +00:00
Tim Woodall	d8ff8010f3	track wether the vfrag is being retransmitted This commit was SVN r9817.	2006-05-04 17:30:58 +00:00
Tim Woodall	1b26caa95b	first cut at btl failover - seems to be working for simple test case This commit was SVN r9816.	2006-05-04 16:16:26 +00:00
Galen Shipman	ba0aa46220	make csum's optional in pml dr, on by default, see mca param pml_dr_enable_csum This commit was SVN r9608.	2006-04-10 21:54:46 +00:00
Galen Shipman	5271948ec0	--- opal object changes add object size to opal class no longer need the size when allocating a new object as this is stored in the class structure --- dr changes Previous rev. maintained state on the communicator used for acking duplicate fragments, but the communicator may be destroyed prior to successfull delivery of an ack to the peer. We must therefore maintain this state globally on a per peer, not a per peer, per communicator basis. This requires that we use a global rank on the wire and translate this as appropriate to a local rank within the communicator. This commit was SVN r9454.	2006-03-29 16:19:17 +00:00
Tim Woodall	c724e4c804	- removed unused flags - updated copyrights This commit was SVN r9430.	2006-03-27 22:44:26 +00:00
Galen Shipman	1677ca1cd4	continue to debug retransmission of incorrect offset, only occurs on vfrag timeout.. This commit was SVN r9421.	2006-03-24 22:28:43 +00:00
Tim Woodall	2e376e0ee8	misc cleanup This commit was SVN r9410.	2006-03-24 06:49:45 +00:00
Tim Woodall	1aaad721e8	clear state on rndv ack This commit was SVN r9404.	2006-03-23 23:36:07 +00:00
Tim Woodall	0fa49f1297	set requests vfrag id when matched This commit was SVN r9402.	2006-03-23 23:04:20 +00:00
Tim Woodall	996a1b56df	more tweaking This commit was SVN r9399.	2006-03-23 22:08:59 +00:00
Galen Shipman	e01cf0a166	Seperate out sequence tracking list as stand alone class. This commit was SVN r9391.	2006-03-23 17:02:17 +00:00
Tim Woodall	d9dc534c08	fix bogus comment This commit was SVN r9388.	2006-03-23 16:41:37 +00:00
Tim Woodall	28fa260404	for frag case don't use retrans flag, simply retransmit all segments of vfrag that have not been acked This commit was SVN r9387.	2006-03-23 16:36:13 +00:00
Tim Woodall	dc125cf7d5	misc corrections This commit was SVN r9380.	2006-03-23 15:11:06 +00:00
Galen Shipman	70cf1ce562	more work in progress.. This commit was SVN r9369.	2006-03-22 23:06:18 +00:00
Tim Woodall	0f6161c6da	reorg This commit was SVN r9366.	2006-03-22 15:02:36 +00:00
Galen Shipman	bcb23dc762	rework rndv and eager data timeout/retrans This commit was SVN r9358.	2006-03-21 21:23:33 +00:00
Tim Woodall	7a1ad5b6fb	corrections to scheduling logic This commit was SVN r9354.	2006-03-21 14:30:54 +00:00
Tim Woodall	797a6b2887	dont compute checksum over header - data only This commit was SVN r9343.	2006-03-20 23:08:14 +00:00
Galen Shipman	ca13833e95	more dr work This commit was SVN r9340.	2006-03-20 21:57:30 +00:00
Tim Woodall	bd870519fd	- modified convertor copy_and_prepare routines to accept an addition flag, new flags to be included when convertor is initialized - modified pml/btl module defs and added stub functions for diagnostic output routines to dump state of queues / endpoints - updates to data reliability pml This commit was SVN r9329.	2006-03-17 18:46:48 +00:00
Galen Shipman	a465047e97	enable timeouts and retransmissions This commit was SVN r9322.	2006-03-16 22:33:08 +00:00
Galen Shipman	ff75de8c52	more dr work, add destination check on all receives, misc This commit was SVN r9317.	2006-03-16 19:38:21 +00:00

1 2

66 Коммитов