openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	33e7cc864c	The ops should be indexed based on the MPI datatype index (which is actually the same as the Fortran value of the MPI type). This commit was SVN r21689.	2009-07-15 21:30:09 +00:00
Rainer Keller	6c5532072a	- Split the datatype engine into two parts: an MPI specific part in OMPI and a language agnostic part in OPAL. The convertor is completely moved into OPAL. This offers several benefits as described in RFC http://www.open-mpi.org/community/lists/devel/2009/07/6387.php namely: - Fewer basic types (int* and float* types, boolean and wchar - Fixing naming scheme to ompi-nomenclature. - Usability outside of the ompi-layer. - Due to the fixed nature of simple opal types, their information is completely known at compile time and therefore constified - With fewer datatypes (22), the actual sizes of bit-field types may be reduced from 64 to 32 bits, allowing reorganizing the opal_datatype structure, eliminating holes and keeping data required in convertor (upon send/recv) in one cacheline... This has implications to the convertor-datastructure and other parts of the code. - Several performance tests have been run, the netpipe latency does not change with this patch on Linux/x86-64 on the smoky cluster. - Extensive tests have been done to verify correctness (no new regressions) using: 1. mpi_test_suite on linux/x86-64 using clean ompi-trunk and ompi-ddt: a. running both trunk and ompi-ddt resulted in no differences (except for MPI_SHORT_INT and MPI_TYPE_MIX_LB_UB do now run correctly). b. with --enable-memchecker and running under valgrind (one buglet when run with static found in test-suite, commited) 2. ibm testsuite on linux/x86-64 using clean ompi-trunk and ompi-ddt: all passed (except for the dynamic/ tests failed!! as trunk/MTT) 3. compilation and usage of HDF5 tests on Jaguar using PGI and PathScale compilers. 4. compilation and usage on Scicortex. - Please note, that for the heterogeneous case, (-m32 compiled binaries/ompi), neither ompi-trunk, nor ompi-ddt branch would successfully launch. This commit was SVN r21641.	2009-07-13 04:56:31 +00:00
Terry Dontje	0178b6c45f	Added padding to predefined handle structures to maintain library version to version compatibility. This commit was SVN r20627.	2009-02-24 17:17:33 +00:00
Jeff Squyres	bcdd3ddbde	Ensure to zero out all the pointers in the op so that the destructor knows what it can and cannot free (these pointers are largely unused and therefore otherwise uninitialized in user-defined op's and MPI_REPLACE). This commit was SVN r20532.	2009-02-12 19:15:37 +00:00
George Bosilca	0dab6eb93d	Release the memory on finalize. This commit was SVN r20521.	2009-02-11 20:58:41 +00:00
Jeff Squyres	4d8a187450	Two major things in this commit: * New "op" MPI layer framework * Addition of the MPI_REDUCE_LOCAL proposed function (for MPI-2.2) = Op framework = Add new "op" framework in the ompi layer. This framework replaces the hard-coded MPI_Op back-end functions for (MPI_Op, MPI_Datatype) tuples for pre-defined MPI_Ops, allowing components and modules to provide the back-end functions. The intent is that components can be written to take advantage of hardware acceleration (GPU, FPGA, specialized CPU instructions, etc.). Similar to other frameworks, components are intended to be able to discover at run-time if they can be used, and if so, elect themselves to be selected (or disqualify themselves from selection if they cannot run). If specialized hardware is not available, there is a default set of functions that will automatically be used. This framework is ''not'' used for user-defined MPI_Ops. The new op framework is similar to the existing coll framework, in that the final set of function pointers that are used on any given intrinsic MPI_Op can be a mixed bag of function pointers, potentially coming from multiple different op modules. This allows for hardware that only supports some of the operations, not all of them (e.g., a GPU that only supports single-precision operations). All the hard-coded back-end MPI_Op functions for (MPI_Op, MPI_Datatype) tuples still exist, but unlike coll, they're in the framework base (vs. being in a separate "basic" component) and are automatically used if no component is found at runtime that provides a module with the necessary function pointers. There is an "example" op component that will hopefully be useful to those writing meaningful op components. It is currently .ompi_ignore'd so that it doesn't impinge on other developers (it's somewhat chatty in terms of opal_output() so that you can tell when its functions have been invoked). See the README file in the example op component directory. Developers of new op components are encouraged to look at the following wiki pages: https://svn.open-mpi.org/trac/ompi/wiki/devel/Autogen https://svn.open-mpi.org/trac/ompi/wiki/devel/CreateComponent https://svn.open-mpi.org/trac/ompi/wiki/devel/CreateFramework = MPI_REDUCE_LOCAL = Part of the MPI-2.2 proposal listed here: https://svn.mpi-forum.org/trac/mpi-forum-web/ticket/24 is to add a new function named MPI_REDUCE_LOCAL. It is very easy to implement, so I added it (also because it makes testing the op framework pretty easy -- you can do it in serial rather than via parallel reductions). There's even a man page! This commit was SVN r20280.	2009-01-14 23:44:31 +00:00
Jeff Squyres	a7586bdd90	Cosmetic changes: * Update to 4 space tabs where relevant (and some irrelevant white space changes) * Move a few constants to the left of !=/== * Add a few {}'s are one line blocks * Use BEGIN/END_C_DECLS * Change /< to / in a few places This commit was SVN r20177.	2008-12-31 14:50:54 +00:00
Jeff Squyres	4f028171a2	Refs trac:1603: * Add OMPI_F77_CHECK_REAL16_C_EQUV test whether REAL16 is bit equivalent to long double. AC_DEFINE OMPI_REAL16_MATCHES_C with result (0 or 1). Update ompi_info to only show real16 support if OMPI_REAL16_MATCHES_C is 1. * Update DDT to only support REAL16 and COMPLEX32 if 1==OMPI_REAL16_MATCHES_C. * MPI Op function pointer tabls will have NULL for the REAL16 and COMPLEX32 entries if 0==OMPI_REAL16_MATCHES_C. * Slightly cleaned up OMPI_F77_GET_ALIGNMENT and OMPI_F77_CHECK m4 tests (use OMPI_VAR_SCOPE_PUSH/POP). This commit was SVN r19948. The following Trac tickets were found above: Ticket 1603 --> https://svn.open-mpi.org/trac/ompi/ticket/1603	2008-11-07 20:37:21 +00:00
Jeff Squyres	b50a4f126a	Gahh! Today is the day for accidental commits. Back out the op portion of r19826. This commit was SVN r19827. The following SVN revision numbers were found above: r19826 --> open-mpi/ompi@6b6c08ef67	2008-10-28 18:56:06 +00:00
Jeff Squyres	6b6c08ef67	Fixes trac:1588: have several BTLs disable themselves in the presence of THREAD_MULTIPLE. There's a new (hidden) MCA parameter to re-enable these BTLs in the presence of THREAD_MULTIPLE: btl_base_thread_multiple_override. This MCA parameter should ''only'' be used by developers who are working on make their BTLs thread safe; it should ''not'' be used by end-users! This commit was SVN r19826. The following Trac tickets were found above: Ticket 1588 --> https://svn.open-mpi.org/trac/ompi/ticket/1588	2008-10-28 18:29:57 +00:00
Shiqing Fan	b82f062f24	- double declaration of extern "C" make MS compiler complain. Change them to *_C_DECLS. This commit was SVN r19432.	2008-08-27 15:49:40 +00:00
Rich Graham	afd71abde6	remove some useless qualifiers. This commit was SVN r18469.	2008-05-21 01:11:49 +00:00
Rich Graham	3b42d2268d	add functions to handle two different input buffers and a separate output buffer. User defined data types have not way to make use of these. This commit was SVN r18012.	2008-03-28 23:45:44 +00:00
Jeff Squyres	4fbcb75ce8	With 5 commits over a 16 hour period and 3 broken tarball builds and a still-broken trunk build on common platforms (e.g., 64 bit Linux RHEL4U4), I think it's clear that this code is not ready for prime-time. I'm backing out all the commits in the trunk/ompi/op tree from r17901 onwards. This code can be re-committed when compiles and runs on common platforms. cd ompi/op svn merge -r 17907:17900 https://svn.open-mpi.org/svn/ompi/trunk/ompi/op . This commit was SVN r17908. The following SVN revision numbers were found above: r17901 --> open-mpi/ompi@b9520e61dc	2008-03-21 14:47:01 +00:00
Jeff Squyres	8284f64af1	With r17906, this commit should make the trunk compile again. This commit was SVN r17907. The following SVN revision numbers were found above: r17906 --> open-mpi/ompi@df4a6c3fc5	2008-03-21 13:49:23 +00:00
Rich Graham	df4a6c3fc5	fix function prototypes for new 3 buffer routines. This commit was SVN r17906.	2008-03-21 13:44:15 +00:00
Rich Graham	0974160e29	correct several of the new macros. This commit was SVN r17904.	2008-03-21 03:45:43 +00:00
Rich Graham	a7c836a2b0	fix location of the restrict key word. Make the tag in the fan-in/fan-out algorithm be fragment based. This commit was SVN r17903.	2008-03-21 01:40:36 +00:00
Rich Graham	b9520e61dc	get the sm optimized allreduce working for all but user defined operations. Added to the reduction operations a set of reduction functions that take 2 input buffers and one output buffer to avoid some extra memory copies. These can't be used with user defined operations. The intel c collective suite passes both original, and new (new, not the user defined operations). This commit was SVN r17901.	2008-03-20 23:51:16 +00:00
Galen Shipman	b3b3c98c89	missing include file This commit was SVN r17484.	2008-02-17 19:38:20 +00:00
George Bosilca	906e8bf1d1	Replace the ompi_pointer_array with opal_pointer_array. The next step (sometimes after the merge with the ORTE branch), the opal_pointer_array will became the only pointer_array implementation (the orte_pointer_array will be removed). This commit was SVN r17007.	2007-12-21 06:02:00 +00:00
Rainer Keller	d3372729bb	- Support for opt. MPI_REAL2 (who has that?) to make checks for MPI-implementations fail in the right way ,-] - check in configure.ac - BINARY INCOMPATIBLE change to mpif-common.h (if implemented the right way) Actually OMPI_F90_CHECK takes two arguments, not three. - Only have corresponding C-Type, if the opt. Fortran type is really supported, Otherwise pass ompi_mpi_unavailable to DECLARE_MPI_SYNONYM_DDT; - Reviewed by George and Jeff This commit was SVN r15133.	2007-06-19 05:03:11 +00:00
Rainer Keller	1feb5fb21a	- Initializaton fixes of structure (o_f_to_c_index)... - Mainly indentation, except for ompi_op_create, here just dont nest into ifs... This commit was SVN r15131.	2007-06-18 23:03:56 +00:00
Rainer Keller	4a462eed3d	- Should make declare only, when we do have long double. This commit was SVN r15130.	2007-06-18 22:59:21 +00:00
Sven Stork	037b01ce9e	- more symbols that need to be exported This commit was SVN r14415.	2007-04-18 14:53:56 +00:00
Jeff Squyres	3c5c8c3c4c	Refinement of Rainer's r13227 and r13228 (worked with Rainer, Ralph, and George on these refinements): * Rename the static OBJ initializer macro to be OPAL_OBJ_STATIC_INIT(class) * Ensure that all static OBJ initializations get a refcount of 1 (doesn't ''really'' matter, since they're static, it should never get to the point where the OBJ is DESTRUCTed, but more correct nonetheless) * Add a "magic number" to the OBJ when compiling with debug support. The magic number does some rudimentary support to ensure that you're operating on a valid OBJ (and fails an assertion if you're not). Check to ensure that the memory contains the magic number when performing actions of OBJ's. Also remove the magic number when DESTRUCTing OBJs, so that if, for example, an OBJ is DESTRUCTed more than once, we'll fail the magic number assert. This commit was SVN r13338. The following SVN revision numbers were found above: r13227 --> open-mpi/ompi@96030de97b r13228 --> open-mpi/ompi@c2e9075d29	2007-01-27 13:44:03 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Jeff Squyres	3c265958ba	@#$%@#%#% Fix one more typo that was missed last night. This commit was SVN r10038.	2006-05-24 10:30:08 +00:00
Jeff Squyres	8c0ebb4897	Drat -- forgot the copyright. This commit was SVN r10025.	2006-05-23 18:42:11 +00:00
Jeff Squyres	dc9a16581e	Unbelieveable how this lived so long. Thanks to Bert Wesarg for reporting this. This commit was SVN r10023.	2006-05-23 18:00:44 +00:00
George Bosilca	a297a7ae67	MPI standard state that MPI_LONG_LONG and MPI_LONG_LONG_INT are synonyms. Thanks to Martin audet for finding out this one. This commit was SVN r9699.	2006-04-24 21:24:10 +00:00
Rainer Keller	b4e7f38360	- Well, well, the OMPI_OP_TYPE_CHAR was not supposed to stay in the enum. Actually, the current ordering of the enum is the nice however, at the moment for 1.0, signed_char is not supported. This commit was SVN r9246.	2006-03-10 16:02:45 +00:00
Rainer Keller	0fa295dc28	- Allow MPI_UNSIGNED_CHAR and MPI_SIGNED_CHAR for Reduction operations as described by MPI2, p77. This commit was SVN r9229.	2006-03-09 16:51:59 +00:00
Jeff Squyres	a192af34e7	Shame on me for not looking at the expanded collective datatype tables for the C++ bindings in MPI-2 p276-278 to see that MPI_BOOL should work with MPI_LAND, MPI_LOR, and MPI_LXOR. Thanks to Andy Selle for pointing this out. This commit was SVN r9200.	2006-03-04 18:35:33 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
George Bosilca	e20265bd2b	Dont let any external to the data-type code check directly for the predefined data-types. Instead, use the newly provided data-type function ompi_ddt_is_predefined.. This commit was SVN r8903.	2006-02-06 18:01:45 +00:00
Jeff Squyres	5f96a74e33	Make user-defined MPI::Op's be thread safe (the previous implementation was not thread safe). See lengthy comment in ompi/mpi/cxx/intercepts.cc::ompi_mpi_cxx_op_intercept() for a full explanation. This commit was SVN r8606.	2005-12-23 16:49:09 +00:00
George Bosilca	6fb4ce5e2e	Some dependencies cleanups (there were on hold for a while). This commit was SVN r8425.	2005-12-09 05:14:18 +00:00
Jeff Squyres	60b19dcf63	Add missing functions for MPI_LONG_LONG, MPI_LONG_LONG_INT, and MPI_UNSIGNED_LONG_LONG. This commit was SVN r8010.	2005-11-07 14:42:46 +00:00
Jeff Squyres	21be5e18ee	- Fix the MPI_Op intrinsic operation string names ("MPI_foo", not "MPI_OP_foo") - Remove all the handlers for MPI_REPLACE for general reductions (it's only defined for MPI_ACCUMULATE, and ACCUMULATE is handled differently than the other reductions, so it's safe to make all the maps for REPLACE be empty) This commit was SVN r8008.	2005-11-07 13:30:17 +00:00
Jeff Squyres	42ec26e640	Update the copyright notices for IU and UTK. This commit was SVN r7999.	2005-11-05 19:57:48 +00:00
Jeff Squyres	2e10d0c099	Forgot to add the intrinsic op MPI_REPLACE (Brian, the One Sided Bug Finder, gets credit) This commit was SVN r7993.	2005-11-04 19:00:35 +00:00
Jeff Squyres	4fc135fd2b	Looks like I forgot to put DDT support for the optional C datatypes MPI_UNSIGNED_LONG_LONG, MPI_LONG_LONG, and MPI_LONG_LONG_INT -- although I already had implementations of all the relevant functions for these types. Doh! This commit was SVN r7944.	2005-11-01 03:28:59 +00:00
Jeff Squyres	7bdfe6557b	- Update the checks in REDUCE, ALLREDUCE, SCAN, EXSCAN, and REDUCE_SCATTER to more thoroughly check the datatype/op combination to see if it's valid or not. If it's not, print a meaningful error message rather than "Invalid MPI_Op" indicating what specifically was wrong (therefore hopefully helping users track down where in the code the problem is, and/or telling us that there's a reduction operation combo that we don't support that we should) - The check for whether a datatype is intrinsic needed to be updated -- it's not sufficient to check that dtype->id < DT_MAX_PREDEFINED; you really need to check the PREDEFINED flag on the datatype. Thanks to George for this fix (only intrinsics have a meaningful value in dtype->id). This commit was SVN r7923.	2005-10-28 16:47:32 +00:00
Jeff Squyres	23ab9e0277	A better solution to the previous commit -- RETAIN/RELEASE the MPI_Op at the top-level MPI API function. This allows two kinds of scenarios: 1. MPI_Ireduce(..., op, ...); MPI_Op_free(op); MPI_Wait(...); For the non-blocking collectives that we're someday planning -- to make them analogous to non-blocking point-to-point stuff. 2. Thread 1: MPI_Reduce(..., op, ...); Thread 2: MPI_Op_free(op); Granted, for #2 to occur would tread a fine line between a correct and erroneous MPI program, but it is possible (as long as the Op_free was after MPI_reduce() had started to execute). It's more realistic with case #1, where the Op_free() could be executed in the same thread or a different thread. This commit was SVN r7870.	2005-10-25 19:20:42 +00:00
Jeff Squyres	ef09e768e0	Ensure to OBJ_DESTRUCT to free memory during finalize (caught by Brian). This commit was SVN r7864.	2005-10-25 17:27:58 +00:00
Jeff Squyres	f8fd10715c	- Minor style fix - Be sure to properly OBJ_CONSTRUCT the intrinsic MPI_Op's - RETAIN/RELEASE the op's when used in the invoke function This commit was SVN r7863.	2005-10-25 16:24:00 +00:00
Brian Barrett	1e2f7d6a3d	* make sure to expose ompi_op_t as an object This commit was SVN r7848.	2005-10-24 20:31:14 +00:00
Brian Barrett	1302cb4072	The next in a long line of crazed build system changes from Brian. This was originally suggested by Ralf Wildenhues, to try to speed autogen, configure, and make (and possibly even make install). Use automake's include directive to drastically reduce the number of Makefile files (although the number of Makefile.am files is the same - most are just included in a top-level Makefile.am). Also use an Automake SUBDIRs feature to eliminate the dynamic-mca tree, which was no longer really needed. This makes adding a framework easier (since you don't have to remember the dynamic-mca tree) and makes building faster (as make doesn't have to recurse through the dynamic-mca tree) This commit was SVN r7777.	2005-10-17 00:21:10 +00:00
Jeff Squyres	727a2cf8b2	Correct a few #if issues that George identified in a code review This commit was SVN r7724.	2005-10-12 13:19:46 +00:00

1 2

57 Коммитов