openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	a0b40ce45a	Fix race condition in setting MPI_ERROR -- with buffered sends, the request can complete before the operation, meaning that a bogus MPI_ERROR is read This commit was SVN r13401.	2007-01-31 21:40:14 +00:00
Brian Barrett	039a3d8c17	add comment about why there's no status update here, since I always forget This commit was SVN r13400.	2007-01-31 21:39:20 +00:00
Brian Barrett	846eed84f1	When receiving a message, need to account for the fact that the displacement of the first entry might not be the start of the user's buffer. This is similar to what ompi_convertor_unpack does. This is the solution for the test case attached to ticket #690. Refs trac:690 This commit was SVN r13397. The following Trac tickets were found above: Ticket 690 --> https://svn.open-mpi.org/trac/ompi/ticket/690	2007-01-31 18:18:19 +00:00
Brian Barrett	65b07140c0	clean up some of the printf warnings caused by the attribute code This commit was SVN r13395.	2007-01-31 17:11:06 +00:00
Pak Lui	289fbd08de	* Fix the MPI::Datatype::Create_keyval() & MPI::Win::Create_keyval() not being able to take C function pointers for either of the copy or the delete fn. Fix by overloading the Create_keyval methods. Fix trac #737, #738. Reviewed by jsquyres. * A couple of cxx tests in ompi-tests (winkeyval.cc & typekeyval.cc) will be re-enabled to regression test for this fix. This commit was SVN r13391.	2007-01-31 15:00:41 +00:00
George Bosilca	a02d1c7c8d	No more warnings. This commit was SVN r13382.	2007-01-31 04:27:41 +00:00
George Bosilca	8cb4024903	Assert in debug mode before segfault. This commit was SVN r13380.	2007-01-31 04:08:13 +00:00
Brian Barrett	ee753694e0	Print out the memlock limit when we can't allocate memory This commit was SVN r13372.	2007-01-30 21:22:56 +00:00
Rainer Keller	061ba05439	- Fixes uncovered with the format attribute to opal_output and opal_output_verbose This commit was SVN r13371.	2007-01-30 20:56:31 +00:00
Rainer Keller	01ba38661d	- First usage of the some of the attributes This commit was SVN r13370.	2007-01-30 20:54:06 +00:00
Jeff Squyres	86f8c66a27	Turns out that the leave_pinned stuff isn't used in these BTLs at all. So just remove it. This commit was SVN r13360.	2007-01-30 15:39:49 +00:00
Rainer Keller	3669e8921e	- Fix further compiler warnings regarding initialization and shadowing variables. This commit was SVN r13358.	2007-01-30 06:34:38 +00:00
Jeff Squyres	e90b3e415b	* Before this commit, if we called ompi_mpi_abort() before MPI_INIT completed successfully, Bad Things(tm) could happen. * Now we explicitly check orte_initialized (a new global in ORTE indicating whether we are between orte_init() and orte_finalize() or not), and if so, react accordingly. * If ORTE is initialized, use orte_system_info.nodename; otherwise, use gethostname(). * Add loop protection to ensure that ompi_mpi_abort() is not invoked multiple times recursively. This commit was SVN r13354.	2007-01-29 22:01:28 +00:00
Jeff Squyres	a45e8bea05	Fixes trac:830. Put in argument checking for MPI_INIT_THREAD. This commit was SVN r13353. The following Trac tickets were found above: Ticket 830 --> https://svn.open-mpi.org/trac/ompi/ticket/830	2007-01-29 21:56:15 +00:00
Jeff Squyres	0ce78f22fa	Compliments r13351 -- use the new field on the ompi_proc_t struct to know what my local rank is, and therefore set my paffinity ID as appropriate. Specifically, we're no longer relying on the special/secret mpi_paffinity_processor MCA parameter that the orted would set for us. This allows processor affinity to be used in environments where the orted is not used (e.g., bproc, and someday in the hopefully not too-distant future, SLURM). This commit was SVN r13352. The following SVN revision numbers were found above: r13351 --> open-mpi/ompi@a338b7e533	2007-01-29 21:53:04 +00:00
Ralph Castain	a338b7e533	Update the proc system to compute and store the number of local procs and my local relative rank during the callback at STG1. This info will be used to locally compute and set the processor affinity and sched_yield (for oversubscribed conditions). Over to Jeff now for modifying mpi_init accordingly. Until Jeff makes his changes, nobody should see anything different as the new info just isn't used by anything! This commit was SVN r13351.	2007-01-29 20:54:55 +00:00
Jeff Squyres	c9f072b84f	Strike down a few more stray places that were registering mpi_leave_pinned and replace them with the one central global variable. This commit was SVN r13349.	2007-01-29 20:24:31 +00:00
Brian Barrett	93a2f31932	Use a recursive halving communication algorithm similar to the one used by MPICH2 for "small" commutative operations in the reduce_scatter basic implementation. "small" is currently pretty big, as it doesn't take much to beat reduce/scatterv. Need to do much more than this for better all around performance of MPI_Reduce_scatter, but this was enough to solve the problems I was having. This commit was SVN r13348.	2007-01-29 19:29:35 +00:00
Jeff Squyres	61bc9fed3c	Duh. Free the temp array when we're done with it. While we're making this switch, use a clear variable name, too. This commit was SVN r13347.	2007-01-29 19:04:57 +00:00
Jeff Squyres	e6a0069e95	Fix from an as-yet uncommitted test that Pak will commit shortly. Found another places that we were incorrectly casting a C++ MPI handle array to the corresponding C array type and hoping for the best (which won't work at all). This commit fixes things so that we now do the proper conversion between C<-->C++ handles. This commit was SVN r13346.	2007-01-29 18:44:46 +00:00
Rainer Keller	ca35881cd0	- Minor bugfixes and removed compiler warnings This commit was SVN r13343.	2007-01-28 19:52:09 +00:00
Jeff Squyres	3c5c8c3c4c	Refinement of Rainer's r13227 and r13228 (worked with Rainer, Ralph, and George on these refinements): * Rename the static OBJ initializer macro to be OPAL_OBJ_STATIC_INIT(class) * Ensure that all static OBJ initializations get a refcount of 1 (doesn't ''really'' matter, since they're static, it should never get to the point where the OBJ is DESTRUCTed, but more correct nonetheless) * Add a "magic number" to the OBJ when compiling with debug support. The magic number does some rudimentary support to ensure that you're operating on a valid OBJ (and fails an assertion if you're not). Check to ensure that the memory contains the magic number when performing actions of OBJ's. Also remove the magic number when DESTRUCTing OBJs, so that if, for example, an OBJ is DESTRUCTed more than once, we'll fail the magic number assert. This commit was SVN r13338. The following SVN revision numbers were found above: r13227 --> open-mpi/ompi@96030de97b r13228 --> open-mpi/ompi@c2e9075d29	2007-01-27 13:44:03 +00:00
Jelena Pjesivac-Grbovic	33dcb4f810	Minor change to linear alltoall algorithm: - post isends in reverse order of posting irecvs. if the messages arrive approximately in order, this should minimize the time spent in matching the requests. I did not see any performance difference over MX up to 64 nodes, but the change makes sense and may have some impact when we have (many) more nodes. This commit was SVN r13337.	2007-01-26 21:59:31 +00:00
Brian Barrett	385a435813	Start long message send as soon as possible, to minimze ack time for the receive, greatly increasing mid-range bandwidth This commit was SVN r13317.	2007-01-25 23:07:03 +00:00
Rich Graham	1c20feb52b	Take into account constants that in the cray headers are defined different than in the portals spec. This commit was SVN r13311.	2007-01-25 18:32:47 +00:00
Jeff Squyres	7b6ed64c7b	Add in the hostname to the BTL_* output macros so that you can tell on which node an event occurred. This commit was SVN r13302.	2007-01-25 14:02:54 +00:00
George Bosilca	f9a3bbfd7a	Don't miss the ODLS component from the output. This commit was SVN r13299.	2007-01-25 08:37:36 +00:00
Jeff Squyres	b9120aed6d	Always make sure to create $(includedir)/openmpi, even if we were configured with --disable-mpi-cxx so that the default -I flags in the wrapper compilers don't point to a directory that doesn't exist. Thanks to Martin Audet for identifying the problem. This commit was SVN r13296.	2007-01-25 01:59:22 +00:00
Jeff Squyres	6fea000e5f	Oops -- get the right function name (copy-n-paste error). This commit was SVN r13290.	2007-01-24 22:31:13 +00:00
Jeff Squyres	6b69ea664d	Make a much, much better error message for a not-uncommon failure scenario (user/sysadmin forgot to set the memlock limits high enough). This commit was SVN r13289.	2007-01-24 22:25:40 +00:00
Patrick Geoffray	b252cb82c8	oops, ".", not "->", copy error... This commit was SVN r13287.	2007-01-24 19:16:46 +00:00
Patrick Geoffray	d58f6b2451	Free memory in synchronous send case if free_after requires it. Fixes memory leak using synchronous sends and custom data types. This commit was SVN r13286.	2007-01-24 19:10:38 +00:00
George Bosilca	d19a4f4740	Cast it to make cl happy. This commit was SVN r13267.	2007-01-24 00:51:01 +00:00
George Bosilca	790f175d4e	Explicit conversions to make the code Windows friendly. This commit was SVN r13266.	2007-01-24 00:50:24 +00:00
George Bosilca	a4488ff8d2	Add explicit conversions. This commit was SVN r13265.	2007-01-24 00:49:08 +00:00
George Bosilca	6f720f0d26	Add all required explicit conversions in order to be able to build on Windows. This commit was SVN r13264.	2007-01-24 00:48:16 +00:00
Jeff Squyres	c9fe68c406	Better patch from Gleb to do the per-port (endpoint) specification of whether to use eager RDMA or not This commit was SVN r13262.	2007-01-23 22:40:59 +00:00
Jelena Pjesivac-Grbovic	5cbcf42dc3	Removing yet another unsed variable (missed it in previous submit). This commit was SVN r13259.	2007-01-23 21:30:57 +00:00
Jelena Pjesivac-Grbovic	afbd032ff9	Removing compiler warnings about comparison of unsigned values to signed ones, and unused variables. This commit was SVN r13258.	2007-01-23 21:10:07 +00:00
Jelena Pjesivac-Grbovic	568477ade8	Adding new Allreduce algorithms, updating allreduce decision function, and cleaning up util. - Allreduce algorithms: - Recursive doubling is used for small messages (up to 10KB) and can be used for both commutative and non-commutative operations. Recursive doubling passed OCC, IMB-3.2, Intel (Allreduce_c, Allreduce_loc_c, and Allreduce_user_c), mpi_test_suite (Allreduce MIN/MAX, and Allreduce MIN/MAX with MPI_IN_PLACE) tests on TCP up to 36 nodes and MX up to 64 nodes. - Ring algorithms performs well for larger messages but cannot be used for non-commutative operations. It passed the same tests as recursive doubling, except some of the non-commutative tests in Intel benchmarks Allreduce_loc_c and Allreduce_user_c (which was expected). - MPI_Allreduce with new decision function passed all of the tests mentioned above. - Cleaning up coll_tuned_util. Moving isendrecv to static inline just like sendrecv. This commit was SVN r13252.	2007-01-23 01:19:11 +00:00
Jeff Squyres	3389a523e9	Arrgh. That printf should not have been in there! This commit was SVN r13243.	2007-01-22 18:52:49 +00:00
Jeff Squyres	a24f3c0886	Move the "use eager RDMA" flag to the individual openib BTL modules, not the component. This potentially allows for a mix of HCAs that support eager RDMA and those who do not on a port-by-port basis. This commit was SVN r13242.	2007-01-22 18:49:32 +00:00
Jeff Squyres	91b855c2f4	Minor fixes in the help messages * If the text to cite where the problem occurred is "\n", prettyprint somethign a little nicer so that it's clear that we're talking about the end of line * Add a missing help message ("ini file:unknown field"), and display it a little better (i.e., show the erroneous field, not a misleading "end of line" marker) * It's "OpenIB", not "Open IB" This commit was SVN r13241.	2007-01-22 18:45:43 +00:00
George Bosilca	242292673a	sendrecv is a static inline. This commit was SVN r13237.	2007-01-22 05:50:23 +00:00
George Bosilca	c0b22678a7	OMPI_ENABLE_DEBUG is always defined. Remove all warnings when compiled in non debug mode. This commit was SVN r13230.	2007-01-21 18:14:06 +00:00
Rainer Keller	c2e9075d29	- Define a OPAL_CLASS_EMPTY to be used for initialization. Similar within the dt_module for the predefined datatypes. This commit was SVN r13228.	2007-01-21 15:52:06 +00:00
Rainer Keller	96030de97b	- Initialize the size of the opal_object class. - Use the OBJ_CLASS_INSTANCE macro to initialize classes. This also gets rid of several missing initialization errors. This commit was SVN r13227.	2007-01-21 14:24:29 +00:00
Jeff Squyres	52ca6cf86c	The mpi_leave_pinned and mpi_leave_pinned_pipeline MCA parameters were needlessly registered in multiple different places, and none of them had a good help string. There was also an inconsistent check for setting both mpi_leave_pinned and mpi_leave_pinned_pipeline (i.e., it was only in ob1). This commit moves the registration of these params to one central place (ompi/runtime/ompi_mpi_params.c, with all other mpi_* MCA params) and uses globals to propagate the values as relevant. The error check was also moved to the central location to ensure that we can consistency everywhere. This commit was SVN r13226.	2007-01-21 14:02:06 +00:00
Jeff Squyres	32bfbfc735	Correct a filename that would prevent show_help messages from appearing properly. This commit was SVN r13225.	2007-01-21 13:56:16 +00:00
Rainer Keller	3f397f3848	- Do not convert the cart_comm -- its intent is out. This commit was SVN r13224.	2007-01-21 13:00:23 +00:00
Rainer Keller	235f87fd14	- Small cleanups, getting rid of variables, using index early, etc. This commit was SVN r13222.	2007-01-19 23:28:04 +00:00
Rainer Keller	125ba1acfa	- Reduce the amount of warnings with -Wshadow -- mainly due to usage of index and abs in inline-fcts in header files. This commit was SVN r13217.	2007-01-19 19:48:06 +00:00
Jeff Squyres	e934272f3e	Commit data supplied by Christian Bell at QLogic for their vendor and part ID's. This commit was SVN r13216.	2007-01-19 19:46:29 +00:00
Sven Stork	862dcb1a34	- fix compiler warning in ia64 This commit was SVN r13212.	2007-01-19 14:48:47 +00:00
Rolf vandeVaart	6a260e4a9a	Fix two problems. For MPI_Buffer_detach, do not attempt to return the buffer address from Fortran. It is not expected behavior. For MPI_Buffer_attach, adjust the address of the buffer handed in so it is always aligned. Refs trac:750 Buffer detach reviewed by Jeff Squyres Buffer attach alignment reviewed by George Bosilca This commit was SVN r13205. The following Trac tickets were found above: Ticket 750 --> https://svn.open-mpi.org/trac/ompi/ticket/750	2007-01-18 23:32:39 +00:00
Ralph Castain	4ef4cbb5ad	Fix a compiler warning about comparing signed/unsigned values This commit was SVN r13190.	2007-01-18 17:14:06 +00:00
Gleb Natapov	4c7dbd36c7	Balance RDMA operation in round robin fashion between all available RDMA BTLs. OB1 always use first element from array of BTLs available for RDMA. The patch change the array creation algorithm, it puts different BTL in the first element in round robin fashion. This commit was SVN r13174.	2007-01-18 09:15:18 +00:00
Jeff Squyres	53c4bcccd5	Fix a problem noted by Tim Campbell: the type of the first parameter to the F90 binding for MPI_INITIALIZED was wrong (should have been logical, not integer). Fixes trac:782. This commit was SVN r13170. The following Trac tickets were found above: Ticket 782 --> https://svn.open-mpi.org/trac/ompi/ticket/782	2007-01-17 22:44:47 +00:00
Jeff Squyres	a3aae09ca3	Since we've found a few MPI_Fint/MPI_Aint problems in the F77 bindings recently, I went through and took a closer look this afternoon and fixed a bunch more places where this problem occurred. This commit was SVN r13166.	2007-01-17 22:17:34 +00:00
Brian Barrett	860fd63710	lower priority of rdma one-sided component so that pt2pt is preferred for most people, so that it gets more testing This commit was SVN r13163.	2007-01-17 22:01:03 +00:00
Jelena Pjesivac-Grbovic	85192c01b0	Modifying util functionality: - removing static qualification on ompi_coll_tuned_sendrecv - adding ompi_coll_tuned_isendrecv function which posts isend and irecv requests These changes are separate from but necessary for new algorithms I am working on. This commit was SVN r13161.	2007-01-17 21:29:13 +00:00
Brian Barrett	95c0a17b9a	Send the unlock request before starting the requests. We won't unlock until we get an ack from the remote side, so there's no longer a race there (I used to do the unlock request last, after local completion of all the requests completed, to try to avoid having the passive side reply to the active side, but I don't do that anymore). The unlock side will not "unlock" the window until it actually receives the correct number of results, so we're good there. This fixes an issue where we would receive data on the remote side we weren't expecting that could cause us to release a lock before it really should have been released to the requesting peer. It could also cause a deadlock if one of the processes trying to unlock was "self", as that would result in the active unlock never sending the unlock request, even though it sent the payload, which could cause a counter that should always be positive to hit -1, causing an infinite loop that could only be solved by popping up the stack, which was an impossibility. Refs trac:785 This commit was SVN r13160. The following Trac tickets were found above: Ticket 785 --> https://svn.open-mpi.org/trac/ompi/ticket/785	2007-01-17 21:13:12 +00:00
Brian Barrett	c1be97199b	Fix an issue with recursive calls into the component progress caused by btls sometimes calling opal_progress() during their send calls by dropping the loop through the list of pending control messages if any are marked as completed. Refs trac:784 This commit was SVN r13159. The following Trac tickets were found above: Ticket 784 --> https://svn.open-mpi.org/trac/ompi/ticket/784	2007-01-17 20:48:35 +00:00
Jeff Squyres	52e8089600	Fix compiler warning. This commit was SVN r13148.	2007-01-17 14:23:46 +00:00
Jeff Squyres	9073b9cd3c	Ensure to use datatype_index consistently. Only a problem for compilers where the MPI_Fint representation is not the same as int, but still... This commit was SVN r13141.	2007-01-17 12:34:30 +00:00
Brian Barrett	32bdc47816	Fix a number of places where MPI_Fint was used where the standard specifies MPI_Aint. On 64-bit big endian machines, these can have some unpleasent issues. Refs trac:734 This commit was SVN r13140. The following Trac tickets were found above: Ticket 734 --> https://svn.open-mpi.org/trac/ompi/ticket/734	2007-01-17 06:41:53 +00:00
Brian Barrett	35c57457c6	Don't call ompi_request_test() if the request isn't likely to finish. Otherwise, we end up recursively calling into the progress functions and corrupting a list that doesn't like to be corrupted. Refs trac:561 This commit was SVN r13138. The following Trac tickets were found above: Ticket 561 --> https://svn.open-mpi.org/trac/ompi/ticket/561	2007-01-17 02:30:11 +00:00
Jeff Squyres	754042f1fc	Fix a compiler warning. This commit was SVN r13134.	2007-01-16 23:03:17 +00:00
Jeff Squyres	fbeaf0919d	Fixes trac:732. The 2nd parameter in MPI_WIN_CREATE is actually an address integer, not a regular integer. The F77 prototype for this function was wrong, causing Bad Things on some 64 bit platforms (on other 64 bit platforms, we just got lucky). This commit was SVN r13133. The following Trac tickets were found above: Ticket 732 --> https://svn.open-mpi.org/trac/ompi/ticket/732	2007-01-16 22:22:08 +00:00
George Bosilca	3a07982ae7	This file is just a left over from a dark past. This commit was SVN r13132.	2007-01-16 22:02:13 +00:00
Brian Barrett	f03ffb3a62	Send reply from the passive side of an unlock request back to the active side and only let MPI_WIN_UNLOCK return when the passive side has actively replied that the window is unlocked. Refs trac:761 This commit was SVN r13118. The following Trac tickets were found above: Ticket 761 --> https://svn.open-mpi.org/trac/ompi/ticket/761	2007-01-14 22:08:38 +00:00
Brian Barrett	ba8c8ce92c	Improve checking of origin and target datatypes when using MPI_ACCUMULATE to pass some of the tests provided by Sun. These will, of course, greatly slow down calls to MPI_ACCUMULATE, but there's no way to pass the test suite without them :/. Refs trac:760 This commit was SVN r13117. The following Trac tickets were found above: Ticket 760 --> https://svn.open-mpi.org/trac/ompi/ticket/760	2007-01-14 21:42:25 +00:00
Brian Barrett	e93eaa0790	Remote pointers are always in .lval, not .pval, so need to read the .lval and convert it to a pointer when finding the destination addr. Refs trac:587 This commit was SVN r13116. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-14 20:34:41 +00:00
Jeff Squyres	d5404f21a3	Make the trunk openib btl compile again. This commit was SVN r13110.	2007-01-13 14:22:42 +00:00
Galen Shipman	4a6ad30440	remove unused macro calls.. This commit was SVN r13107.	2007-01-12 23:17:17 +00:00
Galen Shipman	2097d174f6	heterogeneous fixes to the OpenIB BTL. This includes work by nysal, brian and I. This commit was SVN r13106.	2007-01-12 23:14:45 +00:00
Galen Shipman	df099a4731	call it what it is... we are looking at subnet_id's and we are counting active ports per subnet. move subnet count out of procs loop,, no need to do it there... This commit was SVN r13105.	2007-01-12 22:42:20 +00:00
Brian Barrett	075161afa9	Enable MX wireup in heterogeneous situations. Refs trac:587 This commit was SVN r13095. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-12 15:44:58 +00:00
Donald Kerr	ed097d17c1	fix for bug #749 , though I can not confirm without a linux compiler This commit was SVN r13090.	2007-01-11 22:25:13 +00:00
Donald Kerr	80f2cbb498	add udapl rdma capabilities into the udapl btl This commit was SVN r13082.	2007-01-11 15:22:08 +00:00
Jeff Squyres	e5205657cf	A much better fix for #739 . No configure test -- just do a simple memcpy() instead of assigning the struct's by value. Fixes trac:739. This commit was SVN r13081. The following Trac tickets were found above: Ticket 739 --> https://svn.open-mpi.org/trac/ompi/ticket/739	2007-01-11 14:30:32 +00:00
Jeff Squyres	add3909096	Back out 13076 and 13077 in favor of a much simpler approach. Sorry for the configure change -- hopefully it's early enough in the morning that it won't affect people... (new approach won't have a configure change). Refs trac:739. This commit was SVN r13080. The following Trac tickets were found above: Ticket 739 --> https://svn.open-mpi.org/trac/ompi/ticket/739	2007-01-11 14:07:15 +00:00
George Bosilca	53d87897c8	Once we start the C requests we have to put back their f_to_c index in the fortran array, as we might get new C requests from the startall function. This commit was SVN r13079.	2007-01-11 08:39:42 +00:00
George Bosilca	ceb5d436d8	Missing an include. This commit was SVN r13078.	2007-01-11 05:45:13 +00:00
George Bosilca	24a91fad1d	OPAL_BOOL_STRUCT_COPY or OMPI_BOOL_STRUCT_COPY that's the question! Let's minimize the disturbances and say that the configure system is right. From now on it's OPAL_BOOL_STRUCT_COPY. This one is related to r13076 and has to follow when r13076 goes in the 1.2. This commit was SVN r13077. The following SVN revision numbers were found above: r13076 --> open-mpi/ompi@f0932a0701	2007-01-11 05:44:48 +00:00
Jeff Squyres	f0932a0701	A workaround for a bug in the PGI 6.2 compiler series. This bug has been fixed in the 7.0 PGI series, but is unlikely to be fixed in the 6.2 series: * Add a configure test looking for the bad behavior (the PGI compiler chokes on C code where structs containing bool's are copied by value) * Set OMPI_BOOL_STRUCT_COPY to 1 if it's ok, 0 if it's not (i.e., PGI 6.2 series will have this value set to 0) * In two places in the code base -- orte-clean and btl_openib_ini.h, we have a struct that contains a bool that is copied by value. In these two places, check OMPI_BOOL_STRUCT_COPY and if it's 1, use the "int" type instead of "bool". Fixes trac:739 This commit was SVN r13076. The following Trac tickets were found above: Ticket 739 --> https://svn.open-mpi.org/trac/ompi/ticket/739	2007-01-11 02:21:26 +00:00
Jelena Pjesivac-Grbovic	d2921a9d42	Cleanup of Barrier implementation: - utilizing coll_tuned_util functions - setting line length to 80. This implementation uses standard send messages (instead of synchronous ones). The change improved our performance over MX multiple number of times, however, there exists a small potential that last message to be sent can be delayed (until next mpi call, which means potentially infinitely). If this shows to be a problem, I will modify the algorithms to use synchronous send as last operation (which will incur performance penalty again). This commit was SVN r13071.	2007-01-10 22:49:43 +00:00
Jelena Pjesivac-Grbovic	ccc3ee0b6b	Minor changes to allgather implementation with some clean-up of util code. - in allgather algorithms I replaces irecv-isend-waitall sequence with call to ompi_coll_tuned_sendrecv - most of the functions in util code and allgather decision function conform to 80 character line width. - This commit was SVN r13069.	2007-01-10 21:56:59 +00:00
Josh Hursey	93208445fd	Make sure we wireup the 'verbose' MCA parameter for the BTL's. This commit was SVN r13067.	2007-01-10 21:24:35 +00:00
Pak Lui	ff1dd4f16d	* fixes ticket #730 for supporting both of the pre-errata argument set and post-errata argument set of MPI::Win::Get_attr This commit was SVN r13063.	2007-01-10 16:41:21 +00:00
Pak Lui	de51464aad	* fix #730 for removing an extra argument from MPI::Win::Get_attr as being corrected in the MPI 2.0 errata. * initialized some variables to make our sensitive sun compiler not to not warn about them when user apps are compiling. This commit was SVN r13058.	2007-01-09 23:48:39 +00:00
Gleb Natapov	624f139bd8	This commit fixes trac:729. Initialize pointer to registration to NULL. Otherwise it may contain garbage and we will try to unregister it later in btl_free(). This commit was SVN r13054. The following Trac tickets were found above: Ticket 729 --> https://svn.open-mpi.org/trac/ompi/ticket/729	2007-01-09 10:29:20 +00:00
Gleb Natapov	d3ac56272a	Prevent access to openib_btl after free(). This commit was SVN r13052.	2007-01-09 09:07:32 +00:00
George Bosilca	87ff2b5ce8	Cast to the correct type. This commit was SVN r13046.	2007-01-08 22:04:01 +00:00
George Bosilca	f419960c7f	All files have to include ompi_config.h before anything else. This commit was SVN r13045.	2007-01-08 22:03:16 +00:00
George Bosilca	53ddbe8446	Nothing relevant. This commit was SVN r13044.	2007-01-08 22:02:17 +00:00
George Bosilca	e72b0c1044	Some compilers (VC) don't like conversions between int and bool. This commit was SVN r13043.	2007-01-08 22:00:49 +00:00
Brian Barrett	e130f18cc2	Fix some compiler warnings that have slipped in lately... This commit was SVN r13037.	2007-01-08 17:20:09 +00:00
Brian Barrett	a34e67d743	Remove unneeded PARAM_INIT_FILE variable in configure.params files used by components that use configure.m4 for configuration or are always built. The macro has not been needed since moving to configure types other than configure.stub Fixes trac:590 This commit was SVN r13031. The following Trac tickets were found above: Ticket 590 --> https://svn.open-mpi.org/trac/ompi/ticket/590	2007-01-08 03:44:22 +00:00
Brian Barrett	b8413fb1d5	Just cast the pointer to a uintptr_t then to the match bits, instead of abusing the ompi_ptr_t interface. Not critical for v1.2, as there are no portals platforms that are big endian, so the code in v1.2 will work well enough for now This commit was SVN r13024.	2007-01-07 03:11:27 +00:00
Brian Barrett	8900d3ae43	Second take at fixing the issues with using ompi_ptr_t. Add helper functions for converting from .pval to .lval and vice-versa. Users of ompi_ptr_t types should only use one of the fields in the union unless using the helper conversion functions. For the BTLs, local pointers will always be stored in the .pval field and remote pointers always stored in the .lval field. George wrote the initial patch, I extended it slightly and am responsible for all bugs found. Refs trac:587 This commit was SVN r13023. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-07 01:48:57 +00:00
Jelena Pjesivac-Grbovic	eae3df4904	Updated broadcast decision function based on MX results up to 64 nodes. (The previous decision function did not consider binomial algorithm (since we did not have it at the time)). This commit was SVN r13007.	2007-01-06 00:37:40 +00:00
Jeff Squyres	1bdf883277	Oops --- don't need those header files. This commit was SVN r13000.	2007-01-05 00:08:16 +00:00
Jeff Squyres	fc3637a7c6	Take out the checks for NULL and STATUS[ES]_IGNORE -- it turns out that a) STATUS[ES]_IGNORE is NULL, and b) ROMIO blindly sends its status through to STATUS_SET_ELEMENTS, even if the status is IGNORE. So we have some legal cases where IGNORE can be passed through here. Well, that's what we get for trying to do good error checking. :-( This commit was SVN r12999.	2007-01-04 23:03:36 +00:00
Jeff Squyres	75df4ca602	Minor fixes for MPI-level aborting: - Fix some fpritnf's in ompi_mpi_abort() that incorrectly assumed that we were always being invoked from MPI_ABORT (ompi_mpi_abort() may be invoked from a bunch of different places) - Also try to opal_backtrace_print() if opal_bactrace_buffer() is not supported. - Print a message in MPI_ABORT if we're aborting. This commit was SVN r12998.	2007-01-04 22:30:28 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Galen Shipman	d207a6c988	endpoint should use a uint64_t for subnet, as everyone else does.. makes bad things happen when packing into a 64 bit buffer... Misc cleanup.. This commit was SVN r12993.	2007-01-04 20:25:28 +00:00
Brian Barrett	936fdd2ae1	remove some code that accidently came in with r12974. Refs trac:587 This commit was SVN r12991. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-04 20:17:07 +00:00
Galen Shipman	931a389c4f	fix deadlock on rendezvous protocol.. This commit was SVN r12982.	2007-01-04 03:46:11 +00:00
Galen Shipman	f12bbe0591	Handle different subnets correctly and multiple nic endpoint negotiation This is somewhat limited currently for expample, if you have 3 ports on Node A and 5 ports on Node B then the peers will use 3 ports to communicate with each other. This is on a subnet basis, so for any pair of nodes we take the intersection of the available ports within a subnet. We use subnets to determine reachability for lazy connection establishment. So if Node A and Node B each have two HCA's (on seperate networks) then the subnet's must be distinct, otherwise we will try to wire up HCA's on seperate networks. This commit was SVN r12978.	2007-01-03 22:35:41 +00:00
Brian Barrett	7cac26d240	* fix some typos that slipped in with r12974. Refs trac:587 This commit was SVN r12976. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 20:14:45 +00:00
George Bosilca	1be60e802f	Add the DECLSPEC. This commit was SVN r12975.	2007-01-03 19:59:18 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Gleb Natapov	a6127fd8ce	Increase req_bytes_delivered atomically. This commit was SVN r12971.	2007-01-03 15:19:34 +00:00
Gleb Natapov	79202561f6	Don't check req_pipeline_depth on frag completion. Checking of req_bytes_delivered should be enough. This commit was SVN r12967.	2007-01-03 14:44:20 +00:00
Gleb Natapov	1ad6c41735	Sender can start scheduling send fragments immediately after receiving ACK. No need to wait for RNDV completion. This commit was SVN r12965.	2007-01-03 12:37:11 +00:00
Rich Graham	8a9da02063	change code to conform with coding standard. Handle error condition where shared memory file is not created. This commit was SVN r12964.	2007-01-03 00:06:02 +00:00
Donald Kerr	899297c8f4	udapl btl was not compiling after r12878 on 12/17/2006, some minor changes to allow btl to compile This commit was SVN r12963. The following SVN revision numbers were found above: r12878 --> open-mpi/ompi@190e7a27cd	2007-01-02 21:44:12 +00:00
Jeff Squyres	ea2d49e55d	Because of r12712, we actually need to allocate one ''more'' item than the value of __n holds. This is not a problem in the first case because sizeof(int) == sizeof(MPI_Flogical), so no alloc is actually performed (which is most compilers, and why we haven't been bitten by this yet). But the second case -- where sizeof(int) != sizeof(MPI_Flogical) -- is definitely a problem and needs the "+1" in the alloc, or Bad Things will happen. This commit was SVN r12953. The following SVN revision numbers were found above: r12712 --> open-mpi/ompi@3e11c76d4c	2007-01-02 16:16:29 +00:00
George Bosilca	3cde822d98	Add missing headers. This commit was SVN r12951.	2007-01-02 08:04:34 +00:00
George Bosilca	d8dee3a740	If the MX driver was unable to load correctly, or if the endpoint was not created then don't try to call the MX endpoint close function. This commit was SVN r12950.	2007-01-02 00:01:50 +00:00
Rich Graham	6cb2377015	Change the allocation of the shared memory backing file. The file is allocated on a per comm_world instance, with the lowest rank in comm_world on the given host creating and initializing the file, and then notifying the remaining files via the OOB. Reviewed: Ralph Castain, Brian Barrett Addressing ticket #674. This commit was SVN r12949.	2007-01-01 02:39:02 +00:00
George Bosilca	e223b27268	A fragment is marked completed by the PML when the peer signal the completion of the RDMA operation associated with the fragment. The PML will call the BML free which in turn will call the BTL free. The MX BTL will not release the fragment if it not tagged with 0xff. This commit was SVN r12947.	2006-12-31 03:17:47 +00:00
Brian Barrett	2951586b43	Fix type casting issue with MPI_ERRCODES_IGNORE that would cause errors when using a C++ compiler. This commit was SVN r12946.	2006-12-31 02:51:13 +00:00
Brian Barrett	1ba97181dc	A number of MPI-2 compliance fixes for the C++ bindings: * Added Create_errhandler for MPI::File * Make errors_throw_exceptions a first-class predefined exception handler, and make it work for Comm, File, and Win * Deal with error handlers and attributes for Files, Types, and Wins like we do with Comms - can't just cast the callbacks from C++ signatures to C signatures. Callbacks will then fire with the C object, not the C++ object. That's bad. Refs trac:455 This commit was SVN r12945. The following Trac tickets were found above: Ticket 455 --> https://svn.open-mpi.org/trac/ompi/ticket/455	2006-12-30 23:41:42 +00:00
George Bosilca	47601e315e	Allow the MX BTL to select at runtime if the unexpected handler will be activated or not. This commit was SVN r12944.	2006-12-30 20:57:50 +00:00
Brian Barrett	4e157380bf	Heterogeneous support changes: * Add line about heterogeneous support to ompi_info output * Print warning and abort if heterogeneous detected and no heterogeneous support available. Refs trac:587 This commit was SVN r12943. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2006-12-30 17:13:18 +00:00
Brian Barrett	99c0a29602	Disable CM and DR PMLs in heterogeneous situtations as neither are heterogeneous safe. Refs trac:587 This commit was SVN r12942. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2006-12-30 16:17:56 +00:00
George Bosilca	d401a65975	Minor cleanups. Don't set the fields that will never be used. This commit was SVN r12941.	2006-12-29 07:55:17 +00:00
George Bosilca	0b5d879a63	ompi_convertor_pack do not return errors (all checkings are done when the convertor is created). This commit was SVN r12940.	2006-12-29 07:40:02 +00:00
George Bosilca	d8db9e49f3	Set the bml_btl to NULL or segfault !!! This commit was SVN r12939.	2006-12-29 07:38:24 +00:00
Brian Barrett	c010119667	If a BTL isn't needed due to exclusivity ranking, need to call a matching inuse decrement for the increment that was at the start of the procs loop. Otherwise, the inuse count can end up higher than it actually is and a btl can end up in the progress loop when it isn't active to any peer. Refs trac:543 This commit was SVN r12938. The following Trac tickets were found above: Ticket 543 --> https://svn.open-mpi.org/trac/ompi/ticket/543	2006-12-29 02:22:40 +00:00
George Bosilca	416e5b5f6a	Enable the MX extensions if and only if the mx_extensions.h header is installed on the system. This commit was SVN r12937.	2006-12-29 00:31:32 +00:00
George Bosilca	d7bc180a90	The max allocated tag is not 16. Use the define instead. This commit was SVN r12936.	2006-12-28 22:48:58 +00:00
George Bosilca	3eeecc3838	Add support for faster small messages. While sending a message, we check if the data was buffered by the MX library. If it's the case then we declare the send as completed and disable the completion event for the mx request. This commit was SVN r12935.	2006-12-28 22:34:24 +00:00
Brian Barrett	f191a042f6	Fix compile errors in heterogeneous code when not building heterogeneous support. Refs trac:701 This commit was SVN r12932. The following Trac tickets were found above: Ticket 701 --> https://svn.open-mpi.org/trac/ompi/ticket/701	2006-12-28 20:28:14 +00:00
George Bosilca	b996c00d1a	Set the limits for the MX fragments to 4K. Add code to dump the state of the MX hardware (not activated). This commit was SVN r12931.	2006-12-28 08:40:37 +00:00
George Bosilca	3903009b8b	Add a check for the unexpected handler. If enabled, allow the zero-copy protocol over the MX BTL. Now, we have only one matching, the one in Open MPI. The problem is that when the unexpected handler is triggered, not all the message is on the host memory. In the best case we get one MX fragment (internal MX fragment), in the worst we get NULL. The only way to fit this with the design of the PML is to force the eager protocol at the MX internal fragment size, and to limit the send/receive protocol at the same size. Tests show the outcome is not far from optimal (if the pipeline depth is increased a little bit). Set MX_PIPELINE_LOG in order to allow MX to use internal fragments of 4K. This commit was SVN r12930.	2006-12-28 03:35:41 +00:00
George Bosilca	ff2319dcb7	Complete the OUT protocol. Small latency improvements. Some minor cleanups. Create some macros, reorder some functions. Make sure all fragments are correctly released at the end. This commit was SVN r12926.	2006-12-26 18:15:24 +00:00
George Bosilca	75a35ed7ee	Implement the PUT protocol over MX. The send/receive approach give the best performance on a 2G Myrinet card, as it look like pipelining the messages by 1M is faster than a simple send/receive. However, when using a 10G card the send/receive will limit the maximum bandwidth to 2.5Gbs. The reason is the scarce bus resources that have to be shared between the Myrinet hardware and the memcpy operation. The PUT protocol remove the memcpy, we now have a true zero-copy mechanism. But, there is no pipelining yet as it look like the RDMA pipeline somehow disappeared from the OB1 PML ... This commit was SVN r12925.	2006-12-24 22:52:46 +00:00
George Bosilca	e8bd985870	Add more output when calls to the MX library fails. Move the connection status from theproc into the endpoint. This commit was SVN r12924.	2006-12-24 22:34:48 +00:00
George Bosilca	14dc72f595	Allow the user to change the MX flags. This commit was SVN r12923.	2006-12-24 22:21:00 +00:00
George Bosilca	dbe2798638	Allow MX to handle shared memory and self communications. By default these features are disabled (btl_mx_shared_mem respectively btl_mx_self have to be set in order to activate them). This commit was SVN r12922.	2006-12-24 22:18:41 +00:00
Jelena Pjesivac-Grbovic	3494e1bb05	- Updated decision function for Alltoall collective. Fixes "jump" for intermediate sizes message on 24+ number of nodes (at least on Grig cluster). This commit was SVN r12920.	2006-12-22 19:59:17 +00:00
George Bosilca	b1725e02d4	No more warnings plus some code reordering. This commit was SVN r12919.	2006-12-21 22:42:15 +00:00
Edgar Gabriel	dc532577db	Adding more accurate checking of the input parameters for the add_error_class and add_error_code files. Also fixed the update of the lastusedcode attribute, all of work according to my tests pretty fine. Please note: the testcode attached to the bug 683 still reports some bugs. I am however pretty sure that the testcode is wrong at that points: - the standard says that the attribute MPI_LASTUSEDCODE has to be updated for a new error_class or a new error_code. The test currently assumes, that only the add_error_code call changes the attribute value. - you have to comment out the two lines 73 and 74 in order to make the test finish, since these lines check for the error string of non-existent codes. - line 126 the error-string of MPI_ERR_ARG is not "invalid argument" but a little bit more, so the test thinks the output is wrong. So probably the test has to be update to match the according error string of MPI_ERR_ARG. Fixes trac:682 This commit was SVN r12913. The following Trac tickets were found above: Ticket 682 --> https://svn.open-mpi.org/trac/ompi/ticket/682	2006-12-21 19:36:31 +00:00
Jelena Pjesivac-Grbovic	f1aec23507	Adding tuned allgather implementation. It contains four algorithms: Bruck (ciel(logP) steps), Recursive Doubling (log(P) for power-of-2 processes), Ring (P-1 steps), and Neighbor Exchange (P/2 steps for even number of processes). All algorithms passed occ, IMB-2.3, and intel verification tests from ompi-tests/ for up to 56 processes. The fixed decision function is based on results collected over MX on the Grig cluster at the University of Tennessee at Knoxville. I have also added (and commented out) copy of MPICH2 decision function for allgather (from their IJHPCA 2005 paper). This commit was SVN r12910.	2006-12-21 18:40:02 +00:00
Brian Barrett	7880353fcc	Need to close every endpoint we open, or the MX progress thread doesn't die, which can cause segfaults on shutdown. Calling mx_finalize() isn't enough to shutdown the thread, so must close endpoints as well. Refs trac:513 This commit was SVN r12908. The following Trac tickets were found above: Ticket 513 --> https://svn.open-mpi.org/trac/ompi/ticket/513	2006-12-21 18:13:22 +00:00
Galen Shipman	faa0bafa96	modify preconnect to use a rotating ring algorithm, OOB connections are brought up lazily so we want to be a bit less agressive. This commit was SVN r12906.	2006-12-21 01:36:57 +00:00
Gleb Natapov	484c6a2c1a	Use OPAL_ALIGN() macro to align length. Return address from mpool_alloc is now properly aligned so no need to align it once more. This commit was SVN r12899.	2006-12-19 08:34:48 +00:00
Brian Barrett	2ab65eb521	Remove some debugging output that was #if 0'ed out but shouldn't have been committed into the trunk anyway This commit was SVN r12897.	2006-12-19 02:34:41 +00:00
Gleb Natapov	b910d10a81	Add general functions for alignment and change rdma_mpool_align to always honor an alignment event if posix_memalign() is not available. This commit was SVN r12892.	2006-12-18 10:52:18 +00:00
Brian Barrett	c554638446	Support systems without malloc.h or posix_memalign (ie, pretty much every one we support that isn't Linux) This commit was SVN r12880.	2006-12-17 17:28:59 +00:00
Brian Barrett	b448b4e47e	More heterogeneous fixes. Don't set reachability bit on a remote proc if the remote architecture differs from the local architecture and the btl doesn't support heterogeneous transport. Refs trac:587 This commit was SVN r12879. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2006-12-17 17:27:08 +00:00
Gleb Natapov	190e7a27cd	Merge with gleb-mpool branch. All RDMA components use same mpool now (rdma). udapl/openib/vapi/gm mpools a deprecated. rdma mpool has parameter that allows to limit its size mpool_rdma_rcache_size_limit (default is 0 - unlimited). This commit was SVN r12878.	2006-12-17 12:26:41 +00:00
Brian Barrett	f1fdd7c041	Handle case where remote process is of different architecture than the local process when creating a datatype from an internal description. Refs trac:640 This commit was SVN r12877. The following Trac tickets were found above: Ticket 640 --> https://svn.open-mpi.org/trac/ompi/ticket/640	2006-12-17 04:39:16 +00:00
Brian Barrett	0653dc3f24	Pad headers to eliminate heterogeneous issues. Add conversion functions for switching endianness of headers. Galen is going to add the code to use the endian stuff... This commit was SVN r12876.	2006-12-17 00:50:59 +00:00
Brian Barrett	01e8fc5f91	Redo of r12871, without the preconnect code change: Move the req_mtl structure back to the end of each of the structures in the CM PML. The req_mtl structure is cast into a mtl__request_structure for each MTL, which is larger than the req_mtl itself. The cast will cause the _request to overwrite parts of the heavy requests if the req_mtl isn't the LAST thing on each structure (hence the comment). This was moved as an optimization at some point, which caused buffer sends to fail... Refs trac:669 This commit was SVN r12873. The following SVN revision numbers were found above: r12871 --> open-mpi/ompi@597598b712 The following Trac tickets were found above: Ticket 669 --> https://svn.open-mpi.org/trac/ompi/ticket/669	2006-12-15 17:54:14 +00:00
Brian Barrett	bdf0b231b2	Undo r12871, as it contained some code in ompi/runtime that shouldn't have been committed Refs trac:669 This commit was SVN r12872. The following SVN revision numbers were found above: r12871 --> open-mpi/ompi@597598b712 The following Trac tickets were found above: Ticket 669 --> https://svn.open-mpi.org/trac/ompi/ticket/669	2006-12-15 17:52:13 +00:00
Brian Barrett	597598b712	Move the req_mtl structure back to the end of each of the structures in the CM PML. The req_mtl structure is cast into a mtl__request_structure for each MTL, which is larger than the req_mtl itself. The cast will cause the _request to overwrite parts of the heavy requests if the req_mtl isn't the LAST thing on each structure (hence the comment). This was moved as an optimization at some point, which caused buffer sends to fail... Refs trac:669 This commit was SVN r12871. The following Trac tickets were found above: Ticket 669 --> https://svn.open-mpi.org/trac/ompi/ticket/669	2006-12-15 17:46:53 +00:00
Rainer Keller	b99e5a71d1	- Help message in case of MPI-application with two init or calling init functions after finalize. This commit was SVN r12858.	2006-12-14 19:58:04 +00:00
Brad Benton	18da4c40d3	Set the QP's static rate from the associated MCA parameter, rather than just defaulting to 0. Fixes trac:675 This commit was SVN r12855. The following Trac tickets were found above: Ticket 675 --> https://svn.open-mpi.org/trac/ompi/ticket/675	2006-12-14 19:42:24 +00:00
Brian Barrett	38c2e43ac2	Print out error string rather than errno for TCP-related errors, making it easier for both the user and us to debug issues with BTL and OOB issues... This commit was SVN r12852.	2006-12-14 18:20:43 +00:00
Jeff Squyres	0ca8cb35b7	Fixes trac:366 Add ability for ini files to recognize "use_eager_rdma" flag. Set the default to "no" (because we should assume that HCAs cannot support the property necessary for using RDMA for eager messages -- that the last byte of the message is guaranteed to be written to memory last -- unless proven otherwise. For example, iWARP cards apparently do not provide this guarantee), and then set all Mellanox and IBM HCAs to override the default to enable this behavior on these cards. This commit was SVN r12851. The following Trac tickets were found above: Ticket 366 --> https://svn.open-mpi.org/trac/ompi/ticket/366	2006-12-14 15:52:13 +00:00
Dan Lacher	e3f749acc4	Ticket: #673 Submitted by: Dan Lacher This commit was SVN r12844.	2006-12-13 20:01:16 +00:00
George Bosilca	80bc0c8868	Allow the MX to survive if we are unable to connect to a peer. The PML will try to find another route. This commit was SVN r12837.	2006-12-13 01:12:07 +00:00
Mohamad Chaarawi	cae083dec6	replaced the old CID allocation algorithm with the blocked algorithm. The impace in the communicator directory is still not great since the interface for allocating a Cid has not changed.. This commit was SVN r12836.	2006-12-12 22:01:39 +00:00
Brian Barrett	10af8ab454	Corrections for when threading is enabled. Refs trac:564 This commit was SVN r12830. The following Trac tickets were found above: Ticket 564 --> https://svn.open-mpi.org/trac/ompi/ticket/564	2006-12-12 18:48:42 +00:00
Brad Benton	337116d5fd	Added IBM eHCA vendor and part id info This commit was SVN r12827.	2006-12-12 14:12:39 +00:00
Brian Barrett	cf196ce420	Instead of an unknown proc list that requires ownership transfer of data (which, in turn, requires a complex series of locks to be held during the transfer), use a modex backing store with backpointers from the proc to the backing store. The proc structures no longer own the modex data, which greatly simplifies locking when an unknown proc suddenly becomes known. Refs trac:564 This commit was SVN r12822. The following Trac tickets were found above: Ticket 564 --> https://svn.open-mpi.org/trac/ompi/ticket/564	2006-12-11 21:27:30 +00:00
Ralph Castain	0a5d41857a	Complete next round of message size reduction: "strip" the descriptive info from the returned values. I have now added a flag to the gpr address mode (ORTE_GPR_STRIPPED) that instructs the gpr to not include segment names or tokens in the returned gpr_value_t objects. I found only two places that were looking at the tokens: 1. the odls - we used the tokens to separately process the globals container data from everything else. In this case, I left the subscription that returned the globals data alone, but "stripped" the subscription that returned the launch data for the procs. These subscriptions have nothing to do with the xcast message. 2. the pml_base_modex - the callback function was getting process names from the returned tokens. Actually, this function was doing a very bad thing - it was assuming that the first token returned was always the process name. This is currently true, but is one of those assumptions that someone could have easily changed - and suddenly found the system inexplicably failing. I modified the function to (a) get the name sent back to us, (b) "stripped" the value structures of tokens and segment strings, and (c) correctly obtained process names from the returned values. I also reindented the heck out of the code so it was legible (at least, to my old eyes). This commit was SVN r12813.	2006-12-09 23:10:25 +00:00
Jeff Squyres	e70ef98ea6	Update the help message to be a bit more specific and refer to the web FAQ. This commit was SVN r12812.	2006-12-09 15:13:03 +00:00
Jeff Squyres	c7282855e7	Fixes trac:659 This commit fixes several aspects regarding MPI conformance of requests. * Eliminate the last argument of ompi_errhandler_request_invoke(); we ''always'' want to invoke the back-end exception handler with the real error code. * Make it clear in comments that we only invoke the ''first'' exception in a given array of requests, even if there's more than one request with a non-MPI_SUCCESS value for MPI_ERROR. * Defer the freeing of requests upon exception in the back-end functions to MPI_WAIT* and MPI_TEST* until later; the requests are kept so that we know what handler to invoke when we actually invoke the exception. After figuring that out, ''then'' we free requests with pending exceptions on them. * Clean up return codes from the back-end MPI_TEST* and MPI_WAIT* functions. * Slightly modify ompi_errcode_get_mpi_code() to return unity if it receives an MPI error code (vs. an OMPI error code). This commit was SVN r12810. The following Trac tickets were found above: Ticket 659 --> https://svn.open-mpi.org/trac/ompi/ticket/659	2006-12-09 14:20:08 +00:00
Patrick Geoffray	58c6f8c8e1	Copyright update. Thanks to Jeff to remind me. This commit was SVN r12803.	2006-12-07 23:55:00 +00:00
Patrick Geoffray	6e09b0c23f	lval is not defined when pval is assigned on 32 bit systems. this usually is ok on little-endian systems, as the upper 32 bits will likely be ignored, but on 32-bit big-endian systems, lval is complete junk. Use ival if 32 bit mode, lval if 64. Mixing of 32 and 64 bit architectures won't work without more changes. This commit was SVN r12802.	2006-12-07 23:34:04 +00:00
Brian Barrett	98884e45e4	Clean up the way procs are added to the global process list after MPI_INIT: * Do not add new procs to the global list during modex callback or when sharing orte names during accept/connect. For modex, we cache the modex info for later, in case that proc ever does get added to the global proc list. For accept/connect orte name exchange between the roots, we only need the orte name, so no need to add a proc structure anyway. The procs will be added to the global process list during the proc exchange later in the wireup process * Rename proc_get_namebuf and proc_get_proclist to proc_pack and proc_unpack and extend them to include all information needed to build that proc struct on a remote node (which includes ORTE name, architecture, and hostname). Change unpack to call pml_add_procs for the entire list of new procs at once, rather than one at a time. * Remove ompi_proc_find_and_add from the public proc interface and make it a private function. This function would add a half-created proc to the global proc list, so making it harder to call is a good thing. This means that there's only two ways to add new procs into the global proc list at this time: During MPI_INIT via the call to ompi_proc_init, where my job is added to the list and via ompi_proc_unpack using a buffer from a packed proc list sent to us by someone else. Currently, this is enough to implement MPI semantics. We can extend the interface more if we like, but that may require HNP communication to get the remote proc information and I wanted to avoid that if at all possible. Refs trac:564 This commit was SVN r12798. The following Trac tickets were found above: Ticket 564 --> https://svn.open-mpi.org/trac/ompi/ticket/564	2006-12-07 19:56:54 +00:00
Brian Barrett	b07dfa7841	* remove unused variable in ompi_comm_get_rprocs * don't load data into a buffer until we have the data, as the data contains some header information needed to properly load the data This commit was SVN r12792.	2006-12-07 16:19:44 +00:00
Ralph Castain	a1153fdc8f	Eliminate virtually all of the attribute_predefined data from the STG1 message. We now compute the total number of slots allocated to us and save that in the registry - the attributed_predefined then retrieves it via the STG1 message. The app_num is passed via the process_info structure, which gets the value from the ODLS in the environment. Obviously, people like bproc will have to get the app_num via another avenue...but that's a problem for another day. Several options are easily available. This commit was SVN r12788.	2006-12-07 03:11:20 +00:00
Brian Barrett	41a70a8f01	indent, this time with the right coding standards... This commit was SVN r12787.	2006-12-07 00:24:01 +00:00
Brian Barrett	f9ec8d6f2a	reindent file to make it easier to deal with... This commit was SVN r12786.	2006-12-07 00:21:25 +00:00
Edgar Gabriel	1359ba9b13	Rewriting much of the errorcode and errorclass code, since - we have to be able to attach a string to an error class, not just to an error code - according to MPI-2 the attribute MPI_LASTUSEDCODE has to be updated everytime you add a new code or a new class. Thus, you have to have single list for both. Thus, we got rid of the error_class structure. In the error-code structure, we can distinguish whether we are dealing with an error code or an error class by looking at the err->code element of the structure. In case its value is MPI_UNDEFINED, the according entry is a class, else it is an error code. All predefined error codes have the code and the class field set to the same value. The test MPI_Add_error_class1 passes now. Fixes trac:418 This commit was SVN r12764. The following Trac tickets were found above: Ticket 418 --> https://svn.open-mpi.org/trac/ompi/ticket/418	2006-12-05 19:07:02 +00:00
Brian Barrett	6f8b366acb	Rename liborte to libopen-rte and libopal to libopen-pal per telecon today and bug #632. Refs trac:632 This commit was SVN r12762. The following Trac tickets were found above: Ticket 632 --> https://svn.open-mpi.org/trac/ompi/ticket/632	2006-12-05 18:27:24 +00:00
Brian Barrett	441432950f	Merge in changes from the bwb-heterogeneous temp branch (r12491 - r12714) for supporting compilers / architectures with different padding rules. This commit was SVN r12749. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r12491 r12714	2006-12-04 20:11:42 +00:00
Rainer Keller	6f8f28f40f	- Get rid of inline definition, otherwise static-compilation fails. This commit was SVN r12735.	2006-12-03 14:52:17 +00:00
Gleb Natapov	30ca7457b4	Some BTLs (e.g TCP) can report put/get completion before data actually hits the buffer on the other side. For this kind of BTLs we need to send FIN through the same BTL, PUT was performed with so network will handle ordering for us. If we will use another BTL, receiver can get FIN before data will hit the buffer and complete request prematurely. We mark such problematic BTLs with MCA_BTL_FLAGS_FAKE_RDMA flag (this kind of RDMA is really fake, because the real one guaranties that sender will see the completion only after receiver's NIC confirmed that all the data was received). This commit was SVN r12732.	2006-12-03 10:12:09 +00:00
Gleb Natapov	39c930b160	The bug fixing part of r12720 introduce much more serious bug that it fixes. It calls mca_pml_ob1_send_fin_btl() which may fail and doesn't check return code. This breaks all RDMA transports event when only one BTL is used. Revert it for now, I am working on a real fix for the problem (I hope). This commit was SVN r12731. The following SVN revision numbers were found above: r12720 --> open-mpi/ompi@3e3689320b	2006-12-03 08:55:59 +00:00
Gleb Natapov	65d7ad4581	The "bug fix" from the r12721 reverts part of the r12433 that fixed regresion from v1.1 was reviewed and put to v1.2 branch. So revert this part of r12721 back. This commit was SVN r12730. The following SVN revision numbers were found above: r12433 --> open-mpi/ompi@82f7c0dd69 r12721 --> open-mpi/ompi@3edd850d2e	2006-12-03 08:29:55 +00:00
George Bosilca	a0ed53d70b	Make the compilers happy. This commit was SVN r12729.	2006-12-03 00:19:11 +00:00
George Bosilca	3edd850d2e	Some indentation and code arrangement. However, there is a bug fix. Force the PUT protocol to always obey to the btl_max_rdma_size. This commit was SVN r12721.	2006-12-01 22:26:14 +00:00
George Bosilca	3e3689320b	Some indentations and one BIG fix. Avoid race conditions on the PUT RDMA protocol when multiple NICS are available between 2 peers. The fix force the FIN message to take exactly the same path as the fragment it describe (i.e. same path means same BTL). Otherwise, the FIN can be received by the peer before the RDMA complete and the request will get freed too early. This commit was SVN r12720.	2006-12-01 21:52:07 +00:00
George Bosilca	658879232b	Several small improvements: - consistent error message when something fails (via BTL_ERROR macro) - decrease the number of jumps. - cleanup some parts of the code. This commit was SVN r12719.	2006-12-01 21:48:06 +00:00
Brian Barrett	657168a74c	A couple of Window related error handling cleanups: * Always invoke the error handler on MPI_COMM_WORLD for invalid windows (except in win_create, which should instead be on the given communicator). * Allow get_errhandler in addition to set_errhandler on MPI_WIN_NULL Refs trac:647 This commit was SVN r12718. The following Trac tickets were found above: Ticket 647 --> https://svn.open-mpi.org/trac/ompi/ticket/647	2006-12-01 20:05:06 +00:00
Brian Barrett	2a09fa2d9d	* silence compiler warning This commit was SVN r12717.	2006-12-01 20:01:53 +00:00
Brian Barrett	bfbc281e93	Fix slow startup issue with the MX MTL. The problem is caused by mx_connect() being a one-sided operation from the API level, but not being an interrupting call when the target is not entering the MX library. So if most of the processes exit mtl_mx_add_procs() and enter the stage gate 2 barrier, the other processes can only progress their mx_connect() calls when the targets enter the mx library. Because the event library is in EV_ONELOOP mode, this only happens once a second. The mx progress thread (hidden in the MX library) also only wakes up once a second, so mx_connect calls can take a second to complete. The temporary solution is to switch into EV_NONBLOCK mode earlier (right after the mx_connect loop) so that there isn't a giant slowdown when processes enter the stage gate 2 barrier before other proesses. They will now not block in the event library for any period of time, which appears to have a 50% speedup when running at > 64 procs. Refs trac:645 This commit was SVN r12713. The following Trac tickets were found above: Ticket 645 --> https://svn.open-mpi.org/trac/ompi/ticket/645	2006-12-01 02:49:01 +00:00
Bill D'Amico	3e11c76d4c	Fixes trac:482 OMPI_ARRAY_INT_2_LOGICAL had an array bounds error - fixed this and the analogous error in OMPI_ARRAY_LOGICAL_2_INT. This commit was SVN r12712. The following Trac tickets were found above: Ticket 482 --> https://svn.open-mpi.org/trac/ompi/ticket/482	2006-11-30 19:48:18 +00:00
Pavel Shamis	f08bc818c4	Cleaning mca_btl_openib_progress_thread from unused variables. This commit was SVN r12709.	2006-11-30 18:28:45 +00:00
Jeff Squyres	384caeacf4	More fixes similar to r12684 -- fix bad lvalues in assignments for the case where sizeof(INTEGER) > sizeof(int). This commit was SVN r12707. The following SVN revision numbers were found above: r12684 --> open-mpi/ompi@e2c605f32a	2006-11-30 16:41:56 +00:00
Ralph Castain	79bd8a842e	Add xcast timing in mpi_init since we cannot directly measure it (tree-based comm). This commit was SVN r12706.	2006-11-30 15:06:40 +00:00
Galen Shipman	4e857f0384	add odls framework to ompi_info This commit was SVN r12699.	2006-11-29 16:24:49 +00:00
Jeff Squyres	e2c605f32a	Fix a problem cited by Pierre-Matthieu Anglade: some typos in the code path in the MPI F77 API functions where sizeof(int) != sizeof(INTEGER). This commit was SVN r12684.	2006-11-28 12:21:42 +00:00
Ralph Castain	bc4e97a435	First stage in the move to a faster startup. Change the ORTE stage gate xcast into a binary tree broadcast (away from a linear broadcast). Also, removed the timing report in the gpr_proxy component that printed out the number of bytes in the compound command message as the answer was "not much" - reduces the clutter in the data. This commit was SVN r12679.	2006-11-28 00:06:25 +00:00
Brian Barrett	bc3250fe6f	Fix issue where messages could start arriving before the window was fully initialized, especially during lock/unlock, which doesn't use MPI collectives for synchronization (unlike Fence) This commit was SVN r12676.	2006-11-27 22:42:21 +00:00
Brian Barrett	b5b6f4c4bf	Remove check for epoch on the receiver side, as this is a race condition between message arrival, and because we'll be still blocking in a synchronization call, is of no consequence to the user. We do an epoch check on the sending side, so this isn't an issue there either. Refs trac:488 This commit was SVN r12675. The following Trac tickets were found above: Ticket 488 --> https://svn.open-mpi.org/trac/ompi/ticket/488	2006-11-27 22:15:34 +00:00
Brian Barrett	beb1e9d4dd	* finish move from hard coded tag to #define'd constant tag This commit was SVN r12674.	2006-11-27 21:55:41 +00:00
Brian Barrett	0c25f7be09	More One-sided fixes: * Fix a counter roll-over issue that could result from a large (but not excessive) number of outstanding put/get/accumulate calls during a single synchronization issues (Refs trac:506) * Fix epoch issue with rdma component that would effect PWSC synchronization (Refs trac:507) This commit was SVN r12673. The following Trac tickets were found above: Ticket 506 --> https://svn.open-mpi.org/trac/ompi/ticket/506 Ticket 507 --> https://svn.open-mpi.org/trac/ompi/ticket/507	2006-11-27 21:41:29 +00:00
George Bosilca	59cfee0cd2	Use the MX infinite timeout by default. The user can modify it using an MCA parameter. This commit was SVN r12670.	2006-11-27 20:18:58 +00:00
Brian Barrett	993d2a7753	Fix for issue IU is seeing on BigRed with connections timing out during MPI_INIT. Use an infinite timeout, which is exactly what MPICH-MX does. This commit was SVN r12669.	2006-11-27 20:10:27 +00:00
Brian Barrett	63e5668e29	Number of one-sided fixes: * use one-sided datatype check instead of send/receive and check both the origin and target datatypes * allow error handler to be set on MPI_WIN_NULL, per standard * Allow recursive calls into the pt2pt osc component's progress function * Fix an uninitialized variable problem in the unlock header This commit was SVN r12667.	2006-11-27 03:22:44 +00:00
Brian Barrett	0895f5e08d	Rename OMPI_PROCESS_NAME_{HTON, NTOH} macros to ORTE_PROCESS_NAME_{HTON, NTOH} because they are in ORTE, not OMPI. Also, remove the ORTE_PROCESS_NAME macros in iof base as they are duplicates of the ones that were in ns_types, which meant that bad things happened if you changed what an orte_process_name_t looked like. This commit was SVN r12646.	2006-11-22 03:03:21 +00:00
Brian Barrett	33320b7165	Rework the opal_progress interface to better support dynamic processes and at the same time, remove some of the MPI-related options from OPAL: - provide mechanism to change at runtime whether sched_yield() should be called when the progress engine is idle - provide mechanism for changing the rate at which the event engine is called when there are "no" users of the event engine (ie, when using MPI but not TCP) - fix some function names in the progress engine to better match their intended use (and remove MPI naming scheme) - remove progress_mpi_enable / progress_mpi_disable because we can now use the functions to set the sched_yield and tick rate interfaces - rename opal_progress_events() to opal_progress_set_event_flag() because the first really isn't descriptive of what the function does and I always got confused by it This commit was SVN r12645.	2006-11-22 02:06:52 +00:00
Jeff Squyres	922f335678	Fixes trac:624 Ensure that the new predefined MPI-2 attribute callback functions take the proper types (INTEGER, kind=MPI_ADDRESS_KIND instead of just INTEGER). This commit was SVN r12639. The following Trac tickets were found above: Ticket 624 --> https://svn.open-mpi.org/trac/ompi/ticket/624	2006-11-21 13:54:13 +00:00
Ralph Castain	6d6cebb4a7	Bring over the update to terminate orteds that are generated by a dynamic spawn such as comm_spawn. This introduces the concept of a job "family" - i.e., jobs that have a parent/child relationship. Comm_spawn'ed jobs have a parent (the one that spawned them). We track that relationship throughout the lineage - i.e., if a comm_spawned job in turn calls comm_spawn, then it has a parent (the one that spawned it) and a "root" job (the original job that started things). Accordingly, there are new APIs to the name service to support the ability to get a job's parent, root, immediate children, and all its descendants. In addition, the terminate_job, terminate_orted, and signal_job APIs for the PLS have been modified to accept attributes that define the extent of their actions. For example, doing a "terminate_job" with an attribute of ORTE_NS_INCLUDE_DESCENDANTS will terminate the given jobid AND all jobs that descended from it. I have tested this capability on a MacBook under rsh, Odin under SLURM, and LANL's Flash (bproc). It worked successfully on non-MPI jobs (both simple and including a spawn), and MPI jobs (again, both simple and with a spawn). This commit was SVN r12597.	2006-11-14 19:34:59 +00:00
George Bosilca	139f9cf3d0	Make sure we disable the MX shared memory when we use the MX BTL. This commit was SVN r12587.	2006-11-13 22:17:06 +00:00
Andrew Friedley	a4bdcb4faa	Fix a segfault that turned up in more MPI_THREAD_MULTIPLE testing. Same sort of problem and fix as described in r12323 - mca_pml_ob1_recv_frag_progress() was segfaulting due to a NULL req_proc pointer. The path leading to this was through the mca_pml_ob1_check_cantmatch_for_match() function, where we can match a frag using the same macros as mca_pml_ob1_frag_match() and never initialize the req_proc pointer. This commit was SVN r12582. The following SVN revision numbers were found above: r12323 --> open-mpi/ompi@c752502dee	2006-11-13 20:12:51 +00:00
George Bosilca	b621069638	Don't copy the reference count when we duplicate a datatype. Thanks to Andreas Schafer for the fix. This commit was SVN r12566.	2006-11-13 05:45:29 +00:00
Ralph Castain	8b6921f297	Initialize the rank variable before it is used. This commit was SVN r12565.	2006-11-13 02:37:12 +00:00
George Bosilca	2b9f6613c9	Rewrite the ompi_ddt_sndrcv function. This commit was SVN r12561.	2006-11-12 05:56:02 +00:00
George Bosilca	2ee30fd468	This wasn't the correct way to write the if statement. What I want is that if one of them is zero then the if will get executed. This commit was SVN r12560.	2006-11-12 05:27:27 +00:00
Gleb Natapov	9933a6f469	Previous fix doesn't fix the case when opcode is changed in put/get functions. The fix is to set opcode to SEND at the entrance to the send function before checking credits and putting fragment to the pending list. We do the same thing in put/get functions i.e setting opcode at the entrance to the function. This commit was SVN r12559.	2006-11-11 07:51:06 +00:00
Ralph Castain	4e50cdae52	This commit accomplishes two things: 1. Fix the "hang" condition when an application isn't found. It turned out that the ODLS had some difficulty with the process actually not having been started - hence, it never called the waitpid callback. As a result, the "terminated" trigger didn't fire, and so mpirun didn't wake up. With this change, the HNP's errmgr forces the issue by causing the trigger to fire itself when an abort condition occurs. 2. Shift the recording of the pid and the nodename from mpi_init to the orted launcher. This allows programs such as Eclipse PTP to get the pids even for non-MPI applications. In the case of bproc, the pls handles this chore since we don't use orteds in that system. This commit was SVN r12558.	2006-11-11 04:03:45 +00:00
George Bosilca	ec410644ce	Implement the send receive as 2 non blocking operations. That will help us avoiding too many calls to opal_progress. This commit was SVN r12553.	2006-11-10 23:06:19 +00:00
George Bosilca	c2c6a1b37e	Correctly compute the number of elements in a segment. For broadcast send the correct size for all intermediary nodes. This commit was SVN r12552.	2006-11-10 23:04:50 +00:00
George Bosilca	7102147b9f	Correctly detect when the specified algorithm is out of range. In this case we reset it to zero. This commit was SVN r12551.	2006-11-10 21:47:07 +00:00
George Bosilca	bfbd0e61f6	Minimize the number of lines of code :) This commit was SVN r12550.	2006-11-10 20:56:08 +00:00
George Bosilca	a38cd366d7	Construct the convertor. It's not really required, but it's not in the critical path anyway. At least in debug mode we get nice informations about where the convertor was created. This commit was SVN r12549.	2006-11-10 20:55:06 +00:00
George Bosilca	858ab24e8e	The req_mtl field has to be the last in the struct or bad things happen. This commit was SVN r12548.	2006-11-10 20:53:41 +00:00
George Bosilca	af68171253	Use the macro to compute the number of elements in a segment in both bcast and reduce and update the default values for the variables as required by the comment in the coll_tuned.h file. This commit was SVN r12546.	2006-11-10 20:04:08 +00:00
George Bosilca	476b922074	Updates & upgrades: - consistent arguments checking (not allowing to select an algorithm which is not available) - consistent way of computing the segcount (number of datatypes by segment). - small cleanups. - more informative debugging messages. This commit was SVN r12545.	2006-11-10 19:54:09 +00:00
Gleb Natapov	7e03b83d23	Reset opcode field to SEND. It is checked later in pending progress function. This commit was SVN r12531.	2006-11-10 06:17:00 +00:00
George Bosilca	77ef979457	New architecture for broadcast. A generic broadcast working on a tree description. Most of the bcast algorithms can be completed using this generic function once we create the tree structure. Add all kind of trees. There are 2 versions of the generic bcast function. One using overlapping between receives (for intermediary nodes) and then blocking sends to all childs and another where all sends are non blocking. I still have to figure out which one give the smallest overhead. This commit was SVN r12530.	2006-11-10 05:53:50 +00:00
George Bosilca	17405cd9c6	A temporary fix, until we figure out a better approach. The problem is that if one add "pml=" to the configuration file, really bad things happen. All PMLs will get initialize, and each of them will initialize all BTLs. This patch force the mca_pml_base_pml to get initialized in all cases before we go out of the mca_pml_base_open function. This commit was SVN r12527.	2006-11-10 04:53:00 +00:00
George Bosilca	ab1655079d	Turn off the CONVERTOR_NO_OP once we reach this point. This commit was SVN r12526.	2006-11-09 23:56:31 +00:00
Jeff Squyres	8a08b092f6	Check to see if we need to do anything. If we don't (i.e., if all the rcounts are 0), then just return MPI_SUCCESS. This commit was SVN r12525.	2006-11-09 23:21:34 +00:00
George Bosilca	4cb7910a8b	And now the optimization step. This commit was SVN r12521.	2006-11-09 20:26:51 +00:00
George Bosilca	1d80f685b5	Remove one compiler warning. This commit was SVN r12520.	2006-11-09 20:08:43 +00:00
George Bosilca	e33d1dedab	Rewrite the conditions to keep them as small as possible. Correct some of these conditions. Optimize the flags generation for the convertor. This commit was SVN r12518.	2006-11-09 19:33:19 +00:00
George Bosilca	da1720d1ef	Indentation only. This commit was SVN r12517.	2006-11-09 19:31:52 +00:00
George Bosilca	73eec4bfef	Show the MCA parameter coll_base_verbose only if Open MPI is compiled in debug mode. Otherwise there is no debug anyway ... This commit was SVN r12516.	2006-11-09 19:02:32 +00:00
Jeff Squyres	0a28212392	This is a workaround to bug in the Intel C++ compiler, version 9.1 (all versions up to and including 20060925). The issue has been reported to Intel, along with a small [non-MPI] test program that reproduces the problem (the test program and the OMPI C++ bindings work fine with Intel C++ 9.0 and many other C++ compilers). In short, a static initializer for a global variable (i.e., its constructor is fired before main()) that takes as an argument a reference to a typedef'd type will simply get the wrong value in the argument. Specifically: {{{ namespace MPI { Intracomm COMM_WORLD(MPI_COMM_WORLD); } }}} The constructor for MPI::Intracomm should get the value of &ompi_mpi_comm_world. It does not; it seems to get a random value. As mandated by MPI-2, annex B.13.4, for C/C++ interoperability, the prototype for this constructor is: {{{ class Intracomm { public: Intracomm(const MPI_Comm& data); }; }}} Experiments with icpc 9.1/20060925 have shown that removing the reference from the prototype makes it work (!). After lots of discussions about this issue with a C++ expert (Doug Gregor from IU), we decided the following (cut-n-paste from an e-mail): ----- > So here's my question: given that OMPI's MPI_<CLASS> types are all > pointers, is there any legal MPI program that adheres to the above > bindings that would fail to compile or work properly if we simply > removed the "&" from the second binding, above? I don't know of any way that a program could detect this change. FWIW, the C++ committee has agreed that implementation of the C++ standard library are allowed to decide arbitrarily between const& and by-value. If they don't care, MPI users won't care. When you remove the '&', I suggest also removing the "const". It is redundant, but can trigger some strange name mangling in Sun's C++ compiler. ----- So with this change: * we now work again with the Intel 9.1 compiler * our C++ bindings do not exactly conform to the MPI-2 spec, but valid/legal MPI C++ apps cannot tell the difference (i.e., the functionality is the same) This commit was SVN r12514.	2006-11-09 17:34:12 +00:00
George Bosilca	a14ff905f8	Update the convertor.c file. This commit was supposed to go together with r12486 but somehow I miss it. Update the pack and unpack functions for contigusous datatypes to minimize their impact on the performance. Keep them as condensed as possible. This commit was SVN r12513. The following SVN revision numbers were found above: r12486 --> open-mpi/ompi@8746369338	2006-11-09 17:26:55 +00:00
George Bosilca	a82ce427e4	Update the number of reduce algorithms available. This commit was SVN r12503.	2006-11-08 22:20:34 +00:00
George Bosilca	0dcf0097db	They are supposed to be ints not size_t. This commit was SVN r12497.	2006-11-08 17:05:15 +00:00
George Bosilca	eab1776e9a	Explicit casts for our friendly Windows environment... This commit was SVN r12496.	2006-11-08 17:02:46 +00:00
George Bosilca	f9f1087e7f	Be nice and let everybody access this variable. This commit was SVN r12495.	2006-11-08 16:56:17 +00:00
George Bosilca	0914892044	Small cleanups, some explicit casts. This commit was SVN r12494.	2006-11-08 16:54:03 +00:00
George Bosilca	74d3946342	Remove the call to set_args. This is only required for the MPI level, because there we have to be able to return to the user the description of the data. This commit was SVN r12493.	2006-11-08 16:52:48 +00:00
George Bosilca	b53a313904	These functions dont have to be exported. The outside world access them only via the pointer in the convertor structure. This commit was SVN r12492.	2006-11-08 16:49:34 +00:00
George Bosilca	8746369338	This commit was supposed to go with the r12485. This commit was SVN r12486. The following SVN revision numbers were found above: r12485 --> open-mpi/ompi@b4e9199651	2006-11-08 06:21:15 +00:00
George Bosilca	b4e9199651	Minimize the impact on contiguous and predefined data-types. Most of the usefull things are now done without the call to any external function, directly in the macros in this file. Create a new flag for the convertor to help figure out when we have to do anything special for pack/unpack. A convertor with the flag CONVERTOR_NO_OP is a basic convertor, designed for contiguous types. This commit was SVN r12485.	2006-11-08 06:16:47 +00:00
George Bosilca	915d748d72	Initialize the convertor on _START not on _INIT. This allow us to set it up before the match when we know the peer, saving some time on the critical path. If the receive is ANY_SOURCE then we initialize the convertor on _MATCHED. Anyway, we will set it up only once per receive. This commit was SVN r12484.	2006-11-08 05:42:29 +00:00

... 3 4 5 6 7 ...

2528 Коммитов