openmpi

Автор	SHA1	Сообщение	Дата
Vasily Filipov	87e71b26fe	Jeff Squyres fixes This commit was SVN r22319.	2009-12-16 10:23:58 +00:00
George Bosilca	b3d3a8e7b3	Remove useless lines. This commit was SVN r22316.	2009-12-15 23:55:14 +00:00
George Bosilca	b85c3ca081	Enable support for the INRIA knem (http://runtime.bordeaux.inria.fr/knem/) kernel device. This is part of Ma Teng's work on Open MPI. This commit was SVN r22315.	2009-12-15 23:34:09 +00:00
Vasily Filipov	c036c6ef95	Adding support for on-demand SRQ pre-post (receive wqe allocation) This commit was SVN r22313.	2009-12-15 15:52:10 +00:00
Vasily Filipov	354bfe527f	Improving support for non homogeneous OpenFabrics network configurations This commit was SVN r22312.	2009-12-15 14:25:07 +00:00
Pavel Shamis	4d02aea54c	Enabling, by default, RDMACM connection manager for RDMAoE devices This commit was SVN r22311.	2009-12-15 13:52:19 +00:00
Jeff Squyres	4f68dfb03c	Remove some dead code (thanks to George for pointing it out). This commit was SVN r22309.	2009-12-14 21:20:41 +00:00
Christopher Yeoh	848bf0f5cd	Fixes deadlock in osc rdma module See #2102 for details This commit was SVN r22299.	2009-12-14 01:52:57 +00:00
Christopher Yeoh	d5253aa0f1	Fixes multithread race which causes corruption of no_credits_pending_frags list in the ib btl. See #2128 for details This commit was SVN r22298.	2009-12-14 01:41:45 +00:00
Eugene Loh	8177d91835	Minor change so that if the number of shared-memory FIFOs is greater than can be used (e.g., number of on-node peers), that no additional room is set aside for those FIFOs that will never be created. This makes it easier to have dedicated FIFOs: just set btl_sm_num_fifos to be very large rather than setting it to be the local number of procs. In practice, we ask for extra headroom anyhow, so this change generally won't matter. This commit was SVN r22291.	2009-12-10 19:28:39 +00:00
George Bosilca	76222eb869	Get rid of the useless mca_pml_base_endpoint_t and replace it by [the well known and widely used!] mca_pml_endpoint_t. This commit was SVN r22277.	2009-12-08 17:29:54 +00:00
Pavel Shamis	b024aee10c	Removing unused lists from mca_btl_openib_qp_info_t. The lists were moved to device. This commit was SVN r22271.	2009-12-07 17:42:09 +00:00
George Bosilca	f0303a8b25	Indentation. This commit was SVN r22254.	2009-12-02 22:03:52 +00:00
Pavel Shamis	7d46985096	Removing unneeded spaces This commit was SVN r22246.	2009-12-01 11:15:40 +00:00
Pavel Shamis	75a48f4b3c	Bugfix for possible race in rdmacm_destroy_dummy_qp This commit was SVN r22245.	2009-12-01 08:09:43 +00:00
Shiqing Fan	7cf427c39b	Include the missing thread header, which is needed when build with --enable-progress-thread. This commit was SVN r22239.	2009-11-27 14:49:24 +00:00
Brian Barrett	b57b8c5b3f	Clean up request handling in the I/O framework to be more consistent with other request-using frameworks. - Rather than having mpi/c/* functions allocate requests explicitly, pass the MPI_Request* down to the I/O component and have it perform the allocation. - While the I/O base provides a base request which can be used, it is not required and all request management occurs within the component. - Push progress management into the component, rather than having it happen in the base. Progress functions are now easily registered, and not all (ie, the one existing) components use progress functions in any rational way. ROMIO switched to generalized requests instead of MPIO_Requests many moons ago, and Open MPI now uses ROMIO's generalized requests, so there is no reason to wrap those requests (which are OMPI requests) in another level of request. Now the file function passes the MPI_Request* to the ROMIO component, which passes it to the underlying ROMIO function, which calls MPI_Grequest_start to create an OMPI request, which is what gets set as the request to the user. Much cleaner. This patch has two motivations. One, a whole heck of a lot of code just got removed, and request handling is now much cleaner for I/O components. Two, by adding support for Argonne's proposed generalized request extensions, we can allow ROMIO to provide async I/O through generalized requests, which we couldn't rationally do in the old setup due to the crazy request completion rules. This commit was SVN r22235.	2009-11-26 05:13:43 +00:00
Brian Barrett	8075640ef1	The tests are MPI programs and are built using mpicc, so including OMPI headers won't work This commit was SVN r22233.	2009-11-25 18:06:15 +00:00
Rainer Keller	276b813f48	- Output according to their type. This commit was SVN r22206.	2009-11-09 14:28:15 +00:00
Rainer Keller	366bd96c88	- Allow to work without xt-catamount module on Jaguar, reducing the amount of components, that up to now needed to be deselected. This commit was SVN r22205.	2009-11-09 14:26:24 +00:00
Eugene Loh	88c0921c5e	Corrected the usage of "rc" in mca_btl_sm_component_progress. The return code for this function should be the number of events received. This commit was SVN r22191.	2009-11-04 03:10:35 +00:00
Jeff Squyres	ab00aea1ff	Per http://www.open-mpi.org/community/lists/devel/2009/10/7025.php , use the new Automake "silent rules" if available. If you are using an Automake prior to v1.11, you won't see the new silent rules -- it will automatically default back to the "verbose" rules. Note, too, that even with these changes, you can enable the verbose "make all" output in one of two ways: 1. Add "V=1" to your "make" command line {{{ shell$ make all V=1 }}} 2. Add "--disable-silent-rules" to your "configure" command line: {{{ shell$ ./configure --disable-silent-rules ... }}} The one down side of using the silent rules by default is that we'll get less diagnostic information when users send their build logs. I think we should update the web page to request that users send build logs of "make V=1", but I'm guessing that not everyone will do it. Note that I did ''not'' silent-ize the libltdl build (which is a dozen or so files in the beginning of the build) because we wholly import libltdl at autogen time. I therefore didn't want to patch libltdl (further) after importing it a) to remain as forward- compatible as possible, and b) patching the imported libltdl build system might be tricky in terms of timestamps / dependencies. So those dozen-or-so files will still be "verbose", but the rest of the files in OMPI will be "silent". This commit was SVN r22189.	2009-11-04 02:07:02 +00:00
Eugene Loh	1a44fc478d	In sm_btl_first_time_init(), when we figure the size of the shared area, we cap the size at LONG_MAX. But we are figuring out how much we need. So, if that amount exceeds LONG_MAX, we should return an "out of resource" error code. This commit was SVN r22172.	2009-10-29 23:06:32 +00:00
Rainer Keller	5be03b8fc0	- Patch r22148 overwrites the already defined LDFLAGS, losing e.g. -L... Needs to be move to cmr:v1.3 This commit was SVN r22152. The following SVN revision numbers were found above: r22148 --> open-mpi/ompi@a6c1fe888f	2009-10-28 14:25:10 +00:00
Jeff Squyres	a6c1fe888f	We also need .so versioning of the OMPI "common" components since they are installed as standalone libraries in $libdir. This commit was SVN r22148.	2009-10-27 20:58:34 +00:00
Aurelien Bouteiller	59156cd92a	Fix gcc 4.3 warning berserk about non-literal string format. This commit was SVN r22147.	2009-10-27 20:45:02 +00:00
George Bosilca	3a2f071018	If the user asked for dynamic rules but "forget" to provide them, nicely complain and switch back to the default behavior (fixed rules). This commit was SVN r22109.	2009-10-19 17:58:47 +00:00
Jeff Squyres	9afe50d886	Update Cisco copyrights for consistency This commit was SVN r22072.	2009-10-07 22:02:32 +00:00
Jeff Squyres	0d1e177453	Remove 2 extraneous ORTE_ERROR_LOGs and 1 extraneous opal_output. This commit was SVN r22071.	2009-10-07 20:12:37 +00:00
Jeff Squyres	d56b8d9183	Fix CID 1369: minor memory leak. This commit was SVN r22067.	2009-10-07 19:40:00 +00:00
Jeff Squyres	de59a24593	Fix CID 1384. Also remove some opal_output(0,...)'s in favor of ORTE_ERROR_LOG. This commit was SVN r22066.	2009-10-07 18:58:58 +00:00
Jeff Squyres	ec71acf7ca	Fix CID 1385: fix an over-aggressive use of close, munmap, etc. in the error case. Also check for MAP_FAILED (instead of -1) from mmap(). This commit was SVN r22065.	2009-10-07 18:43:37 +00:00
Jeff Squyres	5ec86e5fe5	Fix CID 1386: fd can't be valid here, so don't bother to close/unlink. This commit was SVN r22064.	2009-10-07 18:30:26 +00:00
Jeff Squyres	0f8ac9223f	Refs trac:2023, #2027 . This commit does a bunch of things: * Address all remaining code review items from CMR #2023: * Defer mmap setup to be lazy; only set it up the first time we invoke a collective. In this way, we don't penalize apps that make lots of communicators but don't invoke collectives on them (per #2027). * Remove the extra assignments of mca_coll_sm_one (fixing a convertor count setup that was the real problem). * Remove another extra/unnecessary assignment. * Increase libevent polling frequency when using the RML to bootstrap mmap'ed memory. * Fix a minor procs-related memory leak in btl_sm. * Commit a datatype fix that George and I discovered along the way to fixing the coll sm. * Improve error messages when mmap fails, potentially trying to de-alloc any allocated memory when that happens. * Fix a previously-unnoticed confusion between extent and true_extent in coll sm reduce. This commit was SVN r22049. The following Trac tickets were found above: Ticket 2023 --> https://svn.open-mpi.org/trac/ompi/ticket/2023	2009-10-02 17:13:56 +00:00
George Bosilca	16c6370b73	A little bit of cleanup, the main logic is still the same. This commit was SVN r22043.	2009-10-01 14:05:25 +00:00
Shiqing Fan	21f6a1cb7c	Update the corresponding part of mmap for Windows. This commit was SVN r22038.	2009-09-30 14:50:17 +00:00
Shiqing Fan	96e9ffa016	Fix a type cast. This commit was SVN r22034.	2009-09-30 14:02:47 +00:00
Jeff Squyres	152bc14079	Rename the help file to be consistent with others; add it to the Makefile.am. This commit was SVN r22005.	2009-09-23 20:28:49 +00:00
Jeff Squyres	ef338602ef	Arrgh -- effectively revert r21997. We ''do'' need that header file... This commit was SVN r21998. The following SVN revision numbers were found above: r21997 --> open-mpi/ompi@bf5f14ab32	2009-09-22 21:19:38 +00:00
Jeff Squyres	bf5f14ab32	Remove some debugging stuff. This commit was SVN r21997.	2009-09-22 19:39:01 +00:00
Jeff Squyres	bb69bf22c0	Fix dumb logic in common sm setup that determines which nodes are local and who has the lowest name. This commit was SVN r21994.	2009-09-22 17:54:43 +00:00
Jeff Squyres	b91e7ba91f	This is no longer necessary. This commit was SVN r21991.	2009-09-22 15:01:00 +00:00
Jeff Squyres	1ef988c3d9	A slight optimization: no longer call sched_yield() when polling for shmem progress (or the Windows equiv). Instead, poll hard on the condition, but periocially call opal_progress(). This allows badly-formed apps (e.g., the ibm test communicator/bsend_free) to actually complete. To be clear, there are far too many apps out there that assume that MPI collectives will actually progress the rest of MPI. I don't like putting in a feature to enable broken apps, but I have a dim recollection of this issue coming up before (apps "hanging" when testing the sm coll because they assumed that calling collectives would trigger other MPI progress). Rather than have people claim that OMPI is broken, I prefer to put in this "workaround". :-( Indeed, the bsend_free test ''may'' be coded that way for exactly that reason...? I don't remember offhand... This commit was SVN r21984.	2009-09-21 22:20:44 +00:00
Jeff Squyres	64e3689a52	Grr -- test ''before'' committing! Sorry for all the noise folks; this one really fixes the problem. One more optimization coming later (separately). This commit was SVN r21983.	2009-09-21 21:32:26 +00:00
Jeff Squyres	bc43b6a085	Arrgh -- there was an extra assignment in there. Additionally, clean it up a little to drive the point home that the lowest named proc goes into array position [0]. This commit was SVN r21982.	2009-09-21 21:15:32 +00:00
Jeff Squyres	f9dfa03fde	Fix a potential ordering issue with the names and RML exchange during sm coll setup. This commit was SVN r21981.	2009-09-21 21:10:45 +00:00
Josh Hursey	7ac8d89f12	Since r21967 converted the mpool sm module into a real module, it broke some of the C/R logic in the ft_event funciton (actually it wouldn't build after that patch). This commit fixes the ft_event logic so that it uses the normal destroy funcitonality instead of the workaround with the component that was previously there. All and all it made for cleaner code, which is always good. If r21967 moves to v1.3, this patch will need to be moved as well. This commit was SVN r21972. The following SVN revision numbers were found above: r21967 --> open-mpi/ompi@533633b8cb	2009-09-17 14:45:17 +00:00
Josh Hursey	59143be39d	Fix a minor C/R bug related to cleaning up session directories when sm is present. Before this, we would restore the topmost old session directory. This commit makes sure that we remove it when we are done with it. This commit was SVN r21971.	2009-09-17 14:43:06 +00:00
Edgar Gabriel	9abeaad6e2	so here is what happens: in the v1.2 series the cid's could never go above the max. allowed for a particular pml. Because of that, pml_add_comm never checked for the cid, and in fact pml_add_comm was called in comm_set, which is before we knew the cid. in the v1.3 series (and trunk) we check now the cid to detect overflow, and because of that pml_add_comm has been moved after the cid allocation routine, namely into the comm_activate routine. in the v1.2 series, the comm_activate contained a synchronization step of the old communicator in order to prevent incoming fragments on the new communicator, with the main problem being that the allreduce in the communicator allocation finished at different times on different processes, and thus, this scenario could and did really occur. in the v1.3 series, the comm_activate does not contain the synchronization step anymore, since we introduced the new queue for fragments with unknown cid. The problem is however, that whether a fragment is known or not is decided by using ompi_comm_lookup(), which will return something useful as soon as the cid allocation finished, even before pml_add_comm has been called. So there is a small time gap where we will not post a message into queue for unknown cid's, but we can also not look up the process structure belonging to the rank in that comm ( that is in pml_ob1_match_recv_frag or something like that). The current fix reintroduces the synchronization step in comm_activate, and ensures that no fragment can be received for a new communicator before the synchronization occurs , and thus comm_nextcid() and pml_add_comm has been called. It seems to be the safest and easiest way for now. Welcome back, v1.2. This commit was SVN r21970.	2009-09-17 14:37:02 +00:00
Jeff Squyres	4a40be650e	Improve the MCA param help messages for btl_tcp_if_in\|exclude. This commit was SVN r21968.	2009-09-15 17:19:57 +00:00

1 2 3 4 5 ...

3139 Коммитов