openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	9a476c7fdd	The print function doesn't change the process name, so take a const... This commit was SVN r15531.	2007-07-20 02:54:43 +00:00
Brian Barrett	5b9fa7e998	reapply r15517 and r15520, which were removed in r15527 so that I could get the RML/OOB merge in slightly easier This commit was SVN r15530. The following SVN revision numbers were found above: r15517 --> open-mpi/ompi@41977fcc95 r15520 --> open-mpi/ompi@9cbc9df1b8 r15527 --> open-mpi/ompi@2d17dd9516	2007-07-20 02:34:29 +00:00
Tim Prins	9602230ad6	remove defunct file This commit was SVN r15529.	2007-07-20 02:10:38 +00:00
Brian Barrett	39a6057fc6	A number of improvements / changes to the RML/OOB layers: * General TCP cleanup for OPAL / ORTE * Simplifying the OOB by moving much of the logic into the RML * Allowing the OOB RML component to do routing of messages * Adding a component framework for handling routing tables * Moving the xcast functionality from the OOB base to its own framework Includes merge from tmp/bwb-oob-rml-merge revisions: r15506, r15507, r15508, r15510, r15511, r15512, r15513 This commit was SVN r15528. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r15506 r15507 r15508 r15510 r15511 r15512 r15513	2007-07-20 01:34:02 +00:00
Brian Barrett	2d17dd9516	temporarily back our r15517 and 15520 so that I can get the RML / OOB changes to cleanly apply This commit was SVN r15527. The following SVN revision numbers were found above: r15517 --> open-mpi/ompi@41977fcc95	2007-07-20 01:10:34 +00:00
Tim Mattox	824ef791f9	Updated the 1.2.4 section of the NEWS file with yet more changes. This commit was SVN r15525.	2007-07-20 00:05:14 +00:00
Tim Mattox	0b4dfe812b	Updated the NEWS file due to reversion of CMR Refs trac:1054 in the 1.2 branch. This commit was SVN r15522. The following Trac tickets were found above: Ticket 1054 --> https://svn.open-mpi.org/trac/ompi/ticket/1054	2007-07-19 23:36:27 +00:00
George Bosilca	9cbc9df1b8	Export the orte_ns_base_print_name_args function. This commit was SVN r15520.	2007-07-19 21:45:27 +00:00
Tim Prins	0b06832fc7	Properly return a value in all cases. This commit was SVN r15519.	2007-07-19 21:33:23 +00:00
Ralph Castain	41977fcc95	Remove the cellid field from the orte_process_name_t structure. This only affects a handful of files in itself, but... Cleanup ALL instances of output involving the printing of orte_process_name_t structures using the ORTE_NAME_ARGS macro so that the number of fields and type of data match. Replace those values with a new macro/function pair ORTE_NAME_PRINT that outputs a string (using the new thread safe data capability) so that any future changes to the printing of those structures can be accomplished with a change to a single point. Note that I could not possibly find outputs that directly print the orte_process_name_t fields, but only dealt with those that used ORTE_NAME_ARGS. Hence, you may still have a few outputs that bark during compilation. Also, I could only verify those that fall within environments I can compile on, so other environments may yield some minor warnings. This commit was SVN r15517.	2007-07-19 20:56:46 +00:00
Ralph Castain	2110064a9a	Ensure that the LD_LIBRARY_PATH and PATH get properly set for procs locally spawned by mpirun. This commit was SVN r15516.	2007-07-19 19:00:06 +00:00
Josh Hursey	6026929490	Fix compiler error on Cray by adding in the std io/lib headers. This commit was SVN r15515.	2007-07-19 18:26:10 +00:00
Jeff Squyres	7ae4a22ed5	Wow. How did that one get through? This commit was SVN r15514.	2007-07-19 17:18:10 +00:00
Brian Barrett	6427c9f92a	oops, need a return statement there... This commit was SVN r15509.	2007-07-19 16:21:11 +00:00
Brian Barrett	52ee1cb5da	fix missing ; in solaris functions This commit was SVN r15505.	2007-07-19 15:15:41 +00:00
Ralph Castain	ccdb834574	Fix a couple of compile errors. Also, we need to ensure that we only attempt to call destructors on tsd keys that were defined. This commit was SVN r15501.	2007-07-19 12:56:41 +00:00
Jeff Squyres	7c52a0ce17	Help track down when NULL is passed to %s for OPAL replacements of asprintf and friends. This is not a failsafe; there are many cases where this check will not be used. But at least it's something... This commit was SVN r15500.	2007-07-19 12:28:43 +00:00
Brian Barrett	9b14008f61	add a couple of comments, clean up the organization a bit This commit was SVN r15499.	2007-07-18 22:56:33 +00:00
Brian Barrett	5c1c3cdf1c	remove debugging output This commit was SVN r15497.	2007-07-18 21:57:58 +00:00
Brian Barrett	8a7b6656b3	Reference count calls to the util access as well as the main initialized code This commit was SVN r15495.	2007-07-18 20:28:19 +00:00
Brian Barrett	916397f358	Use thread specific data and static buffers for the return type of opal_net_get_hostname() rather than malloc, because no one was freeing the buffer and the common use case was for printfs, where calling free is a pain. This commit was SVN r15494.	2007-07-18 20:25:01 +00:00
Brian Barrett	c5d0066c27	add ability to have thread-specific data on windows, pthreads, solaris threads, and non-threaded builds This commit was SVN r15492.	2007-07-18 20:23:45 +00:00
Ralph Castain	b6c60dfc07	Bring over the extra debugging output that helped a user find his NSF mount problems. This just adds ERROR_LOG messages when the session directory creation process fails so we can see where it is happening - really helps users (and us as well) figure out what specifically went wrong. This commit was SVN r15491.	2007-07-18 19:50:54 +00:00
Pavel Shamis	d837f1446b	It is work around for Ticket #1092 . It will prevent the error failure in openib finalize but it doesn't resolve the actual issue. I guess that oneside tests some how allocates memory (mpool?) and doesn't release it. Need to check it. This commit was SVN r15488.	2007-07-18 18:02:13 +00:00
Jeff Squyres	0c321d798f	Group the "expected" NEWS items just so it's easier to see when changes are expected to hit releases This commit was SVN r15485.	2007-07-18 15:42:29 +00:00
Tim Prins	e41f86dfe6	add a small amount of debugging output This commit was SVN r15483.	2007-07-18 15:20:55 +00:00
Sven Stork	a6d04c60b4	- the use_component function is always present independent of OMPI_WANT_LIBLTDL This commit was SVN r15481.	2007-07-18 14:25:51 +00:00
Sven Stork	92fce998fe	- the use_component function is always present independent of OMPI_WANT_LIBLTDL This commit was SVN r15480.	2007-07-18 14:19:24 +00:00
Gleb Natapov	45fcb45e31	Remove debug checks that produce lots of warnings during compilation. This commit was SVN r15479.	2007-07-18 13:49:15 +00:00
Gleb Natapov	30b2183314	Remove debug output from a hot path. This commit was SVN r15478.	2007-07-18 12:48:34 +00:00
Sven Stork	2ab401dc3c	- export required symbols used by OSC This commit was SVN r15476.	2007-07-18 11:51:52 +00:00
Jeff Squyres	3bc940ac27	Fix three things from r15474 (thanks to Brian for noticing): * bml.h had a change that introduced a variable named "_order" to avoid a conflict with a local variable. The namespace starting with _ belongs to the os/compiler/kernel/not us. So we can't start symbols with _. So I replaced it with arg_order, and also updated the threaded equivalent of the macro that was modified. * in btl_openib_proc.c, one opal_output accidentally had its string reverted from "ompi_modex_recv..." to "mca_pml_base_modex_recv....". This was fixed. * The change to ompi/runtime/ompi_preconnect.c was entirely reverted; it was an artifact of debugging. This commit was SVN r15475. The following SVN revision numbers were found above: r15474 --> open-mpi/ompi@8ace07efed	2007-07-18 11:38:06 +00:00
Jeff Squyres	8ace07efed	This commit brings in two major things: 1. Galen's fine-grain control of queue pair resources in the openib BTL. 1. Pasha's new implementation of asychronous HCA event handling. Pasha's new implementation doesn't take much explanation, but the new "multifrag" stuff does. Note that "svn merge" was not used to bring this new code from the /tmp/ib_multifrag branch -- something Bad happened in the periodic trunk pulls on that branch making an actual merge back to the trunk effectively impossible (i.e., lots and lots of arbitrary conflicts and artifical changes). :-( == Fine-grain control of queue pair resources == Galen's fine-grain control of queue pair resources to the OpenIB BTL (thanks to Gleb for fixing broken code and providing additional functionality, Pasha for finding broken code, and Jeff for doing all the svn work and regression testing). Prior to this commit, the OpenIB BTL created two queue pairs: one for eager size fragments and one for max send size fragments. When the use of the shared receive queue (SRQ) was specified (via "-mca btl_openib_use_srq 1"), these QPs would use a shared receive queue for receive buffers instead of the default per-peer (PP) receive queues and buffers. One consequence of this design is that receive buffer utilization (the size of the data received as a percentage of the receive buffer used for the data) was quite poor for a number of applications. The new design allows multiple QPs to be specified at runtime. Each QP can be setup to use PP or SRQ receive buffers as well as giving fine-grained control over receive buffer size, number of receive buffers to post, when to replenish the receive queue (low water mark) and for SRQ QPs, the number of outstanding sends can also be specified. The following is an example of the syntax to describe QPs to the OpenIB BTL using the new MCA parameter btl_openib_receive_queues: {{{ -mca btl_openib_receive_queues \ "P,128,16,4;S,1024,256,128,32;S,4096,256,128,32;S,65536,256,128,32" }}} Each QP description is delimited by ";" (semicolon) with individual fields of the QP description delimited by "," (comma). The above example therefore describes 4 QPs. The first QP is: P,128,16,4 Meaning: per-peer receive buffer QPs are indicated by a starting field of "P"; the first QP (shown above) is therefore a per-peer based QP. The second field indicates the size of the receive buffer in bytes (128 bytes). The third field indicates the number of receive buffers to allocate to the QP (16). The fourth field indicates the low watermark for receive buffers at which time the BTL will repost receive buffers to the QP (4). The second QP is: S,1024,256,128,32 Shared receive queue based QPs are indicated by a starting field of "S"; the second QP (shown above) is therefore a shared receive queue based QP. The second, third and fourth fields are the same as in the per-peer based QP. The fifth field is the number of outstanding sends that are allowed at a given time on the QP (32). This provides a "good enough" mechanism of flow control for some regular communication patterns. QPs MUST be specified in ascending receive buffer size order. This requirement may be removed prior to 1.3 release. This commit was SVN r15474.	2007-07-18 01:15:59 +00:00
George Bosilca	e3ad495e7b	Remove an unused variable. This commit was SVN r15473.	2007-07-17 22:34:59 +00:00
George Bosilca	59ee366728	Remove a compilation warning. This commit was SVN r15472.	2007-07-17 22:32:59 +00:00
Rich Graham	f2a30cde5d	add table of send completion callback functions, on a per send-type basis. This commit was SVN r15471.	2007-07-17 21:26:56 +00:00
Rich Graham	0991c3d5f5	move buffered send component clean up out of the pml to ompi_mpi_finalize. This commit was SVN r15463.	2007-07-17 14:50:52 +00:00
Sven Stork	73f1d800cf	- Make the component select with static build working. Remove the matching logic out of dynamic path into an extra function. Add the corresponing check to the static component path. This commit was SVN r15458.	2007-07-17 12:06:51 +00:00
George Bosilca	e782da00e0	Don't allow the same communicator to be used in a multi-threaded build by several threads to create new communicators. There is nothing in the standard about threading and communicaotr functions, but as they include collective communications I expect the same rules have to be applied. As such, on an incorrect MPI program we deadlock (!). This commit was SVN r15456.	2007-07-17 00:33:27 +00:00
Rich Graham	de5670cd79	add missing header file - Thanks Brian. This commit was SVN r15455.	2007-07-17 00:06:35 +00:00
Rich Graham	1a4ce2a961	move setting of the component used to managed buffer sends out of the pmls, and into ompi_mpi_init. This is the first of several steps to pull buffered send management out of the pmls. This commit was SVN r15451.	2007-07-16 21:52:25 +00:00
Ralph Castain	511457feb5	Remove stale test code. At least we were wise enough to have eliminated this code from the "make check" tree, but almost none of it compiles and of what does compile, nothing seems to really work. This commit was SVN r15446.	2007-07-16 16:34:14 +00:00
Brian Barrett	6a1f876e98	Don't inline this function so that we can access the predefined datatype array even when visibility is turned on This commit was SVN r15444.	2007-07-16 16:29:51 +00:00
Sven Stork	c43335d671	- release the lock before we forward a message. In the threaded build it's possible that we have to process an ack before this function returns. If we don't release the lock here we cause a deadlock later in ack processing function. This commit was SVN r15441.	2007-07-16 14:43:24 +00:00
Ralph Castain	5121dfe7e7	With the changes to the failed-to-start logic, we need to revise the odls so it doesn't overwrite the exit status on procs that are not found. Otherwise, we lose the appropriate error message to the user. This commit was SVN r15440.	2007-07-16 13:50:26 +00:00
Sven Stork	804f3bee41	- export symbols that are required for the fortran bindings This commit was SVN r15439.	2007-07-16 13:23:57 +00:00
Ralph Castain	cd9213a9f0	Son of a gun - how did that fix not get into r15390? Fix the blasted iof null component so it only is selected if/when directed. This commit was SVN r15437. The following SVN revision numbers were found above: r15390 --> open-mpi/ompi@bd65f8ba88	2007-07-16 12:38:11 +00:00
Ralph Castain	d109e9a6f4	Roll in the Voltaire core/socket/etc process mapping implementation. Only change I made was to cleanup some of the diagnostic output in the odls_default component so it uses the -mca odls_base_verbose parameter. You will not see any impact from this change unless you use the syntax described in ticket #1023. I've tried as many of the RAS components as possible and saw no problem - there may be issues with other RAS components that would not compile on any of my systems. Anything that appears should be trivial to fix. This commit was SVN r15427.	2007-07-14 15:14:07 +00:00
George Bosilca	c839694fb8	Dont print anything when the user requested a specific MX interface. This commit was SVN r15426.	2007-07-14 00:04:50 +00:00
George Bosilca	1e825888a5	Fix the problem reported on #1087 . The global send and receive requests queues are now release in the base close, so there is no need for the cm PML to destroy them. This commit was SVN r15425.	2007-07-13 23:56:09 +00:00

1 2 3 4 5 ...

10110 Коммитов