openmpi

Автор	SHA1	Сообщение	Дата
Aurelien Bouteiller	73edda92db	Take into account Ralph's comments. There is no duplicate functionnality with the rml base. Also fix a stupid typo bug. This commit was SVN r19667.	2008-10-01 02:31:14 +00:00
George Bosilca	00d24bf8ab	Scalability patch, or slim-fast effect #1 . All BML structures just got a whole lot smaller, decreasing the memory footprint of the running application. How much it's a good question. Here is a breakdown: - in mca_bml_base_endpoint_t: 3 size_t + 1 uint32_t - in mca_bml_base_btl_t: 1 * int + 1 * double - 1 * float + 6 * size_t + 9 * (void) The decrease in mca_bml_base_endpoint_t is for each peer and the decrease in mca_bml_base_btl_t is for each BTL for each peer. So, if we consider the most convenient case where there is only one network between all peers, this decrease the memory foot print per peer by 9size_t + 9(void) + 2 * int32_t + 1 * double - 1 * float. On a 64 bits machine this will be 156 bytes per peer. Now we access all these fields directly from the underlying BTL structure, and as this structure is common to multiple BML endpoint, we are a lot more cache friendly. Even if this do not improve the latency, it makes the SM performance graph a lot smoother. This commit was SVN r19659.	2008-09-30 21:02:37 +00:00
George Bosilca	b32e4e7f34	Nothing important, mainly replacing tabs with spaces. This commit was SVN r19658.	2008-09-30 18:30:35 +00:00
George Bosilca	325d006577	Mostly cleanups, and eventually a little bit more scalable add_procs. There was an argument that was barely used, and on return at the PML level it contained nothing usable. It has been removed, so now we're using less memory ... This commit was SVN r19657.	2008-09-30 15:47:43 +00:00
Aurelien Bouteiller	89a2eea37a	Add functions to access the opaque port_string and to add routes to a remote port. This is usefull for FT, but could also turn usefull when considering MPI3 extentions to the MPI2 dynamics. This commit was SVN r19653.	2008-09-27 13:22:32 +00:00
Jeff Squyres	8b786cac04	The configure test we had for checking whether openib could build (related to the presence of posix threads and ptmalloc2) is now a little outdated: since we don't build ptmalloc2 as part of libopal anymore, the openib BTL's requirements are not directly tied to ptmalloc2's anymore. Specifically, I altered the test to: 1. At compile time, if no threads are found, the ptmalloc2 component is going to be built, '''and the ptmalloc2 component is going to be inside libopal,''' then refuse to build the openib BTL. 1. At run time, if no threads were available at compile time and the ptmalloc2 component is part of the process, then refuse to use the openib BTL. Fixes trac:1537. This commit was SVN r19652. The following Trac tickets were found above: Ticket 1537 --> https://svn.open-mpi.org/trac/ompi/ticket/1537	2008-09-27 11:19:21 +00:00
George Bosilca	2803de5298	Remove the protection around computing the remote size. This has to be done always in a heterogeneous way in order to be able to support extern32. It doesn't really matter as it is outside the critical path. This commit was SVN r19651.	2008-09-26 23:11:53 +00:00
George Bosilca	ed0f5a18ce	Typo. This commit was SVN r19650.	2008-09-26 23:09:48 +00:00
Brian Barrett	76f47aaa1c	Fix a bunch of compiler warnings. Refs trac:1458 This commit was SVN r19649. The following Trac tickets were found above: Ticket 1458 --> https://svn.open-mpi.org/trac/ompi/ticket/1458	2008-09-26 16:15:05 +00:00
Jeff Squyres	8de0663ae0	Increase the size of MPI_MAX_PORT_NAME from 256 to 1024. Rationale: 1. This value has already changed since v1.2 (v1.2 MPI_MAX_PORT_NAME == 36). Hence, this commit simply increases the value from a previous change. 1. The changes does increase OMPI's memory footprint slightly, but only when using MPI-2 dynamics. So it is expected that the change will have minimal impact on the overall footprint. 1. The change is helpful for nodes that have 4 or more IP networks (e.g., regular ethernet and multiple IP-over-<pick your favorite high-speed network> networks). Without this change, invoking MPI_COMM_SPAWN on hosts with 4 or more IP networks will fail because we'll exceed 256 bytes for the port name. Some OMPI developer test clusters already have this kind of configuration (e.g., Cisco); it is expected that this is not too common in the real world yet, but with "manycore" coming, having multiple IP-based networks in a single server will likely become more common. This commit was SVN r19638.	2008-09-25 16:47:17 +00:00
Ralph Castain	55738aeabe	Very tiny modification of the output when displaying mca param values to clarify that ones found in the environment could have also been set on the cmd line - we don't have a way to distinguish them internally. This commit was SVN r19636.	2008-09-25 13:08:17 +00:00
Jeff Squyres	627f1ecd36	Oops -- fix silly cut-n-paste error... This commit was SVN r19630.	2008-09-24 20:17:54 +00:00
Jeff Squyres	d85aaf521a	Also show the process name; it is useful, at least to us developers ;-) This commit was SVN r19629.	2008-09-24 19:32:34 +00:00
Jeff Squyres	bbfac2dfb5	Based on a review by Ralph, no need to call getpid() or gethostname(); we already have them in orte_process_info. Refs trac:1523. This commit was SVN r19615. The following Trac tickets were found above: Ticket 1523 --> https://svn.open-mpi.org/trac/ompi/ticket/1523	2008-09-23 20:04:34 +00:00
Jeff Squyres	4c558ed637	Enable aggregation checking for "*** An error occurred..." MPI layer help messages so that users only see the message once instead of N times when their MPI app crashes. Note that there is a tradeoff here -- we now call malloc in this particular "show the error" code path. This shouldn't usually be a problem, because the errors typically displayed through this mechanism are MPI API argument problems (e.g., sending a negative count to MPI_SEND), and not memory errors. But such API argument errors could be a consequence of of a prior memory error, so there's a nonzero chance that the error failure will fail to print because malloc failed. In this case, the user can disable help message aggregation (via the orte_base_want_aggregate MCA parameter) and we'll fall back to the no-malloc code path (but without aggregation). Note that we won't aggregate before MPI_INIT or after MPI_FINALIZE. So if you call an MPI function before MPI_INIT / after MPI_FINALIZE, you'll still see the error message N times. Nothing we can do about that; we need ORTE to do the aggregation properly (which is obviously unavailable before MPI_INIT / after MPI_FINALIZE). This commit was SVN r19611.	2008-09-23 17:19:24 +00:00
Jeff Squyres	a676874f47	Disable global ID resolution when sparse groups are used. Tested by Terry and George in the non-sparse-groups scenarios. Fixes trac:1464. Will file a new ticket to actually resolve IDs when sparse groups are used. This commit was SVN r19610. The following Trac tickets were found above: Ticket 1464 --> https://svn.open-mpi.org/trac/ompi/ticket/1464	2008-09-23 16:27:01 +00:00
Jeff Squyres	d6696c46a6	Oops -- sometimes we actually pass NULL for the error_code. Make sure to handle that nicely without segv'ing. This commit was SVN r19603.	2008-09-22 17:41:39 +00:00
Jeff Squyres	5fd742e769	Add in the standardized way to notify a debugger if the MPI job is about to abort. Fixes trac:1509. This commit was SVN r19596. The following Trac tickets were found above: Ticket 1509 --> https://svn.open-mpi.org/trac/ompi/ticket/1509	2008-09-20 11:34:37 +00:00
Matthias Jurenz	5755d35045	Removed - This file will be created by autotools This commit was SVN r19591.	2008-09-19 15:09:46 +00:00
Jeff Squyres	7d119a1c3b	Fix CID 1116: ensure to check return code (patch approved by George :-) ). This commit was SVN r19584.	2008-09-19 13:28:04 +00:00
Jeff Squyres	d0a8be6d2f	Fix CID 1117: ensure to check return values. This commit was SVN r19583.	2008-09-19 13:27:30 +00:00
Matthias Jurenz	d42592113b	Fixed compiler warning (unused variable) This commit was SVN r19577.	2008-09-17 14:39:19 +00:00
Josh Hursey	778e387618	fix a compiler warning This commit was SVN r19574.	2008-09-17 14:01:31 +00:00
Josh Hursey	80d05cf957	Cleanup the patch from r19566. Thanks to George and Jeff for pointing out a better way to do this. This commit was SVN r19573. The following SVN revision numbers were found above: r19566 --> open-mpi/ompi@351c3a3a86	2008-09-17 13:55:21 +00:00
Jeff Squyres	d2d06008a0	Change the default value of mpi_leave_pinned to -1, meaning that we'll figure it out at runtime (really meaning: we'll still default to "0" unless something explicitly overrides to 1, such as the openib BTL). This way, ompi_info doesn't confusingly report mpi_leave_pinned==0 for mpi_leave_pinned, but we end up running with mpi_leave_pinned==1. Fixes trac:1502. This commit was SVN r19571. The following Trac tickets were found above: Ticket 1502 --> https://svn.open-mpi.org/trac/ompi/ticket/1502	2008-09-16 22:06:14 +00:00
Josh Hursey	351c3a3a86	The ft_event function needs access to the bml_r2_remove_btl_progress() to ensure that all progress events are flushed as needed across a checkpoint/restart. This commit was SVN r19566.	2008-09-16 19:06:53 +00:00
Jeff Squyres	270f482fea	Addendum to r19561: also remove a comment that is no longer true and some code that is commented out. This commit was SVN r19564. The following SVN revision numbers were found above: r19561 --> open-mpi/ompi@17e65369be	2008-09-16 13:02:10 +00:00
George Bosilca	6a9514ee08	Make the code match the comment. I checked with Jelena, and based on the papers we published this is the expected algorithm for the specified message and communicator size. This commit closes ticket #1330. This commit was SVN r19563.	2008-09-15 23:28:40 +00:00
George Bosilca	acd3406aa7	Never drop messages. No never no more. This is supposed to fix the ticket #1460. This commit was SVN r19562.	2008-09-15 23:04:18 +00:00
George Bosilca	17e65369be	Fix the deadlock when we run out of resources on the BTLs. Move the progress function from the BML into the PML. The BTL progress functions are now directly registered with the event library. This commit was SVN r19561.	2008-09-15 22:56:23 +00:00
Jeff Squyres	4cf909a932	Fix a typpppo (thanks Rolf!). This commit was SVN r19549.	2008-09-11 21:43:01 +00:00
Jeff Squyres	f794580bbe	Print a [much] better error message when MPI processes are unable to reach each other (this problem just bit me; I had forgotten how horrid our previous error message was). This commit was SVN r19548.	2008-09-11 20:52:58 +00:00
Jeff Squyres	eeabae49b9	Per http://www.open-mpi.org/community/lists/devel/2008/09/4648.php , remove the unconditional opal_output's when mmap() fails, and instead, conditionally output the failure message via btl_base_verbose settings. This commit was SVN r19547.	2008-09-11 19:02:33 +00:00
Ralph Castain	e44ac3f36d	Remove some historical, but now unused, cruft This commit was SVN r19545.	2008-09-11 12:49:11 +00:00
Rainer Keller	5c58b4e7cd	- Move the OMPI_DECLSPEC from .c to .h file. breaks windows compilation. see r19502 This commit was SVN r19544. The following SVN revision numbers were found above: r19502 --> open-mpi/ompi@ce42e749a0	2008-09-11 12:26:33 +00:00
Josh Hursey	36185ad964	Replace the old coordinated component ('coord') and replace it with a much more refined version ('bkmrk'). The new component fixes a number of problems with the old component. The core algorithm is the same, but by changing the data strucutres a bit we have improved performance and memory utilization. There are still a couple corner cases that still need some work. However, I did not want to delay bringing this into the trunk (and v1.3 branch) for too much longer. This commit was SVN r19537.	2008-09-10 18:29:17 +00:00
Rolf vandeVaart	1ad9d0459e	Add a check for LOCK_SHARED in the sys/synch.h file. If it exists then smash it to avoid problems with preprocessor and C++. This fixes trac:1477. Help provided by Jeff and Terry. This commit was SVN r19533. The following Trac tickets were found above: Ticket 1477 --> https://svn.open-mpi.org/trac/ompi/ticket/1477	2008-09-10 12:58:30 +00:00
Rolf vandeVaart	3193cf1008	Fix workaround for Sun Studio compilers and message queue when compiling with threads. This commit was SVN r19528.	2008-09-09 13:46:36 +00:00
Jeff Squyres	0ae2c27d3b	Ensure that the mutex is properly constructed/destructed. This commit was SVN r19527.	2008-09-09 12:57:45 +00:00
Rainer Keller	1d7fd1f51d	- Initialize the lock as well Found by Christoph Niethammer using the mpi_test_suite This commit was SVN r19523.	2008-09-09 08:01:41 +00:00
Shiqing Fan	04ee20a880	- Mainly type casts. Microsoft VC++ compiler is too strict. This commit was SVN r19517.	2008-09-08 15:39:30 +00:00
Jeff Squyres	ec6cffcd71	Patch originally from George to improve ROMIO's performance when checking for contiguous datatypes by using our native DDT engine (rather than several MPI_* calls). The majority of the work is in the IO ROMIO module.c file, but there's a small part in adio/common/iscontig.c that we're also submitting upstream. This commit was SVN r19509.	2008-09-06 09:50:12 +00:00
George Bosilca	579d70edad	We should use #ifdef and not #if This commit was SVN r19504.	2008-09-05 12:44:19 +00:00
Jeff Squyres	1382b29aa3	Add the MPI::Comm::Call_errhandler() function (it was previously missing!). Fixes trac:1483. This commit was SVN r19503. The following Trac tickets were found above: Ticket 1483 --> https://svn.open-mpi.org/trac/ompi/ticket/1483	2008-09-05 12:26:06 +00:00
Rainer Keller	ce42e749a0	- Well, thanks MTT, OPAL_DECLSPEC needs to be on, since the attribute_visibility - Some other older minor changes (indentation) This commit was SVN r19502.	2008-09-05 07:47:54 +00:00
Nysal Jan	3b51258b13	Remove redundant code Refs trac:1463 This commit was SVN r19501. The following Trac tickets were found above: Ticket 1463 --> https://svn.open-mpi.org/trac/ompi/ticket/1463	2008-09-05 04:29:21 +00:00
Nysal Jan	4b68803260	Should be coords(i) >= dims(i) Refs trac:1463 This commit was SVN r19500. The following Trac tickets were found above: Ticket 1463 --> https://svn.open-mpi.org/trac/ompi/ticket/1463	2008-09-05 04:20:48 +00:00
Jeff Squyres	35c5863739	Remove a bit of redundant code -- this is error condition is now checked in the top-level API. This commit was SVN r19497.	2008-09-04 22:24:59 +00:00
George Bosilca	10b4664e9c	Fix the problem with the unexpected MX handler. The Solution so far is to mostly don't use this mechanism, as we have to be thread safe in order to really take full advantage of it. The unexpected handler is called by one of th MX threads, and we do not have control on the moment. In a non threaded case, this will completely destroy our recv requests queues, so the safest approach is to don't the unexpected handler. This commit was SVN r19496.	2008-09-04 15:46:29 +00:00
George Bosilca	9e0d64b9e5	Add a send immediate for MX. This commit was SVN r19495.	2008-09-04 15:43:56 +00:00

1 2 3 4 5 ...

4046 Коммитов