openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	70b02a0178	Sometimes we don't have a valid error code, so don't segv if ompi_mpi_errnum_get_string() returns a NULL. This commit was SVN r19670.	2008-10-01 21:42:08 +00:00
Aurelien Bouteiller	77fa3b5d4c	Code to connect to event loggers over optimized MPI communication channels. This commit was SVN r19669.	2008-10-01 18:42:43 +00:00
Aurelien Bouteiller	aded765084	Upgrade to the new mca version This commit was SVN r19668.	2008-10-01 18:40:44 +00:00
Aurelien Bouteiller	73edda92db	Take into account Ralph's comments. There is no duplicate functionnality with the rml base. Also fix a stupid typo bug. This commit was SVN r19667.	2008-10-01 02:31:14 +00:00
Tim Mattox	1995fb23cb	Resync the NEWS file for a 1.2.8 change. This commit was SVN r19662.	2008-10-01 00:01:01 +00:00
George Bosilca	00d24bf8ab	Scalability patch, or slim-fast effect #1 . All BML structures just got a whole lot smaller, decreasing the memory footprint of the running application. How much it's a good question. Here is a breakdown: - in mca_bml_base_endpoint_t: 3 size_t + 1 uint32_t - in mca_bml_base_btl_t: 1 * int + 1 * double - 1 * float + 6 * size_t + 9 * (void) The decrease in mca_bml_base_endpoint_t is for each peer and the decrease in mca_bml_base_btl_t is for each BTL for each peer. So, if we consider the most convenient case where there is only one network between all peers, this decrease the memory foot print per peer by 9size_t + 9(void) + 2 * int32_t + 1 * double - 1 * float. On a 64 bits machine this will be 156 bytes per peer. Now we access all these fields directly from the underlying BTL structure, and as this structure is common to multiple BML endpoint, we are a lot more cache friendly. Even if this do not improve the latency, it makes the SM performance graph a lot smoother. This commit was SVN r19659.	2008-09-30 21:02:37 +00:00
George Bosilca	b32e4e7f34	Nothing important, mainly replacing tabs with spaces. This commit was SVN r19658.	2008-09-30 18:30:35 +00:00
George Bosilca	325d006577	Mostly cleanups, and eventually a little bit more scalable add_procs. There was an argument that was barely used, and on return at the PML level it contained nothing usable. It has been removed, so now we're using less memory ... This commit was SVN r19657.	2008-09-30 15:47:43 +00:00
Ralph Castain	aa11e0977c	Correct a bug in the bookmarking code that incorrectly looked at #slots instead of #slots_allocated, thus causing slot reductions in hostfiles to be ignored when selecting our starting node. Fixes trac:1527 This commit was SVN r19656. The following Trac tickets were found above: Ticket 1527 --> https://svn.open-mpi.org/trac/ompi/ticket/1527	2008-09-29 14:09:02 +00:00
Ralph Castain	4f89adae0c	Prettify the user level display of allocation and map to make it easier to see and understand This commit was SVN r19655.	2008-09-28 16:44:09 +00:00
Ralph Castain	508cb45583	Add a little more diagnostic info when we cannot do an rml send This commit was SVN r19654.	2008-09-28 02:13:49 +00:00
Aurelien Bouteiller	89a2eea37a	Add functions to access the opaque port_string and to add routes to a remote port. This is usefull for FT, but could also turn usefull when considering MPI3 extentions to the MPI2 dynamics. This commit was SVN r19653.	2008-09-27 13:22:32 +00:00
Jeff Squyres	8b786cac04	The configure test we had for checking whether openib could build (related to the presence of posix threads and ptmalloc2) is now a little outdated: since we don't build ptmalloc2 as part of libopal anymore, the openib BTL's requirements are not directly tied to ptmalloc2's anymore. Specifically, I altered the test to: 1. At compile time, if no threads are found, the ptmalloc2 component is going to be built, '''and the ptmalloc2 component is going to be inside libopal,''' then refuse to build the openib BTL. 1. At run time, if no threads were available at compile time and the ptmalloc2 component is part of the process, then refuse to use the openib BTL. Fixes trac:1537. This commit was SVN r19652. The following Trac tickets were found above: Ticket 1537 --> https://svn.open-mpi.org/trac/ompi/ticket/1537	2008-09-27 11:19:21 +00:00
George Bosilca	2803de5298	Remove the protection around computing the remote size. This has to be done always in a heterogeneous way in order to be able to support extern32. It doesn't really matter as it is outside the critical path. This commit was SVN r19651.	2008-09-26 23:11:53 +00:00
George Bosilca	ed0f5a18ce	Typo. This commit was SVN r19650.	2008-09-26 23:09:48 +00:00
Brian Barrett	76f47aaa1c	Fix a bunch of compiler warnings. Refs trac:1458 This commit was SVN r19649. The following Trac tickets were found above: Ticket 1458 --> https://svn.open-mpi.org/trac/ompi/ticket/1458	2008-09-26 16:15:05 +00:00
Jeff Squyres	c11bee41da	Add bullet about updated SLURM support This commit was SVN r19648.	2008-09-26 12:36:59 +00:00
Ralph Castain	edb3d99687	Update SLURM environmental variables used to describe allocation. Retain backwards compatibility to SLURM 1.1 and earlier versions. This commit was SVN r19647.	2008-09-26 02:38:37 +00:00
Tim Mattox	aaae28b6c7	Resync the NEWS file with the 1.2 branch. This commit was SVN r19646.	2008-09-25 21:48:17 +00:00
Aurelien Bouteiller	4be474f727	CRS is now an opal framework. It should use OPAL version defines. This commit was SVN r19643.	2008-09-25 21:01:04 +00:00
Kenneth Matney	91bbc6b919	Change algorithm from spawning a shell that spawns another shell, and thereby runs apstat twice; and in the process thereof reads the ALPS appinfo file TWICE; and in addition, experiences a failure sometimes which causes mpirun to hang. Change this to a looped read attempt that breaks on success, thereby avoiding failure (except in the most This commit was SVN r19642.	2008-09-25 20:44:16 +00:00
Tim Mattox	821d81304f	Testing... This commit was SVN r19641.	2008-09-25 20:34:00 +00:00
Jeff Squyres	8de0663ae0	Increase the size of MPI_MAX_PORT_NAME from 256 to 1024. Rationale: 1. This value has already changed since v1.2 (v1.2 MPI_MAX_PORT_NAME == 36). Hence, this commit simply increases the value from a previous change. 1. The changes does increase OMPI's memory footprint slightly, but only when using MPI-2 dynamics. So it is expected that the change will have minimal impact on the overall footprint. 1. The change is helpful for nodes that have 4 or more IP networks (e.g., regular ethernet and multiple IP-over-<pick your favorite high-speed network> networks). Without this change, invoking MPI_COMM_SPAWN on hosts with 4 or more IP networks will fail because we'll exceed 256 bytes for the port name. Some OMPI developer test clusters already have this kind of configuration (e.g., Cisco); it is expected that this is not too common in the real world yet, but with "manycore" coming, having multiple IP-based networks in a single server will likely become more common. This commit was SVN r19638.	2008-09-25 16:47:17 +00:00
Ralph Castain	037231fbcb	MOdify the node_rank and local_rank fields to be uint16_t so we can handle more than 256 procs/node. Change the type to a defined one so that any future change can be easily done, if required. This commit was SVN r19637.	2008-09-25 13:39:08 +00:00
Ralph Castain	55738aeabe	Very tiny modification of the output when displaying mca param values to clarify that ones found in the environment could have also been set on the cmd line - we don't have a way to distinguish them internally. This commit was SVN r19636.	2008-09-25 13:08:17 +00:00
Tim Mattox	6a3e28a3b6	Resync the NEWS file with the 1.3 branch. This commit was SVN r19635.	2008-09-24 21:06:34 +00:00
Tim Mattox	fae18d1ea2	Ugh, whitespace differences suck... fixing. This commit was SVN r19632.	2008-09-24 20:51:18 +00:00
Tim Mattox	1d5a6602b6	Resync the NEWS on the trunk with the 1.2 branch. This commit was SVN r19631.	2008-09-24 20:41:17 +00:00
Jeff Squyres	627f1ecd36	Oops -- fix silly cut-n-paste error... This commit was SVN r19630.	2008-09-24 20:17:54 +00:00
Jeff Squyres	d85aaf521a	Also show the process name; it is useful, at least to us developers ;-) This commit was SVN r19629.	2008-09-24 19:32:34 +00:00
Josh Hursey	77e6b72c06	Update my entry to reflect all of the affiliations. This commit was SVN r19627.	2008-09-24 17:34:58 +00:00
Jeff Squyres	78a25cf116	Commit a few missing header files, etc. This commit was SVN r19626.	2008-09-24 15:41:42 +00:00
Brian Barrett	1f69ae5356	Add SNL affiliation for me. SNL is plural. This commit was SVN r19625.	2008-09-24 15:20:45 +00:00
Ralph Castain	8d1ecdb361	Correct the creation of MPIR_Proctable so that the structs in the array correspond to the order of the ranks. This commit was SVN r19624.	2008-09-24 14:55:46 +00:00
Jeff Squyres	bbfac2dfb5	Based on a review by Ralph, no need to call getpid() or gethostname(); we already have them in orte_process_info. Refs trac:1523. This commit was SVN r19615. The following Trac tickets were found above: Ticket 1523 --> https://svn.open-mpi.org/trac/ompi/ticket/1523	2008-09-23 20:04:34 +00:00
Jeff Squyres	2879de60a1	Update a name spelling This commit was SVN r19614.	2008-09-23 19:59:57 +00:00
Jeff Squyres	ca323aae8e	Very minor updates. Refs trac:1399. This commit was SVN r19613. The following Trac tickets were found above: Ticket 1399 --> https://svn.open-mpi.org/trac/ompi/ticket/1399	2008-09-23 19:50:31 +00:00
Jeff Squyres	ef6a216771	Update AUTHORS file with all the IDs that have committed so far on the OMPI trunk. Need all organizations to ensure I got spellings and affiliations correct. Also commit a helper script to help keep AUTHORS up to date on the trunk; it should be run before we create release branches. This commit was SVN r19612.	2008-09-23 19:38:53 +00:00
Jeff Squyres	4c558ed637	Enable aggregation checking for "*** An error occurred..." MPI layer help messages so that users only see the message once instead of N times when their MPI app crashes. Note that there is a tradeoff here -- we now call malloc in this particular "show the error" code path. This shouldn't usually be a problem, because the errors typically displayed through this mechanism are MPI API argument problems (e.g., sending a negative count to MPI_SEND), and not memory errors. But such API argument errors could be a consequence of of a prior memory error, so there's a nonzero chance that the error failure will fail to print because malloc failed. In this case, the user can disable help message aggregation (via the orte_base_want_aggregate MCA parameter) and we'll fall back to the no-malloc code path (but without aggregation). Note that we won't aggregate before MPI_INIT or after MPI_FINALIZE. So if you call an MPI function before MPI_INIT / after MPI_FINALIZE, you'll still see the error message N times. Nothing we can do about that; we need ORTE to do the aggregation properly (which is obviously unavailable before MPI_INIT / after MPI_FINALIZE). This commit was SVN r19611.	2008-09-23 17:19:24 +00:00
Jeff Squyres	a676874f47	Disable global ID resolution when sparse groups are used. Tested by Terry and George in the non-sparse-groups scenarios. Fixes trac:1464. Will file a new ticket to actually resolve IDs when sparse groups are used. This commit was SVN r19610. The following Trac tickets were found above: Ticket 1464 --> https://svn.open-mpi.org/trac/ompi/ticket/1464	2008-09-23 16:27:01 +00:00
Pavel Shamis	bd09bbf851	Disabling IBCM support by default. The component still is not stable. This commit was SVN r19609.	2008-09-23 15:57:55 +00:00
Ralph Castain	e64b79f30f	Modify the --display-map and --display-alloc per note on devel list to reduce info for user understanding. Add --display-devel-map and --display-devel-alloc to display all the detailed info we used to provide - it is only of use/interest to developers anyway and confuses users. This commit was SVN r19608.	2008-09-23 15:46:34 +00:00
Josh Hursey	90c936b292	Cleanup BLCR configure logic. Add a '--with-blcr-libdir' option to allow a user to specify a library directory outside of the '--with-blcr' option. Needs to be moved to v1.3 This commit was SVN r19607.	2008-09-22 19:48:47 +00:00
Josh Hursey	36b824effd	Make sure to protect the symbol, so builds that do not involve threads will build properly. Thanks to Jeff for pointing this out to me. This commit was SVN r19606.	2008-09-22 19:03:41 +00:00
Kenneth Matney	68248a32ef	Add #include for stdio.h to allow make check to run with gcc 4.2.4 (on Cray XT platform). This commit was SVN r19605.	2008-09-22 18:00:30 +00:00
Jeff Squyres	e0a991a8c2	Print out a message telling the user how to enable non-aggregated help / error messages. This commit was SVN r19604.	2008-09-22 17:42:56 +00:00
Jeff Squyres	d6696c46a6	Oops -- sometimes we actually pass NULL for the error_code. Make sure to handle that nicely without segv'ing. This commit was SVN r19603.	2008-09-22 17:41:39 +00:00
Josh Hursey	0cd65bfaa8	Fix a SIGPIPE that may occur when checkpointing a restarted process. This was a result of calling system() in the BLCR CRS. After inspection and testing it was determined that the operation was no longer necessary. So the call was removed thus fixing the bug. This commit was SVN r19601.	2008-09-22 16:49:56 +00:00
Jeff Squyres	8eccda391a	Fix comment to match the code. This commit was SVN r19598.	2008-09-20 12:35:48 +00:00
Jeff Squyres	02f2cbe85a	* Added bullet about upgrading autotools * Added bullet about removing duplicate error messages * Some minor grammar and syntax fixes. This commit was SVN r19597.	2008-09-20 11:42:59 +00:00

1 2 3 4 5 ...

12266 Коммитов