openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	9252afdcd9	Updates and tweaks to the documentation of the new MCA parameter system (written in conjunction with Nathan). This commit was SVN r28758.	2013-07-11 20:04:51 +00:00
Nathan Hjelm	a694bcb6b6	Add support for the MCA variable information level to ompi_info. Add an option to ompi_info (-l, --level) that takes a number in the interval (1,9). Only MCA variables up to this level will be printed. The default level is 1. Print the level as part of both the parsable and readable output. This commit was SVN r28750.	2013-07-10 18:52:36 +00:00
Ralph Castain	028f5ee7a6	Cleanup some bitrot from moving the db framework to opal and from the new mca param system This commit was SVN r28741.	2013-07-09 14:37:08 +00:00
Ralph Castain	315da8125d	Remove stale headers cmr:v1.7.3:reviewer=jsquyres This commit was SVN r28732.	2013-07-08 18:26:58 +00:00
Ralph Castain	eac174e624	For purposes of testing the RFC, make libevent2021 the default for now so it gets tested by MTT This commit was SVN r28730.	2013-07-05 23:14:22 +00:00
Brian Barrett	ea9cee73c1	Per RFC, remove darwin backtrace, since OS X since 10.5 has supported the execinfo() interface (which has been the default for OMPI to use on Darwin) This commit was SVN r28727.	2013-07-05 19:06:27 +00:00
Ralph Castain	21c8041a40	Update libevent 2021 component so it also only warns once when detecting reentrant behavior This commit was SVN r28721.	2013-07-04 04:41:04 +00:00
Ralph Castain	45fad1ddcc	We really should be closing the event framework when told to do so. cmr:v1.7.3,reviewer=jsquyres This commit was SVN r28714.	2013-07-03 16:57:14 +00:00
Ralph Castain	9166a8cc95	Per telecon today, add a flag so we only warn once about reentrant libevent loops - this will allow developers to better diagnose the problem as we won't swamp filesystems with warning messages. This commit was SVN r28712.	2013-07-03 04:51:36 +00:00
Jeff Squyres	ad16bcd6d1	Followup from Justin Bronder: Looks like I spoke too soon. The sandbox team has informed me that they are getting rid of SANDBOX_PID in the future and that using SANDBOX_ON would be preferred. This commit was SVN r28708.	2013-07-03 01:38:26 +00:00
Jeff Squyres	fea15ec34e	Add memory hooks override for Gentoo sandbox v2.5, too. Thanks to Justin Bronder for the patch. This commit was SVN r28702.	2013-07-02 12:34:51 +00:00
Ralph Castain	446e33a5d8	There are cases where we want to use the novm state machine, but the backend node topology differs from that where mpirun is executing. In those cases, we can wind up thinking we are oversubscribed because the head node has fewer cores than the compute nodes. To resolve this situation, add the ability to specify a backend topology file that mpirun shall use for its mapping operations. Create a new "set_topology" function in opal hwloc to support it. This commit was SVN r28682.	2013-06-27 03:04:50 +00:00
Jeff Squyres	dd25421d48	Convert strcpy() to strncpy(), and just to be extra-super paranoid, use memset(0) for extra bonus points. This commit was SVN r28668.	2013-06-22 12:21:18 +00:00
Joshua Ladd	0b5c1f2ea8	Add 'generic' support for PMI2 (previously, we checked for PMI2 only on Cray systems.) If your resource manager (e.g. SLURM) has support for PMI2, then the --with-pmi configure flag will enable its usage. If you don't have PMI2, then you will fallback to regular old PMI1. This patch was submitted by Ralph Castain and reviewed and pushed by Josh Ladd. This should be added to cmr:v1.7:reviewer=jladd This commit was SVN r28666.	2013-06-21 15:28:14 +00:00
Nathan Hjelm	518d1fe200	Fix two typos that prevented alps direct launch from working This commit was SVN r28628.	2013-06-13 17:04:08 +00:00
Joshua Ladd	46362d2761	Stomps compiler warnings in HCA min-dist calculation. This should be added to cmr:v1.7:reviewer=jladd This commit was SVN r28620.	2013-06-12 16:25:25 +00:00
Tom Naughton	d86c3ce669	+ remove autogenerated 'install-sh' This commit was SVN r28602.	2013-06-07 20:40:24 +00:00
Jeff Squyres	6d173af329	This commit introduces a new "mindist" ORTE RMAPS mapper, as well as some relevant updates/new functionality in the opal/mca/hwloc and orte/mca/rmaps bases. This work was mainly developed by Mellanox, with a bunch of advice from Ralph Castain, and some minor advice from Brice Goglin and Jeff Squyres. Even though this is mainly Mellanox's work, Jeff is committing only for logistical reasons (he holds the hg+svn combo tree, and can therefore commit it directly back to SVN). ----- Implemented distance-based mapping algorithm as a new "mindist" component in the rmaps framework. It allows mapping processes by NUMA due to PCI locality information as reported by the BIOS - from the closest to device to furthest. To use this algorithm, specify: {{{mpirun --map-by dist:<device_name>}}} where <device_name> can be mlx5_0, ib0, etc. There are two modes provided: 1. bynode: load-balancing across nodes 1. byslot: go through slots sequentially (i.e., the first nodes are more loaded) These options are regulated by the optional ''span'' modifier; the command line parameter looks like: {{{mpirun --map-by dist:<device_name>,span}}} So, for example, if there are 2 nodes, each with 8 cores, and we'd like to run 10 processes, the mindist algorithm will place 8 processes to the first node and 2 to the second by default. But if you want to place 5 processes to each node, you can add a span modifier in your command line to do that. If there are two NUMA nodes on the node, each with 4 cores, and we run 6 processes, the mindist algorithm will try to find the NUMA closest to the specified device, and if successful, it will place 4 processes on that NUMA but leaving the remaining two to the next NUMA node. You can also specify the number of cpus per MPI process. This option is handled so that we map as many processes to the closest NUMA as we can (number of available processors at the NUMA divided by number of cpus per rank) and then go on with the next closest NUMA. The default binding option for this mapping is bind-to-numa. It works if you don't specify any binding policy. But if you specified binding level that was "lower" than NUMA (i.e hwthread, core, socket) it would bind to whatever level you specify. This commit was SVN r28552.	2013-05-22 13:04:40 +00:00
Jeff Squyres	55382c1bf8	Bring over upstream hwloc trunk commit https://svn.open-mpi.org/trac/hwloc/changeset/5592 to fix the merging of groups when they are I/O objects. This commit was SVN r28551.	2013-05-22 12:34:59 +00:00
Nathan Hjelm	721779d7ab	Per RFC: remove old MCA parameter system. This commit was SVN r28541.	2013-05-20 15:36:13 +00:00
Jeff Squyres	089c632cce	Remove a bunch of dead code: gcc 4.7 warns of set-but-unused variables. So get rid of them. This commit was SVN r28538.	2013-05-17 21:45:49 +00:00
Jeff Squyres	4d9da92e60	Fixes trac:376: bu default the wrappr compilers will enable rpath support in generated executables on systems that support it. Use --disable-wrapper-rpath to disable this behavior. See text in README about --disable-wrapper-rpath for more details. This commit was SVN r28479. The following Trac tickets were found above: Ticket 376 --> https://svn.open-mpi.org/trac/ompi/ticket/376	2013-05-11 00:49:17 +00:00
Jeff Squyres	cad1d920b2	Check to ensure that we have struct ifreq.ifr_mtu before we try to use it, because Solaris although has SIOCFIGMTU, it curiously does not have ifreq.ifr_mtu. This commit was SVN r28460.	2013-05-07 13:51:50 +00:00
Jeff Squyres	4b9b3a81ff	Update the list of post-1.5.2 r numbers from hwloc that we have committed here. This commit was SVN r28458.	2013-05-07 01:22:06 +00:00
Jeff Squyres	ee0cdf86fd	Fix issue raised by Stefan Friedel: remove an extraneous -L that is added by hwloc's embedding so that it doesn't appear in libhwloc_embedded.la (and therefore propogate all the way up to libmpi.la). Committed upstream in hwloc SVN r5588. This commit was SVN r28457. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r5588	2013-05-07 01:21:18 +00:00
Ralph Castain	527ea1d090	Per the RFC, always enable libevent thread support. This commit was SVN r28443.	2013-05-03 15:39:05 +00:00
Ralph Castain	4c0dcb1aa2	Update ignores and remove build product This commit was SVN r28412.	2013-04-29 19:02:03 +00:00
Ralph Castain	5d7a93c032	Add the ability to use an external version of libevent. Clearly not recommended at this time. I've verified that it works in limited scenarios, but more thorough testing and performance impacts need to be assessed. Interesting how many includes had to be fixed here and there to fill in missing dependencies :-) This commit was SVN r28411.	2013-04-29 17:02:37 +00:00
Ralph Castain	3052acd968	Fix minor typo This commit was SVN r28410.	2013-04-29 17:02:11 +00:00
Ralph Castain	3818e88365	Remove and ignore build products This commit was SVN r28404.	2013-04-27 00:07:18 +00:00
Ralph Castain	c081a520a3	Fix --without-hwloc This commit was SVN r28396.	2013-04-25 19:13:56 +00:00
Nathan Hjelm	bccf8c657a	Per RFC add initial support for the MPI 3.0 tools interface. Current MPI_T support: - Full cvar interface. - Full categories interface. - No pvar support at this time. This commit was SVN r28376.	2013-04-24 15:59:23 +00:00
Ralph Castain	d721437c8d	Somebody (accidentally) removed the instructions for updating libevent releases in OMPI, so replace them with at least an outline on how to do it. This commit was SVN r28349.	2013-04-22 17:05:56 +00:00
Ralph Castain	1dc65b5fd7	Update libevent to 2.0.21-stable, but currently ignore it for all but those testing it This commit was SVN r28348.	2013-04-22 17:01:07 +00:00
Ralph Castain	6c6681e880	Fix an error in a test in the libevent configure.ac that we introduced - there are two brackets around the entire test code, so no need for double-brackets around the array indices within it cmr:v1.7.2 This commit was SVN r28347.	2013-04-22 15:29:44 +00:00
Jeff Squyres	e88881c25f	Also support getting the MAC and MTU. This commit was SVN r28344.	2013-04-17 22:17:42 +00:00
Jeff Squyres	eb012c2aad	Defensive programming: add a constructor for opal_if_t that zeros everything out before using it. This is not in response to any known bug, but rather just a pre-emptive, defensive move to help prevent bugs in code that forgets to initialize a field. This commit was SVN r28343.	2013-04-17 22:09:02 +00:00
Jeff Squyres	349ee654c1	Fix some --without-hwloc compile errors. Also remove one assigned-but-not-used variable assignment. This commit was SVN r28321.	2013-04-10 15:08:31 +00:00
Jeff Squyres	aef371c8f6	Fix bug introduced by r28236: make declaration and instantiation agree on "const". This commit was SVN r28320. The following SVN revision numbers were found above: r28236 --> open-mpi/ompi@cf377db823	2013-04-10 14:10:47 +00:00
Ralph Castain	45af6cf59e	The move of the orte_db framework to opal required that we create an opaque opal_identifier_t type as OPAL cannot know anything about the ORTE process name. However, passing a value down to opal and then having the db components reference it causes alignment issues on Solaris Sparc platforms. So pass the pointer instead and do the old "memcpy" trick to avoid the problem. This commit was SVN r28308.	2013-04-08 23:34:16 +00:00
Ralph Castain	4dbc468c3c	Remove stale file This commit was SVN r28299.	2013-04-07 13:52:48 +00:00
Ralph Castain	c121a784ae	Remove some weird code around opal_db_close and cleanup that framework's open/close operation This commit was SVN r28298.	2013-04-07 13:52:28 +00:00
Ralph Castain	10257b8b43	Add missing include This commit was SVN r28297.	2013-04-07 01:32:08 +00:00
Ralph Castain	1067b1f5ee	Add a little debug This commit was SVN r28295.	2013-04-06 15:24:35 +00:00
Ralph Castain	3bfa53eb91	Cleanup (again) the solaris topology code in hwloc...sigh. This commit was SVN r28294.	2013-04-06 14:45:32 +00:00
Ralph Castain	ec00fa3132	Fix missing variable declaration in hwloc 1.5.2 This commit was SVN r28293.	2013-04-05 17:43:34 +00:00
Ralph Castain	39a4e93e44	Correct the includes so that compiling with devel headers works This commit was SVN r28267.	2013-03-30 16:25:24 +00:00
Ralph Castain	d12eed0703	Silence warning This commit was SVN r28249.	2013-03-27 22:07:29 +00:00
Nathan Hjelm	3b3506717e	de-deprecate mca_base_param_init mca_base_param_finalize as they will be needed until the mca_base_param shim layer goes away This commit was SVN r28248.	2013-03-27 22:07:23 +00:00
Ralph Castain	95cf39b224	Fix non-updated opal_output channel This commit was SVN r28245.	2013-03-27 21:57:24 +00:00

1 2 3 4 5 ...

1127 Коммитов