openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	2ba10c37fe	Per RFC, bring in the following changes: * Remove paffinity, maffinity, and carto frameworks -- they've been wholly replaced by hwloc. * Move ompi_mpi_init() affinity-setting/checking code down to ORTE. * Update sm, smcuda, wv, and openib components to no longer use carto. Instead, use hwloc data. There are still optimizations possible in the sm/smcuda BTLs (i.e., making multiple mpools). Also, the old carto-based code found out how many NUMA nodes were ''available'' -- not how many were used ''in this job''. The new hwloc-using code computes the same value -- it was not updated to calculate how many NUMA nodes are used ''by this job.'' * Note that I cannot compile the smcuda and wv BTLs -- I ''think'' they're right, but they need to be verified by their owners. * The openib component now does a bunch of stuff to figure out where "near" OpenFabrics devices are. '''THIS IS A CHANGE IN DEFAULT BEHAVIOR!!''' and still needs to be verified by OpenFabrics vendors (I do not have a NUMA machine with an OpenFabrics device that is a non-uniform distance from multiple different NUMA nodes). * Completely rewrite the OMPI_Affinity_str() routine from the "affinity" mpiext extension. This extension now understands hyperthreads; the output format of it has changed a bit to reflect this new information. * Bunches of minor changes around the code base to update names/types from maffinity/paffinity-based names to hwloc-based names. * Add some helper functions into the hwloc base, mainly having to do with the fact that we have the hwloc data reporting ''all'' topology information, but sometimes you really only want the (online \| available) data. This commit was SVN r26391.	2012-05-07 14:52:54 +00:00
George Bosilca	f09e3ce5a4	Spring cleanup. Nothing important. This commit was SVN r26247.	2012-04-06 15:48:07 +00:00
George Bosilca	654c75ff24	As suggested on the mailing list a while back, switch the default alltoallv algorithm to pairwise exchange instead of the default one. This might improve the scheduling and relax the pressure on the network. This commit was SVN r26246.	2012-04-06 15:47:29 +00:00
Josh Hursey	d1571b027a	Fix a few error return paths This commit was SVN r26233.	2012-04-04 15:11:03 +00:00
Mike Dubman	a45898ea9c	fix support for fca 2.2, warning fixes on rhel 6.x This commit was SVN r26166.	2012-03-20 10:00:52 +00:00
George Bosilca	a78a7bd8e8	The tuned collectives can now deal with more than 2Gb of data. This commit was SVN r26103.	2012-03-05 22:23:44 +00:00
George Bosilca	762b3e13a9	Use the correct name for the datatype destruction function. This commit was SVN r26100.	2012-03-05 15:54:53 +00:00
George Bosilca	7d523a8852	Avoid calling the bcast with counts larger than INT_MAX. This commit was SVN r26098.	2012-03-05 14:30:30 +00:00
George Bosilca	e8c358c188	Allow Open MPI to deal with size_t internally. This commit was SVN r26097.	2012-03-05 14:10:26 +00:00
George Bosilca	f83670211e	Allow the user to define dynamic rules for messages larger than 2GB. This commit was SVN r26084.	2012-03-02 21:16:23 +00:00
George Bosilca	8791ade293	Help he selection of the right algorithm for large data (> 2Gb). Thanks to Fujitsu for the patch. This commit was SVN r26080.	2012-03-02 19:12:22 +00:00
George Bosilca	72f731f25f	The SM2 collective component has not been updated in a long time. Rich, the original developer, agrees with this removal. This commit was SVN r25368.	2011-10-25 22:07:09 +00:00
Rainer Keller	4e6a6fc146	- Check, whether the compiler supports __builtin_clz (count leading zeroes); if so, use it for bit-operations like opal_cube_dim and opal_hibit. Implement two versions of power-of-two. In case of opal_next_poweroftwo, this reduces the average execution time from 83 cycles to 4 cycles (Intel Nehalem, icc, -O2, inlining, measured rdtsc, with loop over 2^27 values). Numbers for other functions are similar (but of course heavily depend on the usage, e.g. opal_hibit() with a start of 4 does not save much). The bsr instruction on AMD Opteron is also not as fast. - Replace various places where the next power-of-two is computed. Tested on Intel Nehalem Cluster with openib, compilers GNU-4.6.1 and Intel-12.0.4 using mpi_testsuite -t "Collective" with 128 processes. This commit was SVN r25270.	2011-10-11 22:49:01 +00:00
George Bosilca	2fefd3a928	Don't forget to move the pointer back by the true_lb. This commit was SVN r25262.	2011-10-11 20:15:49 +00:00
George Bosilca	ce7935c8fa	Obviously these were not needed. This commit was SVN r25231.	2011-10-04 14:56:34 +00:00
George Bosilca	80c02647c8	Each level (OPAL/ORTE/OMPI) should only return it's own constants, instead of the current mismatch. This commit was SVN r25230.	2011-10-04 14:50:31 +00:00
Rolf vandeVaart	0749a220e8	Add support for MPI_IN_PLACE to MPI_Exscan. Required for MPI 2.2 compliance. Reviewed by Jeff Squyres. This fixes trac:2221. This commit was SVN r25165. The following Trac tickets were found above: Ticket 2221 --> https://svn.open-mpi.org/trac/ompi/ticket/2221	2011-09-20 14:54:41 +00:00
George Bosilca	9687e7f38e	This commit fixes trac:2679 and should be added to cmr:v1.4:reviewer=jsquyres and cmr:v1.5:reviewer=jsquyres This commit was SVN r25155. The following Trac tickets were found above: Ticket 2679 --> https://svn.open-mpi.org/trac/ompi/ticket/2679	2011-09-18 00:58:26 +00:00
Wesley Bland	4e7ff0bd5e	By popular demand the epoch code is now disabled by default. To enable the epochs and the resilient orte code, use the configure flag: --enable-resilient-orte This will define both: ORTE_ENABLE_EPOCH ORTE_RESIL_ORTE This commit was SVN r25093.	2011-08-26 22:16:14 +00:00
Mike Dubman	96ef2fc0e4	fix handling datatypes which have a gap in the beginning This commit was SVN r24936.	2011-07-25 06:30:09 +00:00
Jeff Squyres	b2b781e537	Fix a few miscelaneous memory leaks. This commit was SVN r24865.	2011-07-08 16:39:58 +00:00
Wesley Bland	e1ba09ad51	Add a resilience to ORTE. Allows the runtime to continue after a process (or ORTED) failure. Note that more work will be necessary to allow the MPI layer to take advantage of this. Per RFC: http://www.open-mpi.org/community/lists/devel/2011/06/9299.php This commit was SVN r24815.	2011-06-23 20:38:02 +00:00
Samuel Gutierrez	81f38b258a	commit of new shared memory backing facility framework (shmem) and its components. This commit was SVN r24795.	2011-06-21 15:41:57 +00:00
George Bosilca	65661a3cb4	Dont use a temporary string. This commit was SVN r24786.	2011-06-20 09:29:19 +00:00
Mike Dubman	36db9c6233	* updated copyrights * added support for non-contig data layout in FCA This commit was SVN r24702.	2011-05-16 14:43:11 +00:00
Jeff Squyres	ec90a3ba6d	Fix a few memory leaks, and ensure that coll sm is also registering the common SM MCA params. This commit was SVN r24497.	2011-03-08 17:36:59 +00:00
Mike Dubman	70392ac1dc	fca: broadcast comm_new return status to from rank0 to all ranks prior to exiting with an error This commit was SVN r24481.	2011-03-02 22:18:43 +00:00
George Bosilca	87f3109df4	Cleanups. This commit was SVN r24458.	2011-02-25 00:28:32 +00:00
Mike Dubman	89ba89e812	- added support for upcomming FCA v2.1 version This commit was SVN r24418.	2011-02-21 14:08:24 +00:00
Mike Dubman	81222e1fe7	* fix PGI compiler support which does not have __BASE_FILE__ macro This commit was SVN r24369.	2011-02-10 06:42:37 +00:00
Mike Dubman	4a2e29eb32	updated Makefile with a new file This commit was SVN r24199.	2011-01-01 14:11:49 +00:00
Mike Dubman	c56e3141cb	fca: fix segmentation fault when no underlying collective implementation is found This commit was SVN r24198.	2010-12-31 12:03:49 +00:00
Mike Dubman	3d517c0285	ABI cleanups This commit was SVN r24193.	2010-12-28 07:11:46 +00:00
Mike Dubman	b339a7a07b	Add FCA 1.2/2.0 backward compatibility, depending on OMPI_FCA_VERSION_xx macro definition. This commit was SVN r24192.	2010-12-27 21:32:34 +00:00
Shiqing Fan	f43862420c	Convert the bad dos line endings to unix style for all windows related files. This commit was SVN r24137.	2010-12-02 12:08:08 +00:00
Mike Dubman	956e030f28	support for dynamic rules to control offload This commit was SVN r24094.	2010-11-29 04:11:57 +00:00
Mike Dubman	5a7d76bb9c	resolve many warnings, comply to c99 This commit was SVN r24040.	2010-11-11 12:14:31 +00:00
Mike Dubman	f9bebe53f9	- fix fca support for MPI_IN_PLACE in allgather and allgatherv collectives This commit was SVN r23841.	2010-10-06 19:09:02 +00:00
Mike Dubman	f525245498	- support for MPI_IN_PLACE during gather ops - fix ABI check and message This commit was SVN r23840.	2010-10-06 16:27:45 +00:00
Jeff Squyres	73bcc4a36b	Fix mistake that came in via the ompi-agen tree in r23764. The mistake wasn't part of the core autogen upgrade; it was an additional 'bonus' cleanup. Oops. The mistake will always create a set of directories under installdir, even if you do not --with-devel-headers. The set of directories will be empty, but still -- they should not be there at all. This commit fixes that -- the directories are not created at all if you do not --with-devel-headers This commit was SVN r23801. The following SVN revision numbers were found above: r23764 --> open-mpi/ompi@40a2bfa238	2010-09-24 22:53:28 +00:00
Mike Dubman	58aa7fd161	enabling gather This commit was SVN r23773.	2010-09-20 06:29:54 +00:00
Mike Dubman	f754bde8eb	fixing r23764 leftovers, adopting Jeff's note This commit was SVN r23772. The following SVN revision numbers were found above: r23764 --> open-mpi/ompi@40a2bfa238	2010-09-20 06:27:43 +00:00
Mike Dubman	bd9a1f28a3	revert r23764 in ompi/mca/coll/fca This commit was SVN r23771. The following SVN revision numbers were found above: r23764 --> open-mpi/ompi@40a2bfa238	2010-09-20 06:06:45 +00:00
Ralph Castain	40a2bfa238	WARNING: Work on the temp branch being merged here encountered problems with bugs in subversion. Considerable effort has gone into validating the branch. However, not all conditions can be checked, so users are cautioned that it may be advisable to not update from the trunk for a few days to allow MTT to identify platform-specific issues. This merges the branch containing the revamped build system based around converting autogen from a bash script to a Perl program. Jeff has provided emails explaining the features contained in the change. Please note that configure requirements on components HAVE CHANGED. For example. a configure.params file is no longer required in each component directory. See Jeff's emails for an explanation. This commit was SVN r23764.	2010-09-17 23:04:06 +00:00
Mike Dubman	104d57f69a	* Support allgatherv, convert displs and rcounts arrays to bytes. * change comm_init API - no need to pass local rank groups, fca calculates that on its own. * remove local rank list from module - libfca maintains that now. * in fca_bcast and fca_reduce - pass root rank index and let libfca figure out the local rank index. This commit was SVN r23716.	2010-09-05 09:49:59 +00:00
Mike Dubman	48274c1c77	better control for enable/disable specific coll APIs This commit was SVN r23708.	2010-09-02 09:22:24 +00:00
Mike Dubman	8ef56bf258	* drop support for FCA v1.2 * add support for FCA ABI * add support for allgather This commit was SVN r23705.	2010-09-01 11:29:10 +00:00
Mike Dubman	fca50c4a09	comply to code-style: no c++ style commends This commit was SVN r23645.	2010-08-24 13:42:21 +00:00
Mike Dubman	9cb2e0490b	removed #if 0 This commit was SVN r23643.	2010-08-24 13:32:28 +00:00
Mike Dubman	a036c24253	revert fix to comply with #2534 - use op->o_name directly - cosmetic prints This commit was SVN r23614.	2010-08-15 11:04:34 +00:00

1 2 3 4 5 ...

547 Коммитов