openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	aa0ceaa78b	Move common code to ompi/mpi/fortran/base. The attribute and conversion callback subroutine interfaces are used by all 3 modules, and belong in the fortran/base directory, not the directory of a specific module. Also clean up some comments. cmr=v1.7.4:ticket=4162 This commit was SVN r30378. The following Trac tickets were found above: Ticket 4162 --> https://svn.open-mpi.org/trac/ompi/ticket/4162	2014-01-23 01:28:04 +00:00
Jeff Squyres	19617394f0	Add profiling versions of dist_graph functions into the library Also fix the interfaces that have logical parameters (the non-profiling versions were added/fixed a long time ago; it looks like the profiling versions were inadvertantly skipped). cmr=v1.7.4:ticket=4162 This commit was SVN r30377. The following Trac tickets were found above: Ticket 4162 --> https://svn.open-mpi.org/trac/ompi/ticket/4162	2014-01-23 01:24:54 +00:00
Jeff Squyres	5aa75d0ed9	Add missing pmpi interfaces for neighbor routines Somehow these interfaces were missed when adding these interfaces. cmr=v1.7.4:ticket=4162 This commit was SVN r30376. The following Trac tickets were found above: Ticket 4162 --> https://svn.open-mpi.org/trac/ompi/ticket/4162	2014-01-23 01:23:31 +00:00
Jeff Squyres	fe76eac8ab	Revert part of SVN r30273: remove "protected" from special Fortran sentinels r30273 made the use of the Fortran "protected" keyword be compiler-specific (i.e., configure/macro-ized it). But it inadvertantly added the use of "protected" to some sentinel constants that should not be protected (e.g., MPI_STATUS_IGNORE). This commit reverts the addition of "protected" to the constants that should not be protected. cmr=v1.7.4:subject=Rollup of Fortran fixes for v1.7.4 This commit was SVN r30375. The following SVN revision numbers were found above: r30273 --> open-mpi/ompi@5f17bc3c2c	2014-01-23 01:21:42 +00:00
Ralph Castain	06e6a06f3e	Cleanup a couple of abstraction breaks found by Thomas Naughton This commit was SVN r30371.	2014-01-22 21:36:24 +00:00
Hadi Montakhabi	8af6b8b4e4	add support for PLFS filesystem This commit was SVN r30370.	2014-01-22 21:16:15 +00:00
Ralph Castain	0dbbfd49fa	Update platform file cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r30369.	2014-01-22 21:01:33 +00:00
Ralph Castain	de07a64599	Cleanup the sensor code: * use the global flags for linux and apple being found instead of re-doing the case statements * update select procedure to ignore components that measure the same thing (e.g., resusage and sigar), taking the higher priority module cmr=v1.7.5:reviewer=jsquyres:subject=Cleanup the sensor code This commit was SVN r30368.	2014-01-22 21:01:09 +00:00
Nathan Hjelm	7ba8bd81fa	coll/ml: remove debug fprintfs cmr=v1.7.5:ticket=trac:4158 This commit was SVN r30367. The following Trac tickets were found above: Ticket 4158 --> https://svn.open-mpi.org/trac/ompi/ticket/4158	2014-01-22 17:21:05 +00:00
Nathan Hjelm	82d996fb76	coll/ml: cleanup some merge related errors cmr=v1.7.5:ticket=trac:4158 This commit was SVN r30366. The following Trac tickets were found above: Ticket 4158 --> https://svn.open-mpi.org/trac/ompi/ticket/4158	2014-01-22 16:48:09 +00:00
Nathan Hjelm	ff4c9c808a	btl/ugni: fix leak in new sendi function. cmr=v1.7.5:ticket=trac:4151 This commit was SVN r30365. The following Trac tickets were found above: Ticket 4151 --> https://svn.open-mpi.org/trac/ompi/ticket/4151	2014-01-22 16:32:07 +00:00
Nathan Hjelm	66b69da394	Fix a bug in the ob1 optimizations that can cause a segfault. btl sendi functions currently can not handle the descriptor being NULL. The send inline optimization was assuming (incorrectly) that NULL was ok. cmr=v1.7.5:ticket=trac:4149 This commit was SVN r30364. The following Trac tickets were found above: Ticket 4149 --> https://svn.open-mpi.org/trac/ompi/ticket/4149	2014-01-22 16:31:58 +00:00
Nathan Hjelm	1a021b8f2d	coll/ml: add support for blocking and non-blocking allreduce, reduce, and allgather. The new collectives provide a signifigant performance increase over tuned for small and medium messages. We are initially setting the priority lower than tuned until this has had some time to soak in the trunk. Please set coll_ml_priority to 90 for MTT runs. Credit for this work goes to Manjunath Gorentla Venkata (ORNL), Pavel Shamis (ORNL), and Nathan Hjelm (LANL). Commit details (for reference): Import ORNL's collectives for MPI_Allreduce, MPI_Reduce, and MPI_Allgather. We need to take the basesmuma header into account when calculating the ptpcoll small message thresholds. Add a define to bcol.h indicating the maximum header size so we can take the header into account while not making ptpcoll dependent on information from basesmuma. This resolves an issue with allreduce where ptpcoll overwrites the header of the next buffer in the basesmuma bank. Fix reduce and make a sequential collective launcher in coll_ml_inlines.h The root calculation for reduce was wrong for any root != 0. There are four possibilities for the root: - The root is not the current process but is in the current hierarchy. In this case the root is the index of the global root as specified in the root vector. - The root is not the current process and is not in the next level of the hierarchy. In this case 0 must be the local root since this process will never communicate with the real root. - The root is not the current process but will be in next level of the hierarchy. In this case the current process must be the root. - I am the root. The root is my index. Tested with IMB which rotates the root on every call to MPI_Reduce. Consider IMB the reproducer for the issue this commit solves. Make the bcast algorithm decision an enumerated variable Resolve various asset failures when destructing coll ml requests. Two issues: - Always reset the request to be invalid before returning it to the free list. This will avoid an asset in ompi_request_t's destructor. OMPI_REQUEST_FINI does this (and also releases the fortran handle index). - Never explicitly construct or destruct the superclass of an opal object. This screws up the class function tables and will cause either an assert failure or a segmentation fault when destructing coll ml requests. Cleanup allgather. I removed the duplicate non-blocking and blocking functions and modeled the cleanup after what I found in allreduce. Also cleaned up the code somewhat. Don't bother copying from the send to the recieve buffer in bcol_basesmuma_allreduce_intra_fanin_fanout if the pointers are the same. The eliminates a warning about memcpy and aliasing and avoids an unnecessary call to memcpy. Alwasy call CHECK_AND_RELEASE on memsync collectives. There was a call to OBJ_RELEASE on the collective communicator but because CHECK_AND_RECYLCE was never called there was not matching call to OBJ_RELEASE. This caused coll ml to leak communicators. Make allreduce use the sequential collective launcher in coll_ml_inlines.h Just launch the next collective in the component progress. I am a little unsure about this patch. There appears to be some sort of race between collectives that causes buffer exhaustion in some cases (IMB Allreduce is a reproducer). Changing progress to only launch the next bcol seems to resolve the issue but might not be the best fix. Note that I see little-no performance penalty for this change. Fix allreduce when there are extra sources. There was an issue with the buffer offset calculation when there are extra sources. In the case of extra sources == 1 the offset was set to buffer_size (just past the header of the next buffer). I adjusted the buffer size to take into accoun the maximum header size (see the earlier commit that added this) and simplified the offset calculation. Make reduce/allreduce non-blocking. This is required for MPI_Comm_idup to work correctly. This has been tested with various layouts using the ibm testsuite and imb and appears to have the same performance as the old blocking version. Fix allgather for non-contiguous layouts and simplify parsing the topology. Some things in this patch: - There were several comments to the effect that level 0 of the hierarchy MUST contain all of the ranks. At least one function made this assumption but it was not true. I changed the sbgp components and the coll ml initization code to enforce this requirement. - Ensure that hierarchy level 0 has the ranks in the correct scatter gather order. This removes the need for a separate sort list and fixes the offset calculation for allgather. - There were several passes over the hierarchy to determine properties of the hierarchy. I eliminated these extra passes and the memory allocation associated with them and calculate the tree properties on the fly. The same DFS recursion also handles the re-order of level 0. All these changes have been verified with MPI_Allreduce, MPI_Reduce, and MPI_Allgather. All functions now pass all IBM/Open MPI, and IMB tests. coll/ml: correct pointer usage for MPI_BOTTOM Since contiguous datatypes are copied via memcpy (bypassing the convertor) we need to adjust for the lb of the datatype. This corrects problems found testing code that uses MPI_BOTTOM (NULL) as the send pointer. Add fallback collectives for allreduce and reduce. cmr=v1.7.5:reviewer=pasha This commit was SVN r30363.	2014-01-22 15:39:19 +00:00
Jeff Squyres	7768828d2d	Addendum to r30298: tweak the wording of the help messages a bit. Refs trac:4117. Please use this commit rather than the patch attached to the ticket; the patch had a few mistakes in the tweaked wording. This commit was SVN r30362. The following SVN revision numbers were found above: r30298 --> open-mpi/ompi@58479399c3 The following Trac tickets were found above: Ticket 4117 --> https://svn.open-mpi.org/trac/ompi/ticket/4117	2014-01-22 12:17:14 +00:00
Jeff Squyres	be0e557d3c	Revert r30164: it was just the wrong thing to do. Fixes trac:4155. This commit was SVN r30360. The following SVN revision numbers were found above: r30164 --> open-mpi/ompi@ca84ffdbd4 The following Trac tickets were found above: Ticket 4155 --> https://svn.open-mpi.org/trac/ompi/ticket/4155	2014-01-22 00:51:03 +00:00
Nathan Hjelm	c9c335544e	btl/ugni: fix a typo in r30353 cmr=v1.7.5:ticket=trac:4151 This commit was SVN r30354. The following SVN revision numbers were found above: r30353 --> open-mpi/ompi@aa3fea55b2 The following Trac tickets were found above: Ticket 4151 --> https://svn.open-mpi.org/trac/ompi/ticket/4151	2014-01-21 21:02:28 +00:00
Nathan Hjelm	aa3fea55b2	btl/ugni: re-add a sendi function to exploit the new optimization in ob1. Also update LANL platform files to use the latest version of ugni. cmr=v1.7.5:reviewer=manjugv This commit was SVN r30353.	2014-01-21 20:53:35 +00:00
Mike Dubman	34b3718042	revert, need rework. This commit was SVN r30351.	2014-01-21 17:56:01 +00:00
Ralph Castain	9a2dc54311	Apply a patch provided by Paul Hargrove back in Jan 2013 that fixes MIPS assembly issues. This patch was originally reviewed and RM-approved to go into the 1.6 branch (which never happened for logistical reasons), and subsequently the trunk patch was provided. Paul has verified the patch and its application to 1.7.4, so we will consider it reviewed for that purpose. cmr=v1.7.4:reviewer=ompi-rm1.7:subject=Fix MIPS assembly This commit was SVN r30348.	2014-01-21 16:42:49 +00:00
Nathan Hjelm	2b57f4227e	ob1: optimize blocking send and receive paths Per RFC. There are two optimizations in this commit: - Allocate requests for blocking sends and receives on the stack. This bypasses the request free list and saves two atomics on the critical path. This change improves the small message ping-pong by 50-200ns on both AMD and Intel CPUs. - For small messages try to use the btl sendi function before intializing a send request. If the sendi fails or the btl does not have a sendi function silently fallback on the standard send path. cmr=v1.7.5:reviewer=brbarret This commit was SVN r30343.	2014-01-21 15:16:21 +00:00
George Bosilca	7e1593ef80	Prevent integer overflow in datatype creation. Patch based on Gilles Gouaillardet solution attached to ticket #4145. Closes trac:4145. cmr=v1.7.4:reviewer=ompi-rm1.7 cmr=v1.6.6:reviewer=ompi-rm1.6 This commit was SVN r30342. The following Trac tickets were found above: Ticket 4145 --> https://svn.open-mpi.org/trac/ompi/ticket/4145	2014-01-21 14:44:00 +00:00
Mike Dubman	b8550a55a7	HCOLL: many fixes Adds coll_hcoll_np mca parameter similar to that of fca component (defaults to 32). Those who use hcoll be aware that from now on the communicators less than 32 procs will run w/o hcoll by default. - Resolves fallback issue in case libhcoll runs out of allowed contexts. The solution is moving hcoll_context_create from comm_enable to comm_query. Shortly, comm_enable should never return OMPI_ERROR in the coll component with highest priority (hcoll). Otherwise the ompi coll_base_select will unselect the coll funtion pointers and module references leaving the communicator w/o coll pointer. This will cause the fail. Same behavior can be reproduced even with tuned if one would hardcore some "return OMPI_ERROR" into it's module_enable funtion. - Additionally, removed all the dead code under #if 0; removed unused variables (path for library, active_modules list) and classes (module list wrapper) Fixed by Val, Reviewed by Devendar/Josh/Miked cmr=v1.7.4:reviewer=ompi-rm1.7 This commit was SVN r30341.	2014-01-21 12:19:47 +00:00
Mike Dubman	37343574e0	OSHMEM: fix fortran binding The check to enable shmem fortran was too early, MPI can disable fortran but SHMEM fortran check was already done. Refs trac:3763 This commit was SVN r30340. The following Trac tickets were found above: Ticket 3763 --> https://svn.open-mpi.org/trac/ompi/ticket/3763	2014-01-21 09:20:51 +00:00
Brian Barrett	fe093556f7	Only provide OPAL_THREAD_ADD64 if we have 64 bit atomics This commit was SVN r30339.	2014-01-20 20:22:38 +00:00
Ralph Castain	26fbb4e77b	Necessary constants for postgress module This commit was SVN r30338.	2014-01-20 19:58:56 +00:00
Ralph Castain	12e4f8a71d	Add support for postgres database This commit was SVN r30337.	2014-01-20 19:56:26 +00:00
Ralph Castain	e0edc29029	Add comment on future work This commit was SVN r30336.	2014-01-20 19:54:31 +00:00
Ralph Castain	9b2066cfba	Add two new sensor modules - one to monitor core temperatures, and the other to monitor resource usage using the sigar library This commit was SVN r30335.	2014-01-20 19:35:48 +00:00
Ralph Castain	d2d4eeb2d6	Let the floating point pack/unpack work at arbitrary precision cmr=v1.7.5:reviewer=rhc This commit was SVN r30334.	2014-01-20 19:34:15 +00:00
Ralph Castain	2cf4862b49	Cleanup warnings for use of void* - requires intermediate cast to uintptr_t. Thanks to Paul Hargrove for reporting it cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30333.	2014-01-20 15:44:45 +00:00
Ralph Castain	3e9c8497e0	Shift the verbose output a bit Refs trac:4136 This commit was SVN r30332. The following Trac tickets were found above: Ticket 4136 --> https://svn.open-mpi.org/trac/ompi/ticket/4136	2014-01-20 14:41:37 +00:00
Ralph Castain	5ad9795bd8	Cleanup some potential memory overruns cmr=v1.7.5:reviewer=jsquyres This commit was SVN r30331.	2014-01-19 16:31:26 +00:00
Ralph Castain	9f6fd7b98d	A few corrections to hostfile parsing - thanks to Tetsuya Mishima for the review Refs trac:4136 This commit was SVN r30330. The following Trac tickets were found above: Ticket 4136 --> https://svn.open-mpi.org/trac/ompi/ticket/4136	2014-01-19 16:26:12 +00:00
Mike Dubman	79366f8ce7	fix tabulation This commit was SVN r30329.	2014-01-19 12:40:21 +00:00
Ralph Castain	657796f9e0	Revert r30327 - turns out it isn't quite right just yet. :-( Closes trac:4138 This commit was SVN r30328. The following SVN revision numbers were found above: r30327 --> open-mpi/ompi@87d5f86025 The following Trac tickets were found above: Ticket 4138 --> https://svn.open-mpi.org/trac/ompi/ticket/4138	2014-01-18 23:38:39 +00:00
Ralph Castain	87d5f86025	Enable use of unix domain sockets for local OOB communications, thereby removing the requirement for an active network interface when running strictly on a single node. Update the overall OOB system to support cross-transport movement of messages so that the OOB can move a received message to another transport for transmission. cmr=v1.7.5:reviewer=jsquyres:subject=Enable use of unix domain sockets for local OOB communications This commit was SVN r30327.	2014-01-18 21:36:49 +00:00
Ralph Castain	fd5c1d5b63	Update platform file cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r30326.	2014-01-18 21:33:52 +00:00
Ralph Castain	fcdd904af4	Simplify and update hostfile handling to correctly support hostfiles that list nodes multiple times, once for each slot, and those that list a host once and include an explicit slot count. Eliminate support for mixing those two modes as this logic became just too complex when attempting to handle all the corner cases. cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30325.	2014-01-18 16:08:40 +00:00
Edgar Gabriel	be5d5834c5	fix the problem identified by a user on the mailing list with MPI_MODE_EXCL cmr=v1.7.4:reviewer=vvenkatesan:subject=fix a problem when opening a file with MODE_EXCL This commit was SVN r30324.	2014-01-18 16:06:27 +00:00
Ralph Castain	16c061a5f8	Per Paul Hargrove, ROMIO now supports OpenBSD cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r30321.	2014-01-18 15:33:08 +00:00
Ralph Castain	abb432aef6	The portable_platform file moved to opal at some point, but this .m4 didn't get updated to match. Thanks to Paul Hargrove for spotting it and providing a patch! cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30320.	2014-01-18 03:19:34 +00:00
Nathan Hjelm	c88626510c	Fix a merge issues with new ROMIO and fix obvious ROMIO bug. cmr=v1.7.4:reviewer=jsquyres This commit was SVN r30319.	2014-01-18 00:29:16 +00:00
Jeff Squyres	770bf30915	Clarifications that we're talking about the Fortran PGI compiler. This commit was SVN r30317.	2014-01-17 21:39:59 +00:00
Ralph Castain	87f34860fe	Protect array against crossing boundaries cmr=v1.7.5:reviewer=jsquyres This commit was SVN r30316.	2014-01-17 21:36:20 +00:00
Hadi Montakhabi	c1ba8c1219	adding Hadi Montakhabi as an author This commit was SVN r30315.	2014-01-17 18:47:03 +00:00
Hadi Montakhabi	8c14411289	f_cc_size is contiguous chunk size, not the stripe width. There is no stripe_width in the file handle structure. This commit was SVN r30314.	2014-01-17 18:35:55 +00:00
Mike Dubman	0d8424e0a6	OSHMEM: Fixes issue with recent segfault in finalize related to mca_base_group_unselect. We need to explicitly call mca_base_group_unselect in finalize for each group that are not freed with oshmem_group_cache_list_free before we unloading scoll framework. Refs trac:3763 This commit was SVN r30311. The following Trac tickets were found above: Ticket 3763 --> https://svn.open-mpi.org/trac/ompi/ticket/3763	2014-01-17 16:57:12 +00:00
Mike Dubman	2af0f878bc	remove bml_init call, called from btl add_proc. Refs trac:3763 This commit was SVN r30310. The following Trac tickets were found above: Ticket 3763 --> https://svn.open-mpi.org/trac/ompi/ticket/3763	2014-01-17 16:52:20 +00:00
Tom Naughton	41d06d33ae	+ fix typo in previous README addition This commit was SVN r30309.	2014-01-17 15:32:21 +00:00
Jeff Squyres	2091530c9b	Add the results of Paul's Fortran testing as release notes in the README. cmr=v1.7.4:reviewer=rhc:subject=Update README with PGI testing release notes This commit was SVN r30308.	2014-01-17 15:26:51 +00:00

1 2 3 4 5 ...

19342 Коммитов