openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	3d46850c4d	Per patch from Marco Atzeri, have the fortran wrapper links go directly to opal_wrapper to avoid breaks in the chain in some environments.	2015-04-25 17:09:06 -07:00
Jeff Squyres	a026456bef	(orte\|ompi\|oshmem)info tools: convert to opal_dl interface Noe that this commit removes option:lt_dladvise from the various "info" tools output. This technically breaks our CLI "ABI" because we're not deprecating it / replacing it with an alias to some other "into" tool output. Although the dl/libltdl component contains an "have_lt_dladvise" MCA var that contains the same information, the "option:lt_dladvise" output from the various "info" tools is not* an MCA var, and therefore we can't alias it. So it just has to die.	2015-03-09 08:18:13 -07:00
Jeff Squyres	9b716d946e	wrappers: fix errant @{libdir} reference in pkg-config files The RPATH support added a @{libdir} token into <package>_WRAPPER_EXTRA_LDFLAGS. However, these flags are also substituted into the pkg-config data files, and they don't understand the @{foo} notation. So convert @{libdir} into ${libdir}, which pkg-config does understand. Thanks to Christoph Junghans (@junghans) for notifying us of the issue. Fixes #406.	2015-02-20 08:43:19 -08:00
Jeff Squyres	1e58920b4d	*info param.c: use stack string buffers Coverity identified that we treated the possibility that one of the message buffers could be NULL in some places (because strdup() could fail), but not in others. So just use stack buffers that will never be NULL. This was CID 1269914.	2015-02-12 10:24:02 -08:00
Jeff Squyres	3ac1d0dae5	*-info: add "lt_dladvise support" lines	2015-02-11 12:25:20 -08:00
Bert Wesarg	0d0a754c42	Remove VampirTrace.	2015-01-22 08:08:07 +01:00
Gilles Gouaillardet	661c35ca67	cleanup dead code caused by the removal of the --with-threads configure option	2015-01-16 19:13:59 +09:00
Gilles Gouaillardet	27aec2ef5b	configury: disable f08 fortran bindings if the compiler does not support c_funloc with TS 29113 subclause 8.1 aka removed restrictions on ISO_C_BINDING module procedures.	2014-12-17 17:35:45 +09:00
Artem Polyakov	8ffad75a0a	Introduce timing interval measurement facility in timing framework	2014-12-10 16:47:49 +06:00
Nadezhda Kogteva	315a240899	Timing framework: pack timing scripts to tarball always	2014-12-02 12:22:46 +02:00
Nadezhda Kogteva	45ed55afd7	Adding of missed time measurement scripts in tarball	2014-11-28 12:15:30 +02:00
Gilles Gouaillardet	eef7590e58	wrappers: add the $(EXEEXT) extension to the installed symbolic links	2014-10-28 16:42:51 +09:00
Jeff Squyres	c22e1ae33b	configury: new OPAL_SET_LIB_PREFIX/ORTE_SET_LIB_PREFIX macros These two macros set the prefix for the OPAL and ORTE libraries, respectively. Specifically, the OPAL library will be named libPREFIXopen-pal.la and the ORTE library will be named libPREFIXopen-rte.la. These macros must be called, even if the prefix argument is empty. The intent is that Open MPI will call these macros with an empty prefix, but other projects (such as ORCM) will call these macros with a non-empty prefix. For example, ORCM libraries can be named liborcm-open-pal.la and liborcm-open-rte.la. This scheme is necessary to allow running Open MPI applications under systems that use their own versions of ORTE and OPAL. For example, when running MPI applications under ORTE, if the ORTE and OPAL libraries between OMPI and ORCM are not identical (which, because they are released at different times, are likely to be different), we need to ensure that the OMPI applications link against their ORTE and OPAL libraries, but the ORCM executables link against their ORTE and OPAL libraries.	2014-10-22 10:32:19 -07:00
Jeff Squyres	01fd96bfa5	Revert "Provide a mechanism by which an upstream project can rename the OPAL and ORTE libraries. This is required by projects such as ORCM that have their own ORTE and OPAL libraries in order to avoid library confusion. By renaming their version of the libraries, the OMPI applications can correctly dynamically load the correct one for their build." This reverts commit `63f619f871`.	2014-10-22 10:32:11 -07:00
Ralph Castain	63f619f871	Provide a mechanism by which an upstream project can rename the OPAL and ORTE libraries. This is required by projects such as ORCM that have their own ORTE and OPAL libraries in order to avoid library confusion. By renaming their version of the libraries, the OMPI applications can correctly dynamically load the correct one for their build.	2014-10-10 11:39:08 -07:00
Jeff Squyres	72704441a2	URLs: update URLs for GitHub	2014-10-01 14:44:09 -07:00
Jeff Squyres	d13034d0b0	fortran: add configury to check for storage_size() gfortran 4.8 does not support storage_size() on all relevant types that we need. So add a configure test to check and see if the compiler's storage_size() intrinsic supports enough types for us to do MPI_SIZEOF. Also remove an accidentally redundant check for fortran INTERFACE. Refs trac:4917 This commit was SVN r32790. The following Trac tickets were found above: Ticket 4917 --> https://svn.open-mpi.org/trac/ompi/ticket/4917	2014-09-25 00:17:29 +00:00
Ralph Castain	4024c8af9e	Have to include the mpisync directory so the Makefile.in gets built - just don't build the binary and install it if timing isn't enabled This commit was SVN r32781.	2014-09-24 01:18:21 +00:00
Artem Polyakov	f2e586980b	Fix timing framework: 1. Fixes according to (http://www.open-mpi.org/community/lists/devel/2014/09/15869.php) 2. Force mpisync:rank0 to gather results. Now sync info is written by rank0 to the output file. 3. Improve mpirun_prof: 1) adopt to the environment (SLURM/TORQUE); 2) recognize some noteset-related mpirun options. This commit was SVN r32772.	2014-09-23 12:59:54 +00:00
Ralph Castain	70896550bf	Per input from Artem, update the copyrights on these files, ensuring to include all the licensing info for the files broght over from the mpiperf project. This commit was SVN r32770.	2014-09-20 14:54:24 +00:00
Jeff Squyres	d7eaca83fa	Fortran: Fix MPI_SIZEOF. What a disaster. :-( What started as a simple ticket ended up reaching the way up to the MPI Forum. It turns out that we are supposed to have MPI_SIZEOF for all Fortran interfaces: mpif.h, the mpi module, and the mpi_f08 module. It further turns out that to properly support MPI_SIZEOF, your Fortran compiler has support the INTERFACE keyword and ISO_FORTRAN_ENV. We can't use "ignore TKR" functionality, because the whole point of MPI_SIZEOF is that the implementation knows what type was passed to it ("ignore TKR" functionality, by definition, throws that information away). Hence, we have to have an MPI_SIZEOF interface+implementation for all intrinsic types, kinds, and ranks. This commit therefore adds a perl script that generates both the interfaces and implementations for MPI_SIZEOF in each of mpif.h, the mpi module, and mpi_f08 module (yay consolidation!). The perl script uses the results of some new configure tests: * check if the Fortran compiler supports the INTERFACE keyword * check if the Fortran compiler supports ISO_FORTRAN_ENV * find the max array rank (i.e., dimension) that the compiler supports If the Fortran compiler supports both INTERFACE and ISO_FORTRAN_ENV, then we'll build the MPI_SIZEOF interfaces. If not, we'll skip MPI_SIZEOF in mpif.h and the mpi module. Note that we won't build the mpi_f08 module -- to include the MPI_SIZEOF interfaces -- if the Fortran compiler doesn't support INTERFACE, ISO_FORTRAN_ENV, and a whole bunch of ther modern Fortran stuff. Since MPI_SIZEOF interfaces are now generated by the perl script, this commit also removes all the old MPI_SIZEOF implementations (which were laden with a zillion #if blocks). cmr=v1.8.3 This commit was SVN r32764.	2014-09-19 13:44:52 +00:00
Ralph Castain	dfb952fa78	[Contribution from Artem - moved it to svn from git for him] Replace our old, clunky timing setup with a much nicer one that is only available if configured with --enable-timing. Add a tool for profiling clock differences between the nodes so you can get more precise timing measurements. I'll ask Artem to update the Github wiki with full instructions on how to use this setup. This commit was SVN r32738.	2014-09-15 18:00:46 +00:00
Ralph Castain	552c9ca5a0	George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-) WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic. This commit was SVN r32317.	2014-07-26 00:47:28 +00:00
Mike Dubman	e342a11c2e	opal envlist mca: implement Jeff`s quibbles fixed by Elena, reviewed by Miked This commit was SVN r32216.	2014-07-11 07:23:20 +00:00
Ralph Castain	796f57f709	Protect against problems if someone passes us thru a pipe and then abnormally terminates the pipe early This commit was SVN r32189.	2014-07-09 22:41:53 +00:00
Joshua Ladd	057370364d	Opal: Add a new MCA variable type "version_string". Also add a new flag to ompi_info that allows a user to print all MCA variables of a specific type. --type version_string This command will print all MCA variables of type version_string. This feature was developed by Elena Shipunova and was reviewed by Josh Ladd. This commit was SVN r32166.	2014-07-09 01:37:23 +00:00
Ralph Castain	f3cb124e50	Revert r32082 and r32070 - the developer's conference has decided to go a different direction on the threaded progress effort. This will involve some degree of prototyping to understand the tradeoffs prior to making a final design decision, and so we'll hold off on the final change until that is completed. This commit was SVN r32089. The following SVN revision numbers were found above: r32070 --> open-mpi/ompi@12d92d0c22 r32082 --> open-mpi/ompi@aa6438ef7a	2014-06-25 20:43:28 +00:00
Ralph Castain	12d92d0c22	Per the OMPI developer conference, remove the last vestiges of OMPI_USE_PROGRESS_THREADS This commit was SVN r32070.	2014-06-24 17:05:11 +00:00
Oscar Vega-Gisbert	83bdebbf81	Java bindings for OSHMEM. This commit was SVN r31810.	2014-05-18 21:48:09 +00:00
Ralph Castain	4def94900a	Per RFC: OMPI_INSTALL_BINARIES -> OPAL_INSTALL_BINARIES This commit was SVN r31634.	2014-05-05 21:43:05 +00:00
Jeff Squyres	173c046617	build: add Automake-like silent/verbose macros for "ln -s ..." operations Also, since I put some of the macros for these silent/verbose rules up in the top-level Makefile.man-page-rules file, I renamed it to Makefile.ompi-rules. I've had this sitting around for a while; now seems like as good a time as any to commit it. This commit was SVN r31271.	2014-03-28 18:24:32 +00:00
Jeff Squyres	224842e4c9	ompi_info.1: Include much more info about the --level CLI option Add a lot more information about the --level CLI option, and the nine levels. Also remove some now-erroneous examples regarding --version. cmr=v1.8:reviewer=rhc This commit was SVN r31246.	2014-03-27 12:23:21 +00:00
Oscar Vega-Gisbert	66e2e337f3	Fix mpijavac: -cp classpath This commit was SVN r30724.	2014-02-14 08:46:23 +00:00
Jeff Squyres	1a9cdcc8ff	Restore version numbers to "ompi_info --all" output. cmr=v1.7.4:reviewer=rhc This commit was SVN r30523.	2014-01-31 16:20:46 +00:00
Jeff Squyres	5f17bc3c2c	Make the use of PROTECTED in the mpi_f08 module be optional. Add a configure test to see if the Fortran compiler supports the PROTECTED keyword. If it does, use in mpi-f08-types.F90 (via a macro defined in configure-fortran-output-bottom.h). This is needed to support the PGI 9 Fortran compiler, which does not support the PROTECTED keyword. Note that regardless of whether we want to support the PGI 9 Fortran compiler + mpi_f08, we need to correctly detect whether PROTECTED works or not, and then use that determination as a criteria for building the mpi_f08 module. Previously, mpi-f08-types.F90 used PROTECTED unconditionally, and we didn't test for it in configure. So if a compiler (e.g., PGI 9) supported everything else but didn't support PROTECTED, it would try to compile the mpi_f08 stuff and choke on the use of PROTECTED. Refs trac:4093 This commit was SVN r30273. The following Trac tickets were found above: Ticket 4093 --> https://svn.open-mpi.org/trac/ompi/ticket/4093	2014-01-13 18:35:42 +00:00
Jeff Squyres	b0ffdb3ae5	As noted by Paul Hargrove, older PGI compilers support ''some'' of BIND(C), but not ''all'' of it. So expand our configure checks to look for multiple different forms of BIND(C): * ISO_C_BINDING * SUBROUTINE ... BIND(C) * TYPE, BIND(C) * TYPE(foo), BIND(C, name="bar") If the compiler supports all of these, then declare that we support BIND(C), and the rest of the mpi_f08 checks can continue. If we miss any one of those, don't bother continuing -- we won't build the mpi_f08 module. Also push the results of all of these tests down to ompi_info so that they can be reported easily (e.g., "Hey, why doesn't my OMPI installation have the mpi_f08 module?"). cmr=v1.7.4:reviewer=jsquyres:subject=Expand Fortran BIND(C) configure checks This commit was SVN r30247.	2014-01-10 23:44:55 +00:00
Brian Barrett	8b778903d8	Fix longstanding issue with our multi-project support. Rather than using pkg{data,lib,includedir}, use our own ompi{data,lib,includedir}, which is always set to {datadir,libdir,includedir}/openmpi. This will keep us from having help files in prefix/share/open-rte when building without Open MPI, but in prefix/share/openmpi when building with Open MPI. This commit was SVN r30140.	2014-01-07 22:11:15 +00:00
George Bosilca	efb32da1e0	There is no need for this include. This commit was SVN r29918.	2013-12-15 17:04:45 +00:00
Brian Barrett	121ca26c59	Per discussion at Develoepr's Meeting, remove Solaris threads support. Solaris will just fall back to pthreads, which should be no problem. This commit was SVN r29893.	2013-12-13 20:07:11 +00:00
Ralph Castain	eee7e49a4b	Ensure the Java wrapper compiler files are in the tarball This commit was SVN r29584.	2013-11-01 20:08:45 +00:00
Ralph Castain	01c9973a29	Don't include AM directives in continued lines This commit was SVN r29537.	2013-10-27 05:44:55 +00:00
Ralph Castain	ed3bbb977e	Cleanup wrapper makefile when java bindings not enabled This commit was SVN r29532.	2013-10-27 04:35:43 +00:00
Ralph Castain	3ec27b00ae	Cleanup the Java integration - don't install the mpijavac compiler if the user didn't ask for Java bindings This commit was SVN r29526.	2013-10-26 16:18:18 +00:00
Ralph Castain	bd0b13221b	Cleanup ompi_info change to silence compiler warning This commit was SVN r29436.	2013-10-14 16:57:50 +00:00
Mike Dubman	2141e9e6b4	tools: Add oshmem_info utility Reworked ompi_info tool to be close with orte_info implementation. ompi_info_register_types(), ompi_info_close_components() and ompi_info_show_ompi_version() are moved to runtime/ompi_info_support.c. Added runtime/oshmem_info_support layer that exports following api to be used into oshmem_info tool as oshmem_info_register_types() oshmem_info_register_framework_params() oshmem_info_close_components() oshmem_info_show_oshmem_version() These functions call ompi_info_support related interfaces as long as Oshmem supports Open MPI/SHMEM combination. Now orte_info/ompi_info/oshmem_info have identical implementation approach. Possible improvement: OSHMEM processing of --config option is the same as OMPI`s (code is duplicated). Probably list of info_support interfaces can be extended by xxx_info_do_config(). developed by Igor, reviewed by miked This commit was SVN r29429.	2013-10-12 19:03:32 +00:00
Jeff Squyres	c065839d1f	Make the Java wrapper compiler behave like the other wrapper compilers (specifically, with regards to the --showme flag and return code). This commit was SVN r29270.	2013-09-26 22:48:51 +00:00
Nathan Hjelm	c699ee7812	Update the ompi_info man page with information about variable levels and improve the behavior of ompi_info. This commit changes the default behavior of ompi_info --all when a level is not specified. Instead of assuming level 1 in this case we now assume level 9. This change is due to feedback from the community after the introduction of the --level option. I also added a new option: --selected-only. This option will limit the displayed variables to components that can be selected (ie. if there is a selection parameter set-- btl self,sm) cmr=v1.7.3:reviewer=jsquyres This commit was SVN r29070.	2013-08-27 19:11:37 +00:00
Ralph Castain	a200e4f865	As per the RFC, bring in the ORTE async progress code and the rewrite of OOB: * THIS RFC INCLUDES A MINOR CHANGE TO THE MPI-RTE INTERFACE * Note: during the course of this work, it was necessary to completely separate the MPI and RTE progress engines. There were multiple places in the MPI layer where ORTE_WAIT_FOR_COMPLETION was being used. A new OMPI_WAIT_FOR_COMPLETION macro was created (defined in ompi/mca/rte/rte.h) that simply cycles across opal_progress until the provided flag becomes false. Places where the MPI layer blocked waiting for RTE to complete an event have been modified to use this macro. *************************************************************************************** I am reissuing this RFC because of the time that has passed since its original release. Since its initial release and review, I have debugged it further to ensure it fully supports tests like loop_spawn. It therefore seems ready for merge back to the trunk. Given its prior review, I have set the timeout for one week. The code is in https://bitbucket.org/rhc/ompi-oob2 WHAT: Rewrite of ORTE OOB WHY: Support asynchronous progress and a host of other features WHEN: Wed, August 21 SYNOPSIS: The current OOB has served us well, but a number of limitations have been identified over the years. Specifically: * it is only progressed when called via opal_progress, which can lead to hangs or recursive calls into libevent (which is not supported by that code) * we've had issues when multiple NICs are available as the code doesn't "shift" messages between transports - thus, all nodes had to be available via the same TCP interface. * the OOB "unloads" incoming opal_buffer_t objects during the transmission, thus preventing use of OBJ_RETAIN in the code when repeatedly sending the same message to multiple recipients * there is no failover mechanism across NICs - if the selected NIC (or its attached switch) fails, we are forced to abort * only one transport (i.e., component) can be "active" The revised OOB resolves these problems: * async progress is used for all application processes, with the progress thread blocking in the event library * each available TCP NIC is supported by its own TCP module. The ability to asynchronously progress each module independently is provided, but not enabled by default (a runtime MCA parameter turns it "on") * multi-address TCP NICs (e.g., a NIC with both an IPv4 and IPv6 address, or with virtual interfaces) are supported - reachability is determined by comparing the contact info for a peer against all addresses within the range covered by the address/mask pairs for the NIC. * a message that arrives on one TCP NIC is automatically shifted to whatever NIC that is connected to the next "hop" if that peer cannot be reached by the incoming NIC. If no TCP module will reach the peer, then the OOB attempts to send the message via all other available components - if none can reach the peer, then an "error" is reported back to the RML, which then calls the errmgr for instructions. * opal_buffer_t now conforms to standard object rules re OBJ_RETAIN as we no longer "unload" the incoming object * NIC failure is reported to the TCP component, which then tries to resend the message across any other available TCP NIC. If that doesn't work, then the message is given back to the OOB base to try using other components. If all that fails, then the error is reported to the RML, which reports to the errmgr for instructions * obviously from the above, multiple OOB components (e.g., TCP and UD) can be active in parallel * the matching code has been moved to the RML (and out of the OOB/TCP component) so it is independent of transport * routing is done by the individual OOB modules (as opposed to the RML). Thus, both routed and non-routed transports can simultaneously be active * all blocking send/recv APIs have been removed. Everything operates asynchronously. KNOWN LIMITATIONS: * although provision is made for component failover as described above, the code for doing so has not been fully implemented yet. At the moment, if all connections for a given peer fail, the errmgr is notified of a "lost connection", which by default results in termination of the job if it was a lifeline * the IPv6 code is present and compiles, but is not complete. Since the current IPv6 support in the OOB doesn't work anyway, I don't consider this a blocker * routing is performed at the individual module level, yet the active routed component is selected on a global basis. We probably should update that to reflect that different transports may need/choose to route in different ways * obviously, not every error path has been tested nor necessarily covered * determining abnormal termination is more challenging than in the old code as we now potentially have multiple ways of connecting to a process. Ideally, we would declare "connection failed" when all transports can no longer reach the process, but that requires some additional (possibly complex) code. For now, the code replicates the old behavior only somewhat modified - i.e., if a module sees its connection fail, it checks to see if it is a lifeline. If so, it notifies the errmgr that the lifeline is lost - otherwise, it notifies the errmgr that a non-lifeline connection was lost. * reachability is determined solely on the basis of a shared subnet address/mask - more sophisticated algorithms (e.g., the one used in the tcp btl) are required to handle routing via gateways * the RML needs to assign sequence numbers to each message on a per-peer basis. The receiving RML will then deliver messages in order, thus preventing out-of-order messaging in the case where messages travel across different transports or a message needs to be redirected/resent due to failure of a NIC This commit was SVN r29058.	2013-08-22 16:37:40 +00:00
Jeff Squyres	4d9da92e60	Fixes trac:376: bu default the wrappr compilers will enable rpath support in generated executables on systems that support it. Use --disable-wrapper-rpath to disable this behavior. See text in README about --disable-wrapper-rpath for more details. This commit was SVN r28479. The following Trac tickets were found above: Ticket 376 --> https://svn.open-mpi.org/trac/ompi/ticket/376	2013-05-11 00:49:17 +00:00
Ralph Castain	ae68a953f4	Sigh - one more place This commit was SVN r28447.	2013-05-05 00:25:14 +00:00

1 2 3 4 5 ...

388 Коммитов