openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	3ec865558d	Dont miss the Os X atomics on "make dist". This commit was SVN r32390.	2014-08-01 03:35:38 +00:00
Ralph Castain	552c9ca5a0	George did the work and deserves all the credit for it. Ralph did the merge, and deserves whatever blame results from errors in it :-) WHAT: Open our low-level communication infrastructure by moving all necessary components (btl/rcache/allocator/mpool) down in OPAL All the components required for inter-process communications are currently deeply integrated in the OMPI layer. Several groups/institutions have express interest in having a more generic communication infrastructure, without all the OMPI layer dependencies. This communication layer should be made available at a different software level, available to all layers in the Open MPI software stack. As an example, our ORTE layer could replace the current OOB and instead use the BTL directly, gaining access to more reactive network interfaces than TCP. Similarly, external software libraries could take advantage of our highly optimized AM (active message) communication layer for their own purpose. UTK with support from Sandia, developped a version of Open MPI where the entire communication infrastucture has been moved down to OPAL (btl/rcache/allocator/mpool). Most of the moved components have been updated to match the new schema, with few exceptions (mainly BTLs where I have no way of compiling/testing them). Thus, the completion of this RFC is tied to being able to completing this move for all BTLs. For this we need help from the rest of the Open MPI community, especially those supporting some of the BTLs. A non-exhaustive list of BTLs that qualify here is: mx, portals4, scif, udapl, ugni, usnic. This commit was SVN r32317.	2014-07-26 00:47:28 +00:00
Jeff Squyres	e532ab3020	atomic_impl.h: fix trivial typos in comments cmr=v1.8.2:ticket=trac:4664 This commit was SVN r31858. The following Trac tickets were found above: Ticket 4664 --> https://svn.open-mpi.org/trac/ompi/ticket/4664	2014-05-21 17:18:18 +00:00
George Bosilca	7ca7b54718	Removes few unreacheable warnings. This is somehow related to the ticket #2645. cmr=v1.8.2:reviewer=jsquyres This commit was SVN r31845.	2014-05-21 00:06:04 +00:00
Ralph Castain	11faab1091	The final step of the RFC: convert the <foo>libdir and friends to fit their respective code areas, and equate them all at the top. Note that we can't entirely separate things as the opal_install_dirs framework can't handle separated locations for the various trees. This commit was SVN r31679.	2014-05-08 02:01:35 +00:00
Ralph Castain	f4c31cae9b	Per RFC, another round in the renaming game - nearly complete This commit was SVN r31668.	2014-05-07 03:01:47 +00:00
Ralph Castain	a54dbb17d2	Per RFC, continue renaming project This commit was SVN r31667.	2014-05-07 01:00:06 +00:00
Ryan Grant	6d577a2663	Update to atomics selection, fix octal issue and rename defines This commit was SVN r31396.	2014-04-15 15:40:30 +00:00
Ryan Grant	e67ca81dca	Fixing atomics selection issue, to be CMR'd after it passes the nightly tests This commit was SVN r31393.	2014-04-15 13:17:04 +00:00
Ralph Castain	230336b6a8	Upgrade the security framework to avoid multiple hits against the global security server. Add support for future case where mpirun assings a global security credential for a given run, though we need to work out how to handle connect-accept from other mpirun's in that case. Remove a bunch of duplicate code in the OOB by consolidating the connection handshake code. Refs trac:4221 This commit was SVN r30554. The following Trac tickets were found above: Ticket 4221 --> https://svn.open-mpi.org/trac/ompi/ticket/4221	2014-02-04 14:47:04 +00:00
Ralph Castain	883c1a1c57	Fix ia64 operations by correcting a couple of bugs in the ia64 atomics. Thanks to Paul Hargrove for the patch! Since Paul is the only one of the team with the required hardware to test it, and he has done so, consider this RM-approved. cmr=v1.7.5:reviewer=ompi-gk1.7 This commit was SVN r30401.	2014-01-24 00:14:37 +00:00
Ralph Castain	a19510187b	This is an old patch (r26226) from two years ago that somehow went directly into the 1.6 branch without first entering the trunk. Hence, the problem it fixed still remains in the trunk, and in the 1.7 series as a regression :-( Thanks to Paul Hargrove for tracking it down. RM-approved cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r30397. The following SVN revision numbers were found above: r26226 --> open-mpi/ompi@12781482b9	2014-01-23 15:47:49 +00:00
Ralph Castain	26fbb4e77b	Necessary constants for postgress module This commit was SVN r30338.	2014-01-20 19:58:56 +00:00
Brian Barrett	8b778903d8	Fix longstanding issue with our multi-project support. Rather than using pkg{data,lib,includedir}, use our own ompi{data,lib,includedir}, which is always set to {datadir,libdir,includedir}/openmpi. This will keep us from having help files in prefix/share/open-rte when building without Open MPI, but in prefix/share/openmpi when building with Open MPI. This commit was SVN r30140.	2014-01-07 22:11:15 +00:00
George Bosilca	6189d5968b	Make the builtin atomics follow the same convention as every other atomic support we have ([op]_and_fetch instead of fetch_and_[op]). This commit was SVN r29915.	2013-12-15 16:48:27 +00:00
Brian Barrett	121ca26c59	Per discussion at Develoepr's Meeting, remove Solaris threads support. Solaris will just fall back to pthreads, which should be no problem. This commit was SVN r29893.	2013-12-13 20:07:11 +00:00
Brian Barrett	6d7a1fbb82	Move opal_portable_platform.h to opal/include/opal, which is where it really should have been all along and fix one place that uses the file Update opal_portable_platform.h with changes to mpi_portable_platform.h made in r29608. Make mpi_portable_platform.h a symlink to opal_portable_platform.h, so that they won't get out of sync. I'd like to remove mpi_portable_platform.h, but we don't automatically add -I${includedir}/openmpi/ to make that sane from a header include point of view, so that's future work. This commit was SVN r29618. The following SVN revision numbers were found above: r29608 --> open-mpi/ompi@b71bd51cdd	2013-11-06 17:12:26 +00:00
Nathan Hjelm	9cd18f926c	Add missing OSX builtin define This commit was SVN r29576.	2013-10-31 02:06:39 +00:00
Nathan Hjelm	b922cd1583	Add support for OSX builtin atomics. OSX atomic support is disabled by default. Enable with --enable-osx-builtin-atomics. Fixes trac:2120 This commit was SVN r29568. The following Trac tickets were found above: Ticket 2120 --> https://svn.open-mpi.org/trac/ompi/ticket/2120	2013-10-30 17:48:15 +00:00
Dave Goodell	25dd719d4d	opal: support __attribute__((__noinline__)) First cut does not attempt any "cross-check". As we discover compilers which complain about __noinline__, we will add specific cross checks to handle those cases. Reviewed-by: Jeff Squyres <jsquyres@cisco.com> This commit was SVN r29488.	2013-10-23 15:52:05 +00:00
Joshua Ladd	b3f88c4a1d	Per the RFC schedule, this commit adds Mellanox OpenSHMEM to the trunk. It does not yet run on OSX or with CM PML for an MTL other than MXM. Mellanox is aware of these issues and is in the process of resolving them. This should be added to \ncmr=v1.7.4:subject=Move OSHMEM to 1.7.4:reviewer=rhc This commit was SVN r29153.	2013-09-10 15:34:09 +00:00
Ralph Castain	a200e4f865	As per the RFC, bring in the ORTE async progress code and the rewrite of OOB: * THIS RFC INCLUDES A MINOR CHANGE TO THE MPI-RTE INTERFACE * Note: during the course of this work, it was necessary to completely separate the MPI and RTE progress engines. There were multiple places in the MPI layer where ORTE_WAIT_FOR_COMPLETION was being used. A new OMPI_WAIT_FOR_COMPLETION macro was created (defined in ompi/mca/rte/rte.h) that simply cycles across opal_progress until the provided flag becomes false. Places where the MPI layer blocked waiting for RTE to complete an event have been modified to use this macro. *************************************************************************************** I am reissuing this RFC because of the time that has passed since its original release. Since its initial release and review, I have debugged it further to ensure it fully supports tests like loop_spawn. It therefore seems ready for merge back to the trunk. Given its prior review, I have set the timeout for one week. The code is in https://bitbucket.org/rhc/ompi-oob2 WHAT: Rewrite of ORTE OOB WHY: Support asynchronous progress and a host of other features WHEN: Wed, August 21 SYNOPSIS: The current OOB has served us well, but a number of limitations have been identified over the years. Specifically: * it is only progressed when called via opal_progress, which can lead to hangs or recursive calls into libevent (which is not supported by that code) * we've had issues when multiple NICs are available as the code doesn't "shift" messages between transports - thus, all nodes had to be available via the same TCP interface. * the OOB "unloads" incoming opal_buffer_t objects during the transmission, thus preventing use of OBJ_RETAIN in the code when repeatedly sending the same message to multiple recipients * there is no failover mechanism across NICs - if the selected NIC (or its attached switch) fails, we are forced to abort * only one transport (i.e., component) can be "active" The revised OOB resolves these problems: * async progress is used for all application processes, with the progress thread blocking in the event library * each available TCP NIC is supported by its own TCP module. The ability to asynchronously progress each module independently is provided, but not enabled by default (a runtime MCA parameter turns it "on") * multi-address TCP NICs (e.g., a NIC with both an IPv4 and IPv6 address, or with virtual interfaces) are supported - reachability is determined by comparing the contact info for a peer against all addresses within the range covered by the address/mask pairs for the NIC. * a message that arrives on one TCP NIC is automatically shifted to whatever NIC that is connected to the next "hop" if that peer cannot be reached by the incoming NIC. If no TCP module will reach the peer, then the OOB attempts to send the message via all other available components - if none can reach the peer, then an "error" is reported back to the RML, which then calls the errmgr for instructions. * opal_buffer_t now conforms to standard object rules re OBJ_RETAIN as we no longer "unload" the incoming object * NIC failure is reported to the TCP component, which then tries to resend the message across any other available TCP NIC. If that doesn't work, then the message is given back to the OOB base to try using other components. If all that fails, then the error is reported to the RML, which reports to the errmgr for instructions * obviously from the above, multiple OOB components (e.g., TCP and UD) can be active in parallel * the matching code has been moved to the RML (and out of the OOB/TCP component) so it is independent of transport * routing is done by the individual OOB modules (as opposed to the RML). Thus, both routed and non-routed transports can simultaneously be active * all blocking send/recv APIs have been removed. Everything operates asynchronously. KNOWN LIMITATIONS: * although provision is made for component failover as described above, the code for doing so has not been fully implemented yet. At the moment, if all connections for a given peer fail, the errmgr is notified of a "lost connection", which by default results in termination of the job if it was a lifeline * the IPv6 code is present and compiles, but is not complete. Since the current IPv6 support in the OOB doesn't work anyway, I don't consider this a blocker * routing is performed at the individual module level, yet the active routed component is selected on a global basis. We probably should update that to reflect that different transports may need/choose to route in different ways * obviously, not every error path has been tested nor necessarily covered * determining abnormal termination is more challenging than in the old code as we now potentially have multiple ways of connecting to a process. Ideally, we would declare "connection failed" when all transports can no longer reach the process, but that requires some additional (possibly complex) code. For now, the code replicates the old behavior only somewhat modified - i.e., if a module sees its connection fail, it checks to see if it is a lifeline. If so, it notifies the errmgr that the lifeline is lost - otherwise, it notifies the errmgr that a non-lifeline connection was lost. * reachability is determined solely on the basis of a shared subnet address/mask - more sophisticated algorithms (e.g., the one used in the tcp btl) are required to handle routing via gateways * the RML needs to assign sequence numbers to each message on a per-peer basis. The receiving RML will then deliver messages in order, thus preventing out-of-order messaging in the case where messages travel across different transports or a message needs to be redirected/resent due to failure of a NIC This commit was SVN r29058.	2013-08-22 16:37:40 +00:00
Nathan Hjelm	c3b67d0187	Automatically generate a list of installed frameworks in project/include/project/frameworks.h This commit was SVN r28238.	2013-03-27 21:10:32 +00:00
Ralph Castain	a4b6fb241f	Remove all remaining vestiges of the Windows integration This commit was SVN r28137.	2013-02-28 17:31:47 +00:00
Ralph Castain	bd9265c560	Per the meeting on moving the BTLs to OPAL, move the ORTE database "db" framework to OPAL so the relocated BTLs can access it. Because the data is indexed by process, this requires that we define a new "opal_identifier_t" that corresponds to the orte_process_name_t struct. In order to support multiple run-times, this is defined in opal/mca/db/db_types.h as a uint64_t without identifying the meaning of any part of that data. A few changes were required to support this move: 1. the PMI component used to identify rte-related data (e.g., host name, bind level) and package them as a unit to reduce the number of PMI keys. This code was moved up to the ORTE layer as the OPAL layer has no understanding of these concepts. In addition, the component locally stored data based on process jobid/vpid - this could no longer be supported (see below for the solution). 2. the hash component was updated to use the new opal_identifier_t instead of orte_process_name_t as its index for storing data in the hash tables. Previously, we did a hash on the vpid and stored the data in a 32-bit hash table. In the revised system, we don't see a separate "vpid" field - we only have a 64-bit opaque value. The orte_process_name_t hash turned out to do nothing useful, so we now store the data in a 64-bit hash table. Preliminary tests didn't show any identifiable change in behavior or performance, but we'll have to see if a move back to the 32-bit table is required at some later time. 3. the db framework was a "select one" system. However, since the PMI component could no longer use its internal storage system, the framework has now been changed to a "select many" mode of operation. This allows the hash component to handle all internal storage, while the PMI component only handles pushing/pulling things from the PMI system. This was something we had planned for some time - when fetching data, we first check internal storage to see if we already have it, and then automatically go to the global system to look for it if we don't. Accordingly, the framework was provided with a custom query function used during "select" that lets you seperately specify the "store" and "fetch" ordering. 4. the ORTE grpcomm and ess/pmi components, and the nidmap code, were updated to work with the new db framework and to specify internal/global storage options. No changes were made to the MPI layer, except for modifying the ORTE component of the OMPI/rte framework to support the new db framework. This commit was SVN r28112.	2013-02-26 17:50:04 +00:00
Joshua Ladd	70ad711337	Backing out the Open SHMEM project This commit was SVN r28050.	2013-02-12 17:45:27 +00:00
Mike Dubman	ff384daab4	Added new project: oshmem. This commit was SVN r28048.	2013-02-12 15:33:21 +00:00
Brian Barrett	57b21014f8	Fix issue where the static inline part of the declaration would be improperly set when using C++ This commit was SVN r28034.	2013-02-05 18:15:32 +00:00
George Bosilca	15b18cd2cf	Make CMA compile and run. This commit was SVN r27873.	2013-01-19 14:27:54 +00:00
Brian Barrett	579cf4adcd	After discussion with Jeff, don't do C++ inline assembly (there is a non-inline version still avaiable for C++). This is yet another push to try to make OPAL a C only interface... This commit was SVN r27828.	2013-01-15 17:04:42 +00:00
Ralph Castain	ee6c7702d2	Ensure the cma.h file is included in the tarball This commit was SVN r27235.	2012-09-04 19:34:09 +00:00
Shiqing Fan	42dfbc7d2f	Another CMake scripts update for: correctly generate hwloc library automatically define OMPI/OPAL/ORTE_OMPORTS for user applications update the f77 bindings This commit was SVN r26893.	2012-07-27 11:49:09 +00:00
Shiqing Fan	5d81c27282	Update the CMake files for Fortran 77 bindings, get ready for F90 bindings. Change several variable names and update the macros. This commit was SVN r26851.	2012-07-24 08:49:34 +00:00
Shiqing Fan	8c4a3e1269	correct the symbol dllexports for windows build This commit was SVN r26827.	2012-07-22 08:54:50 +00:00
Christopher Yeoh	9c353cf9d8	This adds a file that was missed in r26134 (adds support for Cross Memory Attach). It is a header file that contains syscall defs for process_vm_readv and process_vm_writev. It is only used on systems where glibc does not yet have support for the new syscalls and where --with-cma has been passed to configure. The syscall numbers are hardcoded but have been in a released kernel and so will not change in the future Once all linux distros have the new glibc this file can be removed. This commit was SVN r26615. The following SVN revision numbers were found above: r26134 --> open-mpi/ompi@524de80eaa	2012-06-18 05:01:09 +00:00
Shiqing Fan	aa6cde9886	Change f77 to fortran for the rest of windows build files. This commit was SVN r26558.	2012-06-06 14:09:51 +00:00
Jeff Squyres	2ba10c37fe	Per RFC, bring in the following changes: * Remove paffinity, maffinity, and carto frameworks -- they've been wholly replaced by hwloc. * Move ompi_mpi_init() affinity-setting/checking code down to ORTE. * Update sm, smcuda, wv, and openib components to no longer use carto. Instead, use hwloc data. There are still optimizations possible in the sm/smcuda BTLs (i.e., making multiple mpools). Also, the old carto-based code found out how many NUMA nodes were ''available'' -- not how many were used ''in this job''. The new hwloc-using code computes the same value -- it was not updated to calculate how many NUMA nodes are used ''by this job.'' * Note that I cannot compile the smcuda and wv BTLs -- I ''think'' they're right, but they need to be verified by their owners. * The openib component now does a bunch of stuff to figure out where "near" OpenFabrics devices are. '''THIS IS A CHANGE IN DEFAULT BEHAVIOR!!''' and still needs to be verified by OpenFabrics vendors (I do not have a NUMA machine with an OpenFabrics device that is a non-uniform distance from multiple different NUMA nodes). * Completely rewrite the OMPI_Affinity_str() routine from the "affinity" mpiext extension. This extension now understands hyperthreads; the output format of it has changed a bit to reflect this new information. * Bunches of minor changes around the code base to update names/types from maffinity/paffinity-based names to hwloc-based names. * Add some helper functions into the hwloc base, mainly having to do with the fact that we have the hwloc data reporting ''all'' topology information, but sometimes you really only want the (online \| available) data. This commit was SVN r26391.	2012-05-07 14:52:54 +00:00
Jeff Squyres	c30d1ef0df	Patch from Evan Clinton, reviewed by Leif Lindholm, for supporting ARM5 and ARM6. This commit was SVN r26361.	2012-04-30 20:49:55 +00:00
Nathan Hjelm	e84f9ec8c3	don't define OPAL_HAVE_ATOMIC_SWAP_64/32 in amd/atomic.h unless we have inlined assembly. fixes pgi complilation on XE/XK-6 This commit was SVN r26343.	2012-04-26 20:43:30 +00:00
Jeff Squyres	b6a90434e4	Fix some include file header ordering issues for some BSDs, suggested by Paul Hargrove. This commit was SVN r25984.	2012-02-21 13:32:14 +00:00
Jeff Squyres	63a96e92b5	In a recent v1.5 branch issue, it took a while to figure out that paffinity hwloc was returning "NOT_SUPPORTED" when the real problem was that the underlying hwloc simply hadn't been initialized yet. So let's clearly delineate this case: return OPAL_ERR_NOT_INITIALIZED if the underlying hwloc is not initialized. This commit was SVN r25902.	2012-02-10 18:29:52 +00:00
Brian Barrett	2bb447c804	* Shouldn't have a timer header for sync_builtin, since it doesn't actually have timer support * Default timer size should be a long, not an int. Int will roll over way too fast, with no performance benifit on 64 bit machines... This commit was SVN r25501.	2011-11-23 17:05:01 +00:00
Brian Barrett	5cd5ef623d	Fix compatibility implementation of swap. Turns out that you shouldn't test the compatibility code on a platform which has a native swap. Sorry to all! This commit was SVN r25500.	2011-11-23 16:28:00 +00:00
Brian Barrett	f971a541f1	Implement swap in terms of compare and swap if it isn't implemented directly This commit was SVN r25499.	2011-11-23 05:57:52 +00:00
Brian Barrett	86f555121c	Add (optional/last ditch effort) support for GCC/Intel __sync_ builtin atomic operations. Much easier than adding support for a new architecture. This commit was SVN r25498.	2011-11-23 04:25:41 +00:00
Ralph Castain	6310361532	At long last, the fabled revision to the affinity system has arrived. A more detailed explanation of how this all works will be presented here: https://svn.open-mpi.org/trac/ompi/wiki/ProcessPlacement The wiki page is incomplete at the moment, but I hope to complete it over the next few days. I will provide updates on the devel list. As the wiki page states, the default and most commonly used options remain unchanged (except as noted below). New, esoteric and complex options have been added, but unless you are a true masochist, you are unlikely to use many of them beyond perhaps an initial curiosity-motivated experimentation. In a nutshell, this commit revamps the map/rank/bind procedure to take into account topology info on the compute nodes. I have, for the most part, preserved the default behaviors, with three notable exceptions: 1. I have at long last bowed my head in submission to the system admin's of managed clusters. For years, they have complained about our default of allowing users to oversubscribe nodes - i.e., to run more processes on a node than allocated slots. Accordingly, I have modified the default behavior: if you are running off of hostfile/dash-host allocated nodes, then the default is to allow oversubscription. If you are running off of RM-allocated nodes, then the default is to NOT allow oversubscription. Flags to override these behaviors are provided, so this only affects the default behavior. 2. both cpus/rank and stride have been removed. The latter was demanded by those who didn't understand the purpose behind it - and I agreed as the users who requested it are no longer using it. The former was removed temporarily pending implementation. 3. vm launch is now the sole method for starting OMPI. It was just too darned hard to maintain multiple launch procedures - maybe someday, provided someone can demonstrate a reason to do so. As Jeff stated, it is impossible to fully test a change of this size. I have tested it on Linux and Mac, covering all the default and simple options, singletons, and comm_spawn. That said, I'm sure others will find problems, so I'll be watching MTT results until this stabilizes. This commit was SVN r25476.	2011-11-15 03:40:11 +00:00
Nathan Hjelm	ad9005820f	fixed typo in last commit This commit was SVN r25306.	2011-10-17 21:35:22 +00:00
Nathan Hjelm	e6ead53eef	add opal_atomic_swap_xx for amd64 This commit was SVN r25305.	2011-10-17 21:33:44 +00:00
Brian Barrett	431f1b6f8c	Remove long-dead sparc support. Sparc (v8) has not been a supported platform since the 1.4 release (and configure would abort when run with sparc v8), but the code was left in place. Sparc v9 (32 or 64 bit) are still supported targets. This commit was SVN r25258.	2011-10-11 18:46:06 +00:00
Brian Barrett	98e98ce2c5	* opal_atomic_trylock is documented to return 0 if the lock was acquired, 1 otherwise. It was doing the opposite, so this patch fixes the return values. All uses (all in ORTE) used the actual return values, not the documented values, so fix them as well. This commit was SVN r25257.	2011-10-11 18:43:45 +00:00

1 2 3 4 5

233 Коммитов