openmpi

Автор	SHA1	Сообщение	Дата
Nathan Hjelm	6186b5ed9d	Remove extra file that made its way into r29490. cmr=v1.7.4:ticket=3862 This commit was SVN r29491. The following SVN revision numbers were found above: r29490 --> open-mpi/ompi@cde3b05ed3 The following Trac tickets were found above: Ticket 3862 --> https://svn.open-mpi.org/trac/ompi/ticket/3862	2013-10-23 16:17:51 +00:00
Nathan Hjelm	cde3b05ed3	Add support for the Intel scif interface. Depends on #3847. cmr=v1.7.4:reviewer=rhc This commit was SVN r29490.	2013-10-23 15:59:14 +00:00
Dave Goodell	d969cfa513	usnic: correctly clean up verbs resources Due to deallocation ordering (and an entirely missed deallocation), we were leaking modest amounts of memory inside libusnic_verbs. Reviewed-by: Jeff Squyres <jsquyres@cisco.com> This commit was SVN r29485.	2013-10-23 15:51:33 +00:00
Dave Goodell	a6ed232a10	usnic: fix several memory leaks - some free lists simply were not being OBJ_DESTRUCTed, so they never freed their internal memory - channel->recv_segs.ctx was being assigned in a way that got clobbered by ompi_free_list_init_new, so the cleanup code that relied on it being set never ran - numerous other ".ctx" assignments were similarly ineffectual and were not being consumed, so I deleted them Reviewed-by: Jeff Squyres <jsquyres@cisco.com> This commit was SVN r29484.	2013-10-23 15:51:22 +00:00
Dave Goodell	c9b2343982	usnic: add ompi_btl_usnic_component_debug helper This new routine can be called in exceptional situations, either conditionally in BTL code or from a debugger, to help with debugging in cases where MSGDEBUG1/2 or stats logging are impractical but more detail is needed. Reviewed-by: Jeff Squyres <jsquyres@cisco.com> This commit was SVN r29483.	2013-10-23 15:51:11 +00:00
Dave Goodell	d0b7d125b2	usnic: refactor usnic_stats_callback Pull the bulk of the functionality out into a new routine, ompi_btl_usnic_print_stats, which can be used in other debugging contexts. This also lets us eliminate the module->final_stats state tracking. Reviewed-by: Jeff Squyres <jsquyres@cisco.com> This commit was SVN r29482.	2013-10-23 15:50:57 +00:00
Jeff Squyres	0fb8edd720	Trivial comment change This commit was SVN r29480.	2013-10-23 10:15:18 +00:00
Mike Dubman	d6ead2a3a5	Add support for routable ROCE where different subnet_id is a valid to proceed with MPI routing. (can happen in the same LAN) developed by vasily, reviewed by miked cmr=v1.7.4:reviewer=ompi-gk1.7 This commit was SVN r29479.	2013-10-23 06:08:54 +00:00
Nathan Hjelm	280a89448f	Make btl/vader valgrind safe. cmr=v1.7.4:reviewer=samuel This commit was SVN r29464.	2013-10-22 15:33:32 +00:00
Rolf vandeVaart	0cd1e8dfd9	Add runtime support to turn off CUDA IPC support. This commit was SVN r29444.	2013-10-16 16:48:18 +00:00
Jeff Squyres	b5e2ae86ad	Remove all of our "to-do" items from the README.txt. This commit was SVN r29424.	2013-10-11 16:43:56 +00:00
Jeff Squyres	66dadbe1e7	Per RFC, remove the udapl BTL. This commit was SVN r29400.	2013-10-08 15:18:59 +00:00
Rolf vandeVaart	3bd02fbaf5	Add one more verbose debug output that prints when we are out of memory. This commit was SVN r29378.	2013-10-04 18:56:06 +00:00
Rolf vandeVaart	66725f6973	Enable some CUDA-aware support on tcp btl. Only when configured in. This commit was SVN r29364.	2013-10-04 12:50:16 +00:00
Nathan Hjelm	f3d18028e5	Fix typo in uGNI prepare source that could cause incorrect results with non-contiguous datatypes. cmr=v1.7.3 This commit was SVN r29294.	2013-09-30 16:00:58 +00:00
Dave Goodell	a42fa78da7	usnic: SEGV in OSU benchmarks Prevent frag from being freed out from under us in the case the PML callback routine calls usnic_free(). We accomplish this by delaying decrement of sf_bytes_to_ack until after the callback is performed, since sf_bytes_to_ack == 0 is condition of freeing the frag. Fixes Cisco bug CSCuj45094. Authored-by: Reese Faucette <rfaucett@cisco.com> cmr=v1.7.3 This commit was SVN r29264.	2013-09-26 21:48:04 +00:00
Ralph Castain	34fbec1f49	Sadly, the connection priorities being defined at time of variable instantiation were being overridden just before registering the param. Thus, changes people made to the relative priority of the cpc methods were being lost. Fix it be removing the duplicate initializiation, letting the value defined at instantiation be the one actually used. cmr:v1.7.4:reviewer=hjelmn This commit was SVN r29212.	2013-09-19 19:45:00 +00:00
Jeff Squyres	74d1278f48	btl_usnic_util.c:ompi_btl_usnic_util_abort() also passes in the strerror(). This commit was SVN r29188.	2013-09-17 12:35:51 +00:00
Reese Faucette	8f235e6977	usnic: wrong SG entry used to compute length for small put()s This commit was SVN r29186.	2013-09-17 08:18:02 +00:00
Reese Faucette	651d61f1a3	Clean up debugging logging a bit. MSGDEBUG2 now means "print a one-liner for all PML calls into BTL, and also when BTL calls PML with a recv completion (not send completions)" MSGDEBUG1 means print more internal gory detail MSGDEBUG is gone, replaced by MSGDEBUG1 In the process also found that PUT_DEST style fragments could potentially be leaked in usnic_free() since send_fragment tests were being applied to see if it was eligible to be freed. This commit was SVN r29185.	2013-09-17 07:29:40 +00:00
Reese Faucette	f35d9b50e3	Cisco CSCuj22803: fixes for Bsend changes required to support MPI_Bsend(). Introduces concept of attaching a buffer to a large segment that the PML can scribble into and we will send from. The reason we don't use a pinned buffer and send directly from that is that usnic_verbs does not (yes) support num_sge>1 for regular sends. This means the data gets copied twice, but that is unavoidable. changed the logic in handle_large_send to be more sensible Incorporated David's review comments This commit was SVN r29184.	2013-09-17 07:27:39 +00:00
Reese Faucette	25b5c84d0f	Cisco CSCuj13135: Data corruption in MPI_Bsend_ator_c Do not assume that the "size" passed to alloc_send() will be the same as the size of the message the resulting fragment will hold when usnic_send() is called. This means usnic_send()/usnic_put() can never trust any pre-computed size values, and are only allowed to look at the lengths and pointers of the elements in the desc SG list. This commit was SVN r29183.	2013-09-17 07:25:05 +00:00
Reese Faucette	b9103c0f66	Cisco CSCuj12524: c_put_big segfault - usnic_free() cannot free the fragment until ACK is received This commit was SVN r29182.	2013-09-17 07:23:15 +00:00
Reese Faucette	89b5f0899b	Cisco CSCuj12520: various problems running c_fence_put_1 - tag needs to be sent in our header, not the PML header - usnic_alloc() should return smaller value if too much data requested - be careful about callbacks vs removing items from lists (we need to remove from outr lists before the callback) - improve send callback handling - add some more MSGDEBUG2 logging and cleanup This commit was SVN r29181.	2013-09-17 07:20:44 +00:00
Rolf vandeVaart	096b8c022e	Also add flag to debug output. This commit was SVN r29163.	2013-09-13 19:47:05 +00:00
Rolf vandeVaart	c15b2a26b8	Fix some formatting. Move some CUDA-aware mca parameter initialization earlier. This commit was SVN r29162.	2013-09-13 17:43:41 +00:00
Rolf vandeVaart	d247c26b84	In the case that HAVE_IBV_FORK_INIT is not defined, we will need this variable so we can give the user an error if they ask for it. Also fixes compile error when HAVE_IBV_FORK_INIT is not defined. This commit was SVN r29160.	2013-09-13 14:38:49 +00:00
Rolf vandeVaart	ba9ec1b8bc	For debug builds, add the ability to view memory registrations and deregistrations in the openib BTL. This commit was SVN r29159.	2013-09-13 14:28:26 +00:00
Joshua Ladd	b3f88c4a1d	Per the RFC schedule, this commit adds Mellanox OpenSHMEM to the trunk. It does not yet run on OSX or with CM PML for an MTL other than MXM. Mellanox is aware of these issues and is in the process of resolving them. This should be added to \ncmr=v1.7.4:subject=Move OSHMEM to 1.7.4:reviewer=rhc This commit was SVN r29153.	2013-09-10 15:34:09 +00:00
Jeff Squyres	c9f05a2664	Delineate OMPI_FREE_LIST__MT separately. The FREE_LIST__MT stuff was introduced on the SVN trunk in r28722 (2013-07-04), but so far, has not been merged into the v1.7 branch yet (2013-09-06). So put it in its own #ifdef, rather than defining it based on OMPI_MAJOR_VERSION/OMPI_MINOR_VERSION. This commit was SVN r29148. The following SVN revision numbers were found above: r28722 --> open-mpi/ompi@c9e5ab9ed1	2013-09-06 19:22:56 +00:00
Jeff Squyres	e02cc0a7ec	No need for this header file. This commit was SVN r29147.	2013-09-06 19:22:28 +00:00
Jeff Squyres	c53b0890cf	Ensure that btl_usnic_compat.h is in the tarball. This commit was SVN r29140.	2013-09-06 15:53:56 +00:00
Dave Goodell	75fa28c303	usnic: v1.6<->trunk unification, trunk side The Cisco-maintained v1.6 port of the usnic BTL has diverged from the upstream trunk and v1.7 branches. This commit adjusts the trunk to more closely match the v1.6 branch to simplify future merging and cherry-picking. The usnic MCA parameters also need work on this side. Should be included in usnic v1.7.3 roll-up CMR (refs trac:3760) This commit was SVN r29138. The following Trac tickets were found above: Ticket 3760 --> https://svn.open-mpi.org/trac/ompi/ticket/3760	2013-09-06 03:21:34 +00:00
Dave Goodell	a669bd01e6	usnic: revamp convertor handling. The fix for the HPL SEGV was incorrect because it assumed the prepare_src() routine was always allowed to return "bytes processed" less than the requested "bytes to send". It turns out this is only true if the convertor is what limits the size, we are not allowed to limit the data sent for our own reasons, else we break login in the upper layers. This means we need to learn the number of bytes out of the size requested the convertor will give us, no matter how big the size is. Unfortunately, this is a destructive test, and (currently) the only way to learn that number is to actually have the convertor copy the data out into buffers. This change implements this, copying the entire data out into a chain of send segments which are attached to the large send fragment. Now we can always return the proper size value to the PML. Fixes Cisco bug CSCuj08024 Authored-by: Reese Faucette <rfaucett@cisco.com> Should be included in usnic v1.7.3 roll-up CMR (refs trac:3760) This commit was SVN r29137. The following Trac tickets were found above: Ticket 3760 --> https://svn.open-mpi.org/trac/ompi/ticket/3760	2013-09-06 03:21:21 +00:00
Dave Goodell	0ef8336502	new bookkeeping code should return value indicating whether packet is good or not. Authored-by: Reese Faucette <rfaucett@cisco.com> Should be included in usnic v1.7.3 roll-up CMR (refs trac:3760) This commit was SVN r29136. The following Trac tickets were found above: Ticket 3760 --> https://svn.open-mpi.org/trac/ompi/ticket/3760	2013-09-06 03:19:32 +00:00
Dave Goodell	122890c2fd	usnic: "bookeeping" --> "bookkeeping" Should be included in usnic v1.7.3 roll-up CMR (refs trac:3760) This commit was SVN r29135. The following Trac tickets were found above: Ticket 3760 --> https://svn.open-mpi.org/trac/ompi/ticket/3760	2013-09-06 03:19:20 +00:00
Dave Goodell	0df6ed4acc	usnic: squash warnings from perf improvements Should be included in usnic v1.7.3 roll-up CMR (refs trac:3760) This commit was SVN r29134. The following Trac tickets were found above: Ticket 3760 --> https://svn.open-mpi.org/trac/ompi/ticket/3760	2013-09-06 03:19:08 +00:00
Dave Goodell	6dc54d372d	usnic: Basket of performance changes including: - round segment buffer allocation to cache-line - split some routines into an inline fast section and a called slower section - introduce receive fastpath in component_progress that: o returns immediately if there is a packet available on priority queue and fastpath is enabled o disables fastpath for 1 time after use to provide fairness to other processing o defers receive buffer posting o defers bookeeping for receive until next call to usnic_component_progress Authored-by: Reese Faucette <rfaucett@cisco.com> Should be included in usnic v1.7.3 roll-up CMR (refs trac:3760) This commit was SVN r29133. The following Trac tickets were found above: Ticket 3760 --> https://svn.open-mpi.org/trac/ompi/ticket/3760	2013-09-06 03:18:57 +00:00
Dave Goodell	9cab9777d9	usnic: properly destroy embedded small send frag Without this, an `--enable-debug` build would hit an assertion in the list code when run under valgrind with `--malloc-fill=0xff` or any other case where malloc returned non-zeroed buffers. Also allow the normal OBJ_ machinery to handle the constructor invocation ordering for us instead of doing it by hand (which could have led to future bugs). Reviewed-by: jsquyres@cisco.com cmr=v1.7.4 Depends on trunk functionality in r29095 and r29096. Refs trac:3740,#3741. This commit was SVN r29127. The following SVN revision numbers were found above: r29095 --> open-mpi/ompi@d1b5940e97 r29096 --> open-mpi/ompi@a552921171 The following Trac tickets were found above: Ticket 3740 --> https://svn.open-mpi.org/trac/ompi/ticket/3740	2013-09-04 20:59:12 +00:00
Brian Barrett	16a1166884	Remove the proc_pml and proc_bml fields from ompi_proc_t and replace with a configure-time dynamic allocation of flags. The net result for platforms which only support BTL-based communication is a reduction of 8*nprocs bytes per process. Platforms which support both MTLs and BTLs will not see a space reduction, but will now be able to safely run both the MTL and BTL side-by-side, which will prove useful. This commit was SVN r29100.	2013-08-30 16:54:55 +00:00
Rolf vandeVaart	18962d296b	This has bothered me for a while. Change MCA_BTL_TAG_BTL to MCA_BTL_TAG_IB. They are the same value so this does not change anything. (MCA_BTL_TAG_IB = MCA_BTL_TAG_BTL + 0). This just makes it more correct. This commit was SVN r29099.	2013-08-30 14:53:59 +00:00
Dave Goodell	c5a7e8a079	usnic: stomp format specifier warnings The usnic BTL now builds cleanly under `--enable-picky` when `MSGDEBUG1` is set. Reviewed-by: jsquyres cmr=v1.7.4:reviewer=jsquyres This commit was SVN r29097.	2013-08-29 23:24:14 +00:00
George Bosilca	305fa88d4b	Remove two warnings from the SM BTL. The return code can be safely ignored as the internals of the SM BTL will repost the fragment until the send operation succesfully complete. This commit was SVN r29077.	2013-08-28 06:36:01 +00:00
Dave Goodell	dd82bd3c19	usnic: fix invalid rfstart initialization endpoint_rfstart was being initialized from a value which was not yet set. Also ensure that rfstart is a valid index in the range 0..WINDOW_SIZE-1, since it is used as the index into endpoint_rcvd_segs, which has WINDOW_SIZE elements. Without this change there is significant risk of memory corruption or segfaults, resulting in hangs or crashes, if malloc ever returns us a value >=WINDOW_SIZE (4096). Right now we seem to be getting lucky that the malloc is returning zero-pages to us when we are allocating endpoint structures (possibly because the freelist performs a single large allocation for all endpoints). Fixes Cisco bug CSCui88781. Reviewed-by: rfaucett@cisco.com Reviewed-by: jsquyres@cisco.com cmr=v1.7.3:reviewer=jsquyres This commit was SVN r29075.	2013-08-27 22:43:20 +00:00
Rolf vandeVaart	96457df9bc	Fix compile errors created from changeset 29058. This commit was SVN r29061.	2013-08-22 18:25:23 +00:00
Jeff Squyres	63ac60864b	Refs trac:3730 Turns out that AC_CHECK_DECLS is one of the "new style" Autoconf macros that #defines the output to be 0 or 1 (vs. #define'ing or #undef'ing it). So don't check for "#if defined(..."; just check for "#if ...". This commit was SVN r29059. The following Trac tickets were found above: Ticket 3730 --> https://svn.open-mpi.org/trac/ompi/ticket/3730	2013-08-22 17:44:20 +00:00
Ralph Castain	a200e4f865	As per the RFC, bring in the ORTE async progress code and the rewrite of OOB: * THIS RFC INCLUDES A MINOR CHANGE TO THE MPI-RTE INTERFACE * Note: during the course of this work, it was necessary to completely separate the MPI and RTE progress engines. There were multiple places in the MPI layer where ORTE_WAIT_FOR_COMPLETION was being used. A new OMPI_WAIT_FOR_COMPLETION macro was created (defined in ompi/mca/rte/rte.h) that simply cycles across opal_progress until the provided flag becomes false. Places where the MPI layer blocked waiting for RTE to complete an event have been modified to use this macro. *************************************************************************************** I am reissuing this RFC because of the time that has passed since its original release. Since its initial release and review, I have debugged it further to ensure it fully supports tests like loop_spawn. It therefore seems ready for merge back to the trunk. Given its prior review, I have set the timeout for one week. The code is in https://bitbucket.org/rhc/ompi-oob2 WHAT: Rewrite of ORTE OOB WHY: Support asynchronous progress and a host of other features WHEN: Wed, August 21 SYNOPSIS: The current OOB has served us well, but a number of limitations have been identified over the years. Specifically: * it is only progressed when called via opal_progress, which can lead to hangs or recursive calls into libevent (which is not supported by that code) * we've had issues when multiple NICs are available as the code doesn't "shift" messages between transports - thus, all nodes had to be available via the same TCP interface. * the OOB "unloads" incoming opal_buffer_t objects during the transmission, thus preventing use of OBJ_RETAIN in the code when repeatedly sending the same message to multiple recipients * there is no failover mechanism across NICs - if the selected NIC (or its attached switch) fails, we are forced to abort * only one transport (i.e., component) can be "active" The revised OOB resolves these problems: * async progress is used for all application processes, with the progress thread blocking in the event library * each available TCP NIC is supported by its own TCP module. The ability to asynchronously progress each module independently is provided, but not enabled by default (a runtime MCA parameter turns it "on") * multi-address TCP NICs (e.g., a NIC with both an IPv4 and IPv6 address, or with virtual interfaces) are supported - reachability is determined by comparing the contact info for a peer against all addresses within the range covered by the address/mask pairs for the NIC. * a message that arrives on one TCP NIC is automatically shifted to whatever NIC that is connected to the next "hop" if that peer cannot be reached by the incoming NIC. If no TCP module will reach the peer, then the OOB attempts to send the message via all other available components - if none can reach the peer, then an "error" is reported back to the RML, which then calls the errmgr for instructions. * opal_buffer_t now conforms to standard object rules re OBJ_RETAIN as we no longer "unload" the incoming object * NIC failure is reported to the TCP component, which then tries to resend the message across any other available TCP NIC. If that doesn't work, then the message is given back to the OOB base to try using other components. If all that fails, then the error is reported to the RML, which reports to the errmgr for instructions * obviously from the above, multiple OOB components (e.g., TCP and UD) can be active in parallel * the matching code has been moved to the RML (and out of the OOB/TCP component) so it is independent of transport * routing is done by the individual OOB modules (as opposed to the RML). Thus, both routed and non-routed transports can simultaneously be active * all blocking send/recv APIs have been removed. Everything operates asynchronously. KNOWN LIMITATIONS: * although provision is made for component failover as described above, the code for doing so has not been fully implemented yet. At the moment, if all connections for a given peer fail, the errmgr is notified of a "lost connection", which by default results in termination of the job if it was a lifeline * the IPv6 code is present and compiles, but is not complete. Since the current IPv6 support in the OOB doesn't work anyway, I don't consider this a blocker * routing is performed at the individual module level, yet the active routed component is selected on a global basis. We probably should update that to reflect that different transports may need/choose to route in different ways * obviously, not every error path has been tested nor necessarily covered * determining abnormal termination is more challenging than in the old code as we now potentially have multiple ways of connecting to a process. Ideally, we would declare "connection failed" when all transports can no longer reach the process, but that requires some additional (possibly complex) code. For now, the code replicates the old behavior only somewhat modified - i.e., if a module sees its connection fail, it checks to see if it is a lifeline. If so, it notifies the errmgr that the lifeline is lost - otherwise, it notifies the errmgr that a non-lifeline connection was lost. * reachability is determined solely on the basis of a shared subnet address/mask - more sophisticated algorithms (e.g., the one used in the tcp btl) are required to handle routing via gateways * the RML needs to assign sequence numbers to each message on a per-peer basis. The receiving RML will then deliver messages in order, thus preventing out-of-order messaging in the case where messages travel across different transports or a message needs to be redirected/resent due to failure of a NIC This commit was SVN r29058.	2013-08-22 16:37:40 +00:00
Rolf vandeVaart	504fa2cda9	Fix support in smcuda btl so it does not blow up when there is no CUDA IPC support between two GPUs. Also make it so CUDA IPC support is added dynamically. Fixes ticket 3531. This commit was SVN r29055.	2013-08-21 21:00:09 +00:00
Rolf vandeVaart	96fdb060ea	Fix compile errors and warnings from changeset 29052. This commit was SVN r29054.	2013-08-21 19:01:54 +00:00
Steve Wise	67fe3f23ed	Use the HAVE_DECL_IBV_LINK_LAYER_ETHERNET macro. Commit r27211 added ifdef checks for #define HAVE_IBV_LINK_LAYER_ETHERNET, which is incorrect. The correct #define is HAVE_DECL_IBV_LINK_LAYER_ETHERNET. This broke OMPI over iWARP. This fixes trac:3726 and should be added to cmr:v1.7.3:reviewer=jsquyres This commit was SVN r29053. The following SVN revision numbers were found above: r27211 --> open-mpi/ompi@b27862e5c7 The following Trac tickets were found above: Ticket 3726 --> https://svn.open-mpi.org/trac/ompi/ticket/3726	2013-08-20 20:00:46 +00:00
Ralph Castain	45e695928f	As per the email discussion, revise the sparse handling of hostnames so that we avoid potential infinite loops while allowing large-scale users to improve their startup time: * add a new MCA param orte_hostname_cutoff to specify the number of nodes at which we stop including hostnames. This defaults to INT_MAX => always include hostnames. If a value is given, then we will include hostnames for any allocation smaller than the given limit. * remove ompi_proc_get_hostname. Replace all occurrences with a direct link to ompi_proc_t's proc_hostname, protected by appropriate "if NULL" * modify the OMPI-ORTE integration component so that any call to modex_recv automatically loads the ompi_proc_t->proc_hostname field as well as returning the requested info. Thus, any process whose modex info you retrieve will automatically receive the hostname. Note that on-demand retrieval is still enabled - i.e., if we are running under direct launch with PMI, the hostname will be fetched upon first call to modex_recv, and then the ompi_proc_t->proc_hostname field will be loaded * removed a stale MCA param "mpi_keep_peer_hostnames" that was no longer used anywhere in the code base * added an envar lookup in ess/pmi for the number of nodes in the allocation. Sadly, PMI itself doesn't provide that info, so we have to get it a different way. Currently, we support PBS-based systems and SLURM - for any other, rank0 will emit a warning and we assume max number of daemons so we will always retain hostnames This commit was SVN r29052.	2013-08-20 18:59:36 +00:00
Jeff Squyres	b30ad28276	Remove some unused variables and an unused goto label. This commit was SVN r29044.	2013-08-19 16:18:35 +00:00
Ralph Castain	611d7f9f6b	When we direct launch an application, we rely on PMI for wireup support. In doing so, we lose the de facto data compression we get from the ORTE modex since we no longer get all the wireup info from every proc in a single blob. Instead, we have to iterate over all the procs, calling PMI_KVS_get for every value we require. This creates a really bad scaling behavior. Users have found a nearly 20% launch time differential between mpirun and PMI, with PMI being the slower method. Some of the problem is attributable to poor exchange algorithms in RM's like Slurm and Alps, but we make things worse by calling "get" so many times. Nathan (with a tad advice from me) has attempted to alleviate this problem by reducing the number of "get" calls. This required the following changes: * upon first request for data, have the OPAL db pmi component fetch and decode all the info from a given remote proc. It turned out we weren't caching the info, so we would continually request it and only decode the piece we needed for the immediate request. We now decode all the info and push it into the db hash component for local storage - and then all subsequent retrievals are fulfilled locally * reduced the amount of data by eliminating the exchange of the OMPI_ARCH value if heterogeneity is not enabled. This was used solely as a check so we would error out if the system wasn't actually homogeneous, which was fine when we thought there was no cost in doing the check. Unfortunately, at large scale and with direct launch, there is a non-zero cost of making this test. We are open to finding a compromise (perhaps turning the test off if requested?), if people feel strongly about performing the test * reduced the amount of RTE data being automatically fetched, and fetched the rest only upon request. In particular, we no longer immediately fetch the hostname (which is only used for error reporting), but instead get it when needed. Likewise for the RML uri as that info is only required for some (not all) environments. In addition, we no longer fetch the locality unless required, relying instead on the PMI clique info to tell us who is on our local node (if additional info is required, the fetch is performed when a modex_recv is issued). Again, all this only impacts direct launch - all the info is provided when launched via mpirun as there is no added cost to getting it Barring objections, we may move this (plus any required other pieces) to the 1.7 branch once it soaks for an appropriate time. This commit was SVN r29040.	2013-08-17 00:49:18 +00:00
Ralph Castain	f8a72feb25	Silence unitialized var warning This commit was SVN r29036.	2013-08-16 21:39:28 +00:00
Jeff Squyres	c09ec204ad	Change usNIC BTL to always use small fragments when there is a non-contiguous converter. We can't "convert on the fly" because the # of bytes requested may not divide evenly into the convertor data type. This commit was SVN r29014.	2013-08-11 17:04:13 +00:00
George Bosilca	837b3363fe	Silence few warnings. This commit was SVN r29004.	2013-08-06 09:38:30 +00:00
Brian Barrett	2cc947513b	* Fix some compile errors * Need to subtract 1 off the size so that we stay in the bit length requirements This commit was SVN r28997.	2013-08-05 18:49:48 +00:00
Jeff Squyres	87910daf51	Fix a collection of bugs found by QA and Coverity, and make some minor improvements: * Fix minor memory leaks during component_init * Ensure that an initialization loop does not underflow an unsigned int * Improve mlock limit checking * Fix set of BTL modules created during component_init when failing to get QP resources or otherwise excluding some (but not all) usnic verbs devices * Fix/improve error messages to be consistent with other Cisco documentation * Randomize the initial sliding window sequence number so that we silently drop incoming frames from previous jobs that still have existant processes in the middle of dying (and are still transmitting) * Ensure we don't break out of add_procs too soon and create an asymetrical view of what interfaces are available This commit was SVN r28975.	2013-08-01 16:56:15 +00:00
Jeff Squyres	f7337b8f77	Correct faulty max payload and MTU computations (and update some debugging that helped us find those). This commit was SVN r28942.	2013-07-24 16:06:28 +00:00
Jeff Squyres	5323051047	Use sysfs to check MPI has enough VFs, QPs, and CQs Use the new sysfs files to check that there are enough VFs, QPs, and CQs for all the MPI processes on this server. Move the checking code into its own subroutine to make it smaller and easier to read/grok. This commit was SVN r28937.	2013-07-24 00:38:32 +00:00
Jeff Squyres	b437041aeb	Update one more comment. This commit was SVN r28908.	2013-07-22 17:29:00 +00:00
Jeff Squyres	4b6006402d	Use the RTE framework instead of calling ORTE directly. Brian (rightfully) hit me on the head with the don't-use-ORTE-use-the-rte-framework clue bat; the usnic BTL now nicely plays with the RTE framework. This commit was SVN r28907.	2013-07-22 17:28:23 +00:00
Jeff Squyres	194b285447	First commit of the Cisco usNIC BTL. This BTL accesses the Cisco usNIC Linux device via the Linux verbs API via Unreliable Datagram queue pairs. A few noteworthy points: * This BTL does most of its own fragmentation; it tells the PML that it has a very high max_send_size (much higher than the network MTU). * Since UD fragments are, by definition, unreliable, the usnic BTL handles all of its own reliability via a sliding window approach using the opal_hotel construct and many tricks stolen from the corpus of knowledge surrounding efficient TCP. * There is a fun PML latency-metric based optimization for NUMA awareness of short messages. * Note that this is ''not'' a generic UD verbs BTL; it is specific to the Cisco usNIC device. This commit was SVN r28879.	2013-07-19 22:13:58 +00:00
Jeff Squyres	3546163c48	Devices that do not support RC QP's are also intentionally skipped; don't warn about skipping them. This commit was SVN r28874.	2013-07-19 19:05:18 +00:00
Rolf vandeVaart	49663fb802	Move CUDA-aware configurary to its own file and other minor changes due to review. This commit was SVN r28832.	2013-07-17 22:12:29 +00:00
Mike Dubman	5bd2e15cbb	support for ConnectX3-Pro card. cmr:v1.7:reviewer=jsquyres cmr:v1.6:reviewer=jsquyres This commit was SVN r28787.	2013-07-14 06:44:19 +00:00
Nathan Hjelm	dfca3d4804	fix typos in the ugni and vader btls This commit was SVN r28772.	2013-07-12 17:55:33 +00:00
Nathan Hjelm	1119cd3e8a	Merge branch 'vader_fix' This commit was SVN r28764.	2013-07-11 23:30:20 +00:00
Brian Barrett	2f19fc52de	use the same multi-md workaround the rest of the Portals code is using. This commit was SVN r28761.	2013-07-11 21:00:11 +00:00
Nathan Hjelm	b5281778b0	btl/vader: improve small message performance This commit improved the small message latency and bandwidth when using the vader btl. These improvements should make performance competative with other MPI implementations. This commit was SVN r28760.	2013-07-11 20:54:12 +00:00
Brian Barrett	bea54eeeb1	First take at a BTL for Portals 4 This commit was SVN r28759.	2013-07-11 20:47:08 +00:00
Jeff Squyres	baa3182794	Per RFC (http://www.open-mpi.org/community/lists/devel/2013/07/12534.php), remove a bunch of dead code. This commit was SVN r28756.	2013-07-11 17:34:28 +00:00
Rolf vandeVaart	5051cd53fd	Use new API. This commit was SVN r28754.	2013-07-11 17:06:14 +00:00
Jeff Squyres	80145742a3	Fix typo in comment This commit was SVN r28747.	2013-07-10 15:13:08 +00:00
Jeff Squyres	ea94936531	First cut at assigning some fine-grained "levels" to MCA parameters for the SM and TCP BTLs, as well as the mca_btl_base_param_register() function (which registers MCA params for all BTLs). The guidelines in https://svn.open-mpi.org/trac/ompi/wiki/MCAParamLevels were used to pick these levels. This commit was SVN r28746.	2013-07-10 00:47:52 +00:00
Aurelien Bouteiller	e1066143a4	rename ompi_free_list operations to _mt, as per discussions at last face to face meeting This commit was SVN r28734.	2013-07-08 22:07:52 +00:00
George Bosilca	483ed8da8c	Remove an unused variable resulting from the removal of the last parameter of the OMPI_FREE_LIST_GET macro. This commit was SVN r28723.	2013-07-04 09:19:00 +00:00
George Bosilca	c9e5ab9ed1	Our macros for the OMPI-level free list had one extra argument, a possible return value to signal that the operation of retrieving the element from the free list failed. However in this case the returned pointer was set to NULL as well, so the error code was redundant. Moreover, this was a continuous source of warnings when the picky mode is on. The attached parch remove the rc argument from the OMPI_FREE_LIST_GET and OMPI_FREE_LIST_WAIT macros, and change to check if the item is NULL instead of using the return code. This commit was SVN r28722.	2013-07-04 08:34:37 +00:00
George Bosilca	b82abf6bef	Silence a compiler warning. This commit was SVN r28686.	2013-07-01 11:40:42 +00:00
Jeff Squyres	a0b27f5b28	Better comment than what was submitted in r28614. This commit was SVN r28631. The following SVN revision numbers were found above: r28614 --> open-mpi/ompi@9556310bd0	2013-06-13 20:52:44 +00:00
Mike Dubman	9556310bd0	cosmetic: add comment with rationale for malloc.h include This commit was SVN r28614.	2013-06-12 05:58:32 +00:00
Nathan Hjelm	9b1f32bf12	BTL: add flags for signaled BTL operations As per discussion in the June 2013 developer meeting these flags will be used by the PML in the future to request asynchronous progress on an operation. The naming was chosen to reflect that a BTL supports this mode (MCA_BTL_FLAG_SIGNALED) and that a descriptor should "signal" the remote side to wake up and progress the message (MCA_BTL_DES_FLAG_SIGNAL). Future commits will update OB1 to take advantage of this feature when performing the RDMA get or RDMA rendezvous protocols. This commit was SVN r28612.	2013-06-11 21:52:20 +00:00
Mike Dubman	d18b3ae1a7	fix malloc deprication error with gcc 4.6.3 on ubuntu/fedora This commit was SVN r28605.	2013-06-09 18:13:16 +00:00
Jeff Squyres	713e3aa3db	Refs trac:3626: that ticket specifically refers to the v1.6 branch; this commit is the trunk version of what is needed for #3626. Add the "ignore_device" field to the INI file. This allows us to specifically list devices that should be ignored by the openib BTL (such as the Intel Phi, at least as of May 2013 -- see #3626). Also add the Intel Phi to the ini file, and set its ignore_device=1. Finally, add the concept of counting intentionally ignored verbs devices. Devices are ignored for one of two reasons: * If the number of allowed ports on that device is 0 (i.e., if if_include/if_exclude was set such that we're intentionally ignoring this device). * If the INI ignore_device field for this device is set to 1. Once we have the count of devices that were intentionally ignored, only show the "Hey, there's verbs devices that you're not using!" show_help message if there are devices that were ''unintentionally'' ignored. This commit was SVN r28589. The following Trac tickets were found above: Ticket 3626 --> https://svn.open-mpi.org/trac/ompi/ticket/3626	2013-06-05 12:12:09 +00:00
Jeff Squyres	3019b7a3f8	Oops! Remove duplicate registration. This commit was SVN r28588.	2013-06-05 11:55:19 +00:00
Jeff Squyres	1de00b17ad	Properly check the return status from registering the MCA params. This commit was SVN r28587.	2013-06-05 11:53:18 +00:00
Rolf vandeVaart	3d1d158a80	Do not abort in BTL. Rather, callback into PML error function. Thanks George for review. This commit was SVN r28559.	2013-05-23 18:45:23 +00:00
Nathan Hjelm	721779d7ab	Per RFC: remove old MCA parameter system. This commit was SVN r28541.	2013-05-20 15:36:13 +00:00
Rolf vandeVaart	91fdb423d7	Fix warning in CUDA-aware code. This commit was SVN r28511.	2013-05-14 21:04:15 +00:00
Rolf vandeVaart	52ebb0b17f	Change some opal_output to OPAL_OUTPUT per CMR review. This commit was SVN r28510.	2013-05-14 20:49:42 +00:00
Nathan Hjelm	32a8ff5255	btl/openib: bump up udcm priority This commit was SVN r28505.	2013-05-14 20:02:40 +00:00
Rolf vandeVaart	9d569f1487	Fix warning when compiling in CUDA aware code. This commit was SVN r28476.	2013-05-10 21:29:08 +00:00
Nathan Hjelm	422331b4da	btl/openib: fix unconnected datagram connection method (udcm) The primary issue with udcm is that the immediate data in message acks were often bogus. This caused the sender to keep trying even though a message was received and acked. The fix is to use the source LID and QP to determine which message is being acked. In most cases this should work well since only one message will be in flight to any peer. This commit was SVN r28444.	2013-05-03 17:11:38 +00:00
Alex Mikheev	9e2fdc7d56	- correction of r28440 This commit was SVN r28441. The following SVN revision numbers were found above: r28440 --> open-mpi/ompi@93ce233530	2013-05-02 12:52:58 +00:00
Alex Mikheev	93ce233530	- btl_openib: changed default SRQ settings: - increase number of wqe to minimize number of RNRs - it is better to have high watermark and post relatively small number of wqes - increased TX queue size This commit was SVN r28440.	2013-05-02 12:46:35 +00:00
Alex Mikheev	f76680fbd0	- btl_openib: fix total registered memory calculation for ConnectIB and Ofed 2.0 This commit was SVN r28432.	2013-05-01 13:39:29 +00:00
Jeff Squyres	d92a8e01f8	Use the _SAFE list traversal macro so that we can remove each item from the list (just for good measure), and then free() it (without using _SAFE, we were accessing memory that was just free()'d to get to the next item). Also be a little more thorough -- DESTRUCT the list when we're all done. This commit was SVN r28429.	2013-05-01 12:26:16 +00:00
George Bosilca	8b0335380a	Fix the error messages to reference the correct function. This commit was SVN r28425.	2013-04-30 23:26:03 +00:00
George Bosilca	6a75c84fa8	Remove useless define. This commit was SVN r28424.	2013-04-30 23:24:59 +00:00
Ralph Castain	ceb4061214	Fix BTL_VERBOSE - when the MCA param change was committed, it left the base verbosity variable declared so things compiled. Sadly, the verbosity was now being set to a new variable, so debug never was output. This commit was SVN r28414.	2013-04-30 01:15:52 +00:00
Nathan Hjelm	f384263de7	btl/openib: fix typo This commit was SVN r28413.	2013-04-29 22:21:25 +00:00
Ralph Castain	8996ecb128	Add missing include This commit was SVN r28405.	2013-04-27 00:09:36 +00:00
Jeff Squyres	f55cea1a5b	If there are no BTLs, do ''not'' actually shut down the fd listener, because a) it may still be needed to shut down the CPCs, and b) it will be shut down during component_close(). This commit was SVN r28402.	2013-04-26 15:31:50 +00:00
Jeff Squyres	99b7a0f20d	Remove unused variables. This commit was SVN r28401.	2013-04-26 15:29:42 +00:00
Nathan Hjelm	2edff7f784	btl/openib: don't free string handle by MCA variable system This commit was SVN r28383.	2013-04-24 18:59:18 +00:00
Rolf vandeVaart	5e1dde419c	Fix some compile errors in CUDA-aware code that has crept in. This commit was SVN r28346.	2013-04-18 15:34:16 +00:00
Steve Wise	134baaf2fa	Add Chelsio T5 device. This fixes trac:3552 and should be added to cmr:v1.6:reviewer=jsquyres and cmr:v1.7:reviewer=jsquyres This commit was SVN r28327. The following Trac tickets were found above: Ticket 3552 --> https://svn.open-mpi.org/trac/ompi/ticket/3552	2013-04-11 19:30:53 +00:00
George Bosilca	2d33c9ee39	Stop complaining about an overwritten default parameter. This commit was SVN r28322.	2013-04-10 19:44:37 +00:00
Jeff Squyres	8405975bf6	Be a little more conservative about initializing devices and modules (i.e., ensure that more data items get zeroed out/set to NULL) so that if something goes wrong during initialization, we don't try to clean up something that isn't there (and segv). The chance of this happening on the trunk is very low (and will also be low once the verbs improvements are brought over to v1.7). But it can actually happen in the v1.6 branch (e.g., if no CPC is available, we'll try to get the length of the endpoints list, but the endpoints list is NULL). Hence, even though the real goal is to get this functionality over to v1.6, I figured I'd commit to the trunk/CMR to v1.7 just to try to keep commonality in the openib between all three where possible. This commit was SVN r28317.	2013-04-09 21:55:31 +00:00
Pavel Shamis	fed6e60131	Fixing OpenIB BTL compilation failure for a cases when BTL_OPENIB_MALLOC_HOOKS_ENABLED is disabled. This commit was SVN r28290.	2013-04-04 20:17:18 +00:00
Nathan Hjelm	f1fa290157	btl/vader: add missing return statement This commit was SVN r28252.	2013-03-27 22:16:21 +00:00
Nathan Hjelm	113fadd749	btl/vader: do not use common/sm for shared memory fragments This commit was SVN r28250.	2013-03-27 22:10:02 +00:00
Nathan Hjelm	9d4a26f47d	Update OMPI frameworks to use the MCA framework system. Notes: - This commit also eliminates the need for an available components list in use in several frameworks. None of the code in question was making use of the priority field of the priority component list item so these extra lists were removed. - Cleaned up selection code in several frameworks to sort lists using opal_list_sort. - Cleans up the ompi/orte-info functions. Expose the functions that construct the list of params so they can be used elsewhere. patches for mtl/portals4 from brian missed a few output variables in openib This commit was SVN r28241.	2013-03-27 21:17:31 +00:00
Nathan Hjelm	cf377db823	MCA/base: Add new MCA variable system Features: - Support for an override parameter file (openmpi-mca-param-override.conf). Variable values in this file can not be overridden by any file or environment value. - Support for boolean, unsigned, and unsigned long long variables. - Support for true/false values. - Support for enumerations on integer variables. - Support for MPIT scope, verbosity, and binding. - Support for command line source. - Support for setting variable source via the environment using OMPI_MCA_SOURCE_<var name>=source (either command or file:filename) - Cleaner API. - Support for variable groups (equivalent to MPIT categories). Notes: - Variables must be created with a backing store (char *, int , or bool *) that must live at least as long as the variable. - Creating a variable with the MCA_BASE_VAR_FLAG_SETTABLE enables the use of mca_base_var_set_value() to change the value. - String values are duplicated when the variable is registered. It is up to the caller to free the original value if necessary. The new value will be freed by the mca_base_var system and must not be freed by the user. - Variables with constant scope may not be settable. - Variable groups (and all associated variables) are deregistered when the component is closed or the component repository item is freed. This prevents a segmentation fault from accessing a variable after its component is unloaded. - After some discussion we decided we should remove the automatic registration of component priority variables. Few component actually made use of this feature. - The enumerator interface was updated to be general enough to handle future uses of the interface. - The code to generate ompi_info output has been moved into the MCA variable system. See mca_base_var_dump(). opal: update core and components to mca_base_var system orte: update core and components to mca_base_var system ompi: update core and components to mca_base_var system This commit also modifies the rmaps framework. The following variables were moved from ppr and lama: rmaps_base_pernode, rmaps_base_n_pernode, rmaps_base_n_persocket. Both lama and ppr create synonyms for these variables. This commit was SVN r28236.	2013-03-27 21:09:41 +00:00
Ralph Castain	317915225c	Finish the binding cleanup by removing the no-longer-used binding level scheme. This proved to be fallible as there is no guarantee that the hierarchy it used matched physical reality of the machine (e.g., is L3 "above" the socket or not). Still have to complete the ppr update, but get the rest of it correct. This commit was SVN r28223.	2013-03-26 20:09:49 +00:00
Samuel Gutierrez	8ce2041102	Cleanup in error path. Fixes CID 967211. Thanks, Jeff. This commit was SVN r28183.	2013-03-19 20:00:08 +00:00
Jeff Squyres	2513122d31	Remove extraneous semicolon. This commit was SVN r28180.	2013-03-18 23:58:11 +00:00
Rolf vandeVaart	5c761d701d	Remove tabs for spaces, fix some error messages. This commit was SVN r28141.	2013-03-01 19:13:06 +00:00
Ralph Castain	a4b6fb241f	Remove all remaining vestiges of the Windows integration This commit was SVN r28137.	2013-02-28 17:31:47 +00:00
Ralph Castain	8d2fa3693b	First cut at removing the native Windows support. Remove all the Windows-specific components, and the .windows files sprinkled around. Remove the Windows platform files and MTT scripts. Update the NEWS to point Windows users to the cygwin package. This commit was SVN r28116.	2013-02-26 20:44:56 +00:00
Ralph Castain	70a28c8a27	Now that we are using local ranks in OMPI, we need to define an ompi_local_rank_t and equate it to orte_local_rank_t. Change the sm btl to use the correct abstraction. This commit was SVN r28098.	2013-02-22 17:48:53 +00:00
Samuel Gutierrez	af5ed9b25c	OMPI_NODE_RANK_INVALID ==> OMPI_LOCAL_RANK_INVALID This commit was SVN r28096.	2013-02-21 18:28:07 +00:00
Samuel Gutierrez	4bf0134901	Remove debug. This commit was SVN r28095.	2013-02-21 18:21:22 +00:00
Samuel Gutierrez	b7791963f2	Fix sm BTL initialization for MPI_Comm_spawn and friends. Thanks to Jeff for finding the issue. This commit was SVN r28094.	2013-02-21 18:19:46 +00:00
Nathan Hjelm	55cf850eca	Add comment about r28083 This commit was SVN r28084. The following SVN revision numbers were found above: r28083 --> open-mpi/ompi@5411e28c00	2013-02-20 21:42:13 +00:00
Nathan Hjelm	5411e28c00	btl/openib: don't align fragments on 2 byte boundaries (changed to 8) cmr:v1.6,v1.7 This commit was SVN r28083.	2013-02-20 21:27:01 +00:00
Rolf vandeVaart	da3e9ff906	Add show_help.h where needed. This commit was SVN r28071.	2013-02-19 15:42:09 +00:00
Ralph Castain	ebad55b933	Apply patches from ORNL to fix compile issues - minor stuff. Thanks to Geoffroy Vallee for the patches. This commit was SVN r28065.	2013-02-15 22:14:23 +00:00
Jeff Squyres	bbddd6ea03	Add header file for opal_show_help(). This commit was SVN r28056.	2013-02-13 16:31:59 +00:00
Brian Barrett	312f37706e	In talking about this with Jeff and Ralph, we don't actually need ompi_show_help, because opal_show_help is replaced with an aggregating version when using ORTE, so there's no reason to directly call orte_show_help. This commit was SVN r28051.	2013-02-12 21:10:11 +00:00
Joshua Ladd	70ad711337	Backing out the Open SHMEM project This commit was SVN r28050.	2013-02-12 17:45:27 +00:00
Mike Dubman	ff384daab4	Added new project: oshmem. This commit was SVN r28048.	2013-02-12 15:33:21 +00:00
Jeff Squyres	c8dc1905f0	Fixes trac:3494: If we get 0 bytes back for the ACK, it doesn't necessarily mean an error -- it could (and usually does) mean that the peer realized that we both initiated a connect at the same time, and therefore it decided to hang up. I also added a friendly show_help error message for other cases where recv_blocking() fails (i.e., "Something went wrong. Kaboom! Your job will abort..."). This commit was SVN r28023. The following Trac tickets were found above: Ticket 3494 --> https://svn.open-mpi.org/trac/ompi/ticket/3494	2013-02-02 01:19:03 +00:00
Jeff Squyres	f05b7aa6d8	As the help message states, it's not an ''error'' if the specified interface is not found. It should just be skipped. This commit was SVN r28016.	2013-02-01 20:17:43 +00:00
Brian Barrett	b8442ba505	Revamp the handling of wrapper compiler flags. The user flags, main configure flags, and mca flags are kept seperate until the very end. The main configure wrapper flags should now be modified by using the OPAL_WRAPPER_FLAGS_ADD macro. MCA components should either let <framework>_<component>_{LIBS,LDFLAGS} be copied over OR set <framework>_<component>_WRAPPER_EXTRA_{LIBS,LDFLAGS}. The situations in which WRAPPER CPPFLAGS can be set by MCA components was made very small to match the one use case where it makes sense. This commit was SVN r27950.	2013-01-29 00:00:43 +00:00
George Bosilca	4defdea9f2	The shortest lifespan for a BTL. This commit was SVN r27939.	2013-01-28 03:43:23 +00:00
George Bosilca	1b7dff3f2f	A copy for posterity of the Open MPI Sicortex BTL. This commit was SVN r27938.	2013-01-28 03:42:52 +00:00
Brian Barrett	f42783ae1a	Move the RTE framework change into the trunk. With this change, all non-CR runtime code goes through one of the rte, dpm, or pubsub frameworks. This commit was SVN r27934.	2013-01-27 23:25:10 +00:00
George Bosilca	42753b4690	Make the TCP BTL really fail-safe. It now trigger the error callback on all pending fragments when the destination goes down. This allows the PML to recalibrate its behavior, either find an alternate route or just give up. This commit was SVN r27881.	2013-01-21 11:41:08 +00:00
George Bosilca	d2281cc672	Remove the CMA related warnings. This commit was SVN r27872.	2013-01-19 14:26:43 +00:00
Rolf vandeVaart	f63c88701f	Improve CUDA GPU transfers over openib BTL. Use aynchronous copies. This is RFC that was submitted in July and December of 2012. This commit was SVN r27862.	2013-01-17 22:34:43 +00:00
Rolf vandeVaart	a07a4bb3f7	Update smcuda to match recent changes in sm BTL. This commit was SVN r27803.	2013-01-14 14:42:19 +00:00
Rolf vandeVaart	34d1f0a585	Add some comments to the #ifdefs for clarity. No functional changes. This commit was SVN r27802.	2013-01-13 16:08:48 +00:00
Alex Mikheev	344d407ed4	fixed compilation warning always send signalled when BTL_OPENIB_FAILOVER is defined This commit was SVN r27801.	2013-01-13 10:11:03 +00:00
Samuel Gutierrez	4c28c8cbd0	New sm BTL initialization take two. This approach is pretty simple. Instead of using the modex or RML to share sm initialization information, have node rank 0 create a file containing initialization information in a well-known place. Then during add_procs, the rest of the node processes requiring sm BTL initialization will just read from that file to complete their initialization. This commit was SVN r27789.	2013-01-11 16:24:56 +00:00
Alex Mikheev	fe672f255f	request signal when sending over SRQ and number of SRQ sd_credits is 0 This commit was SVN r27767.	2013-01-08 14:00:29 +00:00
Samuel Gutierrez	c4acd20eb9	Backout r27739. This commit was SVN r27745. The following SVN revision numbers were found above: r27739 --> open-mpi/ompi@a159bfaf25	2013-01-05 01:54:23 +00:00
Nathan Hjelm	84e34ee0d7	Fix a bug in the uGNI btl that could cause certain descriptor callbacks to be called twice. There was a race condition in the eager get protocol where the RDMA complete message could be received before the local completion of the SMSG message that started the eager get protocol. cmr:v1.7 This commit was SVN r27740.	2013-01-03 23:11:13 +00:00
Samuel Gutierrez	a159bfaf25	sm BTL initialization via modex, as discussed at last year's meeting. This commit was SVN r27739.	2013-01-03 21:52:20 +00:00
Mike Dubman	b6d50a5733	Performance optimizations by alexm: * btl sendi(): if message can be send inline try to avoid signal * signal is requested one per 64 or when there are no send wqes when message can not be send inline any other btl method then sendi() This commit was SVN r27724.	2012-12-26 10:19:12 +00:00
George Bosilca	ed77868984	No need for event.h in the SM BTL. This commit was SVN r27718.	2012-12-23 19:33:53 +00:00
Nathan Hjelm	ba5b2b0540	btl/vader: fix bug in single copy code that could cause ob1 sends to not get marked complete. cmr:v1.7 This commit was SVN r27671.	2012-12-13 23:18:53 +00:00
Nathan Hjelm	3e1b13b13a	Re-add support for old flex (2.5.4a and earlier) while still cleaning up properly in new flex. This commit was SVN r27657.	2012-12-07 00:12:43 +00:00
Brian Barrett	702451111b	Remove Portals 3.3 support This commit was SVN r27656.	2012-12-06 20:11:27 +00:00
Jeff Squyres	c00e6a7abf	Remove the OFUD BTL. It doesn't work, and isn't included in 1.7. An upcoming BTL from Cisco used ofud as a starting point, and should probably be used as a starting point for any future UD-based BTL. And this OFUD BTL is obviously still in history if anyone ever wants to resurrect it. This commit was SVN r27655.	2012-12-06 17:43:28 +00:00
Steve Wise	176a5a9b3b	Update the Chelsio T4 openib device params. This fixes trac:3414 and should be added to cmr:v1.6:reviewer=jsquyres and cmr:v1.7:reviewer=jsquyres This commit was SVN r27648. The following Trac tickets were found above: Ticket 3414 --> https://svn.open-mpi.org/trac/ompi/ticket/3414	2012-11-30 16:32:34 +00:00
Jeff Squyres	ad15fb5437	Developer enhancement: if a BTL component returns a NULL in its array of modules, print a BTL_ERROR and exit(1) (previous behavior was to segv). This at least explicitly tells the developer that their BTL component is behaving badly. This commit was SVN r27634.	2012-11-26 21:19:02 +00:00
Nathan Hjelm	e0f5137e46	add prototypes for lex destroy functions This commit was SVN r27580.	2012-11-09 22:00:27 +00:00
Nathan Hjelm	8658bbc902	instead of relying on yyterminate to clean up the lex context call the destroy functions directly (after closing the file) This commit was SVN r27577.	2012-11-09 16:10:55 +00:00
Nathan Hjelm	7fb5caea92	Remove the finish_parsing function from various .l files. The function is incomplete (doesn't clean up the lex state) and should be replaced by *_yylex_destroy which correctly cleans up the state. Checked with the flex 2.5.35. Verified with valgrind that this fixes several "still reachable" leaks. cmr:v1.7 This commit was SVN r27571.	2012-11-06 19:26:14 +00:00
Nathan Hjelm	bdedd8b0d3	Per RFC modify the behavior of mca_base_components_close to NOT close the output. Modify frameworks to always close their output and set to -1. Reasoning: The old behavior was a little confusing. mca_base_components_open does not open an output stream so it is a little unexpected that mca_base_components_close does. To add to this several frameworks (that don't use mca_base_components_close) failed to close their output in the framework close function and others closed their output a second time. This change is an improvement to the symantics of mca_base_components_open/close as they are now symetric in their functionality. This commit was SVN r27570.	2012-11-06 19:09:26 +00:00
Nathan Hjelm	f3ce12e71a	Per RFC fix several leaks in opal and ompi. Details below. pml/v: - If vprotocol is not being used vprotocol_include_list is leaked. Assume vprotocol never takes ownership (see below) and always free the string. coll/ml: - (patch verified) calling mca_base_param_lookup_string after mca_base_param_reg_string is unnecessary. The call to mca_base_param_lookup_string causes the value returned by mca_base_param_reg_string to be leaked. - Need to free mca_coll_ml_component.config_file_name on component close. btl/openib: - calling mca_base_param_lookup_string after mca_base_param_reg_string is unnecessary. The call to mca_base_param_lookup_string causes the value returned by mca_base_param_reg_string to be leaked. vprotocol/base: - There was no way for pml/v to determine if vprotocol took ownership of vprotocol_include_list. Fix by always never ownership (use strdup). mca/base: - param_lookup will result in storage->stringval to be a newly allocated string if the mca parameter has a string value. ensure this string is always freed. cmr:v1.7 This commit was SVN r27569.	2012-11-06 18:57:46 +00:00
Mike Dubman	d47d550dfc	performance optimization: process completions in the batch manner This commit was SVN r27559.	2012-11-05 14:02:37 +00:00
Jeff Squyres	4569f77645	Remove redundant common_verbs.h include. This commit was SVN r27556.	2012-11-02 14:16:55 +00:00
Mike Dubman	5cdb3654d7	SRQ now supported in ConnectIB This commit was SVN r27552.	2012-11-01 08:13:56 +00:00
Nathan Hjelm	2acd0f83de	Revert "Revert r27451 and r27456 - the cmd line parser is incorrectly marking the application as an MCA parameter". It appears the problem was not with the command line parser but the rsh plm. I don't know why this problem was not occuring before the command line parser changes but it appears to be resolved now. This commit was SVN r27527. The following SVN revision numbers were found above: r27451 --> open-mpi/ompi@d59034e6ef r27456 --> open-mpi/ompi@ecdbf34937	2012-10-30 19:45:18 +00:00
Ralph Castain	e6014bf2e1	Revert r27451 and r27456 - the cmd line parser is incorrectly marking the application as an MCA parameter This commit was SVN r27477. The following SVN revision numbers were found above: r27451 --> open-mpi/ompi@d59034e6ef r27456 --> open-mpi/ompi@ecdbf34937	2012-10-24 18:38:44 +00:00
Yael Dayan	d6f7e4eb73	openib: modified Mellanox ConnectIB max_inline_data param This commit was SVN r27457.	2012-10-18 15:59:18 +00:00
Nathan Hjelm	d59034e6ef	MCA: remove deprecated mca_base_param functions (mca_base_param_register_int, mca_base_param_register_string, mca_base_param_environ_variable). Remove all uses of deprecated functions. cmr:v1.7 This commit was SVN r27451.	2012-10-17 20:17:37 +00:00
Yael Dayan	0122cf6cbb	openib: added Mellanox ConnectIB device ID and params to the device parameters ini file This commit was SVN r27402.	2012-10-04 13:20:47 +00:00
Jeff Squyres	8c369224bf	More common/verbs improvements: * Add OMPI_COMMON_VERBS_FLAGS_NOT_RC, which looks for a device that does ''not'' support RC * Add ompi_common_verbs_find_max_inline(), and remove that code from the openib BTL component This commit was SVN r27393.	2012-10-03 00:57:39 +00:00
Jeff Squyres	30d9c36275	FreeBSD detection improvement. Thanks to Brooks Davis for the patch. This commit was SVN r27334.	2012-09-13 13:25:04 +00:00
Jeff Squyres	3cc8b0461a	More updates to common verbs infrastructure: * Moved "check basics" sanity check from openib BTL to common/verbs (which also allows us to have openib ''not'' include <infiniband/driver.h>, which is a Very Good Thing) * Add new ompi_common_verbs_qp_test() function, which tests to see whether a device supports RC and/or UD QPs. The openib BTL now uses this function to ensure that the device supports RC QPs. * Rename ompi_common_verbs_find_ibv_ports() to be ompi_common_verbs_find_ports() -- the "ibv" was redundant. * Re-work ompi_common_verbs_find_ports() to use ompi_common_verbs_qp_test() instead of testing for RC/UD QPs itself * Add bunches of opal_output_verbose() to the find_ports() routine (to help diagnosing connectivity problems -- imaging running with --mca btl_base_verbose 10; you'll see all the find_ports() test results) * Make ompi_common_verbs_qp_test() warn if devices/ports are supplied in the if_include/if_exclude strings that do not exists (quite similar to what the openib BTL does today). * Add ompi_common_verbs_mca_register() function, which registers common verbs MCA params. It will also register MCA param synonyms for thse MCA params to upper-level components (e.g., btl_<upper-level-component>_<the-mca-param>). * common_verbs_warn_nonexistent_if: warn if if_include/if_exclude-specified devices or ports do not exist. This commit was SVN r27332.	2012-09-12 20:47:47 +00:00
Jeff Squyres	fb2e543a57	Refs trac:3275. We ran into a case where the OMPI SVN trunk grew a new acceptable MCA parameter value, but this new value was not accepted on the v1.6 branch (hwloc_base_mem_bind_failure_action -- on the trunk it accepts the value "silent", but on the older v1.6 branch, it doesn't). If you set "hwloc_base_mem_bind_failure_action=silent" in the default MCA params file and then accidentally ran with the v1.6 branch, every OMPI executable (including ompi_info) just failed because hwloc_base_open() would say "hey, 'silent' is not a valid value for hwloc_base_mem_bind_failure_action!". Kaboom. The only problem is that it didn't give you any indication of where this value was being set. Quite maddening, from a user perspective. So we changed the ompi_info handles this case. If any framework open function return OMPI_ERR_BAD_PARAM (either because its base MCA params got a bad value or because one of its component register/open functions return OMPI_ERR_BAD_PARAM), ompi_info will stop, print out a warning that it received and error, and then dump out the parameters that it has received so far in the framework that had a problem. At a minimum, this will show the user the MCA param that had an error (it's usually the last one), and ''where it was set from'' (so that they can go fix it). We updated ompi_info to check for O???_ERR_BAD_PARAM from each from the framework opens. Also updated the doxygen docs in mca.h for this O???_BAD_PARAM behavior. And we noticed that mca.h had MCA_SUCCESS and MCA_ERR_??? codes. Why? I think we used them in exactly one place in the code base (mca_base_components_open.c). So we deleted those and just used the normal OPAL_* codes instead. While we were doing this, we also cleaned up a little memory management during ompi_info/orte-info/opal-info finalization. Valgrind still reports a truckload of memory still in use at ompi_info termination, but they mostly look to be components not freeing memory/resources properly (and outside the scope of this fix). This commit was SVN r27306. The following Trac tickets were found above: Ticket 3275 --> https://svn.open-mpi.org/trac/ompi/ticket/3275	2012-09-11 20:47:24 +00:00
Shiqing Fan	ec4cf39925	Windows doesn't need to exclude any interface by default. This will avoid tcp warnings. This commit was SVN r27291.	2012-09-11 15:39:37 +00:00
Shiqing Fan	0c4c2a5f5d	Revert r27283. A better solution is found. Thanks to Ralph anyway. This commit was SVN r27290. The following SVN revision numbers were found above: r27283 --> open-mpi/ompi@38bcd86ae4	2012-09-11 15:37:22 +00:00
Ralph Castain	38bcd86ae4	Per request by Shiqing, specifically exclude the "lo" interface from the TCP btl. Apparently, Windows sometimes fails to resolve the 127.0.0.1 to "lo", causing subsequent failures. This commit was SVN r27283.	2012-09-10 16:22:46 +00:00
Jeff Squyres	dd254cc202	OMPI_HAVE_IBV_LINK_LAYER does not exist. Instead, check defined(HAVE_IBV_LINK_LAYER_ETHERNET). This commit was SVN r27251.	2012-09-06 18:25:36 +00:00
Jeff Squyres	341ce2f9a4	Per some discussions between LANL, Cisco, ORNAL, and Mellanox, move some new common OpenFabrics functionality to ompi/mca/common/verbs. Also move everything that was in ompi/mca/common/ofautils under ompi/mca/common/verbs. * Move ofautils -> verbs * Add new functionality in ompi/mca/common/verbs (see doxygen * comments in ompi/mca/common/verbs/common_verbs.h for details): * ompi_common_verbs_find_ibv_ports() * ompi_common_verbs_port_bw() * ompi_common_verbs_mtu() * '''If you're writing verbs-based code, you should be using this common functionality''' * Adapt openib BTL to use some trivial common functionality in common/verbs * Don't use "#ifdef OMPI_HAVE_RDMAOE",use "#if defined(HAVE_IBV_LINK_LAYER_ETHERNET)" * Update the following to include/link against common/verbs * bcol/iboffload * sbgp/ibnet * btl/openib This commit was SVN r27212.	2012-09-01 01:42:37 +00:00
Jeff Squyres	e5babf830a	Fixes trac:3258: add btl_openib_abort_not_enough_reg_mem MCA parameter that causes MPI jobs to abort if there is not enough registered memory available (vs. just warning). This commit was SVN r27140. The following Trac tickets were found above: Ticket 3258 --> https://svn.open-mpi.org/trac/ompi/ticket/3258	2012-08-25 11:39:06 +00:00
Jeff Squyres	c8cee23ee7	Priorities really shouldn't be less than 0. This commit was SVN r27098.	2012-08-21 15:47:15 +00:00
Ralph Castain	dacb07000d	Turn udcm and ud oob off by default, but allow them to build and be used if someone wants to test them cmr:v1.7 This commit was SVN r27097.	2012-08-21 15:18:34 +00:00
Ralph Castain	69753c37ef	Turn off one place that won't compile if ompi progress threads enabled because it calls a non-existent function This commit was SVN r27082.	2012-08-16 22:53:14 +00:00
Yael Dayan	b3b8a2a23a	function mca_btl_openib_endpoint_post_send can return 3 statuses: - OMPI_SUCCESS - OMPI_ERROR - OMPI_ERR_RESOURCE_BUSY If an "OMPI_ERR_OUT_OF_RESOURCE" occurs, the request is added to the pending list, and will be handled later. An error message should not be printed to the user in this case. This is not an error, but rather a notification of a possible valid condition. Only in the case of "OMPI_ERROR" should it be printed to the user. This commit was SVN r27065.	2012-08-16 07:04:40 +00:00
Christopher Yeoh	cc091f4979	Adds synchronisation between main thread and service thread in btl_openib_connect_udcm when notifying not to listen to an fd to ensure that the main thread does not continue until the service thread has processed the message Adds ability to send message to openib async thread to tell it to ignore the ERR state on a specific QP. Adds this call to udcm_module_finalize so when we set the error state on the QP it doesn't cause the openib async thread to abort the mpi program prematurely Fixes trac:3161 This commit was SVN r27064. The following Trac tickets were found above: Ticket 3161 --> https://svn.open-mpi.org/trac/ompi/ticket/3161	2012-08-16 03:56:21 +00:00
Jeff Squyres	02e2c88224	Back out r26869 (i.e., put back a single per-peer QP in the default receive queues value) so that we don't break the use of RDMA CM, and therefore break RoCE. This commit was SVN r27017. The following SVN revision numbers were found above: r26869 --> open-mpi/ompi@fe0e7f81df	2012-08-13 15:57:21 +00:00
Samuel Gutierrez	6188d97e1a	Getting out of bed this morning was a bad idea... Reverting the sm update once more because it breaks direct launch. Will address this issue and commit the update once it has all been tested. Sorry everyone! This commit was SVN r27001.	2012-08-10 22:20:38 +00:00
Samuel Gutierrez	159bd2e62e	Let's try this again: sm BTL initialization via modex. This commit was SVN r26989.	2012-08-10 20:12:36 +00:00
Samuel Gutierrez	6a70063812	Yikes - that's not right! Back out 26987. I'll try again in a bit... Sorry! This commit was SVN r26988.	2012-08-10 19:57:51 +00:00
Samuel Gutierrez	2c80273246	sm BTL initialization via modex. This commit was SVN r26987.	2012-08-10 19:51:41 +00:00
Yael Dayan	7895cd1114	adding a fragmentation mechanism to the Get flow in function mca_pml_ob1_recv_request_progress_rget This commit was SVN r26956.	2012-08-07 07:15:21 +00:00
Yevgeny Kliteynik	a6458da4ba	Using 8K as a minimal CQ length - For now we'll use 8192 as a base value - We leave the adjust_cq() as is - For the long term we can work on an appropriate setting to expose through the INI file. 8K CQEs are 512K per process, which is 8MB for ppn=16 This commit was SVN r26877.	2012-07-26 21:06:18 +00:00
Nathan Hjelm	8736953c7f	btl/openib/connect improve the help message printed when a queue pair can not be created This commit was SVN r26876.	2012-07-26 20:36:46 +00:00
Shiqing Fan	204fbfe4b1	update the wv btl component. This commit was SVN r26872.	2012-07-26 15:35:01 +00:00
Nathan Hjelm	fe0e7f81df	btl/openib: as discussed remove the per-peer queue pair from the default configuration This commit was SVN r26869.	2012-07-25 22:53:58 +00:00
Jeff Squyres	5ec6a65a72	After I spent a while looking in libibverbs for ibv_get_device_list_compat() and not finding it, I finally realized that it was a function in OMPI. So let's name it with a proper ompi_ prefix, not an ibv_ prefix. This commit was SVN r26867.	2012-07-25 16:32:51 +00:00
Jeff Squyres	6f5fd6245f	Add missing %d This commit was SVN r26857.	2012-07-24 13:33:11 +00:00
Jeff Squyres	0b4a659683	Stomp some compiler warnings; use proper printf sequences for uint64_t. This commit was SVN r26856.	2012-07-24 13:03:55 +00:00
Jeff Squyres	e66d386441	Add a new missing field to the template BTL module that was causing a bunch of compiler warnings. This commit was SVN r26855.	2012-07-24 12:55:12 +00:00
George Bosilca	6ebbacb054	Complete the dump function for the SM BTL. Now we can see all fragments in all the queues as long as the BTL is dump-friendly (only SM right now). This commit was SVN r26849.	2012-07-24 00:22:22 +00:00

... 2 3 4 5 6 ...

2066 Коммитов