openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	72501f8f9c	Consistent return from all progress functions. This fix ensures that all progress functions return the number of completed events. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2020-01-28 20:16:53 +01:00
Joseph Schuchart	2c97187ee0	Harmonize return values of progress callbacks Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2020-01-28 20:15:03 +01:00
Austen Lauria	1275766037	Merge pull request #7207 from devreal/remove_shmem_seg_hdr Remove unused opal_shmem_seg_hdr_t to retain alignment	2020-01-28 13:57:55 -05:00
Aurelien Bouteiller	9f4365fef6	Merge pull request #6783 from abouteiller/export/macos-epipe Prevent EPIPE on OSX.	2020-01-28 11:18:46 -05:00
Brian Barrett	d768d82231	Merge pull request #7167 from wckzhang/reachable_netlinks Reachable documentation change	2020-01-27 15:39:14 -08:00
Brian Barrett	fc8c7a5869	Merge pull request #7134 from wckzhang/btl_tcp_interface_match btl tcp: Use reachability and graph solving for global interface matching	2020-01-27 15:38:49 -08:00
Austen Lauria	10f6a77640	Merge pull request #7315 from abouteiller/export/tcp_errors_v2 Handle error cases in TCP BTL (v2)	2020-01-27 17:03:07 -05:00
Aurelien Bouteiller	76021e35ee	Adding a description of the FIN message for future reference. Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-01-27 13:32:34 -05:00
Aurelien Bouteiller	93846fd0ee	Remove the pending event when socket is TCP_FAILED Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-01-27 13:32:34 -05:00
Aurelien Bouteiller	6b3be224d4	Adding a FIN message to differentiate normal TCP closing from failures Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-01-27 13:32:34 -05:00
Aurelien Bouteiller	b7be64482a	Revert "Revert "Handle error cases in TCP BTL"" This reverts commit 51620114281720c9e375b95cadc45272b3135837. Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2020-01-27 13:31:06 -05:00
William Zhang	e958f3cf22	btl tcp: Use reachability and graph solving for global interface matching Previously we used a fairly simple algorithm in mca_btl_tcp_proc_insert() to pair local and remote modules. This was a point in time solution rather than a global optimization problem (where global means all modules between two peers). The selection logic would often fail due to pairing interfaces that are not routable for traffic. The complexity of the selection logic was Θ(n^n), which was expensive. Due to poor scalability, this logic was only used when the number of interfaces was less than MAX_PERMUTATION_INTERFACES (default 8). More details can be found in this ticket: https://svn.open-mpi.org/trac/ompi/ticket/2031 (The complexity estimates in the ticket do not match what I calculated from the function) As a fallback, when interfaces surpassed this threshold, a brute force O(n^2) double for loop was used to match interfaces. This commit solves two problems. First, the point-in-time solution is turned into a global optimization solution. Second, the reachability framework was used to create a more realistic reachability map. We switched from using IP/netmask to using the reachability framework, which supports route lookup. This will help many corner cases as well as utilize any future development of the reachability framework. The solution implemented in this commit has a complexity mainly derived from the bipartite assignment solver. If the local and remote peer both have the same number of interfaces (n), the complexity of matching will be O(n^5). With the decrease in complexity to O(n^5), I calculated and tested that initialization costs would be 5000 microseconds with 30 interfaces per node (Likely close to the maximum realistic number of interfaces we will encounter). For additional datapoints, data up to 300 (a very unrealistic number) of interfaces was simulated. Up until 150 interfaces, the matching costs will be less than 1 second, climbing to 10 seconds with 300 interfaces. Reflecting on these results, I removed the suboptimal O(n^2) fallback logic, as it no longer seems necessary. Data was gathered comparing the scaling of initialization costs with ranks. For low number of interfaces, the impact of initialization is negligible. At an interface count of 7-8, the new code has slightly faster initialization costs. At an interface count of 15, the new code has slower initialization costs. However, all initialization costs scale linearly with the number of ranks. In order to use the reachable function, we populate local and remote lists of interfaces. We then convert the interface matching problem into a graph problem. We create a bipartite graph with the local and remote interfaces as vertices and use negative reachability weights as costs. Using the bipartite assignment solver, we generate the matches for the graph. To ensure that both the local and remote process have the same output, we ensure we mirror their respective inputs for the graphs. Finally, we store the endpoint matches that we created earlier in a hash table. This is stored with the btl_index as the key and a struct mca_btl_tcp_addr_t* as the value. This is then retrieved during insertion time to set the endpoint address. Signed-off-by: William Zhang <wilzhang@amazon.com>	2020-01-21 18:24:08 +00:00
Jeff Squyres	ed753afbc0	hwloc2: advance hwloc git submodule Advance to hwloc-2.1.0rc2-33-g38433c0f, which includes a .gitignore update that we want here in Open MPI. Be warned; this is actually 33 commits beyond the hwloc v2.1.0 tag. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-01-21 09:42:39 -08:00
Nathan Hjelm	037b0bd9ee	Merge pull request #7304 from hjelmn/btl_vader_fix_max_address_on_aarch64 btl/vader: modify how the max attachment address is determined	2020-01-15 17:02:33 -07:00
Nathan Hjelm	728d51f9f3	btl/vader: modify how the max attachment address is determined This PR removes the constant defining the max attachment address and replaces it with the largest address that shows up in /proc/self/maps. This should address issues found on AARCH64 where the max address may differ based on the configuration. Since the calculated max address may differ between processes the max address is sent as part of the modex and stored in the endpoint data. Signed-off-by: Nathan Hjelm <hjelmn@google.com>	2020-01-14 15:15:36 -08:00
Nathan Hjelm	61f96b3d6d	Merge pull request #7283 from hjelmn/fix_issues_in_both_vader_and_opal_interval_tree_t_that_were_causing_issue_6524 Fix issues in both vader and opal interval tree t that were causing issue 6524	2020-01-14 14:54:51 -07:00
Jeff Squyres	25931ea8bf	Merge pull request #7200 from cpshereda/master-opal_gethostname-change Fix unsafe use of gethostname()	2020-01-13 16:07:43 -05:00
Charles Shereda	cbc6feaab2	Created opal_gethostname() as safer gethostname substitute. The opal_gethostname() function provides a more robust mechanism to retrieve the hostname than gethostname(), which can return results that are not null-terminated, and which can vary in its behavior from system to system. opal_gethostname() just returns the value in opal_process_info.nodename; this is populated in opal_init_gethostname() inside opal_init.c. -Changed all gethostname calls in opal subtree to opal_gethostname -Changed all gethostname calls in orte subtree to opal_gethostname -Changed all gethostname calls in ompi subdir to opal_gethostname -Changed all gethostname calls in oshmem subdir to opal_gethostname -Changed opal_if.c in test subdir to use opal_gethostname -Changed opal_init.c to include opal_init_gethostname. This function returns an int and directly sets opal_process_info.nodename per jsquyres' modifications. Relates to open-mpi#6801 Signed-off-by: Charles Shereda <cpshereda@lanl.gov>	2020-01-13 08:52:17 -08:00
Jeff Squyres	21bc9042e1	mtl/ofi: check for FI_LOCAL_COMM+FI_REMOTE_COMM Make sure to get an RDM provider that can provide both local and remote communication. We need this check because some providers could be selected via RXD or RXM, but can't provide local communication, for example. Add OPAL_CHECK_OFI_VERSION_GE() m4 macro to check that the Libfabric we're building against is >= a target version. Use this check in two places: 1. MTL/OFI: Make sure it is >= v1.5, because the FI_LOCAL_COMM / FI_REMOTE_COMM constants were introduced in Libfabric API v1.5. 2. BTL/usnic: It already had similar configury to check for Libfabric >= v1.1, but the usnic component was checking for >= v1.3. So update the btl/usnic configury to use the new macro and check for >= v1.3. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2020-01-13 08:19:53 -08:00
Brian Barrett	f853971bc1	Merge pull request #6821 from jsquyres/pr/make-hwloc201-tarball-a-submodule hwloc v2.1.0: use a git submodule	2020-01-08 07:41:37 -08:00
Nathan Hjelm	f86f805be1	btl/vader: fix issues with xpmem registration invalidation This commit fixes an issue discovered in the XPMEM registration cache. It was possible for a registration to be invalidated by multiple threads leading to a double-free situation or re-use of an invalidated registration. This commit fixes the issue by setting the INVALID flag on a registation when it will be deleted. The flag is set while iterating over the tree to take advantage of the fact that a registration can not be removed from the VMA tree by a thread while another thread is traversing the VMA tree. References #6524 References #7030 Closes #6534 Signed-off-by: Nathan Hjelm <hjelmn@google.com>	2020-01-07 22:50:52 -07:00
Eisuke Kawashima	d26d4e1d63	Fix typo and update URLs (https, redirection) [skip ci] Signed-off-by: Eisuke Kawashima <e-kwsm@users.noreply.github.com>	2020-01-07 03:52:25 +09:00
Jeff Squyres	a2a9a9516b	hwloc2: bump up to hwloc v2.1.0 Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-12-24 16:01:03 -08:00
Jeff Squyres	c292e759da	hwloc2: bump up to hwloc 2.0.4 Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-12-24 16:01:03 -08:00
Jeff Squyres	e5722acc37	hwloc201: replace with "hwloc2" component+git submodule Rename the component to be "hwloc2" (since it can now be any v2.x.y version of hwloc), and make the embedded copy of hwloc be a git submodule. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-12-24 16:01:03 -08:00
Jeff Squyres	18c3e1af5e	hwloc: clarify --with-hwloc behavior Clarify in README what --with-hwloc does in its different use cases. Also, ensure that the behavior when specifying `--with-hwloc` is the same as if that option is not specified at all. This is what we did in Open MPI <= v3.x; looks like we inadvertantly caused `--with-hwloc` to be synonymous with `--with-hwloc=external` in v4.0.0. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-12-19 08:38:57 -08:00
Todd Kordenbrock	1af6dbe277	Merge pull request #7066 from tkordenbrock/topic/master/portals4.fix.flowcontrol.bugs portals4: fix flow control bugs	2019-12-11 06:31:26 -06:00
Maxwell Coil	52a9cce6f3	memory/patcher: fix compiler warning syscall() returns a long, but we are invoking shmat(), which returns a void*. Signed-off-by: Maxwell Coil <mcoil@nd.edu>	2019-12-08 13:56:00 -05:00
Jeff Squyres	53ebea12aa	mpool/base: fix basic mpool_base() function The prior implementation was simply wrong. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-12-05 18:57:37 -05:00
William Zhang	a471f8749f	reachable: Update documentation on reachable function We have decided to show interfaces that are identical to itself as reachable. This is consistent with the previous netmask logic when determining reachability. Signed-off-by: William Zhang <wilzhang@amazon.com>	2019-12-02 18:04:40 +00:00
William Zhang	ce40436895	reachable/netlink: Show an interface as reachable to itself Due to the way netlinks detects reachability, it will not show an interface as reachable to itself, even if it can pass through a loopback interface. To maintain similar behavior with netmasks, we display an interface as reachable to itself. Signed-off-by: William Zhang <wilzhang@amazon.com>	2019-12-02 18:04:40 +00:00
Joseph Schuchart	ee80babe5c	Remove unused opal_shmem_seg_hdr_t to retain alignment Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2019-11-29 08:40:14 +01:00
Geoffroy Vallee	de6f130b4a	Add the missing code to check a return code Signed-off-by: Geoffroy Vallee <geoffroy.vallee@gmail.com>	2019-11-24 11:04:10 -05:00
Austen Lauria	edcd6d8aeb	Merge pull request #7146 from bgoglin/master fix typos hlwoc->hwloc	2019-11-13 15:38:00 -05:00
Nathan Hjelm	09dd383f8b	Merge pull request #7108 from devreal/btl-ugni-deadlock uGNI: Fix potential deadlock when processing outstanding transfers	2019-11-11 10:56:56 -08:00
Howard Pritchard	9d345d9aa0	btl/uct: add UCT API version check to configury related to #7128 The UCX crew is no longer guaranteeing that the UCT API is going to be frozen, so this is kind of a whack-a-mole problem trying to keep the BTL UCT working with various changing UCT APIs. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2019-11-06 14:27:58 -07:00
Brice Goglin	5c6bd7ea4e	fix typos hlwoc->hwloc Signed-off-by: Brice Goglin <Brice.Goglin@inria.fr>	2019-11-06 10:42:36 +01:00
Nathan Hjelm	a3026c016a	btl/uct: fix compilation for UCX 1.7.0 Ref #7128 Signed-off-by: Nathan Hjelm <hjelmn@google.com>	2019-11-05 12:53:26 -08:00
George Bosilca	476562752f	Correctly report TCP connect errors. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2019-10-31 18:33:15 -04:00
Joseph Schuchart	c09ca039b4	uGNI: Fix potential deadlock when processing outstanding transfers Signed-off-by: Joseph Schuchart <schuchart@hlrs.de>	2019-10-26 12:21:17 +02:00
Austen Lauria	aa8be9c12d	Merge pull request #6284 from devreal/ompi-rdma-memalign Ensure proper alignment of memory provided by MPI	2019-10-25 12:27:58 -04:00
Nathan Hjelm	b1ef5a40fa	Merge pull request #7016 from hjelmn/fix_btl_uct_from_yet_another_unannounced_api_break_in_the_openucx_uct_layer btl/uct: add support for OpenUCX v1.8 API changes	2019-10-17 06:27:18 -07:00
Jeff Squyres	b6c4d5c118	Merge pull request #7060 from jsquyres/pr/usnic-mca-updates BTL usnic MCA updates	2019-10-15 10:48:10 -04:00
Stanislav Kirillov	0e0763e006	fix ipv6 btl connection bug Signed-off-by: Stanislav Kirillov <staskirillof@yandex.ru>	2019-10-10 11:20:37 +00:00
Todd Kordenbrock	f7e74b6a3d	btl-portals4: fix a flow control configure bug This commit fixes a configure bug that caused flow control to be disabled regardless of the configure options used. Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com>	2019-10-09 17:12:56 -05:00
Geoff Paulsen	4e1e6f8972	Merge pull request #6993 from awlauria/fix_warnings_master Fix miscellaneous compiler warnings.	2019-10-09 09:17:02 -05:00
Jeff Squyres	3080033a8c	btl/usnic: set retrans_timeout back down to 5ms Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-10-08 11:17:54 -07:00
Jeff Squyres	132e4cab3b	btl/usnic: set ack_iteration_delay default to 4 It was previously accidentally set to 0. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-10-08 11:17:30 -07:00
Jeff Squyres	fe7f772f21	btl/usnic: properly size freelist items Move the prefix area from the head to the body in relevant size computations. This fixes a problem in high traffic situations where usNIC may have sent from unregistered memory. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-10-04 14:40:56 -07:00
Jeff Squyres	27e3040dfe	btl/usnic: cap the number of resends per progress iteration New MCA param: btl_usnic_max_resends_per_iteration. This is the max number of resends we'll do in a single pass through usNIC component progress. This prevents progress from getting stuck in an endless loop of retransmissions (i.e., if more retransmissions are triggered during the sending of retransmissions). Specifically: we need to leave the resend loop to allow receives to happen (which may ACK messages we have sent previously, and therefore cause pending resends to be moot). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2019-10-04 13:05:51 -07:00

1 2 3 4 5 ...

3810 Коммитов