openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	38ed70de6f	ompi_mpi_finalize: remove some dead code Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-06-01 13:37:20 -07:00
Jeff Squyres	35438ae9b5	mpi/finalized: revamp INITIALIZED/FINALIZED Per MPI-3.1:8.7.1 p361:11-13, it's valid for MPI_FINALIZED to be invoked during an attribute destruction callback (e.g., during the destruction of keyvals on MPI_COMM_SELF during the very beginning of MPI_FINALIZE). In such cases, MPI_FINALIZED must return "false". Prior to this commit, we hung in FINALIZED if it were invoked during a COMM_SELF attribute destruction callback in FINALIZE. See https://github.com/open-mpi/ompi/issues/5084. This commit converts the MPI_INITIALIZED / MPI_FINALIZED infrastructure to use a single enum (ompi_mpi_state, set atomically) to represent the state of MPI: - not initialized - init started - init completed - finalize started - finalize past COMM_SELF destruction - finalize completed The "finalize past COMM_SELF destruction" state is what allows us to return "false" from MPI_FINALIZED before COMM_SELF has been fully destroyed / all attribute callbacks have been invoked. Since this state is checked at nearly every MPI API call (to see if we're outside of the INIT/FINALIZE epoch), care was taken to use atomics to set the ompi_mpi_state value in ompi_mpi_init() and ompi_mpi_finalize(), but performance-critical code paths can simply read the variable without needing to use a slow call to an opal_atomic_*() function. Thanks to @AndrewGaspar for reporting the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-06-01 13:36:29 -07:00
Edgar Gabriel	52bd606294	fcoll/dynamic_gen2: make sure that intermediate variables can hold the offset for very large offsets, ome ariables used in the fcoll/dynamic_gen2 code base were under certain circumstances not large enough to hold intermediate values. This issue was more detected in the vulcan component but could happen in the dynamic_gen2 component as well. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-06-01 06:53:38 -05:00
Gilles Gouaillardet	9f7586465d	fortran/mpif-h: fix MPI1 compatibility Makefile appends MPI1 compatible source files instead of redefining all the source files fix a typo from open-mpi/ompi@89da9651bb Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-06-01 09:52:22 +09:00
Nathan Hjelm	b323655809	mpi: make C++ bindings compile when MPI-1 compat is disabled Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-31 09:44:19 -06:00
Nathan Hjelm	89da9651bb	ompi: disable functions removed from MPI-3.0 by default This commit adds a new configure option: --enable-mpi1-compat. Without this option we will no longer provide APIs, typedefs, and defines that were removed from the standard in MPI-3.0. This option will exist for one major release (Open MPI v4.x.x) and then the option and associated code will be removed in Open MPI v5.x.x. Open MPI has already internally prepared for this change. Please prepare your codes accordingly. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-31 09:44:19 -06:00
KAWASHIMA Takahiro	04b509b2d2	Merge pull request #5207 from t-kurita/pr/java-doc-descriptions java: Improve descriptions of `javadoc`	2018-05-31 16:12:21 +09:00
Kurita, Takehiro	11ae771b82	java: Improve descriptions of `javadoc` - Improve descriptions - Fix some typos - Remove MPI-1 functions and replace them with MPI-2 functions Signed-off-by: Kurita, Takehiro <fj6370fp@aa.jp.fujitsu.com>	2018-05-31 15:02:35 +09:00
Jeff Squyres	25f2d02c61	fcoll/dynamic_gen2: minor compiler warning stomp Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-30 10:08:19 -07:00
Jeff Squyres	2dce549df2	ompi/debuggers: stomp a compiler warning in dlopen_test.c Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-30 10:08:14 -07:00
Nathan Hjelm	e9de42544e	osc/sm: add support for controlling location of backing store This commit adds a new MCA variable to set the location of the backing store: osc_sm_backing_directory. The default on Linux has been changed to use /dev/shm to improve performance in cases where /tmp is not a tmpfs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-29 21:44:01 -06:00
Nathan Hjelm	d0d59b1d7d	osc/rdma: add support for controlling location of backing store This commit adds a new MCA variable to set the location of the backing store: osc_rdma_backing_directory. The default on Linux has been changed to use /dev/shm to improve performance in cases where /tmp is not a tmpfs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-29 21:43:33 -06:00
Howard Pritchard	5b7c866f59	osc/pt2pt: disable when THREAD_MULTIPLE. Per discussion at https://github.com/open-mpi/ompi/issues/2614#issuecomment-392815654, do not allow for selection of the OSC PT2PT when creating an MPI RMA window when THREAD_MULTIPLE is active. Print a helpful message and return a not-supported error. Signed-off-by: Howard Pritchard <howardp@lanl.gov> Signed-off-by: Jeff Squyres <jsquyres@cisco.com> (cherry picked from commit d0ffd660841623c02d1dfa3151e7f7afd3327698) Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-29 08:59:53 -07:00
Mikhail Kurnosov	28d5837dd9	coll: reduce_scatter_block: add butterfly algorithm Implements butterfly algorithm for MPI_Reduce_scatter_block. The algorithm can be used both by commutative and non-commutative operations, for power-of-two and non-power-of-two number of processes. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-27 14:17:41 +07:00
Edgar Gabriel	6b03cee7f1	io/ompio: erroneous condition in selecting aggregator selection logic fix the logic in the decision which aggregator selection algorithm to use. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-05-24 15:52:19 -05:00
John L. Jolly	36b9e15fb7	- Build warning: stringop-overflow in get_dynamic_win_info() at osc_ucx_comm.c In file included from /usr/include/string.h:494:0, from ../../../../ompi/info/info.h:29, from ../../../../ompi/mca/osc/base/base.h:24, from osc_ucx_comm.c:13: In function 'memcpy', inlined from 'get_dynamic_win_info' at osc_ucx_comm.c:359:5, inlined from 'ompi_osc_ucx_put' at osc_ucx_comm.c:401:18: /usr/include/bits/string_fortified.h:34:10: warning: '__builtin___memcpy_chk' writing 8 bytes into a region of size 4 overflows the destination [-Wstringop-overflow=] return __builtin___memcpy_chk (__dest, __src, __len, __bos0 (__dest)); ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This is caused by a type size mismatch in a call to memcpy This fix corrects the type definition of the win_count variable. Signed-off-by: John Jolly <jjolly@suse.com>	2018-05-22 10:11:57 -06:00
Brian Barrett	09e4c40ce9	mtl: remove MXM MTL Remove the MXM MTL, which has been deprecated in preference for the Yalla PML. This was discussed at the last developers meeting and somehow I ended up with the action item to do the removal. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2018-05-21 14:18:30 -07:00
Sergey Oblomov	5ec26914a6	PML/UCX: do not set offset on ordered data recv Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-21 19:40:07 +03:00
Sergey Oblomov	19607daa32	PML/UCX: create convertor clone instead of stack reset Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-17 16:39:13 +03:00
Sergey Oblomov	7c5de01c57	PML/UCX: reset converter stack on unordered messages Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-05-17 13:11:02 +03:00
Jeff Squyres	9f21ea437c	java: clean up MPI Java configury The Java configury is split into two parts: 1. Determine if we want MPI Java bindings. 2. Find the Java compiler (and related). This commit does a few things: - Move the "Find the Java compiler" step from OPAL to OMPI (because there is no Java in OPAL, and there doesn't appear to be any immanent danger that there will be). - As a direct consequence, remove the --enable-java CLI option (--enable-mpi-java still remains). Enabling the MPI Java bindings and enabling Java are now considered the same thing (since there is no Java elsewhere in the code base, the different was meaningless). - Only invoke the "Find the Java compiler" step if we actually want the MPI Java bindings. - A few miscellaneous Java-related cleanups in configury (E.g., change testing "$foo" == "1" to $foo -eq 1, etc. This commit is mostly s/opal/ompi/gi in many places in configury and shifting code around. But it looks bigger than it actually is because of two reasons: 1. Some files were renamed: * ompi_setup_java.m4 -> ompi_setup_mpi_java.m4 (setup MPI Java bindings) * opal_setup_java.m4 -> ompi_setup_java.m4 (setup Java compiler) 2. Indenting level changed in (the new) ompi_setup_java.m4. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-05-15 15:15:22 -07:00
George Bosilca	7191ea120c	Fix merge conflict related to function renaming. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-05-15 11:34:20 -04:00
bosilca	2ab628b92e	Merge pull request #5074 from bosilca/topic/remove_warnings Remove warnings identified by clang.	2018-05-15 11:15:23 -04:00
bosilca	d13b9a2e25	Merge pull request #5156 from ggouaillardet/topic/reduce_scatter_block coll: reduce_scatter_block: rename and MCA parameter description fix	2018-05-15 11:13:26 -04:00
Mikhail Kurnosov	82299a9c04	coll: reduce_scatter_block: add recursive halving algorithm Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-15 08:20:32 +07:00
Gilles Gouaillardet	ce7b3113f6	coll: reduce_scatter_block: rename and MCA parameter description fix - rename ompi_coll_base_reduce_scatter_block_basic to more self descriptive ompi_coll_base_reduce_scatter_block_basic_linear - fix the description of the coll_tuned_reduce_scatter_block_algorithm MCA param this fixes and documents previous open-mpi/ompi@0e8b35b615 MPI_Reduce_scatter_block used to be implemented by the coll/basic module only. A new algo (recursive doubling) was recently introduced and can be used via the coll/tuned module, but we never intended to make it the default algo. In order to "restore" the previous default, the initial algo was moved from coll/basic to coll/base, and is now used by default by coll/tuned. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-09 08:54:48 +09:00
Nathan Hjelm	cf585d725c	osc/rdma: fix SEGV will null origin in FOP in debug build Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-08 14:10:20 -06:00
Jeff Squyres	b39bbfb3c0	Merge pull request #5142 from mkurnosov/base-reduce-remove-warnings coll/base/reduce: remove warning identified by Coverity Scan	2018-05-07 15:49:56 -04:00
Gilles Gouaillardet	0e8b35b615	coll/tuned: use basic algo for reduce_scatter_block by default Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:44 +09:00
Gilles Gouaillardet	32095be0d6	coll/{base,basic}: move reduce_scatter_block from basic to base Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-07 16:11:38 +09:00
Mikhail Kurnosov	ba968e4490	coll/base/reduce: Remove warning identified by Coverity Scan Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-04 20:48:37 +07:00
Mikhail Kurnosov	8cf8553abd	Resolve merge conflicts Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-05-03 07:28:32 +07:00
Nathan Hjelm	c22c485837	Merge pull request #5136 from hjelmn/mtl_fix mtl: reset ompi_mtl_base_selected_component on framework close	2018-05-02 15:48:54 -06:00
Joshua Ladd	32ddc6af7e	Merge pull request #5094 from xinzhao3/topic/osc-win-fix-master OMPI/OSC/UCX: fix issue in impl of MPI_Win_create_dynamic/MPI_Win_attach/MPI_Win_detach	2018-05-02 17:42:34 -04:00
Nathan Hjelm	f432d07844	mtl: reset ompi_mtl_base_selected_component on framework close Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-02 14:53:34 -06:00
Xin Zhao	3f5ac97649	OMPI/OSC/UCX: set priority to 0. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-05-02 21:40:06 +03:00
Yossi Itigin	66d931b7c4	Merge pull request #5116 from yosefe/topic/ucx-connect-errs ucx: improve error messages during connection establishment	2018-05-02 14:04:24 +03:00
Jeff Squyres	b2aaf28a67	Merge pull request #5113 from ThemosTsikas/patch-2 Use standard-conforming dummy names for Fortran bindings	2018-05-02 06:36:08 -04:00
Nathan Hjelm	1aa82c0e08	Merge pull request #5125 from hjelmn/romio_mpi3 io/romio314: fix two more MPI-3 compliance issues	2018-05-01 20:21:57 -06:00
Themos Tsikas	4d126c16fa	mpi/fortran: use conformant dummy names for Fortran bindings The MPI spec defines that the "mpi" and "mpi_f08" module Fortran bindings support passing by parameters by name. Hence, we need to use the MPI-spec-defined parameter names ("dummy variables", in Fortran parlance) for the "mpi" and "mpi_f08" modules. Specifically, Fortran allows calls to procedures to be written with keyword arguments, e.g., "call mpi_sizeof(x=x,size=rsize,ierror=ier)" An "explicit interface" for the procedure must be in scope for this to be allowed in a Fortran program unit. Therefore, the explicit interface blocks we provide in the "mpi" and "mpi_f08" modules must match the MPI published standard, including the names of the dummy variables (i.e., parameter names), as that is how Fortran programs may call them. Note that we didn't find this issue previously because even though the MPI spec allows for name-based parameter passing, not many people actually use it. I suspect that we might have some more incorrect parameter names -- we should probably do a full "mpi" / "mpi_f08" module parameter name audit someday. Thanks to Themos Tsikas for reporting the issue and supplying the initial fix. Signed-off-by: themos.tsikas@nag.co.uk Signed-off-by: Jeff Squyres <jsquyres@cisco.com> Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-05-02 09:42:19 +09:00
Nathan Hjelm	c6475e572f	ompitrace: MPI_Address -> MPI_Get_address Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-01 15:20:02 -06:00
Nathan Hjelm	ae17908f35	io/romio314: fix two more MPI-3 compliance issues Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-05-01 15:18:18 -06:00
Jeff Squyres	72f2c3befe	Merge pull request #5121 from blegat/bl/cartcreatebracket Remove brackets for ndims for c++ doc of Cart_create	2018-05-01 13:37:14 -04:00
Nathan Hjelm	bfd8ee7d57	Merge pull request #4898 from bosilca/topic/dist_graph Allow MPI_PROC_NULL as neighbor.	2018-05-01 09:55:27 -06:00
Nathan Hjelm	380dcb57de	Merge pull request #5072 from bosilca/topic/datatype_add_size_t Allow OPAL DDT to receive size_t count argument.	2018-05-01 09:47:45 -06:00
Nathan Hjelm	e9ef7aa256	Merge pull request #4985 from mkurnosov/spacc-scan-exscan coll/spacc: Add recursive doubling algorithm for Scan and Exscan	2018-05-01 09:21:23 -06:00
Benoît Legat	709c4f6646	Remove brackets for ndims for c++ doc of Cart_create Signed-off-by: Benoît Legat <benoit.legat@gmail.com>	2018-05-01 12:00:55 +02:00
Yossi Itigin	385f38ab4e	ucx: improve error messages during connection establishment Also, unite common code calling ucp_ep_create() Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2018-04-30 15:45:05 +03:00
Ninad Prabhukhanolkar	1518d7e003	Updated aggregate_profile.pl The files array was also storing $phase.prof. This was leading to $phase.prof's output getting dumped into itself again and again. Updated code to initialise files array with files other than $phase.prof. Signed-off-by: Ninad Prabhukhanolkar <ninadchess96@gmail.com>	2018-04-26 20:34:24 +05:30
Edgar Gabriel	300c61f2df	Merge pull request #5093 from edgargabriel/topic/io-configury Topic/io configury	2018-04-25 15:26:37 -05:00
Edgar Gabriel	19b71e4eb6	ompio/fs: add summary of supported file systems Add the list of supported file systems to the summary output add the end of configure Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-04-25 14:39:18 -05:00
Jeff Squyres	5bd02f6649	fortran/tkr: Fix Makefile.am warnings Set mpi.lo dependencies outside of AM conditionals. Per https://github.com/open-mpi/ompi/issues/5085, make mpi.lo depend on whatever we decide the sizeof source files are (which may be empty, if this compiler does not support the Right Stuff for MPI_SIZEOF, or it may be mpi-tkr-sizeof.[h\|f90]). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-04-25 10:24:01 -07:00
Gilles Gouaillardet	2abeada060	fortran: build MPI_Sizeof() interface in use-mpi-tkr bindings whenever possible. Add the missing OMPI_FORTRAN_BUILD_SIZEOF macro to Fortran and add a missing dependency. Thanks Themos Tsikas for reporting this issue. Refs open-mpi/ompi#5085 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-25 15:09:33 +09:00
Xin Zhao	53bdfd1dcb	OMPI/OSC/UCX: fix issue in impl of MPI_Win_create_dynamic/MPI_Win_attach/MPI_Win_detach Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-04-24 23:09:52 +03:00
Mikhail Kurnosov	787ec8929b	Rename function `rounddown` into `ompi_rounddown` FreeBSD 11 `/sys/param.h` has declaration of `rounddown` Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-24 07:59:49 +07:00
Mikhail Kurnosov	4cbcff7fcd	coll/base: add recursive doubling algorithm for MPI_Reduce_scatter_block Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-23 11:02:31 +07:00
raafatfeki	91e028f7fd	fcoll/dynamic_gen2: Reduce number of realloc calls keep track of the sizeof the blocklen_per_process and displs_per_process on the aggregator datastructure to minimze the number of realloc function calls required in the shuffle_init operation. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-04-20 10:13:57 -05:00
Jeff Squyres	8251139bac	status_set_cancelled: fix F08 binding The F08 bindings for MPI_STATUS_SET_CANCELLED incorrectly had the "flag" dummy parameter set as INTENT(OUT) when it really should be INTENT(IN). On the one hand, this is technically an ABI change. On the other hand, this is an incorrect MPI binding. On the other hand (that's 3 hands for you fans counting at home), this is such a rarely-used API -- even in the C bindings -- that I'm guessing no one is using this API, and therefore no one has noticed this error and it isn't worth porting back to the release branches. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-04-16 17:33:22 -07:00
Nathan Hjelm	84765001aa	io/romio: do not use removed functions This commit attempts to update the romio io component to not use functions removed in MPI-3.0 (2012). This is a first cut and will probably need to be reviewed for correctness. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-04-16 12:06:52 -06:00
Nathan Hjelm	4d876ec6fe	io/romio314: fix minmax datatypes romio assumes that all predefined datatypes are contiguous. Because of the (terribly named) composed datatypes MPI_SHORT_INT, MPI_DOUBLE_INT, MPI_LONG_INT, etc this is an incorrect assumption. The simplest way to fix this is to override the MPI_Type_get_envelope and MPI_Type_get_contents calls with calls that will work on these datatypes. Note that not all calls to these MPI functions are replaced, only the ones used when flattening a non-contiguous datatype. References #5009 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-04-16 10:46:38 -06:00
George Bosilca	6ff11267fb	Remove warnings identified by clang. Plus minor spacing and indentation issues. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-04-14 17:14:12 -04:00
George Bosilca	cd683e3eec	Allow OPAL DDT to receive size_t count argument. Fixes issue #5069, which relates a BigMPI bug with the use of MPI_Type_vectpor to construct very large datatypes (>2GB). Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-04-14 15:32:19 -04:00
Todd Kordenbrock	d646a00cd9	Merge pull request #5054 from tkordenbrock/topic/master/mtl-portals4.finalize.fix master: mtl-portals4: don't call progress() in finalize() if Portals4 was not initialized	2018-04-12 12:12:05 -05:00
Todd Kordenbrock	90659671bc	mtl-portals4: don't call progress() in finalize() if Portals4 was not initialized This commit fixes a segfault in mtl-portals4 finalize(). The segfault occurs if finalize() is called without any calls to add_procs(). This commit resolves the segfault by skipping the progress() loop in finalize() if the Portals was not initialized. Signed-off-by: Todd Kordenbrock (thkgcode@gmail.com)	2018-04-10 14:22:32 -05:00
Mikhail Kurnosov	82a3a5bdb5	Fix dynamic decision for Scan and bug in Allreduce Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-06 11:03:17 +07:00
KAWASHIMA Takahiro	5e12e0f2e5	Merge pull request #5001 from ggouaillardet/topic/javah configury: use javac vs javah whenever possible.	2018-04-06 11:30:20 +09:00
Jeff Squyres	fc8ebbb0e0	MPI_Comm_spawn_multiple.3in: update Fortran string array notes Per `0ab6b201fe`, note in the MPI_Comm_spawn_multiple.3in man page that the array_of_commands does not need to be terminated -- it just need to have exactly "count" entries. In the Fortran binding, at least, this is different than in prior released versions of Open MPI (it's not a backwards incompatibility, since prior versions of Open MPI required array_of_commands to be blank-string-terminated in Fortran -- this change makes Open MPI be less restrictive, and therefore still backwards compatible). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-04-05 06:52:46 -07:00
Jeff Squyres	0ab6b201fe	mpi/fortran: fix parsing arrays of Fortran strings MPI defines the "argv" param to Fortran MPI_COMM_SPAWN as being terminated by a blank string. While not precisely defined (except through a non-binding example, Example 10.2, MPI-3.1 p382:6-29), one can infer that the "array_of_argv" param to Fortran MPI_COMM_SPAWN_MULTIPLE is also a set of argv, each of which are terminated by a blank line. The "array_of_commands" argument to Fortran MPI_COMM_SPAWN_MULTIPLE is a little less well-defined. It is assumed to be of length "count" (another parameter to MPI_COMM_SPAWN_MULTIPLE) -- and not be terminated by a blank string. This is also given credence by the same example 10.2 in MPI-3.1. The previous code assumed that "array_of_commands" should also be terminated by a blank line -- but per the above, this is incorrect. Instead, we should just parse our "count" number of strings from "array_of_commands" and not look for a blank line termination. This commit separates these two cases: * ompi_fortran_argv_blank_f2c(): parse a Fortran array of strings out and stop when reaching a blank string. * ompi_fortran_argv_count_f2c(): parse a Fortran array of strings out and stop when "count" number of strings have been parsed. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-04-04 18:56:44 -07:00
Gilles Gouaillardet	5370586d98	configury: use javac vs javah whenever possible javah is no more available from Java 10, so try javac -h first (available since Java 8) and fallback on javah Refs. open-mpi/ompi#5000 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-05 10:37:35 +09:00
Gilles Gouaillardet	132ea1a6b0	java: cleanup the list of automatically generated header files Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-05 09:27:34 +09:00
Edgar Gabriel	ef28d941d9	Merge pull request #5002 from raafatfeki/pr/coverty-dynamic_gen2-fixes fcoll/dynamic_gen2: fix coverty warnings	2018-04-04 09:08:42 -05:00
Gilles Gouaillardet	e85fa469f3	coll/tuned: add recursive doubling algo for [ex]scan Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-04 14:56:23 +09:00
Gilles Gouaillardet	393376bbd9	coll/basic: move [ex]scan from coll/basic to coll/base Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-04 13:41:01 +09:00
Gilles Gouaillardet	65fa0b59c3	coll/tuned: add Rabenseifner algo for [all]reduce Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-04-04 13:25:41 +09:00
Mikhail Kurnosov	177c6ce51f	Move algorithms from coll/spacc to coll/base and remove coll/spacc Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-04 10:21:06 +07:00
raafatfeki	5d99af29cd	fcoll/dynamic_gen2: Formatting fixes Adjust Coding Style to match the 4 space tab rule. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-04-02 17:25:00 -05:00
raafatfeki	92822613ea	fcoll/dynamic_gen2: fix coverty warnings fix warnings for coverty CID 1433655 and CID 1433654 Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-04-02 16:18:07 -05:00
Mikhail Kurnosov	1d2d43bdf0	Fix compile error with dtype Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-04-01 08:27:34 +07:00
Edgar Gabriel	c4879ec29f	io/ompio: don't reset amode if MODE_SEQUENTIAL is set the ompio module resets the amode from WRONLY to RDWR in order to accoomodate data sieving in the two-phase fcoll componet. This leads however to an error if MPI_MODE_SEQUENTIAL has been requested by the user, since MODE_SEQUENTIAL is incompatible with MODE_RDWR. SInce the change to the amode was done after opening the file for individual file pointers but before opening the file for shared filepointers, this lead to an error message in the sharedfp component. Note, that data sieving is never necessary if MODE_SEQUENTIAL is set, so this should not be a problem for any scenario. Fixes #4991 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-03-30 07:56:47 -05:00
Mikhail Kurnosov	50ec214d42	Add recursive doubling algorithm for MPI_Scan and MPI_Exscan to coll/base Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-03-30 10:12:51 +07:00
raafatfeki	100677721d	fcoll/dynamic_gen2: use hindexed constructor on the sender side instead of using a temporary buffer and copy data into the temp buffer before sending, use a derived datatype to describe the data that needs to be sent during a cycle in the collective I/O operation. Signed-off-by: raafatfeki <fekiraafat@gmail.com>	2018-03-28 14:37:30 -05:00
Mikhail Kurnosov	bd12e2b1c6	Add recursive doubling algorithm for Scan and Exscan Implements recursive doubling algorithm for MPI_Scan and MPI_Exscan. The algorithm preserves order of operations so it can be used both by commutative and non-commutative operations. Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2018-03-28 16:27:11 +07:00
Nathan Hjelm	e79debc320	osc/rdma: fix overflow in offset calculation This commit fixes a bug is osc/rdma that can occur if the total size of the shared memory segment gets larger than 4 GiB. The bug was caused by a typo. The type of my_base_offset should have been size_t not int. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-03-27 09:33:44 -06:00
Nathan Hjelm	f7faacca4e	osc/rdma: fix 32-bit builds Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-03-27 09:16:04 -06:00
Jeff Squyres	06af6f1c4c	Merge pull request #4962 from jsquyres/pr/cid-fixes A bunch of CID fixes	2018-03-26 22:30:31 -04:00
Ralph Castain	f92acd735b	Merge pull request #4965 from rhc54/topic/rank Fix breakage in ranking system and silence OSC/RDMA warnings	2018-03-26 19:10:36 -05:00
Jeff Squyres	5360035995	topo/treematch: fix CID 1416327 Ensure to free things in the right order so that we don't access memory after it is freed. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:26:17 -07:00
Jeff Squyres	08ceb66a19	osc/pt2pt: fix (effectively false positive) CID 1402113 This will almost certainly never happen, but be defensive and guarantee that we never return an uninitialized variable. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:26:17 -07:00
Jeff Squyres	9de750a280	io/ompio: fix CID 1269889 Free some memory upon error conditions. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:21:21 -07:00
Jeff Squyres	dca66b9775	comm_join: fix CID 1323170 Enusre that the port name is always NULL-terminated. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:21:21 -07:00
Jeff Squyres	6319292170	fcoll/static: fix CID 1413066 local_iov_array is unconditionally allocated, so unconditionally de-allocate it, too. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:21:21 -07:00
Jeff Squyres	2968ffa296	fcoll/static: remove useless/dead code Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:21:21 -07:00
Jeff Squyres	8e925b4f17	fbtl/posix: fix CID 1419954 Ensure to initialized ret_code. This problem will likely never occur in practice, but we might as well be defensive about it. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:21:21 -07:00
Jeff Squyres	124208198c	osc/rdma: fix CID 1424327 Fix minor memory leak. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-26 14:21:21 -07:00
Nathan Hjelm	1c75aa82fc	use-mpi-f08: fix rma function signatures The various RMA functions need to have the asynchronous property on all buffers. This property was missing and some buffers were incorrectly marked as intent(in). This commit fixes the function signatures. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-03-26 15:11:07 -06:00
Ralph Castain	3a93b535ec	Silence the flood of OSC/RDMA warnings Fixes #4950 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-03-25 16:12:41 -07:00
Artem Polyakov	77ff99e9ee	Merge pull request #4933 from karasevb/timings_update timings: added new timing points	2018-03-25 00:10:49 -07:00
Jeff Squyres	871e5c76bc	Merge pull request #4960 from jsquyres/pr/warnings-fixes Coverity fix + compiler warning fixes	2018-03-23 14:47:56 -05:00
Jeff Squyres	c3adcb05eb	Miscellaneous compiler warnings fixes Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-03-23 11:45:30 -07:00
Nathan Hjelm	5f7ff5307e	fcoll/two_phase: do not use removed function (MPI_Address) Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-03-23 08:43:24 -06:00
Edgar Gabriel	36747cca67	io/ompio: disable the fcoll timing by default somehow the flag indicating to gather performance data on collective io operations has changed to 1 accidentally. Should be 0 ( false) by default. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-03-21 11:34:35 -05:00
Edgar Gabriel	aae8c6c6ad	remove addproc sharedfp component never got to move this sharedfp component into anything usable. Can easily be restored if necessary. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-03-21 11:27:01 -05:00
Edgar Gabriel	e703ac2da8	remove plfs components plfs components are at this point not utilized by anybody as far as I know. Easy to bring back if we want to. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-03-21 11:27:01 -05:00
Boris Karasev	3796307a57	timings: added new timing points Signed-off-by: Boris Karasev <karasev.b@gmail.com>	2018-03-21 05:16:25 +02:00
bosilca	bf3dd8af19	Merge pull request #4884 from bosilca/topic/fix_wtime Improve the range and accuracy of MPI_Wtime.	2018-03-16 14:09:33 +09:00
Nathan Hjelm	7f4872d483	osc/rdma: performance improvments and bug fixes This commit is a large update to the osc/rdma component. Included in this commit: - Add support for using hardware atomics for fetch-and-op and single count accumulate when using the accumulate lock. This will improve the performance of these operations even when not setting the single intrinsic info key. - Rework how large accumulates are done. They now block on the get operation to fix some bugs discovered by an IBM one-sided test. I may roll back some of the changes if the underlying bug in the original design is discovered. There appear to be no real difference (on the hardware this was tested with) in performance so its probably a non-issue. References #2530. - Add support for an additional lock-all algorithm: on-demand. The on-demand algorithm will attempt to acquire the peer lock when starting an RMA operation. The lock algorithm default has not changed. The algorithm can be selected by setting the osc_rdma_locking_mode MCA variable. The valid values are two_level and on_demand. - Make use of the btl_flush function if available. This can improve performance with some btls. - When using btl_flush do not keep track of the number of put operations. This reduces the number of atomic operations in the critical path. - Make the window buffers more friendly to multi-threaded applications. This was done by dropping support for multiple buffers per MPI window. I intend to re-add that support once the underlying performance bug under the old buffering scheme is fixed. - Fix a bug in request completion in the accumulate, get, and put paths. This also helps with #2530. - General code cleanup and fixes. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-03-15 14:53:53 -06:00
Benoît Legat	00600c7cbb	Fix typo in MPI_Cart_shift doc Signed-off-by: Benoît Legat <benoit.legat@gmail.com>	2018-03-13 15:25:42 +01:00
Edgar Gabriel	da640f98df	fcoll/two_phase: data sieving has to occur at offset 0 as well data sieving has to occur for any offset provided that is larger or equal zero for this implementation to work correctly. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-03-10 11:23:09 -06:00
George Bosilca	0f0c27a184	Allow MPI_PROC_NULL as neighbor. Allowing MPI_PROC_NULL as a neighbor in any topology allows us to add gaps on the send and recv buffers. This does make the traditional neighbor collective have a similar behavior as the V version, but in same time it allows the users to skip the step where they prepare the counts and the displacement array. For more info please take a look at issue #4675. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-03-09 12:20:26 +09:00
Edgar Gabriel	c83b47c266	io/romio314: mark datatypes of size 0 as contiguous this commit fixes an issue observed with romio314 and the hdf5 1.10.x testsuite. The ADIOI_Datatype_iscontig() routine in romio314/src/io_romio314_module.c will now return for a datatype of size 0 that it is contiguous, even if the extent of the datatype is non-zero. This avoids a segmentation fault observed in the ADIOI_Flatten routine, and fixes this particular with the hdf5 1.10.x testsuite in OpenMPI with romio314. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-03-08 09:10:09 -06:00
George Bosilca	9bced03213	Improve the range and accuracy of MPI_Wtime. As discussed on https://github.com/mpi-forum/mpi-issues/issues/77#issuecomment-369663119 the conversion to double in the MPI_Wtime decrease the range and accuracy of the resulting timer. By setting the timer to 0 at the first usage we basically maintain the accuracy for 194 days even for gettimeofday. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2018-03-08 14:26:02 +09:00
Nathan Hjelm	5ed2fc2d48	mca/base: add support for additional variable types This commit adds long, int32_t, uint32_t, int64_t, and uint64_t as possible MCA variable types. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-03-01 20:42:27 -07:00
bosilca	9944d63de1	Merge pull request #4852 from thananon/pr/ob1_oos_fix pml/ob1: fixed out of sequence bug.	2018-02-28 13:02:03 -05:00
Thananon Patinyasakdikul	09cba8b30b	pml/ob1: fixed out of sequence bug. This commit fixes #4795 - Fixed typo that sometimes causes deadlock in change of protocol. - Redesigned out of sequence ordering and address the overflow case of sequence number from uint16_t. Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2018-02-27 13:49:40 -05:00
Valentin Petrov	bf4e694a96	coll/hcoll: Fix return codes Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2018-02-22 17:48:29 +02:00
Matias Cabral	0a822f8f99	Merge pull request #4821 from nrspruit/OFI_mtl_multi_event_progress MTL OFI: Added support for reading multiple CQ events in ofi progress	2018-02-20 14:59:47 -08:00
Jeff Squyres	9ef0f3d83a	ompi/monitoring: add .sh versionig to common monitoring lib Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2018-02-20 07:07:23 -08:00
Artem Polyakov	b601dd504a	ompi: ompi_mpi_init(): do not export threading level to modex. For some of our configuration this flag increases per-process contribution by ~20% while it is not being used currently. The consumer of this flag was communicator ID calculation logic, but it was changed in `0bf06de3f1`. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2018-02-18 01:40:15 +07:00
Spruit, Neil R	e7bff501cd	MTL OFI: Added support for reading multiple CQ events in ofi progress -Updated ompi_mtl_ofi_progress to use an array to read CQ events up to a threshold that can be set by the Open MPI User. -Users can adjust the number of events that can be handled in the ompi_mtl_ofi_progress by setting "--mca mtl_ofi_progress_event_cnt #". -The default value for the the number of CQ events that can be read in a single call to ofi progress is 100 which is an average based off workload usecase anaylsis showing 70-128 as the range of multiple events returned during ofi progress. Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>	2018-02-15 09:41:14 -05:00
Nathan Hjelm	0e83568466	coll/libnbc: do not take lock in progress if there are no requests This commit fixes a flaw in the progress function for libnbc. The function was unconditionally taking a lock even if there are no requests to process. This lock was showing up in vtune traces of multi-threaded benchmarks. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-02-13 09:51:01 -07:00
Edgar Gabriel	a3a734b6d2	io/ompio: correctly reset the request after performing the final OBJ_RELEASE on the request, reset the user level variable to MPI_REQUEST_NULL. Otherwise the c_2_f translation step in the fortran interface fails. Fixes issue #4807 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-02-13 09:18:25 -06:00
Jeff Squyres	e7f91f8068	Merge pull request #4527 from clementFoyer/osc-no-includes Remove inter-dependencies between OSC modules.	2018-02-09 15:49:56 -05:00
Nathan Hjelm	da9f833f4a	pml/ob1: ignore the eager limit of RDMA-only btls This commit fixes a flaw in the eager limit check in pml/ob1. The check was incorrectly checking if RDMA-only BTLs (BTLs without the send flag) has a valid eager limit. This commit fixes the check by adding an additional check for the send flag on the BTL module. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2018-02-07 12:42:44 -07:00
Clement Foyer	f5b4fc05f8	Remove inter-dependencies between OSC modules. The osc monitoring component needed to include other OSC components header in order to be able tu access communicator through the component specific ompi_osc__module_t structures. This commit remove the dependency, and resolve the issue #4523. Extend the common monitoring API. Now it's possible to translate from local rank to world rank from both the communicator and the group. * Remove useless hashtable as we directly use the w_group contained in window structure. Add automatic generation at config time. The templates are expanded at configure time. It creates a new header file that generates all the variables/functions needed. Adding this during the autogen automagicaly generates for each of the available modules the proper functions. Only keep a generated argv-style array. Following Jeff's advice, the configure.m4 file generate a simple array of module variables to be iterated over to find the proper module. Signed-off-by: Clement Foyer <clement.foyer@inria.fr>	2018-02-07 11:52:00 +00:00
Ralph Castain	7ddffc627d	Merge pull request #4776 from rhc54/topic/rte Correct abstraction break and update ignores	2018-01-31 04:34:05 -08:00
Ralph Castain	8e8a9aecc5	Correct abstraction break - direct reference to ORTE Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2018-01-30 21:19:14 -08:00
Gilles Gouaillardet	34b45cc879	osc/sm: fix the osc_free callback If component selection fails, then module->bases might be unallocated when ompi_osc_sm_free() in invoked, so test it before trying to free() module->bases[0]. Thanks Martin Binder for the report. Refs open-mpi/ompi#4770 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-31 11:23:21 +09:00
Sergey Oblomov	7a5811d0a8	request/state: update state for canceled request - fixed issue in set state for canceled request Signed-off-by: Sergey Oblomov <sergeyo@mellanox.com>	2018-01-29 18:26:20 +02:00
Edgar Gabriel	bcf26d419f	fs/ufs and fs/lustre: remove erroneous return statement an erroneous return statement has creeped in commit `1885d99` which leads to some processes not resetting stripe_size and stripe_count correctly. This can lead in 3.0.x to different fcoll modules being selected. The impact is not that dramatic on master and 3.1.x, but could lead to problems as well. Fixes #4745 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2018-01-24 14:07:21 -06:00
Xin Zhao	72ff2b1135	OMPI/OSC/UCX: adding atomic lock for fetch_and_op and compare_and_swap Signed-off-by: Xin Zhao <xinz@mellanox.com>	2018-01-18 00:36:22 +02:00
Yossi Itigin	f2851fd502	Merge pull request #4724 from alex-mikheev/topic/ucx_as_default ompi/oshmem: ucx is selected over yalla/ikrit by default	2018-01-17 17:41:49 +02:00
Alex Mikheev	640e945b9c	ompi: pml/ucx: blocking send using ucp_tag_send_nbr Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2018-01-17 15:54:18 +02:00
Alex Mikheev	ae326546f4	ompi/oshmem: ucx is selected over yalla/ikrit by default Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2018-01-17 15:08:04 +02:00
Gilles Gouaillardet	cb5dfbe5b1	man: fix indentation of MPI_Comm_spawn[_multiple] no code change. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-15 11:29:15 +09:00
Matias Cabral	8049c06a96	Merge pull request #4580 from matcabral/fix_comments_pr_425_osc_rmda osc/rmda: fix missing opal_argv_free in mtls search.	2018-01-12 09:20:11 -08:00
Matias A Cabral	009ba475e1	osc/rmda: fix missing opal_argv_free in mtls search. Use asprintf in description message to avoid missing default values Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2018-01-11 14:29:16 -08:00
Jeff Squyres	1c5664fdec	Merge pull request #4681 from nathanweeks/issue/f08_mpi_errcodes_ignore Fix type of mpi_f08 MPI_ERRCODES_IGNORE	2018-01-10 12:34:16 -05:00
Gilles Gouaillardet	02f8215b25	ompi: enhance MPI_File_set_view datatype check. Per MPI 3.1 chapter 13.3 : "Derived etypes can be constructed by using any of the MPI datatype constructor routines, provided all resulting typemap displacements are non-negative and monotonically nondecreasing." Same restriction applies to ftypes. add the OMPI_DATATYPE_CHECK_FOR_VIEW() macro that is check the underlying opal_datatype_t is monotonic, on top of all checks performed in OMPI_DATATYPE_CHECK_FOR_RECV(). Since checking monotoniciy is expensive, check is only performed when needed, but the result is cached by ompi_datatype_is_monotonic(). Thanks Wei-keng Liao for the valuable feedback. Thanks George for the guidance. Refs. open-mpi/ompi#4682 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2018-01-09 18:05:15 +09:00
Nathan T. Weeks	3158d2c5ed	Fix type of mpi_f08 MPI_ERRCODES_IGNORE Signed-off-by: Nathan T. Weeks <weeks@iastate.edu>	2018-01-07 05:36:41 -08:00
KAWASHIMA Takahiro	710080be63	Merge pull request #4667 from kawashima-fj/pr/f08-pmpi fortran: Fix PMPI interface bugs in mpi_f08 module	2018-01-05 03:45:10 -06:00
bosilca	ef38ca5663	Merge pull request #4644 from bosilca/topic/treematch Fix treematch topology assert	2018-01-02 21:21:54 -05:00
Gilles Gouaillardet	2dd345465f	ompi/communicator: optimize ompi_comm_split() set grp_local_rank as MPI_UNDEFINED before invoking ompi_comm_nexcid() in order to benefit from the optimizations introduced in open-mpi/ompi@68167ec879 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-28 15:34:04 +09:00
Yossi Itigin	697a9437e2	Merge pull request #4666 from alex-mikheev/topic/pml_ucx_recv_fix ompi: pml ucx: improve recv latency	2017-12-26 20:12:18 +02:00
Alex Mikheev	e7bf0617cf	ompi: pml ucx: improve recv latency Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2017-12-26 16:24:16 +02:00
KAWASHIMA Takahiro	bd2fe9c324	fortran: Call PMPI from PMPI_Status_set_cancelled_f08 This is a bug which was forgotten to change in `c08f97b030`. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-12-26 15:53:12 +09:00
KAWASHIMA Takahiro	00e3c7a973	fortran: Align indentation This change makes comparison of `mpi-f08-interfaces.F90` and `pmpi-f08-interfaces.F90` easier. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-12-26 14:39:09 +09:00
KAWASHIMA Takahiro	056eb39b12	fortran: Correct type of `info_used` It is incorrectly typed as `MPI_Comm` in only `pmpi` in `24f7bd327e`. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-12-26 14:39:09 +09:00
KAWASHIMA Takahiro	0c3a534b32	fortran: Use `C_PTR` for `buffer_addr` It was changed to use `C_PTR` in only `mpi` in `fc69c0be24`. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-12-26 14:39:09 +09:00
KAWASHIMA Takahiro	d4fc404dc6	fortran: Change `PMPI_Aint_{add,diff}` to functions. They were incorrectly changed to subroutines in only `pmpi` in `258d1aa160`. Strictly speaking, this change involves binary incompatibility. But nobody used these subroutines and nobody will be affected because these subroutines were useless (didn't return a calculated value). Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-12-26 14:39:09 +09:00
KAWASHIMA Takahiro	9240967b8f	fortran: Remove ASYNCHRONOUS from mpi_f08 pmpi It was removed from only `mpi` as a bug fix in `db41d749c1`. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-12-26 14:39:09 +09:00
KAWASHIMA Takahiro	4cd917a3fd	Merge pull request #4659 from yanagibashi/pr/fortran2008 Add missing Fortran 2008 binding subroutines	2017-12-22 01:53:36 -06:00
Tsubasa Yanagibashi	3f4b373856	Add missing Fortran 2008 binding subroutines added missing Fortran 2008 binding pmpi_{*} subroutines to Open MPI. Signed-off-by: Tsubasa Yanagibashi <fj2505dt@aa.jp.fujitsu.com>	2017-12-22 13:45:53 +09:00
Aravind Gopalakrishnan	fb68726baf	MTL OFI: Allow retries in MTL progress for interrupted syscalls This fixes a regression in sockets provider which could return -EINTR value from fi_cq_read() due to a syscall being interrupted. The error value is currently interpreted as fatal condition. Relax the rule so that we can retry fi_cq_read() operation. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2017-12-20 14:58:49 -08:00
George Bosilca	38455845db	Fix asserts. In both cases we were comparing with the wrong size, it should be either the number of local processes or the number of nodes, and not the size of the communicator. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-12-20 11:51:35 -05:00
George Bosilca	808f865e9d	Force all output to use OMPI infrastructure. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-12-20 11:50:51 -05:00
Jed Brown	533800070e	MPI_Attr_get: doc fix: MPI_Comm_create_attr -> MPI_Comm_get_attr MPI_Comm_create_attr does not exist. Signed-off-by: Jed Brown <jed@jedbrown.org>	2017-12-17 07:44:22 -07:00
Gilles Gouaillardet	794cc09d3e	mpiext: fix path to Fortran 2008 modules OMPI_FORTRAN_USEMPIF08_MOD macro was removed in open-mpi/ompi@791bcee6c0 so this macro is now manually expanded to mpi/fortran/use-mpi-f08/mod Thanks to Nathan T. Weeks for reporting Refs open-mpi/ompi#3605 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-11 11:02:43 +09:00
Matias Cabral	2c86b8723d	Merge pull request #4510 from matcabral/mtl_psm2_shadow_vars New flag for MCA parameters that allows a behaving with a default value of "unset".	2017-12-04 12:25:37 -08:00
Howard Pritchard	b160cf6339	Merge pull request #4533 from hppritcha/topic/ofi_mtl_mprobe_fixes mtl/ofi: fix problem with mprobe/mrecv	2017-12-04 09:11:47 -07:00
Howard Pritchard	2233e44848	Merge pull request #4534 from hppritcha/topic/fix_a_segv_in_request pml/cm: check for request comp. before completing bsend	2017-12-04 09:09:41 -07:00
Gilles Gouaillardet	2f5b1e9fe0	Merge pull request #4551 from ggouaillardet/topic/communicator_mutex_c_lock Make usage of ompi_communicator_t, ompi_file_t and ompi_win_t mutex consistent	2017-12-04 09:20:52 +09:00
Edgar Gabriel	1f151be6d2	io/ompio: introduce a new function to retrieve mca parameter values ompio has the unique problem, that mca parameters set in the io/ompio component have to be accessible from other frameworks as well. This is mostly done to avoid a replication in the parameter names and to reduce the number of mca parameters that and end-user has to worry about. This commit introduces a generic function to retrieve ompio mca parameters, the function pointer is stored on the file handle. It replaces two functions that used the same concept already for one parameter each. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-12-01 10:00:23 -06:00
Gilles Gouaillardet	b8e77ba759	mpi/c: use OPAL_THREAD[UN]LOCK() instead of opal_mutex_[un]lock() in order to keep consistency between ompi_communicator_t, ompi_file_t and ompi_win_t. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-01 16:06:32 +09:00
Gilles Gouaillardet	1ba4c185bc	ompi/communicator: destruct ompi_communicator_t's c_lock in the destructor Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-01 16:06:27 +09:00
Gilles Gouaillardet	5f1a967351	ompi/file: rename ompi_file_t's f_mutex into f_lock in order to use a consistent name between ompi_file_t, ompi_win_t and ompi_communicator_t Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-12-01 16:06:22 +09:00
bosilca	5cb72aa568	Merge pull request #4552 from hjelmn/asm_cleanup2 Add atomic fetch-and-op and compare-exchange functions	2017-11-30 22:29:38 -05:00
Nathan Hjelm	7893248c5a	opal/asm: add fetch-and-op atomics This commit adds support for fetch-and-op atomics. This is needed because and and or are irreversible operations so there needs to be a way to get the old value atomically. These are also the only semantics supported by C11 (there is not atomic_op_fetch, just atomic_fetch_op). The old op-and-fetch atomics have been defined in terms of fetch-and-op. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-30 10:41:23 -07:00
Nathan Hjelm	1282e98a01	opal/asm: rename existing arithmetic atomic functions This commit renames the arithmetic atomic operations in opal to indicate that they return the new value not the old value. This naming differentiates these routines from new functions that return the old value. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-30 10:41:22 -07:00
Nathan Hjelm	9d0b3fe9f4	opal/asm: remove opal_atomic_bool_cmpset functions This commit eliminates the old opal_atomic_bool_cmpset functions. They have been replaced by the opal_atomic_compare_exchange_strong functions. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-30 10:41:22 -07:00
Gilles Gouaillardet	f1778d2778	communicator: remove the USE_MUTEX_FOR_COMMS macro It should have always been #define'd in order to correctly handle the multi-threaded case. Also fix indentation in ompi/mpi/c/comm_get_errhandler.c Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-30 14:29:11 +09:00
Gilles Gouaillardet	7c3e675479	fix communicator's c_lock usage - initialize c_lock in the ompi_communicator_t constructor - USE_OPAL_THREAD_[UN]LOCK(c_lock) - #ifdef USE_MUTEX_FOR_COMMS protect c_lock access Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-30 14:27:59 +09:00
Nathan Hjelm	45db3637af	osc/rdma: bug fixes This commit fixes the following bugs: - Allow a btl to be used for communication if it can communicate with all non-self peers and it supports global atomic visibility. In this case CPU atomics can be used for self and the btl for any other peer. - It was possible to get into a state where different threads of an MPI process could issue conflicting accumulate operations to a remote peer. To eliminate this race we now update the peer flags atomically. - Queue up and re-issue put operations that failed during a BTL callback. This can occur during an accumulate operation. This was an unhandled error case. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-29 12:43:58 -07:00
Nathan Hjelm	67e26b6e5a	Merge pull request #4482 from matcabral/osc_rdma_skip_mtls osc/rdma: mca parameter to list MTLs that lower osc rdma priority	2017-11-29 09:34:33 -07:00
Nathan Hjelm	647b40f3f2	Merge pull request #4442 from bosilca/topic/ob1_pvar Topic/ob1 pvar	2017-11-29 09:31:07 -07:00
Ralph Castain	5e109fba6a	Take the smallest change approach to fixing the hang on disconnect of connect/accept operations by replacing the call to pmix.disconnect with a simple pmix.fence as this is all OMPI currently requires Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-29 03:30:40 -08:00
Ralph Castain	7ad6886a30	Add a new OMPI rte component to support direct-launch using PMIx. Cleanup several places where abstraction violations crept into OMPI layer (direct reference of ORTE). Add some missing includes that were exposed by this change. Note that this compiles, but I haven't tested it for execution yet. Handing it over to Noah Evans for completion Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-28 12:05:01 -08:00
bosilca	c4c5df4d1c	Merge pull request #4476 from hjelmn/errhandler_fixes ompi/errhandler: make set/get actually thread safe	2017-11-27 19:43:42 -05:00
Nathan Hjelm	6b68d1cfc8	ompi/errhandler: make set/get actually thread safe The current versions of these functions have a fatal flaw. If a errhandler set and free call is made by another thread while the thread calling get is between the cmpset and retain then we will retain an invalid object. Fixing this by just using locking. This is not a critical path so this should be ok. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-11-27 15:14:17 -07:00
Howard Pritchard	3285325884	pml/cm: check for request comp. before completing bsend. Turns out there are edge cases where an MTL's isend method may end up marking a send request complete prior to returning to the CM code. The would end up causing problems in the bsend path since the ompi_request_complete would end up getting invoked a second time on this request. This ended up causing segfaults, etc. in ompi_request_complete . Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-11-27 12:27:43 -07:00
Ralph Castain	3906aaf41a	Silence warnings Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-11-25 11:50:18 -08:00
Howard Pritchard	cd48eccbae	mtl/ofi: fix problem with mprobe/mrecv At least with some providers (sockets and GNI), the mprobe/mrecv ofi mtl methods were incorrect. For these two providers at least one must supply the original tag and mask bits used with the prior FI_PEEK \| FI_CLAIM request that had been used to probe for the message. These providers take a strict interpretation of the following sentence from the libfabric fi_tagged man page: ``` Claimed messages can only be retrieved using a subsequent, paired receive operation with the FI_CLAIM flag set. ``` Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-11-24 08:11:18 -07:00
Edgar Gabriel	75ab006ec0	io/ompio: add a new option to disable amode overwriting ompio has historically changed the WRONLY flag provided by the applicaiton to RDWR to allow for the data sieving optimization within the two-phase I/O fcoll component. This change did not have a performance impact on regular UNIX file systems, but seems to hurt performance on NFS (and maybe Lustre?) So provide an option that allows to keep the WRONLY option, and raise an error if tha fcoll/two-phase would actually like to use the data sieving. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-11-17 13:13:38 -06:00
Matias A Cabral	1fad59465f	New flag for MCA parameters that allows a behaving with a default value of "unset". mtl/psm2: Update some shadow mca parameters to use the default "unset". mtl/psm2: Add new shadow parameter to allow specifying the service level. Signed-off-by: Matias A Cabral <matias.a.cabral@intel.com>	2017-11-16 16:28:50 -08:00
Matias Cabral	d1869a725a	Merge pull request #4467 from matcabral/master mtl/ofi: Set data and control progress options default values to FI_PROGRESS_UNSPEC	2017-11-13 07:35:39 -08:00
Jeff Squyres	a8686a6813	mtl ofi: squelch compiler warnings gcc 5.2 complains: ``` mtl_ofi_component.c: In function ‘ompi_mtl_ofi_finalize’: mtl_ofi_component.c:613:5: warning: suggest parentheses around assignment used as truth value [-Wparentheses] if (ret = fi_close((fid_t)ompi_mtl_ofi.fabric)) { ^ ``` Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-11 05:07:11 -08:00
Jeff Squyres	5a6ddf42d6	mtl ofi: it is not an error to return no data from fi_getinfo() Before this commit, the presence of usNIC devices -- which will (currently) return no data when fi_getinfo() is queried for tagged matching providers -- would cause an error message to be displayed. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-11 05:07:11 -08:00
Jeff Squyres	f910f554f7	mtl ofi: show the positive value of the error The value of ret is negative (e.g., -61), but it is displayed in the help message as `%zd`, which renders as unsigned (i.e., a giant positive value). So make sure to negate the negative value before rendering it (e.g., so we display "61", not "4294967235"). Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-11 05:07:11 -08:00
Jeff Squyres	e8c13ef286	mtl ofi: fix trivial comment whitespace No code or logic changes. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-11 05:07:10 -08:00
Jeff Squyres	bed1930df8	mtl ofi: fix formatting of help message No code or logic changes. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-11 05:07:05 -08:00
bosilca	3f43e2c8df	Merge pull request #4419 from thananon/pr_newob1 pml/ob1: match callback will now queue wrong sequence frag and return.	2017-11-10 20:10:53 -05:00
Matias A Cabral	80c8858c5a	osc/rdma: add an mca parameter to list MTLs for which osc pt2pt should have higher priority than rdma and default to psm2. Context: the Intel Omni-path driver (hfi1) has verbs support, so the openib btl is available to use. However, at a bad performance. Without this change osc rdma using btl openib is the default choice when running on Intel Omni-path, with a lower performance than osc pt2pt over mtl psm2. Signed-off-by: Matias A Cabral <matias.a.cabral@intel.com>	2017-11-09 11:54:11 -08:00
George Bosilca	409638bdf4	Keep the out-of-sequence fragment ordered. Rework the logic to handle the out-of-sequence fragments on the receiver side. A large number of OOS messages are still arriving even in single threaded scenarios. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-11-08 14:27:13 -05:00
Matias Cabral	b76bb42ac1	mtl/ofi: Set data and control progress options default values to FI_PROGRESS_UNSPEC so each provider will use its default. Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2017-11-08 08:24:33 -08:00
Thomas Naughton	86bb6f8bac	Merge pull request #4444 from naughtont3/tjn-fix-plm-monitoring-configury configury: single quote to avoid trouble with BSD	2017-11-08 10:56:37 -05:00
Jeff Squyres	5ac1ea4512	ompi_info: add ipv6 config param Allow ompi_info to show whether IPv6 support was enabled in Open MPI or not. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-11-08 07:02:27 -08:00
Edgar Gabriel	0c3aa44ba3	Merge pull request #4454 from edgargabriel/topic/large-file-bug-fix io/ompio: fix a bug in handling large write/read operations	2017-11-07 09:46:47 -06:00
Gilles Gouaillardet	18d3897479	fs/ufs: add some more error codes on file_open set proper error codes in mca_fs_ufs_file_open by mapping the errno value to the MPI error code. Refs. open-mpi/ompi#4443 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-11-07 11:09:32 +09:00
Edgar Gabriel	c9bb049d00	io/ompio: fix a bug in handling large write/read operations This is a bug fix based on a problem reported on the mailing list. For very large read/write operations, ompio breaks the operation down into multiple cycles. The problem was that one of the variables required to maintain its values across the different cycles did not do that, and because of that the calculations of the memory offsets was wrong. Fixes issue #4453 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-11-06 11:48:13 -06:00
Edgar Gabriel	91881c4fdc	fs/lustre: update component to match fs/ufs the fs/lustre component has missed out on a number of updates to the fs/ufs component. This commit tries to import all the changes performed on the fs/ufs component w.r.t to the file_open operation, including updates on how the amode is set, error is propegated and setting the fs_block_size value (which is required for locking purposes). Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-11-03 16:11:46 -05:00
Edgar Gabriel	1885d99ac7	fs/ufs: set proper error codes on file_open set proper error codes in mca_fs_ufs_file_open by mapping the errno value to the MPI error code. Fixes an issue reported on the mailing by Wei-keng Liao Fixes Issue #4443 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-11-03 16:06:31 -05:00
Thomas Naughton	c5dc41ee1a	configury: single quote to avoid trouble with BSD Signed-off-by: Thomas Naughton <naughtont@ornl.gov>	2017-11-03 11:34:28 -04:00
George Bosilca	d261282029	OB1 pvars should be linked with the OB1 component. If not the pvars will remain valid after the OB1 PML is unloaded, and any access will segfault (the callbacks associated with the pvar will point to the memory of the dlclosed module). Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-11-03 01:18:26 -04:00
bosilca	3e1f771045	Merge pull request #4403 from naughtont3/tjn-fix-plm-monitoring-configury fix PML monitoring configury to compile DSOs	2017-11-02 19:08:20 -04:00
bosilca	63e8a8c608	Merge pull request #4431 from hjelmn/asm_cleanup opal: rename opal_atomic_cmpset* to opal_atomic_bool_cmpset*	2017-11-02 18:45:56 -04:00
Matias Cabral	c8aa22ee22	Merge pull request #4304 from aravindksg/master Fix OFI MTL to recognize correct CQ empty scenario and improve error reporting	2017-11-02 11:56:57 -07:00
George Bosilca	b2b3da3046	Do not access the frag after returning it. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-10-31 16:39:23 -04:00
Nathan Hjelm	3ff34af355	opal: rename opal_atomic_cmpset* to opal_atomic_bool_cmpset* This commit renames the atomic compare-and-swap functions to indicate the return value. This is in preperation for adding support for a compare-and-swap that returns the old value. At the same time the return type has been changed to bool. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-10-31 12:47:23 -06:00
Ralph Castain	27f3d417ca	Revert the MPI_Init fence operations to use volatile bool instead of thread macros. The problem is that the waiting thread is cycling using OMPI_LAZY_WAIT_FOR_COMPLETION so it can exercise opal_progress. This probably isn't as critical for the modex step, but definitely necessary for the barrier at the end of mpi_init. The problem this creates is that the lazy macro exits as soon as "active" becomes false, and then we destruct the lock. However, wakeup_thread sets "active" to false - and then calls the condition broadcast to wakeup any waiting threads. So there is a race condition between that broadcast and the lock destruct. Add OPAL_ACQUIRE_OBJECT and OPAL_POST_OBJECT memory barriers to help protect against thread race conditions on some platforms Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-31 08:09:02 -07:00
Aravind Gopalakrishnan	285fc42b4e	Fix OFI MTL to recognize correct CQ empty scenario Currently, the progress function is incorrectly interpreting any error value other than a positive value or -FI_EAVAIL to mean CQ is empty. CQ is empty only if fi_cq_read() call returned -EAGAIN error code. Fix that here. While at it, fix help text output for calls made to OFI API. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2017-10-30 12:13:44 -07:00
Thananon Patinyasakdikul	e3b267a8fe	pml/ob1: match callback will now queue wrong sequence frag and return. In multithreaded case, it is expensive to release the lock, call the slow match and retake the lock again just to queue the frag. This patch will eliminate number of lock taken by queueing the frag right away and return. Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2017-10-30 12:01:50 -04:00
Aravind Gopalakrishnan	bea4503f95	Move help text output regarding PSM2_CUDA envvar to component init phase The messages should be printed only in the event of CUDA builds and in the presence of supporting hardware and when PSM2 MTL has actually been selected for use. To this end, move help text output to component init phase. Also use opal_setenv/unsetenv() for safer setting, unsetting of the environment variable and sanitize the help text message. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2017-10-26 16:01:01 -07:00
Thomas Naughton	86d282d6dd	fix PML monitoring configury to compile DSOs Signed-off-by: Thomas Naughton <naughtont@ornl.gov>	2017-10-26 15:53:11 -04:00
Ralph Castain	cf3bc4f55b	Merge pull request #4346 from matcabral/psm2_mtl_mq_thread_fix MTL PSM2: add a thread lock while peeking and completing the psm2 requests.	2017-10-24 16:41:29 -05:00
Ralph Castain	0353be9704	Update MPI init to properly skip barriers Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-23 19:28:34 -07:00
Ralph Castain	6ea3c8a0bd	Update the interlib example to show an alternative method for model declaration. Add a missing range value to the OPAL layer. Make it easier to see OMPI model callbacks Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-10-23 11:27:42 -07:00
Matias Cabral	b81bcd4b0d	MTL PSM2: add a thread lock while peeking and completing the psm2 requests. Reviewed-by: Gopalakrishnan, Aravind <aravind.gopalakrishnan@intel.com> Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2017-10-20 14:46:48 -07:00
Edgar Gabriel	be0de21e6f	fs/ufs and fbtl/posix: cleanup lock management This commit looks large, but its really mostly a cleanup step. 1. introduce proper error handling for the return values of fcntl and the fbtl_posix_lock function 2. rename a parameter to more accurately reflect what it does 3. introduce an mca parameter in the fs/ufs component that allows to control what the level of locking the user would like to enforce 4. move the initialization of the fs_block_size parameter from fs/ufs into the common/ompio component. An fs component might be allowed to overwrite this value, but none of the actual fs components do that. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-19 14:56:28 -05:00
Edgar Gabriel	e62f9d2e52	fs/ufs: ensure that the never-lock flag is set if not on NFS Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-19 13:32:40 -05:00
Edgar Gabriel	f66c55f77a	fbtl/posix: fixes in the offset calculation and for aio operations our own internal testsuite passes now correctly. More testing to follow. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-19 13:32:39 -05:00
Edgar Gabriel	a3c638bc38	fbtl/posix: add support for file locking for the non-blocking operations Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-19 13:32:38 -05:00
Edgar Gabriel	415e76514d	fbtl/posix: make the code compile Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-19 13:32:37 -05:00
Edgar Gabriel	f5e158c869	fbtl/posix: first cut in adding locking support Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-19 13:32:37 -05:00
Gilles Gouaillardet	9771c575f5	Merge pull request #4352 from edgargabriel/pr/sem_close_fix sharedfp/sm: close the named semaphore	2017-10-19 17:04:43 +09:00
Edgar Gabriel	4d995bd4eb	sharedfp/sm: close the named semaphore in case a named semaphore is used, it is necessary to close the semaphore to remove all sm segments. sem_unlink just removes the name references once all proceeses have closed the sem. Fixes issue: #4336 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu> sharedfp/sm: unlink only needs to be called by one process Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-10-18 10:37:30 -05:00
Aurelien Bouteiller	3ef23f41a3	Bugfix a crash when a comm cannot be initialized Signed-off-by: Aurelien Bouteiller <bouteill@icl.utk.edu>	2017-10-18 11:32:37 -04:00
Valentin Petrov	1e311b2619	coll/hcoll: dtype fallback optimization If hcoll fails to create mpi derived type let's set zero_dte on this dtype. This will save cycles on subsequent collective calls with the same derived type since we will not try to create hcoll type again. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2017-10-06 10:29:29 +03:00
Valentin Petrov	06ef344630	coll/hcoll: extends dtypes support Adds support for legacy MPI_UB/LB types (old apps may use it) as well as for BOOL/WCHAR. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2017-10-06 10:29:29 +03:00
Geoff Paulsen	be7b0af5d9	Merge pull request #3609 from markalle/pr/single_type_with_LB_UB single_predefined_type with MPI_LB/UB	2017-10-04 15:13:09 -05:00
Mark Allen	e24d5ccb7e	single_predefined_type with MPI_LB/UB The ompi_datatype_get_single_predefined_type_from_args() recurses down into a constructed type to identify what base datatype it's built from if it's built from a single type. But if the type has MPI_LB/MPI_UB, for example lens[0] = 1; lens[1] = 1; disps[0] = 0; disps[1] = 0; types[0] = MPI_LB; types[1] = MPI_INT; MPI_Type_create_struct(2, lens, disps, types, &mydt); then this function will see the base type MPI_LB as differing from MPI_INT and will identify mydt as not being constructed from a single base type, so the type will be rejected for calls like MPI_Accumulate. I think those "meta data" types shouldn't result in rejection like that, and the above mydt should still be identified as having a single base type of MPI_INT. Addition: boslica wanted another change discussed here https://github.com/open-mpi/ompi/pull/3609 relating to the calculation for "count" after identifying the predefined_type that was being used. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-10-03 19:08:18 -04:00
George Bosilca	bdbea63a1c	Update the MPI standard reference. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-10-03 16:48:50 -04:00
George Bosilca	a3ac67be0d	Remove double include. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-10-02 21:33:40 -04:00
Gilles Gouaillardet	9492766dbd	romio: disable Fortran support romio314 is a just a component that does not require Fortran bindings, so simply disable Fortran support to prevent warnings about deprecated flags Fixes open-mpi/ompi#4281 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-10-02 21:44:33 +09:00
Matias Cabral	d9b2c94d4a	Merge pull request #4286 from aravindksg/master Use opal_show_help to warn about PSM2_CUDA envvar setting	2017-10-01 11:10:19 -07:00
George Bosilca	2a2db13b32	Gracefully deal with a get returning 1 (complete right away). Kudos to @EmmanuelBRELLE for spotting it. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-10-01 02:24:02 -04:00
Aravind Gopalakrishnan	f8a2b7f6bf	Use opal_show_help to warn about PSM2_CUDA envvar setting If Open MPI is configured with CUDA, then user also should be using a CUDA build of PSM2 and therefore be setting PSM2_CUDA environment variable to 1 while using CUDA buffers for transfers. If we detect this setting to be missing, force set it. If user wants to use this build for regular (Host buffer) transfers, we allow the option of setting PSM2_CUDA=0, but print a warning message to user that it is not a recommended usage scenario. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2017-09-29 17:04:10 -07:00
George Bosilca	92e4ecb618	Fix the offset computation. Also ensure the marked array is correctly freed in all cases. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-27 11:42:21 -04:00
George Bosilca	0ceed71368	Fix Coverity warnings in treematch topo. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-26 22:24:01 -04:00
Edgar Gabriel	4c0d347412	Merge pull request #4230 from edgargabriel/topic/no-smart-fview io/ompio: add a new grouping option avoiding communication	2017-09-26 10:56:06 -05:00
bosilca	f44e674992	Merge pull request #4074 from bosilca/topic/coverity Fix coverity complaints.	2017-09-25 15:05:31 -04:00
Guillaume Mercier	4e7c130c31	Add correct reordering computation in partially distributed case. Replaced matching array with k and bcast with scatter. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Guillaume Mercier <mercier@labri.fr>	2017-09-25 13:10:11 -04:00
George Bosilca	3dd1d8cb53	Delay the first check for the HWLOC topology. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-25 13:09:57 -04:00
George Bosilca	28046b37df	Always succesfully return. As the reordering is an optional step, if any operation during the reorder fails we can return the duplicata of the original communicator associated with the topology information. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-25 12:53:20 -04:00
George Bosilca	219a96fa69	Prevent memory leaks. Reorder the code to simplify the memory management. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-25 12:53:20 -04:00
George Bosilca	64bff0e326	Disable monitoring if we compile statically. Protect all components against compilation on static builds. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-25 12:18:23 -04:00
George Bosilca	458ccc12e1	Move the profiling library in common/monitoring Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-25 12:18:23 -04:00
Clément FOYER	f334607c34	Simplify the communicator's name caching management (#6 ) Signed-off-by: Clement Foyer <clement.foyer@inria.fr>	2017-09-25 12:18:23 -04:00
bosilca	a680b3ac6d	Merge pull request #3853 from clementFoyer/master OMPI monitoring: Simplify the communicator's name caching management + misc test changes	2017-09-25 12:14:36 -04:00
yohann	1f8cabc890	mtl/ofi: Fix provider selection. This allows mtl_ofi_provider_include to work with layered providers as well. e.g. --mca mtl_ofi_provider_include "providerX;ofi_rxm" Signed-off-by: yohann <yohann.burette@intel.com>	2017-09-20 16:00:50 -07:00
Gilles Gouaillardet	b9315edb85	configury: remove the --disable-mpi-io option Fixes open-mpi/ompi#2185 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-20 14:39:09 +09:00
Edgar Gabriel	76a8c67575	io/ompio: add a new grouping option avoiding communication the new grouping option simple+ performs all calculations used for the aggregator selection as if the default file view would be used, thus avoiding communication in file_set_view all together. This mode is useful for applications that do not set a file view, but use explicit offset operations on the default file view. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-09-18 12:30:34 -05:00
Ralph Castain	ed508010b4	Remove stale tools Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-18 07:30:47 -07:00
Ralph Castain	3c914a7a97	Complete the fix of the ORTE DVM. We will now use "prun" instead of "orterun -hnp foo" to execute jobs. This provides the feature of automatic discovery of the orte-dvm so you don't need to manually enter URI's or contact file locations. All IO is forwarded to prun. Still in the "needs to be done" category: * mapping/ranking/binding options aren't correctly supported * if the DVM encounters some errors (e.g., not enough resources for the job), the resulting error is globally set and impacts any subsequent job submission Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-16 13:13:07 -07:00
Ralph Castain	3f8908871b	Since the DVM is now tied to prun, don't build the DVM either unless prun can be built Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-09-13 11:55:10 -07:00
Brian Barrett	637ebf60f9	atomics: Remove requirement of 64 bit atomics Remove two of the three instances of components requiring 64 bit atomics, even on 32 bit systems. The SM OSC component also uses 64 bit atomics, but is a more complicated fix that will follow this one. Currently, no one is testing on platforms that don't provide 64 bit atomics (even in 32 bit mode), but with the removal of the non-inline assembly for IA32, the older compilers on Absoft's test systems now result in no practical way to call cmpxchg8 in 32 bit mode. At that point, these failures started popping up. Signed-off-by: Brian Barrett <bbarrett@amazon.com>	2017-09-11 19:50:10 -07:00
Nathan Hjelm	7cdda24206	osc/sm: do not require 64-bit atomic math This commit fixes a compile issue on 32-bit systems that do not support 64-bit atomic math. The active target path was using 64-bit atomics exclusively to support PSCW. This commit updates the code to use either 32 or 64-bit atomic math depending on what is available. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-09-11 14:10:38 -10:00
Nathan Hjelm	4bba8774f4	monitoring: fix MPI_T regression The monitoring code causes MPI_T based tools to segfault when monitoring is disabled. This happens because the performance variables remain registered after the common/monitoring component is dlclosed due to a missing variable registration flag. This commit adds the necessary flag to all the registered performance variables. The issue on github is #4162. Close when applied to master. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-09-06 14:24:35 -06:00
bosilca	dc538e9675	Merge pull request #1177 from bosilca/topic/large_msg Topic/large msg	2017-09-05 13:30:19 -04:00
Gilles Gouaillardet	ecb6b81a05	mpi: correctly handle MPI_IN_PLACE by memchecker in neighborhood collectives MPI_IN_PLACE is not a valid send buffer for neighborhood collectives, so do not invoke memchecker in this case. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-04 11:21:32 +09:00
Gilles Gouaillardet	66c9485e77	MPI_Isend: memchecker do not mark send buffer as unaccessible after pml isend invokation Today's MPI standard mandates the send buffer remains accessible during the send operation. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-04 11:21:32 +09:00
Gilles Gouaillardet	af8242a121	pml/ob1: have memchecker make recv buffer defined again when mca_pml_ob1_recv completes Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-04 11:18:05 +09:00
Gilles Gouaillardet	6ee9366243	MPI_Wait: correctly handle MPI_STATUS_IGNORE in MEMCHECKER Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-04 11:18:05 +09:00
Aravind Gopalakrishnan	2e83cf15ce	Add support for GPU buffers for PSM2 MTL PSM2 enables support for GPU buffers and CUDA managed memory and it can directly recognize GPU buffers, handle copies between HFIs and GPUs. Therefore, it is not required for OMPI to handle GPU buffers for pt2pt cases. In this patch, we allow the PSM2 MTL to specify when it does not require CUDA convertor support. This allows us to skip CUDA convertor init phases and lets PSM2 handle the memory transfers. This translates to improvements in latency. The patch enables blocking collectives and workloads with GPU contiguous, GPU non-contiguous memory. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2017-09-01 16:59:03 -07:00
George Bosilca	866899e836	Always abide to the RDMA pipeline limit. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-01 18:52:48 -04:00
George Bosilca	050bd3b6d7	Make the pipeline depth an int instead of a size_t. While they are supposed to be unsigned, casting them to a signed value for all atomic operations is as errorprone as handling them as signed entities. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-01 18:52:48 -04:00
Clement Foyer	9a8fc1b9f1	Simplify the communicator's name caching management Remove useless over-initialization Signed-off-by: Clement Foyer <clement.foyer@inria.fr>	2017-08-29 12:52:47 +02:00
Yossi Itigin	14a93a5992	pml_ucx: fix tag/context_id layout and upper bounds. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2017-08-27 17:15:48 +03:00
Josh Hursey	ad87aa2674	Merge pull request #4121 from jjhursey/explore/dlopen-local mca: Dynamic components link against project lib	2017-08-25 13:15:51 -05:00
Joshua Hursey	49c40f05d4	mpi/java: Remove dlopen() workaround * See discussion on Issue #3705 regarding why this is no longer needed. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:17 -04:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
Ralph Castain	e02c39385a	Merge branch 'master' into topic/modex	2017-08-22 20:06:35 -07:00
George Bosilca	50f471e31e	Cleanup a set of warnings reported by Ralph. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-22 23:00:18 -04:00
Ralph Castain	d80b0c7990	If the HWLOC shared memory system is unable to connect, then fallback to providing the topology via XML. Do not automatically provide the XML to every process as that defeats the purpose of the shared memory system. Instead, use PMIx_Query_info_nb to get the info from the server when required. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-22 18:12:26 -07:00
Jeff Squyres	ea5093fc14	mpi/info_delete: fix return code Per MPI-3.1, ensure to raise an MPI exception with value MPI_ERR_INFO_NOKEY if we try to MPI_INFO_DELETE a key that does not exist. Thanks to @dalcinl (Lisando Dalcin) for raising the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-08-22 08:56:40 -07:00
Gilles Gouaillardet	a3e31fa8d0	ompi/communicator: plug a memory leak in ompi_comm_init() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-21 11:47:11 +09:00
Edgar Gabriel	99c7482dd8	Merge pull request #3739 from cniethammer/sharedfp_sm_file_dir Create file for file backed shared memory in process job session dir.	2017-08-15 11:53:30 -05:00
Edgar Gabriel	8fe1c63e25	io/ompio: change the increment for cost based aggr. selection - change the increment used to test various no. of aggregators to avoid using only power of two numbers - convert some paratemers in the cost function from integers to to floats for providing smoother and more consistent results - set the FVIEW_IS_SET flag on the file only if the user has set anything else than the default file view. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Edgar Gabriel	f258036e06	fcoll/two_phase: adjust aggregator selection to new mapby flag on MPI_COMM_WORLD adjust how the aggregator nodes are selected depending on whether processes have been mapped by node or anything else. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Edgar Gabriel	92eff9050c	communicator/comm_init.c: add a new flag indicating binding policy Check for the binding policy used. We are only interested in whether mapby-node has been set right now (could be extended later) and only on MPI_COMM_WORLD, since for all other sub-communicators it is virtually impossible to identify their layout across nodes in the most generic sense. This is used by OMPIO for deciding which ranks to use for aggregators Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Edgar Gabriel	b3f59c76e1	io/ompio: new simple aggr. selection algorithm add a new aggregator selection algorithm based on the performance model described in: Shweta Jha, Edgar Gabriel, 'Performance Models for Communication in Collective I/O Operations' Proceedings of the 17th IEEE/ACM Symposium on Cluster, Cloud and Grid Computing, Workshop on Theoretical Approaches to Performance Evaluation, Modeling and Simulation, 2017. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Jeff Squyres	791bcee6c0	ompi/fortran: remove proof-of-concept mpi_f08 module This module was always intended to be a proof of concept, and was far from complete. If/when someone implemented F08 descriptor support for the mpi_f08 module, this commit can either be restored or used as reference material. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-08-10 06:19:17 -07:00
Gilles Gouaillardet	dfe7b2be3f	fortran/use-mpi-f08-desc: add a missing include file Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-09 13:19:22 +09:00
Gilles Gouaillardet	2c71c27882	fortran2008: fix mpiext example in order to solve an egg and the chicken problem, in which mpiext need mpi-f08-types.mod and/but use-mpi-f08[-desc] needs mpiext, add an extra step - build fortran 2008 modules only - build fortran 2008 mpi extensions - and then build fortran 2008 bindings Fixes open-mpi/ompi#3605 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-09 13:19:22 +09:00
bosilca	9b43de112c	Merge pull request #4014 from bosilca/topic/treematch Topic/treematch	2017-08-08 11:28:22 -04:00
Nathan Hjelm	76320a8ba5	opal: rename opal_atomic_init to opal_atomic_lock_init This function is used to initalize and opal atomic lock. The old name was confusing. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-08-07 14:15:11 -06:00
Joshua Ladd	c27beea3a1	Merge pull request #3962 from karasevb/ucx_detect configure: detect UCX support by default	2017-08-03 16:33:57 -04:00
Nathan Hjelm	29b059e4eb	Merge pull request #3971 from plesn/yield_srun fix srun latency, change default yield_when_idle=0	2017-08-03 07:49:00 -06:00
George Bosilca	3d27e0d3a4	Add support for hwloc 2.0 API. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 02:47:04 -04:00
Guillaume Mercier	569239ec44	Check if topo weighted in case of partially distrib case Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:47:46 -04:00
George Bosilca	1d7cca75a1	Fix a typo in the copyright. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:47:10 -04:00
George Bosilca	e4db9e574f	Fix all warnings. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:47:02 -04:00
George Bosilca	c2927d7e91	Update to the latest version provided by Guillaume. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:48 -04:00
George Bosilca	6c8ea09cc5	Use OPAL random generator. This fix is related to issue #1877, and prevents the OMPI library from messing the user level random values. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:37 -04:00
George Bosilca	5542559130	Cleaning and optimizations. Including variable renaming and loop merging. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:28 -04:00
George Bosilca	bc634dbcb0	Make sure the gather is called in all cases, and not simply based on some local state. This is the second part of the patch proposed for open-mpi/ompi#1183. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:17 -04:00
Brian Barrett	1ec3fd38be	Revert "Topic/treematch"	2017-08-02 14:40:55 -07:00
bosilca	d6048af915	Merge pull request #3960 from bosilca/topic/treematch Update OMPI support for topologies and reordering.	2017-08-02 12:47:23 -04:00
Ralph Castain	f39ce67982	Merge pull request #3951 from rhc54/topic/hwloc2 Update to hwloc 2.0.0a	2017-08-01 15:18:31 -06:00
KAWASHIMA Takahiro	3eac4b0c9a	communicator: Refine `ompi_comm_set` error check The `ompi_comm_set` function never sets `NULL` to its first argument `ncomm`. So `NULL` check is unnecessary in its callers. Furthermore, `NULL` check may obscure a real return code when an error occurs if the variable is initialized to a `NULL` value. Also, `NULL` check is added in the `ompi_comm_set` function to avoid segmentation fault in an out-of-memory condition. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-07-31 20:26:51 +09:00
KAWASHIMA Takahiro	ebc4eb347c	Merge pull request #3701 from kawashima-fj/pr/non-pml-persistent ompi/request: Support non-PML persistent requests	2017-07-31 02:36:17 -05:00
Edgar Gabriel	d93dae326e	Merge pull request #3959 from edgargabriel/topic/performance-fixes Topic/performance fixes	2017-07-27 09:51:57 -05:00

... 4 5 6 7 8 ...

10183 Коммитов