openmpi

Автор	SHA1	Сообщение	Дата
Gilles Gouaillardet	af8242a121	pml/ob1: have memchecker make recv buffer defined again when mca_pml_ob1_recv completes Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-04 11:18:05 +09:00
Gilles Gouaillardet	6ee9366243	MPI_Wait: correctly handle MPI_STATUS_IGNORE in MEMCHECKER Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-09-04 11:18:05 +09:00
Aravind Gopalakrishnan	2e83cf15ce	Add support for GPU buffers for PSM2 MTL PSM2 enables support for GPU buffers and CUDA managed memory and it can directly recognize GPU buffers, handle copies between HFIs and GPUs. Therefore, it is not required for OMPI to handle GPU buffers for pt2pt cases. In this patch, we allow the PSM2 MTL to specify when it does not require CUDA convertor support. This allows us to skip CUDA convertor init phases and lets PSM2 handle the memory transfers. This translates to improvements in latency. The patch enables blocking collectives and workloads with GPU contiguous, GPU non-contiguous memory. Signed-off-by: Aravind Gopalakrishnan <Aravind.Gopalakrishnan@intel.com>	2017-09-01 16:59:03 -07:00
George Bosilca	866899e836	Always abide to the RDMA pipeline limit. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-01 18:52:48 -04:00
George Bosilca	050bd3b6d7	Make the pipeline depth an int instead of a size_t. While they are supposed to be unsigned, casting them to a signed value for all atomic operations is as errorprone as handling them as signed entities. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-09-01 18:52:48 -04:00
Clement Foyer	9a8fc1b9f1	Simplify the communicator's name caching management Remove useless over-initialization Signed-off-by: Clement Foyer <clement.foyer@inria.fr>	2017-08-29 12:52:47 +02:00
Yossi Itigin	14a93a5992	pml_ucx: fix tag/context_id layout and upper bounds. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2017-08-27 17:15:48 +03:00
Josh Hursey	ad87aa2674	Merge pull request #4121 from jjhursey/explore/dlopen-local mca: Dynamic components link against project lib	2017-08-25 13:15:51 -05:00
Joshua Hursey	49c40f05d4	mpi/java: Remove dlopen() workaround * See discussion on Issue #3705 regarding why this is no longer needed. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:17 -04:00
Joshua Hursey	e1d079544b	mca: Dynamic components link against project lib * Resolves #3705 * Components should link against the project level library to better support `dlopen` with `RTLD_LOCAL`. * Extend the `mca_FRAMEWORK_COMPONENT_la_LIBADD` in the `Makefile.am` with the appropriate project level library: ``` MCA components in ompi/ $(top_builddir)/ompi/lib@OMPI_LIBMPI_NAME@.la MCA components in orte/ $(top_builddir)/orte/lib@ORTE_LIB_PREFIX@open-rte.la MCA components in opal/ $(top_builddir)/opal/lib@OPAL_LIB_PREFIX@open-pal.la MCA components in oshmem/ $(top_builddir)/oshmem/liboshmem.la" ``` Note: The changes in this commit were automated by the script in the commit that proceeds it with the `libadd_mca_comp_update.py` script. Some components were not included in this change because they are statically built only. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-08-24 11:56:16 -04:00
Ralph Castain	e02c39385a	Merge branch 'master' into topic/modex	2017-08-22 20:06:35 -07:00
George Bosilca	50f471e31e	Cleanup a set of warnings reported by Ralph. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-22 23:00:18 -04:00
Ralph Castain	d80b0c7990	If the HWLOC shared memory system is unable to connect, then fallback to providing the topology via XML. Do not automatically provide the XML to every process as that defeats the purpose of the shared memory system. Instead, use PMIx_Query_info_nb to get the info from the server when required. Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-08-22 18:12:26 -07:00
Jeff Squyres	ea5093fc14	mpi/info_delete: fix return code Per MPI-3.1, ensure to raise an MPI exception with value MPI_ERR_INFO_NOKEY if we try to MPI_INFO_DELETE a key that does not exist. Thanks to @dalcinl (Lisando Dalcin) for raising the issue. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-08-22 08:56:40 -07:00
Gilles Gouaillardet	a3e31fa8d0	ompi/communicator: plug a memory leak in ompi_comm_init() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-21 11:47:11 +09:00
Edgar Gabriel	99c7482dd8	Merge pull request #3739 from cniethammer/sharedfp_sm_file_dir Create file for file backed shared memory in process job session dir.	2017-08-15 11:53:30 -05:00
Edgar Gabriel	8fe1c63e25	io/ompio: change the increment for cost based aggr. selection - change the increment used to test various no. of aggregators to avoid using only power of two numbers - convert some paratemers in the cost function from integers to to floats for providing smoother and more consistent results - set the FVIEW_IS_SET flag on the file only if the user has set anything else than the default file view. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Edgar Gabriel	f258036e06	fcoll/two_phase: adjust aggregator selection to new mapby flag on MPI_COMM_WORLD adjust how the aggregator nodes are selected depending on whether processes have been mapped by node or anything else. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Edgar Gabriel	92eff9050c	communicator/comm_init.c: add a new flag indicating binding policy Check for the binding policy used. We are only interested in whether mapby-node has been set right now (could be extended later) and only on MPI_COMM_WORLD, since for all other sub-communicators it is virtually impossible to identify their layout across nodes in the most generic sense. This is used by OMPIO for deciding which ranks to use for aggregators Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Edgar Gabriel	b3f59c76e1	io/ompio: new simple aggr. selection algorithm add a new aggregator selection algorithm based on the performance model described in: Shweta Jha, Edgar Gabriel, 'Performance Models for Communication in Collective I/O Operations' Proceedings of the 17th IEEE/ACM Symposium on Cluster, Cloud and Grid Computing, Workshop on Theoretical Approaches to Performance Evaluation, Modeling and Simulation, 2017. Signed-off-by: Edgar Gabriel <gabriel@cs.uh.edu>	2017-08-15 09:50:41 -05:00
Jeff Squyres	791bcee6c0	ompi/fortran: remove proof-of-concept mpi_f08 module This module was always intended to be a proof of concept, and was far from complete. If/when someone implemented F08 descriptor support for the mpi_f08 module, this commit can either be restored or used as reference material. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-08-10 06:19:17 -07:00
Gilles Gouaillardet	dfe7b2be3f	fortran/use-mpi-f08-desc: add a missing include file Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-09 13:19:22 +09:00
Gilles Gouaillardet	2c71c27882	fortran2008: fix mpiext example in order to solve an egg and the chicken problem, in which mpiext need mpi-f08-types.mod and/but use-mpi-f08[-desc] needs mpiext, add an extra step - build fortran 2008 modules only - build fortran 2008 mpi extensions - and then build fortran 2008 bindings Fixes open-mpi/ompi#3605 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-08-09 13:19:22 +09:00
bosilca	9b43de112c	Merge pull request #4014 from bosilca/topic/treematch Topic/treematch	2017-08-08 11:28:22 -04:00
Nathan Hjelm	76320a8ba5	opal: rename opal_atomic_init to opal_atomic_lock_init This function is used to initalize and opal atomic lock. The old name was confusing. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-08-07 14:15:11 -06:00
Joshua Ladd	c27beea3a1	Merge pull request #3962 from karasevb/ucx_detect configure: detect UCX support by default	2017-08-03 16:33:57 -04:00
Nathan Hjelm	29b059e4eb	Merge pull request #3971 from plesn/yield_srun fix srun latency, change default yield_when_idle=0	2017-08-03 07:49:00 -06:00
George Bosilca	3d27e0d3a4	Add support for hwloc 2.0 API. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 02:47:04 -04:00
Guillaume Mercier	569239ec44	Check if topo weighted in case of partially distrib case Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:47:46 -04:00
George Bosilca	1d7cca75a1	Fix a typo in the copyright. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:47:10 -04:00
George Bosilca	e4db9e574f	Fix all warnings. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:47:02 -04:00
George Bosilca	c2927d7e91	Update to the latest version provided by Guillaume. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:48 -04:00
George Bosilca	6c8ea09cc5	Use OPAL random generator. This fix is related to issue #1877, and prevents the OMPI library from messing the user level random values. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:37 -04:00
George Bosilca	5542559130	Cleaning and optimizations. Including variable renaming and loop merging. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:28 -04:00
George Bosilca	bc634dbcb0	Make sure the gather is called in all cases, and not simply based on some local state. This is the second part of the patch proposed for open-mpi/ompi#1183. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-08-03 00:46:17 -04:00
Brian Barrett	1ec3fd38be	Revert "Topic/treematch"	2017-08-02 14:40:55 -07:00
bosilca	d6048af915	Merge pull request #3960 from bosilca/topic/treematch Update OMPI support for topologies and reordering.	2017-08-02 12:47:23 -04:00
Ralph Castain	f39ce67982	Merge pull request #3951 from rhc54/topic/hwloc2 Update to hwloc 2.0.0a	2017-08-01 15:18:31 -06:00
KAWASHIMA Takahiro	3eac4b0c9a	communicator: Refine `ompi_comm_set` error check The `ompi_comm_set` function never sets `NULL` to its first argument `ncomm`. So `NULL` check is unnecessary in its callers. Furthermore, `NULL` check may obscure a real return code when an error occurs if the variable is initialized to a `NULL` value. Also, `NULL` check is added in the `ompi_comm_set` function to avoid segmentation fault in an out-of-memory condition. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-07-31 20:26:51 +09:00
KAWASHIMA Takahiro	ebc4eb347c	Merge pull request #3701 from kawashima-fj/pr/non-pml-persistent ompi/request: Support non-PML persistent requests	2017-07-31 02:36:17 -05:00
Edgar Gabriel	d93dae326e	Merge pull request #3959 from edgargabriel/topic/performance-fixes Topic/performance fixes	2017-07-27 09:51:57 -05:00
Piotr Lesnicki	3fa7aabf89	fix srun latency, change default yield_when_idle=0 This changes the default to 0, to avoid yields during progress in srun. In mpirun, ompi_mpi_yield_when_idle is set to 1 if oversubscribed otherwise 0. But the default is 1 though, and it is used in srun. Now srun and mpirun have the same latency in non-oversubscribed cases. Signed-off-by: Piotr Lesnicki <piotr.lesnicki@atos.net>	2017-07-27 09:41:48 +02:00
Guillaume Mercier	a66dc811b2	Check if topo weighted in case of partially distrib case	2017-07-26 11:54:24 -04:00
George Bosilca	8a7f0baee0	Fix call to opal_hwloc_base_get_topology. Make sure the HWLOC topology is available as early as possible, so that we can fail graciously. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:54:24 -04:00
George Bosilca	6061454055	Fix a typo in the copyright. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:54:24 -04:00
George Bosilca	911850d82e	Fix all warnings. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:54:24 -04:00
George Bosilca	2c00c4209a	Update to the latest version provided by Guillaume. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:54:23 -04:00
George Bosilca	fc21ffadc9	Cleaning and optimizations. Including variable renaming and loop merging. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:54:23 -04:00
George Bosilca	081f9bc8db	Use OPAL random generator. This fix is related to issue #1877, and prevents the OMPI library from messing the user level random values. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:54:23 -04:00
George Bosilca	fbe6c22b90	Make sure the gather is called in all cases, and not simply based on some local state. This is the second part of the patch proposed for open-mpi/ompi#1183. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-07-26 11:52:47 -04:00

... 2 3 4 5 6 ...

9824 Коммитов