openmpi

Автор	SHA1	Сообщение	Дата
Gilles Gouaillardet	68ac95003f	coll/base: fix zero size datatype handling in mca_coll_base_alltoallv_intra_basic_inplace() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-06-20 14:36:35 +09:00
Nathan Hjelm	0c8c7e50d0	Merge pull request #3682 from hjelmn/comm_assertions ompi: add support for new communicator info assertions	2017-06-19 09:49:59 -06:00
Edgar Gabriel	70107b3e52	Merge pull request #3703 from edgargabriel/pr/cart-comm-file-open-fix Pr/cart comm file open fix	2017-06-15 15:03:38 -05:00
Edgar Gabriel	3b0b8fa12c	io/ompio: update cartesian based grouping strategy update the cartesian communicator based grouping strategy to match the other algorithms used in the aggregator selection process. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-06-15 14:05:54 -05:00
Edgar Gabriel	bd6b430798	common/ompio: remove function call to cart_based_grouping the cart_based_grouping aggregator strategy was not correctly updated during the last major rewrite of the aggregator selection algorithm. It is also not supposed to be called from file_open (but from file_set_view). Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-06-15 14:04:03 -05:00
George Bosilca	e9d533e62e	Fix warnings from non-debug mode. Thanks Ralph for the report.	2017-06-13 16:57:42 -04:00
Joshua Hursey	80a91dc244	io/romio314: Add work around support for missing MPI_File ops * Add work around support for the following missing ops in ROMIO 3.1.4 - `MPI_File_iread_at_all` - `MPI_File_iwrite_at_all` - `MPI_File_iread_all` - `MPI_File_iwrite_all` Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-06-09 14:42:59 -05:00
Nathan Hjelm	db2204f2f3	ompi: add support for new communicator info assertions This commit adds code to allow support for the info assertions added by mpi-forum/mpi-issues#11. The assertions added are: mpi_assert_no_any_tag, mpi_assert_no_any_source, mpi_assert_exact_length, and mpi_assert_allow_overtaking. This commit also adds support for the mpi_assert_no_any_source and mpi_assert_allow_overtaking info keys to the ob1 pml. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-06-08 15:52:12 -06:00
KAWASHIMA Takahiro	0cbdbe32f7	ompi/request: Support non-PML persistent requests This commit adds the `req_start` member to the `ompi_request_t` struct. The `MPI_START` and `MPI_STARTALL` routines call this callback function instead of `MCA_PML_CALL(start(...))`. So components that return persistent request must set this member to their request objects. `mca_pml_base_module_t::pml_start` is not deleted because `MCA_PML_CALL(start(...))` is still used elsewhere across OMPI. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-06-02 13:08:17 +09:00
Nathan Hjelm	d10e6455a0	osc/sm: fix SEGV in new info usage This commit moves the info subscribe for the blocking_fence to after the global_state is allocated and moves setting win->w_osc_module to before the info subscribe for alloc_shared_contig. This fixes a SEGV caught by MTT. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-06-01 12:32:30 -06:00
Gilles Gouaillardet	5e9be7667b	Merge pull request #3600 from ggouaillardet/topic/osc_rdma_get_segment osc/rdma: fix osc_rdma_get_remote_segment() length parameter	2017-06-01 13:09:14 +09:00
Nathan Hjelm	e1a997c0cb	Merge pull request #3593 from hjelmn/bug_3575 osc/rdma: fix typo in ompi_osc_rdma_lock_acquire_exclusive	2017-05-31 08:54:40 -06:00
Ralph Castain	ed4078e2dd	Protect against the condition where the port string is actually NULL Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-28 20:51:09 -07:00
Gilles Gouaillardet	e622ca8c1c	osc/rdma: fix osc_rdma_get_remote_segment() length parameter a buffer defined by (buf, count, dt) will have data starting at buf+offset and ending len bytes later with len = opal_datatype_span(&dt.super, count, &offset); Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-05-29 11:08:03 +09:00
Ralph Castain	9f60cd0fe7	Update the connect/accept support so we check to see if we have the proper infrastructure and RTE support, including whether we have ompi-server available if the connect/accept spans multiple applications. Print pretty help messages in all cases where we do not have support Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-27 10:47:08 -07:00
Nathan Hjelm	b83c5dbee5	osc/rdma: fix typo in ompi_osc_rdma_lock_acquire_exclusive Fixes #3575 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-05-26 14:21:08 -06:00
Josh Hursey	4bfb0fcddd	Merge pull request #3577 from markalle/pr/osc_rdma_rangecheck fix for buffer length check (rdma osc w/ odd datatypes)	2017-05-26 10:44:33 -05:00
Nathan Hjelm	7d5cc8ebca	Merge pull request #3572 from ggouaillardet/topic/ompi_osc_rdma_rget_accumulate_internal osc/rdma: fix datatype extent usage in ompi_osc_rdma_rget_accumulate_…	2017-05-26 09:37:51 -06:00
Gilles Gouaillardet	47ebfaa60d	Merge pull request #3451 from mkurnosov/reduce-allreduce-rebenseifner coll: Add Rabenseifner's algorithm for Reduce and Allreduce	2017-05-26 21:00:30 +09:00
Mikhail Kurnosov	f6e2d4ab04	coll: Add Rabenseifner's algorithm for Reduce and Allreduce A component with implementation of R. Rabenseifner's algorithm for Reduce and Allreduce. This algorithm is a combination of a reduce-scatter implemented with recursive vector halving and recursive distance doubling, followed either by a gather or an allgather. Current limitations: -- count >= 2^{\floor{\log_2 p}} -- commutative operations only -- intra-communicators onl Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com> coll/spacc: Modify implementation to use `ompi_coll_base_sendrecv()` Replace irecv() + isend() + ompi_request_wait() to ompi_coll_base_sendrecv(). Signed-off-by: Mikhail Kurnosov <mkurnosov@gmail.com>	2017-05-26 14:33:35 +07:00
Gilles Gouaillardet	0f79259b94	osc/rdma: use extent of the appropriate datatype in ompi_osc_rdma_rget_accumulate_internal() origin_datatype and target_datatype might be different and hence have different extent, so use either origin_extent or target_extent when appropriate. Refs open-mpi/ompi#3569 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-05-26 13:59:38 +09:00
Geoff Paulsen	93078ad824	Merge pull request #3551 from markalle/1sided_some_single 1sided with some hosts single rank -- Fixes #3548	2017-05-25 13:59:48 -05:00
Mark Allen	df14cbf039	fix for buffer length check (rdma osc w/ odd datatypes) The osc_rdma_get_remote_segment() has the 3rd and 4th args as * target_disp * length which it uses to determine if the rdma falls within the bounds of the window or not (actually it only checks the upper bound, but I'm okay with that). Anyway the caller previously was passing in the length argument as target_datatype->super.size * target_count which which doesn't really represent the number of bytes after target_disp for which data exists. In particular I could create a datatype as { disp -4, len 4 } and use target_disp 4 and that would be bytes 0-3 of the window where the original code would think it was bytes 4-7 and could abort at the range check. Ive changed it to use the opal_datatype_span() function. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-05-24 19:10:39 -04:00
Mark Allen	36f51bca26	yalla with irregular contig datatype -- Fixes 3566 Yalla has a macro PML_YALLA_INIT_MXM_REQ_DATA that checks if a datatype is contiguous via opal_datatype_is_contiguous_memory_layout(dt,count) and if so it selects a size and lb that presumably is what will rdma, as ompi_datatype_type_size(_dtype, &size); \ ompi_datatype_type_lb(_dtype, &lb); \ This failed when I gave it a datatype constructed as [ ...] with extent 4. What I mean by that datatype is lens[0] = 3; disps[0] = 1; types[0] = MPI_CHAR; MPI_Type_struct(1, lens, disps, types, &tmpdt); MPI_Type_create_resized(tmpdt, 0, 4, &mydt); So there are 3 chars at offset 1, and the LB is 0 and the UB is 4. So that macro decides that size=4 and lb=0 and later I suppose size is getting updated to 3 for the final rdma, and so a send of a buffer [ 0 1 2 3 ] gets recved as [ 0 1 2 _ ]. I think it should use the true lb and the true extent. For "regular" contig datatypes it would be the same, and for the irregular ones that are still deemed contiguous by that utility function it should still be the right thing to use. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-05-23 20:56:12 -04:00
Mark Allen	c9f31a8d39	fix for 1sided with some hosts single rank See bug report https://github.com/open-mpi/ompi/issues/3548 If a 1sided test is launched -host hostA:2,hostB:1 some of the ranks call allocate_state_single() and others call allocate_state_shared(). These functions were producing different values for module->state_size but that's used when they lookup peer info from each other in ompi_osc_rdma_peer_setup() so they need to all have matching module->state_offset values. This change adds a few unused bytes in the memory allocate_state_single() creates so it matches. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-05-22 15:10:49 -04:00
Geoff Paulsen	50f9287c03	Merge pull request #2941 from markalle/pr/mpi-info-update2 Finally Merging this in. MPI_*_get_info/set_info(). Targeting v3.1 release. @hjelmn were you interested in switching some internal pieces to begin using this? Should we target v3.1 (or whatever we call the Oct 15th release?)	2017-05-22 09:22:04 -05:00
Mark Allen	482d84b6e5	fixes for Dave's get/set info code The expected sequence of events for processing info during object creation is that if there's an incoming info arg, it is opal_info_dup()ed into the obj at obj->s_info first. Then interested components register callbacks for keys they want to know about using opal_infosubscribe_infosubscribe(). Inside info_subscribe_subscribe() the specified callback() is called with whatever matching k/v is in the object's info, or with the default. The return string from the callback goes into the new k/v stored in info, and the input k/v is saved as __IN_<key>/<val>. It's saved the same way whether the input came from info or whether it was a default. A null return from the callback indicates an ignored key/val, and no k/v is stored for it, but an __IN_<key>/<val> is still kept so we still have access to the original. At MPI__set_info() time, opal_infosubscribe_change_info() is used. That function calls the registered callbacks for each item in the provided info. If the callback returns non-null, the info is updated with that k/v, or if the callback returns null, that key is deleted from info. An __IN_<key>/<val> is saved either way, and overwrites any previously saved value. When MPI__get_info() is called, opal_info_dup_mpistandard() is used, which allows relatively easy changes in interpretation of the standard, by looking at both the <key>/<val> and __IN_<key>/<val> in info. Right now it does 1. includes system extras, eg k/v defaults not expliclty set by the user 2. omits ignored keys 3. shows input values, not callback modifications, eg not the internal values Currently the callbacks are doing things like return some_condition ? "true" : "false" that is, returning static strings that are not to be freed. If the return strings start becoming more dynamic in the future I don't see how unallocated strings could support that, so I'd propose a change for the future that the callback()s registered with info_subscribe_subscribe() do a strdup on their return, and we change the callers of callback() to free the strings it returns (there are only two callers). Rough outline of the smaller changes spread over the less central files: comm.c initialize comm->super.s_info to NULL copy into comm->super.s_info in comm creation calls that provide info OBJ_RELEASE comm->super.s_info at free time comm_init.c initialize comm->super.s_info to NULL file.c copy into file->super.s_info if file creation provides info OBJ_RELEASE file->super.s_info at free time win.c copy into win->super.s_info if win creation provides info OBJ_RELEASE win->super.s_info at free time comm_get_info.c file_get_info.c win_get_info.c change_info() if there's no info attached (shouldn't happen if callbacks are registered) copy the info for the user The other category of change is generally addressing compiler warnings where ompi_info_t and opal_info_t were being used a little too interchangably. An ompi_info_t* contains an opal_info_t*, at &(ompi_info->super) Also this commit updates the copyrights. Signed-off-by: Mark Allen <markalle@us.ibm.com>	2017-05-17 01:12:49 -04:00
Todd Kordenbrock	27ee862964	mtl-portals4: in rendezvous, reissue PtlGet() if it fails This commit fixes a race condition in the rendezvous protocol. The race occurs because the sender does not wait for the link event on the send buffer. Even though this has not been seen in the wild, it is possible for the receiver to issue the PtlGet() before the ME is linked which causes a NAK at the receiver. This commit resolves this race by reissuing the PtlGet() when a NAK occurs. Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com>	2017-05-15 13:11:13 -05:00
David Solt	50aa143ab6	Major structural changes to data types: .super infosubscriber ompi_communicator_t, ompi_win_t, ompi_file_t all have a super class of type opal_infosubscriber_t instead of a base/super type of opal_object_t (in previous code comm used c_base, but file used super). It may be a bit bold to say that being a subscriber of MPI_Info is the foundational piece that ties these three things together, but if you object, then I would prefer to turn infosubscriber into a more general name that encompasses other common features rather than create a different super class. The key here is that we want to be able to pass comm, win and file objects as if they were opal_infosubscriber_t, so that one routine can heandle all 3 types of objects being passed to it. MPI_INFO_NULL is still an ompi_predefined_info_t type since an MPI_Info is part of ompi but the internal details of the underlying information concept is part of opal. An ompi_info_t type still exists for exposure to the user, but it is simply a wrapper for the opal object. Routines such as ompi_info_dup, etc have all been moved to opal_info_dup and related to the opal directory. Fortran to C translation tables are only used for MPI_Info that is exposed to the application and are therefore part of the ompi_info_t and not the opal_info_t The data structure changes are primarily in the following files: communicator/communicator.h ompi/info/info.h ompi/win/win.h ompi/file/file.h The following new files were created: opal/util/info.h opal/util/info.c opal/util/info_subscriber.h opal/util/info_subscriber.c This infosubscriber concept is that communicators, files and windows can have subscribers that subscribe to any changes in the info associated with the comm/file/window. When xxx_set_info is called, the new info is presented to each subscriber who can modify the info in any way they want. The new value is presented to the next subscriber and so on until all subscribers have had a chance to modify the value. Therefore, the order of subscribers can make a difference but we hope that there is generally only one subscriber that cares or modifies any given key/value pair. The final info is then stored and returned by a call to xxx_get_info. The new model can be seen in the following files: ompi/mpi/c/comm_get_info.c ompi/mpi/c/comm_set_info.c ompi/mpi/c/file_get_info.c ompi/mpi/c/file_set_info.c ompi/mpi/c/win_get_info.c ompi/mpi/c/win_set_info.c The current subscribers where changed as follows: mca/io/ompio/io_ompio_file_open.c mca/io/ompio/io_ompio_module.c mca/osc/rmda/osc_rdma_component.c (This one actually subscribes to "no_locks") mca/osc/sm/osc_sm_component.c (This one actually subscribes to "blocking_fence" and "alloc_shared_contig") Signed-off-by: Mark Allen <markalle@us.ibm.com> Conflicts: AUTHORS ompi/communicator/comm.c ompi/debuggers/ompi_mpihandles_dll.c ompi/file/file.c ompi/file/file.h ompi/info/info.c ompi/mca/io/ompio/io_ompio.h ompi/mca/io/ompio/io_ompio_file_open.c ompi/mca/io/ompio/io_ompio_file_set_view.c ompi/mca/osc/pt2pt/osc_pt2pt.h ompi/mca/sharedfp/addproc/sharedfp_addproc.h ompi/mca/sharedfp/addproc/sharedfp_addproc_file_open.c ompi/mca/topo/treematch/topo_treematch_dist_graph_create.c ompi/mpi/c/lookup_name.c ompi/mpi/c/publish_name.c ompi/mpi/c/unpublish_name.c opal/mca/mpool/base/mpool_base_alloc.c opal/util/Makefile.am	2017-05-12 14:41:05 -04:00
Matias A Cabral	644641d06f	PSM and PSM2 MTLs check on the max message size allowed by API. OMPI send and receive mesages use size_t for the lenght while PSM and PSM2 psm(2)mq_send/receive use uint32_t. Type size_t is 64 bits in 64 bits arch. Therefore, this patch adds a sanity check on the lenght of the message and fails gracefully. Signed-off-by: Matias Cabral <matias.a.cabral@intel.com>	2017-05-10 12:45:11 -07:00
Gilles Gouaillardet	a66909b8b4	Merge pull request #3488 from ggouaillardet/topic/romio314_ad_nfs romio314: ad_nfs fixes for large files from upstream mpich	2017-05-09 16:58:02 +09:00
Gilles Gouaillardet	26f44da429	coll/base: fix mca_coll_base_alltoallv_intra_basic_inplace() correctly handle the case when a MPI task has no data to send/recv Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-05-09 15:19:14 +09:00
Gilles Gouaillardet	eaf050cfe1	romio314: adio/ad_nfs: fix buffer overflows in ADIOI_NFS_{Read,Write}Strided Refs: models/mpich#2338 Refs: models/mpich#2617 Signed-off-by: Rob Latham <robl@mcs.anl.gov> (back-ported from upstream commit pmodels/mpich@642db57648) Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-05-09 11:11:12 +09:00
Gilles Gouaillardet	02af10ce6e	romio314: update NFS read/write routines for large xfers When we updated UFS and others we left NFS alone. HDF group would like a fix, so here we go. Signed-off-by: Ken Raffenetti <raffenet@mcs.anl.gov> (back-ported from upstream commit pmodels/mpich@684df9f4c9) Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-05-09 11:07:47 +09:00
Jeff Squyres	7185567d50	Merge pull request #3455 from jsquyres/pr/fix-lustre-configure Lustre configure fixes	2017-05-08 16:49:23 -04:00
Ralph Castain	ef0e0171c9	Implement the changes required to support cross-library coordination. Update PMIx to support intra-process notifications and ensure that we always notify ourselves for events. Add a new ompi/interlib directory where cross-lib coordination code can go, and put the code to declare ourselves there (called from ompi_mpi_init.c). Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-05-08 10:04:50 -07:00
Jeff Squyres	c81bc50198	fs/lustre: remove redundant/dead code We check for liblustreapi.h in OMPI_CHECK_LUSTRE, so this code was commented out here. Might as well fully delete it, since it's redundant and dead. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-05-05 05:28:33 -07:00
Yossi	f56847542e	Merge pull request #3347 from alinask/topic/ucx-sync-send PML UCX: handle a synchronous send.	2017-04-26 18:02:09 +03:00
Alina Sklarevich	49913c692a	PML UCX: unite the code for all the sending modes. Signed-off-by: Alina Sklarevich <alinas@mellanox.com>	2017-04-26 13:17:06 +03:00
Howard Pritchard	462342d148	Merge pull request #3311 from hppritcha/topic/libfabric_moves_to_ofi common/libfabric: move libfabric to ofi	2017-04-21 07:50:38 -06:00
Howard Pritchard	841192645b	common/libfabric: move libfabric to ofi This PR renames the common library for OFI libfabric from libfabric to ofi. There are a number of reasons this is good to do: 1) its shorter and replaces 9 characters with three for function names for what may eventually be a fairly extensive interface 2) OFI is the term used for MTL and RML components that use the OFI libfabric interface 3) A planned OSC component will also use the OFI term. 4) Other HPC libraries that can use OFI libfabric tend to use the term "ofi" internally and also in their configure options relevant to OFI libfabric (i.e. MPICH/CH4, Intel MPI, Sandia SHMEM) There seem to be comments in places in the Open MPI source code that indicate that this common library will be going away. Far from it as we will want to be able to share things like AV objects between OMPI and possibly OSHMEM components that use the OFI libfabric interface. This PR also adds a synonym to the --with-libfabric(-libdir) configury options: --with-ofi and with-ofi-libdir. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-04-20 13:07:16 -06:00
Gilles Gouaillardet	ded63c5e0c	ompi: use ompi_coll_base_sendrecv_actual() whenever possible Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-20 10:01:28 +09:00
Gilles Gouaillardet	52551d96c1	Merge pull request #3285 from ggouaillardet/topic/coll_zerobyte_messages coll/base: always send/recv zero-byte messages	2017-04-20 09:22:47 +09:00
Gilles Gouaillardet	fa5cd0dbe5	use ptrdiff_t instead of OPAL_PTRDIFF_TYPE since Open MPI now requires a C99, and ptrdiff_t type is part of C99, there is no more need for the abstract OPAL_PTRDIFF_TYPE type. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-19 13:41:56 +09:00
Yossi	9ebcafd6d6	Merge pull request #3260 from derbeyn/fix_yalla Fix yalla PML: MPI_Recv does not return MPI_ERR_TRUNCATE upon overflow	2017-04-18 11:37:48 +03:00
Alina Sklarevich	d93b67257b	PML UCX: handle a synchronous send. MCA_PML_BASE_SEND_SYNCHRONOUS Signed-off-by: Alina Sklarevich <alinas@mellanox.com>	2017-04-13 18:11:55 +03:00
Alina Sklarevich	eec310c99c	PML/UCX/YALLA: Fix the message release call. Set message to MPI_MESSAGE_NULL. Signed-off-by: Alina Sklarevich <alinas@mellanox.com>	2017-04-13 14:41:13 +03:00
Nathan Hjelm	12b52b2b2c	osc/pt2pt: fix infinite frag allocation loop Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-04-10 16:30:47 -06:00
Howard Pritchard	f5942ff23c	Merge pull request #3304 from hppritcha/topic/de-ortization-of-ompi de-ORTEfy the ompi tree	2017-04-07 14:14:41 -06:00
Noah Evans	ef29fb13cb	de-ORTEfy the ompi tree The ompi tree should be runtime independent, but over time a few ORTE depedent definitions and functions have escaped into the ompi tree. I'm working on my own runtime so I've used this as an opportunity to get rid of ORTE dependencies in the ompi/ tree. I still need to go back and change orte to conform to the new world and these changes are untested, but I can now compile (but not link) without orte so I'm commiting this changeset. Signed-off-by: Noah Evans <noah.evans@gmail.com>	2017-04-07 12:35:58 -06:00
Nadia Derbey	f918d88c3e	Fix yalla PML: Update previous commit after Yossofe's review Signed-off-by: Nadia Derbey <Nadia.Derbey@atos.net>	2017-04-06 07:58:26 +02:00
Gilles Gouaillardet	f3581c8259	coll/base: have alltoallv send/recv zero-bytes messages Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-05 13:44:17 +09:00
Gilles Gouaillardet	5492edd71e	coll/base: have ompi_coll_base_sendrecv() send/recv zero-bytes messages Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-05 13:44:05 +09:00
Nathan Hjelm	1322e5dee8	Merge pull request #3274 from hjelmn/osc_rdma_fix osc/rdma: fix typo in atomic code	2017-04-04 00:20:42 -06:00
Gilles Gouaillardet	5dfd4ab6ca	coll/tuned: remove set-but-not-used variables Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-04-04 13:18:11 +09:00
Nathan Hjelm	fad0803920	osc/rdma: fix typo in atomic code Fixes #3267 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-04-03 15:54:28 -06:00
Nadia Derbey	b6de94e449	Fix yalla PML: MPI_Recv does not return MPI_ERR_TRUNCATE upon overflow Signed-off-by: Nadia Derbey <Nadia.Derbey@atos.net>	2017-03-30 15:18:31 +02:00
Xin Zhao	ee952fcccd	Passing estimated_num_procs to UCX init in PML and SPML. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2017-03-27 20:36:52 +03:00
Nathan Hjelm	c72fb30eb5	osc/pt2pt: fix typo Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2017-03-23 09:00:21 -06:00
Xin Zhao	6a99c60fbd	Add multithreading support in PML UCX framework. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2017-03-20 19:55:00 +02:00
Jeff Squyres	760db0d5ce	osc/pt2pt: fix compiler warning Remove unused variable. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-03-16 05:46:11 -07:00
Jeff Squyres	1947280865	topo/treematch: squash some compiler warnings Only define MIN/MAX if they are not already defined. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-03-16 05:44:26 -07:00
Nathan Hjelm	37214eda09	Merge pull request #3164 from hjelmn/ob1_pinned pml/ob1: do not cache leave_pinned	2017-03-14 13:22:18 -06:00
Nathan Hjelm	3e7ef48c13	pml/ob1: do not cache leave_pinned This commit fixes a bug that disabled both the RDMA pipeline and RDMA protocols in ob1. ob1 was internally caching the values of opal_leave_pinned and opal_leave_pinned_pipeline at init time. This is no longer valid as opal_leave_pinned may be set by any call to a btl's add_procs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-03-14 09:00:40 -06:00
Valentin Petrov	fe069c9570	Fixes the coll_allgather usage bug One should use the correct module object when calling c_coll.coll_allgather. Otherwise there will be a segfault in the case, for example, when hcoll is used. In that case c_coll.coll_allgather = mca_coll_hcoll_allgather while c_coll.coll_gather_module = tuned. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2017-03-14 09:47:39 +02:00
Alex Mikheev	c081239f88	ompi: pml ucx: fix persistant request init CR changes Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2017-03-08 13:26:29 +02:00
Alex Mikheev	c113c37a7a	ompi: pml ucx: fix persistant request initialization Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2017-03-08 10:59:41 +02:00
Nathan Hjelm	0195d15401	osc/pt2pt: flush pending fragments on lock ack This commit addresses an issue that can occur in cases where a lot of fragments are outstanding. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-03-06 13:58:46 -07:00
Edgar Gabriel	607dc2c039	Merge pull request #3103 from edgargabriel/pr/sharedfp-name-collision-fix sharedfp/lockedfile and sm: fix the namecollision	2017-03-05 14:46:20 -06:00
Edgar Gabriel	2d462b3b80	sharedfp/lockedfile and sm: fix name collision this fixes the issue reported by Nicolas Joly on the mailing: the sharedfp/lockedfile component does not support right now a scenario where multiple jobs read from the same input file, due to a collision of the filenames utilized for the sharedfp handle. Although not part of the oroginal report, the same occurs for the sharedfp/sm component. Add therefore the jobid to be part of the lockedfilename/sm file name. use the OMPI_CAST_RTE_NAME macro to determine jobid Fixes: #3098 Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-03-05 11:28:28 -06:00
Artem Polyakov	9448814c40	ompi/pml/ucx: Fix uninitialized UCX request field. Signed-off-by: Artem Polyakov <artpol84@gmail.com>	2017-03-05 03:06:30 +07:00
Edgar Gabriel	d1fed77781	Merge pull request #3094 from edgargabriel/pr/master-lustre-priority io/ompio: adjust the priority of the OMPIO component on lustre	2017-03-03 09:29:14 -06:00
KAWASHIMA Takahiro	39294caf04	Merge pull request #3086 from kawashima-fj/pr/coll-base-defs coll: Update `ompi/mca/coll/base/coll_base_functions.h`	2017-03-03 18:53:00 +09:00
Edgar Gabriel	9e19834327	io/ompio: adjust the priority of the OMPIO component on lustre this commit brings over the behavior from the 2.x series to master, mostly with the fork for the 3.x series in mind. Also, use strncasecmp instead of two strncmps Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-03-02 12:10:11 -06:00
KAWASHIMA Takahiro	c4ca5e703d	coll: Update `ompi/mca/coll/base/coll_base_functions.h` - Support MPI-2.2 and MPI-3.0 COLL features. * `MPI_REDUCE_SCATTER_BLOCK` * neighborhood collective communication * nonblocking collective communication - Add `_BASE_ARGS` and `_BASE_ARG_NAMES` for convenience. - Use parameter names used in the MPI Standard. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-03-02 17:58:02 +09:00
KAWASHIMA Takahiro	96aa0d90c1	pml/bfo: Correct a function name and header filenames These lines were incorrectly modified in `90f2940`. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2017-03-02 16:02:53 +09:00
Alex Mikheev	152f77df59	ompi: pml ucx: fix datatype packing error in bsend Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2017-03-01 16:18:19 +02:00
Yossi Itigin	33471c44ee	pml_yalla/mtl_mxm/hcoll: open memory component to activate memory hooks. Memory hooks are now set-up on demand. pml/yalla, mtl/mxm and coll/hcoll need the memory hooks, so make sure those are installed. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2017-03-01 12:12:20 +02:00
Jeff Squyres	d5266aba90	Merge pull request #2955 from jsquyres/pr/hwloc-external-fixes Fix --with-hwloc=external	2017-02-28 14:57:07 -05:00
Josh Hursey	0006f0d7c5	Merge pull request #2773 from jjhursey/topic/hook-fwk Add a 'hook' framework	2017-02-28 12:29:50 -06:00
Jeff Squyres	fec519a793	hwloc: rename opal/mca/hwloc/hwloc.h -> hwloc-internal.h Per a prior commit, the presence of "hwloc.h" can cause ambiguity when using --with-hwloc=external (i.e., whether to include opal/mca/hwloc/hwloc.h or whether to include the system-installed hwloc.h). This commit: 1. Renames opal/mca/hwloc/hwloc.h to hwloc-internal.h. 2. Adds opal/mca/hwloc/autogen.options to tell autogen.pl to expect to find hwloc-internal.h (instead of hwloc.h) in opal/mca/hwloc. 3. s@opal/mca/hwloc/hwloc.h@opal/mca/hwloc/hwloc-internal.h@g in the rest of the code base. Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-02-28 07:48:42 -08:00
Jeff Squyres	0cd3b6c235	treematch: do not include <hwloc.h> Instead, include "opal/mca/hwloc/hwloc.h" Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2017-02-28 07:45:23 -08:00
Josh Hursey	b1c4e50500	Merge pull request #2934 from jjhursey/topic/coll-comm-restructure Move coll structure outside of the communicator	2017-02-28 08:45:18 -06:00
Nathan Hjelm	032bcf915a	osc/rdma: fix compile warning Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-02-27 16:26:00 -07:00
George Bosilca	366d64b7e5	Move the collective structure outside the communicator. As we changed the ABI (forcing a major release), we can limit the size of the predefined communicators by moving the collective structure outside the communicator. This might have a minimal, but unnoticeable, impact on performance. This approach has been discussed during the January 2017 devel meeting. Signed-off-by: George Bosilca <bosilca@icl.utk.edu> Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-27 11:54:17 -06:00
Joshua Hursey	c10bbfded6	ompi/hook: Add the hook/license framework * Include a 'demo' component that shows some of the features. * Currently has hooks for: - MPI_Initialized - top, bottom - MPI_Init_thread - top, bottom - MPI_Finalized - top, bottom - MPI_Init - top (pre-opal_init), top (post-opal_init), error, bottom - MPI_Finalize - top, bottom * Other places in ompi can 'register' to hook into any one of these places by passing back a component structure filled with function pointers. * Add a `MCA_BASE_COMPONENT_FLAG_REQUIRED` flag to the MCA structure that is checked by the `hook` framework. If a required, static component has been excluded then the `hook` framework will fail to initialize. - See note in `opal/mca/mca.h` as to why this is checked in the `hook` framework and not in `opal/mca/base/mca_base_component_find.c` Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-27 12:05:53 -05:00
Nathan Hjelm	581bff9871	Merge pull request #3034 from hjelmn/osc_rdma_atomic osc/rdma: make locking code more robust	2017-02-27 08:46:52 -07:00
Nathan Hjelm	4707c7c5e0	osc/rdma: make locking code more robust Under heavy load the locking code could fail if the underlying btl module started to return OPAL_ERR_OUT_OF_RESOURCE on atomic operations. This commit updates the code to gracefully handle btl errors. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-02-27 00:01:26 -07:00
Gilles Gouaillardet	af0b5cffb4	asm: rename the AMD64 into X86_64 in this context, AMD64 really means amd64 or em64t, so let's rename this into X86_64 in order to avoid any confusion Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-27 15:10:50 +09:00
Yossi	fb67c966a8	Merge pull request #2944 from alex-mikheev/topic/pml_ucx_bsend ompi: pml ucx: add support for the buffered send	2017-02-22 12:21:03 +02:00
Alex Mikheev	b015c8bb48	ompi: pml ucx: add support for the buffered send Signed-off-by: Alex Mikheev <alexm@mellanox.com>	2017-02-21 17:19:22 +02:00
Gilles Gouaillardet	4184c01be5	Merge pull request #2393 from bosilca/topic/no_predefined_ddt_refcount Don't refcount the predefined datatypes.	2017-02-21 09:38:11 +09:00
Todd Kordenbrock	048f757d9f	osc-portals4: add support for noncontiguous datatypes This commit implements onesided operations for noncontiguous datatypes using two different algorithms. * If the result and/or origin datatype is noncontiguous and the target datatype is contiguous, then an iovec MD is created for the result and origin. The operation is performed using a single Portals4 call (unless it exceeds the max message size). * If the target datatype is noncontigous, then an algorithm similar to the one in osc-rdma is used to loop over the contiguous blocks of each datatype. The operation is performed using multiple Portals4 calls. This commit ensures that individual operations do not exceed the max atomic size or the max message size supported by the device. Signed-off-by: Todd Kordenbrock <thkgcode@gmail.com>	2017-02-15 16:17:13 -06:00
Gilles Gouaillardet	cd4537193c	osc/sm: fix MPI_Win_allocate_shared() alignment add padding so the memory allocated by MPI_Win_allocate_shared() is 64 bytes aligned. Thanks Joseph Schuchart for the bug report Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-15 13:40:48 +09:00
Josh Hursey	0b273c2561	Merge pull request #2808 from jjhursey/fix/ibm/reduce-local-to-coll coll: Move reduce_local into the coll framework	2017-02-14 15:54:15 -06:00
Nathan Hjelm	cc4a0fabcf	Merge pull request #2727 from hjelmn/osc_rdma osc/rdma: fix typo in check for MPI_MODE_NOCHECK	2017-02-14 10:50:33 -07:00
Joshua Hursey	78006f93a4	coll: Move reduce_local into the coll framework * Since we are adding a new function to `mca_coll_base_module_2_1_0_t` we need to increase the version of the module structure to `2_2_0`. * Add a comment just above the PREDEFINED_COMMUNICATOR_PAD describing it's purpose and when it should change. To help future developers trying to answer the question noted in the comment. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-14 08:56:07 -06:00
Gilles Gouaillardet	e70a30cca4	coll/libnbc: optimize zero size ialltoall{v,w} with MPI_IN_PLACE and incidentally avoids malloc(0) Thanks Lisandro Dalcin for the report Fixes open-mpi/ompi#2945 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-13 15:21:28 +09:00
Gilles Gouaillardet	12949547f4	coll/libnbc: fix a2aw_sched_linear() with zero size datatype or zero count Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-13 15:21:28 +09:00
Joshua Hursey	383330a50d	coll/basic: Expand check for negative input values * Negative values are parameter errors for neighborhood collectives - Add checks to the mpi/c interface `MPI_PARAM_CHECK` * Fix a success check for neighbor_alltoallw with dist_graph Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-02-08 14:26:32 -06:00
Geoff Paulsen	4917e44a7d	Merge pull request #2832 from jjhursey/topic/ibm/osc-base-dt-abort osc/base: Detect unsupported data types and abort	2017-02-05 04:26:04 -06:00
Howard Pritchard	f4ad119693	Merge pull request #2914 from hppritcha/topic/nbc_compiler_warning swat some compiler warnings	2017-02-04 11:56:52 -05:00
Howard Pritchard	acaecb2448	swat some compiler warnings Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2017-02-03 08:28:15 -07:00
Gilles Gouaillardet	e879d2910a	coll/tuned: make coll_tuned_gather_algorithms MCA settable Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-02 11:00:38 +09:00
Nathan Hjelm	362ac8b87e	osc/pt2pt: fix threading issues This commit fixes a number of threading issues discovered in osc/pt2pt. This includes: - Lock the synchronization object not the module in osc_pt2pt_start. This fixes a race between the start function and processing post messages. - Always lock before calling cond_broadcast. Fixes a race between the waiting thread and signaling thread. - Make all atomically updated values volatile. - Make the module lock recursive to protect against some deadlock conditions. Will roll this back once the locks have been re-designed. - Mark incoming complete after completing an accumulate not before. This was causing an incorrect answer under certain conditions. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-02-01 10:33:01 -07:00
Gilles Gouaillardet	02558134ef	coll/base: remove unused local variable Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-01 11:54:17 +09:00
Gilles Gouaillardet	ad44ecb2ba	pml/base: initialize global variables Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-02-01 11:49:47 +09:00
bosilca	c331e6794c	Allow all tuned MCA parameters to be modified programatically. (#2829 ) Fix a comment in the MCA header. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-01-31 21:47:36 -05:00
Josh Hursey	5fcd69da52	Merge pull request #2831 from jjhursey/topic/ibm/pml-bsend pml/base: Expose some bsend varaibles so PMLs may reference them	2017-01-31 10:31:42 -06:00
Gilles Gouaillardet	9bcadbd51b	coll/libnbc: fix the red_schain algo of ireduce with MPI_IN_PLACE this fixes a regression introduced in open-mpi/ompi@045d0c5f4c Fixes open-mpi/ompi#2879 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-30 14:19:45 +09:00
Yossi Itigin	13c3bf0dd7	yalla: fix memory leak with blocking non-contig send. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2017-01-29 18:51:43 +02:00
Josh Hursey	0408c116eb	Merge pull request #2805 from jjhursey/fix/ibm/base-allgatherv coll/base: Allgatherv MPI_IN_PLACE Bug	2017-01-26 14:21:57 -06:00
Geoffrey Paulsen	d2527cff46	Fixing comment only in MPI_IN_PLACE case for ireduce in libnbc. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2017-01-26 10:58:51 -08:00
Geoffrey Paulsen	045d0c5f4c	Fix for Ireduce + MPI_IN_PLACE. Fixes a wrong answer from MPI_Ireduce when the red_sched_chain() path was taken (which only happens for np<=4 and mesgsize>=64k). The way libnbc treats MPI_IN_PLACE is to set sbuf == rbuf, and whether an algorithm will work cleanly or not after that depends on the details. In this case the last steps of the algorithm amounted to (right neighbor is sending us reduction results from ranks 1..n-1) recv into rbuf from right neighbor add the contribution from our sbuf into rbuf this would be fine in general, but if sbuf==rbuf, that recv overwrites the sbuf. I changed it to recv into a tmpbuf if MPI_IN_PLACE was used. Signed-off-by: Geoffrey Paulsen <gpaulsen@us.ibm.com>	2017-01-25 18:08:08 -08:00
Nysal Jan K.A	94f92f6b49	osc/base: Detect unsupported data types and abort Using MPI_MINLOC or MPI_MAXLOC with the following data types leads to data corruption: * MPI_DOUBLE_INT * MPI_LONG_INT * MPI_SHORT_INT * MPI_LONG_DOUBLE_INT Detect this print a error message and abort. This workaround should be removed once the following issue is resolved: * https://github.com/open-mpi/ompi/issues/1666 Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-01-25 15:28:28 -06:00
Sameh S. Sharkawi	320ab3b84f	pml/base: Expose some bsend varaibles so PMLs may reference them Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-01-25 15:21:53 -06:00
Mark Allen	a3452adfa9	coll/base: Allgatherv MPI_IN_PLACE Bug MPI_Allgatherv with MPI_IN_PLACE reads data from wrong location. They were locating the MPI_IN_PLACE send buffer as ```c send_buf = (char)rbuf; for (i = 0; i < rank; ++i) { send_buf += ((ptrdiff_t)rcounts[i] extent); } ``` when it should be ```c send_buf = (char)rbuf; send_buf += ((ptrdiff_t)disps[rank] extent); ``` because disps[] specifies where things are in the v-style buffers. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2017-01-24 15:52:36 -06:00
Edgar Gabriel	cbb3cb9745	fs/ufs: avoid using the exclusive flag with shared file pointer when a file is opened a second time for shared file pointer operations, avoid setting the create and exclusive flag. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-01-24 12:11:29 -06:00
Edgar Gabriel	f5289a1803	common/ompio: store correctly the SHAREDFP_IS_SET flag it looks like disabling the lazy_open flag for sharedfp components revealead a bug that lead to a crash in file_close in some tests. Make sure the SHAREDFP_IS_SET flag is correctly set (and not overwritten again), and we use that to avoid a double-free of the communicator. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-01-24 12:09:56 -06:00
Gilles Gouaillardet	501eb8dc7e	ompio: plug misc memory leaks Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-24 09:13:19 +09:00
Gilles Gouaillardet	d0629f18c2	coll/libnbc: optimize size one communicators simply "return" with ompi_request_empty if the communicator size is 1 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-24 09:12:47 +09:00
Edgar Gabriel	4dc09de3b8	common/ompio: update comment based on the previsou commit. No source code changed. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-01-23 13:38:05 -06:00
Edgar Gabriel	3eae0eecd0	io/ompio: change default for sharedfp_lazy_open parameter Revert the logic of io_ompio_sharedfp_lazy_open. The user now has to explicitely disable shared fp in order for the structures not to be allocated. Otherwise, resetting the shared fp e.g. in case the file was opened in append mode will not work correctly, the code could deadlock. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-01-23 08:59:22 -06:00
Edgar Gabriel	d3a8d38cc6	common/ompio: correctly position shared fp in append mode Fixes a bug reported on the mailing list. ompio did only reposition the individual file pointer when the file was opened in append mode. Set the shared file pointer also to point to the end of the file, similarly to the individual file pointer. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2017-01-23 08:59:05 -06:00
Nathan Hjelm	0497ec0b70	osc/rdma: fix typo in check for MPI_MODE_NOCHECK This commit fixes two typos in the lock_all path that inverted the MPI_MODE_NOCHECK flag. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2017-01-12 11:28:11 -07:00
George Bosilca	c2cd717f82	Don't refcount the predefined datatypes. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2017-01-11 16:48:59 -05:00
Gilles Gouaillardet	1daa80d78f	mtl/psm2: plug a memory leak in ompi_mtl_psm2_component_open() Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2017-01-06 09:28:32 +09:00
Joshua Ladd	57c0c847d0	Merge pull request #2603 from xinzhao3/topic/revert-ucx-mt Revert "PML/SPML/UCX: add UCX MT support to PML and SPML."	2017-01-04 11:50:37 -05:00
Ralph Castain	66131b4183	Remove the bcol, coll/ml, and sbgp code as stale and lacking a maintainer Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2017-01-03 19:32:48 -08:00
Ralph Castain	dadc6fbaf6	Merge pull request #2448 from thananon/remove_request_lock Completely removed ompi_request_lock and ompi_request_cond	2017-01-03 19:31:46 -08:00
Jeff Squyres	33d2988985	Merge pull request #2647 from OMGtechy/master Fixed -Wmisleading-indentation in ad_read_coll.c	2017-01-03 12:24:22 -05:00
Ralph Castain	fe68f23099	Only instantiate the HWLOC topology in an MPI process if it actually will be used. There are only five places in the non-daemon code paths where opal_hwloc_topology is currently referenced: * shared memory BTLs (sm, smcuda). I have added a code path to those components that uses the location string instead of the topology itself, if available, thus avoiding instantiating the topology * openib BTL. This uses the distance matrix. At present, I haven't developed a method for replacing that reference. Thus, this component will instantiate the topology * usnic BTL. Uses the distance matrix. * treematch TOPO component. Does some complex tree-based algorithm, so it will instantiate the topology * ess base functions. If a process is direct launched and not bound at launch, this code attempts to bind it. Thus, procs in this scenario will instantiate the topology Note that instantiating the topology on complex chips such as KNL can consume megabytes of memory. Fix pernode binding policy Properly handle the unbound case Correct pointer usage Do not free static error messages! Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-12-29 10:33:29 -08:00
Joshua Gerrard	94e87654c6	Fixed -Wmisleading-indentation in ad_read_coll.c Signed-off-by: Joshua Gerrard <joshuagerrard+ompi-commit@protonmail.com>	2016-12-28 20:14:13 +00:00
Xin Zhao	2d77912c19	Revert "PML/SPML/UCX: add UCX MT support to PML and SPML." This reverts commit `0ecf3c951c`. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2016-12-19 18:57:48 +02:00
Mark Allen	eec1d5bf2e	osc/pt2pt: Fix hang with Put and Win_lock_all * When using `MPI_Put` with `MPI_Win_lock_all` a hang is possible since the `put` is waiting on `eager_send_active` to become `true` but that variable might not be reset in the case of `MPI_Win_lock_all` depending on other incoming events (e.g., `post` or ACKs of lock requests. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-12-16 11:52:53 -05:00
Mark Allen	0d1336b4a8	osc/pt2pt: Fix Lock/Unlock and Get wrong answer * When using `MPI_Lock`/`MPI_Unlock` with `MPI_Get` and non-contiguous datatypes is is possible that the unlock finishes too early before the data is actually present in the recv buffer. * We need to wait for the irecv to complete before unlocking the target. This commit waits for the outgoing fragment counts to become equal before unlocking. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-12-16 11:52:51 -05:00
Mark Allen	1ebf9fd3a4	osc/pt2pt: Fix PSCW after Fence wrong answer. * If the user uses PSCW synchronization after a Fence then the previous epoch is not reset which can cause the PSCW to transfer data before it is ready leading to wrong answers. * This commit resets the `eager_send_active` in the start call. Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-12-16 11:52:49 -05:00
Xin Zhao	0ecf3c951c	PML/SPML/UCX: add UCX MT support to PML and SPML. Signed-off-by: Xin Zhao <xinz@mellanox.com>	2016-12-15 23:59:15 +02:00
Ralph Castain	585540bcee	Reduce the flood of warnings due to uninitialized variables, mismatched types, and unused things to a more bearable trickle Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-12-14 16:33:50 -08:00
Yossi	fa6e263821	Merge pull request #2537 from alinask/topic/pml-spml-ucx-api PML/SPML/UCX: Adapt to the API changes in the UCX lib.	2016-12-13 20:01:47 +02:00
KAWASHIMA Takahiro	6510800c16	ompi/request: Fix a persistent request creation bug According to the MPI-3.1 p.52 and p.53 (cited below), a request created by `MPI_*_INIT` but not yet started by `MPI_START` or `MPI_STARTALL` is inactive therefore `MPI_WAIT` or its friends must return immediately if such a request is passed. The current implementation hangs in `MPI_WAIT` and its friends in such case because a persistent request is initialized as `req_complete = REQUEST_PENDING`. This commit fixes the initialization. Also, this commit fixes internal requests used in `MPI_PROBE` and `MPI_IPROBE` which was marked wrongly as persistent. MPI-3.1 p.52: We shall use the following terminology: A null handle is a handle with value MPI_REQUEST_NULL. A persistent request and the handle to it are inactive if the request is not associated with any ongoing communication (see Section 3.9). A handle is active if it is neither null nor inactive. An empty status is a status which is set to return tag = MPI_ANY_TAG, source = MPI_ANY_SOURCE, error = MPI_SUCCESS, and is also internally configured so that calls to MPI_GET_COUNT, MPI_GET_ELEMENTS, and MPI_GET_ELEMENTS_X return count = 0 and MPI_TEST_CANCELLED returns false. We set a status variable to empty when the value returned by it is not significant. Status is set in this way so as to prevent errors due to accesses of stale information. MPI-3.1 p.53: One is allowed to call MPI_WAIT with a null or inactive request argument. In this case the operation returns immediately with empty status. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2016-12-08 21:42:05 +09:00
Alina Sklarevich	e9d2d029c6	PML/SPML/UCX: Adapt to the API changes in the UCX lib. Signed-off-by: Alina Sklarevich <alinas@mellanox.com>	2016-12-08 11:33:29 +02:00
Joshua Ladd	59f40e7cc5	Merge pull request #2500 from vspetrov/hcoll_ctx_free_detection Detect hcoll_context_free at config	2016-12-05 22:39:40 -05:00
Jeff Squyres	40d94fdc5a	Merge pull request #2422 from edgargabriel/pr/cycle-buf-default-val io/ompio: change the default value of mca parameter	2016-12-05 15:33:52 -05:00
Valentin Petrov	e13e264185	Detect hcoll_context_free at config Needed for better flexibility with versioning Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2016-12-02 22:09:20 +02:00
Jeff Squyres	1504ffb18d	ompi_file_delete: output a better error message Signed-off-by: Jeff Squyres <jsquyres@cisco.com>	2016-12-02 11:08:04 -05:00
Gilles Gouaillardet	fe4c4e95eb	coll/libnbc: fix MPI_IN_PLACE handling in i{gather,scatter}[v] MPI_IN_PLACE is only relevant on the root task, so only test is there Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:25 +09:00
Gilles Gouaillardet	1a8a276914	coll/libnbc: use zero-size messages in ibarrier and silence a valgrind warning Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:25 +09:00
Gilles Gouaillardet	2eec6a08b5	coll/base: fix ompi_coll_base_reduce_scatter_intra_nonoverlapping() with MPI_IN_PLACE invoke underlying scatterv with MPI_IN_PLACE when appropriate Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:24 +09:00
Gilles Gouaillardet	8b7999469b	coll/base: fix MPI_IN_PLACE in ompi_coll_base_reduce_generic() avoid copying data to itself when MPI_IN_PLACE is used Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:59:24 +09:00
Gilles Gouaillardet	3f1486a508	pml/ob1: initialize one more field in mca_pml_ob1_recv_request_progress_rget() always initialize recvreq->req_rdma_offset to zero. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-12-01 13:14:23 +09:00
Gilles Gouaillardet	15098161a3	coll/libnbc: add some comments on how locks are used no code change Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-30 17:29:51 +09:00
Ralph Castain	d5fd635efe	Bring forward the debugger-related changes Refs https://github.com/open-mpi/ompi/pull/2425 Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-11-29 13:15:20 -08:00
Valentin Petrov	4cdb8ecaad	coll/hcoll: hcoll_context_free Adds the new API hcoll_conetxt_free that resolves the issues observed with the ctx cache and group_destroy_notify. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2016-11-29 07:33:05 +02:00
Jeff Squyres	34ea3ce25a	Merge pull request #1946 from thananon/romio-add-notes romio: update REFRESH_NOTES to accommodate the random() patch.	2016-11-28 16:37:23 -05:00
KAWASHIMA Takahiro	9bfca8b274	pml/ob1: Reduce per-rank memory footprint slightly `sturct mca_pml_ob1_comm_proc_t`, which is allocated per connected rank in a communicator, had two paddings after `expected_sequence` and `send_sequence` by alignments. By changing the order of the members, the size of `mca_pml_ob1_comm_proc_t` is reduced by 8 bytes on 64-bit architectures. Signed-off-by: KAWASHIMA Takahiro <t-kawashima@jp.fujitsu.com>	2016-11-28 19:20:48 +09:00
Edgar Gabriel	b10558c3da	fcoll/dynamic_gen2: fix bug exposed by uneven distribution of data This fixes a bug reported in-house occuring with this component. It is triggered if the data assigned to different aggregators is highly differing, leading to different number of internal iterations required to handle it. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2016-11-24 13:02:19 -06:00
Ralph Castain	1e2019ce2a	Revert "Update to sync with OMPI master and cleanup to build" This reverts commit `cb55c88a8b`.	2016-11-22 15:03:20 -08:00
Thananon Patinyasakdikul	b25a8c3fa5	Completely removed ompi_request_lock and ompi_request_cond as we dont need them anymore. Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2016-11-22 17:58:31 -05:00
Ralph Castain	cb55c88a8b	Update to sync with OMPI master and cleanup to build Signed-off-by: Ralph Castain <rhc@open-mpi.org>	2016-11-22 14:24:54 -08:00
Gilles Gouaillardet	2c94a3a6f3	coll/libnbc: fix race condition with multi threaded apps protect the mca_coll_libnbc_component.active_requests list with the new mca_coll_libnbc_component.lock mutex. Thanks Jie Hu for the report Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-21 10:21:47 +09:00
Jijo Varghese	25e138ea1d	error correction to the MPI_file operations thread safety lock Signed-off-by: Jijo Varghese <jijo733@gmail.com>	2016-11-17 08:18:49 -05:00
Edgar Gabriel	26e9210b15	io/ompio: change the default value of mca parameter change the default value of the mca_io_ompio_cycle_buffer_size parameter in order to avoid accidental truncation of a file for very large individual operations. Thanks to @cniethammer for reporting it. Signed-off-by: Edgar Gabriel <egabriel@central.uh.edu>	2016-11-15 10:09:43 -06:00
Gilles Gouaillardet	bd364d29f7	osc/sm: plug an other memory leak in ompi_osc_sm_free Fixes open-mpi/ompi@f1b473ee63 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-14 23:19:07 -07:00
Gilles Gouaillardet	f1b473ee63	osc/sm: plug a memory leak in ompi_osc_sm_free Thanks Joseph Schuchart for the report. Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-14 22:22:43 -07:00
Joshua Hursey	5a8b2f7431	topo/base: Fix module reference in collective call Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-11-14 11:34:54 -06:00
Gilles Gouaillardet	fc776e3fa5	coll: code cleanup - instead of coll_base_comm_get_reqs(2) for irecv/isend, use only one request allocated in the stack and do a irecv/send - instead of ompi_request_wait_all(2), simpy ompi_request_wait Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-13 22:35:33 -07:00
Gilles Gouaillardet	99d30353af	coll: Don't allocate space for zero requests Refs #2402 Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-13 22:20:58 -07:00
George Bosilca	725277bc26	Don't allocate space for the requests if the underlying topology has no neighbors. This commit fixes issue #2402. Signed-off-by: George Bosilca <bosilca@icl.utk.edu>	2016-11-12 18:01:09 -05:00
Gilles Gouaillardet	023d18abae	pml/ob1: mca_pml_ob1_recv must have memchecker mark the buffer as defined upon success this is generally done in mca_pml_ob1_recv_request_free(), but this is not invoked in via mca_pml_ob1_recv(), so do it manually Thanks Yvan Fournier for the report Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-11-07 13:10:15 +09:00
Jeff Squyres	f11b0c7edf	Merge pull request #2330 from jjhursey/topic/ibcast-non-uniform-dt-wa coll/libnbc: Work around for non-uniform data types in ibcast	2016-11-05 10:26:04 -04:00
Joshua Hursey	350ef67fe0	coll/libnbc: Work around for non-uniform data types in ibcast * If (legal) non-uniform data type signatures are used in ibcast then the chosen algorithm may fail on the request, and worst case it could produce wrong answers. * Add an MCA parameter that, by default, protects the user from this scenario. If the user really wants to use it then they have to 'opt-in' by setting the following parameter to false: - `-mca coll_libnbc_ibcast_skip_dt_decision f` * Once the following Issues are resolved then this parameter can be removed. - https://github.com/open-mpi/ompi/issues/2256 - https://github.com/open-mpi/ompi/issues/1763 Signed-off-by: Joshua Hursey <jhursey@us.ibm.com>	2016-11-01 13:33:23 -05:00
Yossi Itigin	17c8f76411	pml_ucx: fix uninitialized field req_status->_cancelled. Signed-off-by: Yossi Itigin <yosefe@mellanox.com>	2016-11-01 17:02:22 +02:00
Thananon Patinyasakdikul	ea2d38de14	romio: update REFRESH_NOTES to accommodate the random() patch. From patch: open-mpi/ompi@23b27c510c Signed-off-by: Thananon Patinyasakdikul <tpatinya@utk.edu>	2016-10-31 16:08:08 -04:00
Joshua Ladd	d27b680de2	Merge pull request #2305 from vspetrov/hcoll_fortran_pair_types coll/hcoll fortran pair types	2016-10-28 12:05:00 -04:00
Gilles Gouaillardet	af67183e2f	pml/v: fix a memory leak close the framework if no more component should be used Signed-off-by: Gilles Gouaillardet <gilles@rist.or.jp>	2016-10-28 09:32:30 +09:00
Valentin Petrov	2b7e362e56	coll/hcoll fortran pair types Adds mapping of the MPI Fortran pair types (2INTEGER, 2REAL, 2DBLPREC) to the corresponding hcoll dtypes. Signed-off-by: Valentin Petrov <valentinp@mellanox.com>	2016-10-27 18:24:07 +03:00
Edgar Gabriel	2076622924	Merge pull request #2238 from edgargabriel/pr/delete-error-codes update the error codes reported by file_delete	2016-10-25 12:38:03 -05:00
George Bosilca	028e747470	Do not alter ompi_coll_tuned_use_dynamic_rules. This is set globally as an MCA parameter and should be never altered based on a single communicator setting.	2016-10-25 12:17:25 -04:00
George Bosilca	253eb80e26	Code cleaning of the tuned module.	2016-10-25 12:17:25 -04:00
Edgar Gabriel	74441b960b	update the error codes reported by file_delete	2016-10-25 10:15:14 -05:00
Gilles Gouaillardet	8e788b5aee	pml/ob1: refactor append_recv_req_to_queue() to improve readability and fix a typo in a comment Thanks George for the patch	2016-10-25 10:50:40 +09:00
Gilles Gouaillardet	4a886ac4cc	pml/ob1: correctly reset receive request type before init recvreq->req_recv.req_base.req_type should always be set before invoking MCA_PML_OB1_RECV_REQUEST_INIT(recvreq, ...) otherwise, the previous type might be set, and you could end up with MPC_PML_REQUEST_IMPROBE when MCA_PML_REQUEST_RECV is expected. Thanks Chris Pattison for the report and test case. Fixes open-mpi/ompi#2275	2016-10-24 16:50:23 +09:00
Gilles Gouaillardet	6714f6aee7	coll/libnbc: fix MPI_Ialltoallv with MPI_IN_PLACE and without MPI param check	2016-10-24 09:29:06 +09:00
Josh Hursey	d1ecc83e14	Merge pull request #2245 from jjhursey/topic/libnbc-error-path coll/libnbc: Fix error path on internal error	2016-10-21 13:27:17 -05:00
Joshua Hursey	8748e54c11	coll/libnbc: Fix error path on internal error * If an error is detected internal to libnbc (e.g., PML truncation error) this patch makes sure that the request is completed and the `MPI_ERROR` field is set approprately. * Make an attempt to cleanup outstanding requests before returning. - This is a "best attempt" since not all PMLs support canceling requests.	2016-10-21 11:41:08 -04:00
Gilles Gouaillardet	45336d0bea	libnbc: fix iallgather[v] In order to optimize for MPI_IN_PLACE, data is sent from the receive buffer. consequently, it should be sent with the receive type and count. Thanks Josh Hursey for the report and test case Refs open-mpi/ompi#2256	2016-10-21 10:24:25 +09:00
Gilles Gouaillardet	e78fcc4db9	coll/base: fix ompi_coll_base_{gather,scatter}_intra_binomial receive type is only relevant for root with gather, send type is only relevant for root with scatter, so do not access these types on a non root task	2016-10-19 14:05:22 +09:00
Gilles Gouaillardet	1e3191115b	Merge pull request #2172 from ggouaillardet/topic/ialltoall_in_place support MPI_IN_PLACE in MPI_Ialltoall*	2016-10-17 17:00:47 +09:00
Joshua Ladd	64a15188bd	Merge pull request #2199 from vspetrov/coll_hcoll_ialltoallv coll/hcoll: ialltoallv interface	2016-10-14 07:59:23 -06:00
Gilles Gouaillardet	9389de4199	topo/treematch: fix displacements in mca_topo_treematch_dist_graph_create()	2016-10-14 17:16:49 +09:00
Joshua Ladd	b661307e6f	Merge pull request #2218 from yosefe/topic/ucx-pml-spml-update ucx: adapt pml_ucx and spml_ucx to new UCX APIs	2016-10-13 09:23:37 -04:00
Gilles Gouaillardet	958e29f929	osc/rdma: silence a warning declare a local variable volatile and silence CID 1372692	2016-10-13 16:10:07 +09:00
Yossi Itigin	05ca466c6b	ucx: adapt pml_ucx and spml_ucx to new UCX APIs - pass field_mask to ucp_init(). - use non-blocking disconnect. - recv() with pre-allocated request. - call opal_progress() from iprobe() and improbe(). - use shift pattern in connect/disconnect.	2016-10-12 23:45:45 +03:00
Nathan Hjelm	e8ef503bee	osc/rdma: fix warnings Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-10-12 10:17:25 -06:00
Nathan Hjelm	432d79046b	Merge pull request #2197 from tkordenbrock/topic/master/osc-rdma.put.use.true_extent osc-rdma: fix datatype lower bound errors in ompi_osc_rdma_master()	2016-10-11 10:42:02 -06:00
Valentin Petrov	9747a9ea9b	coll/hcoll: ialltoallv interface	2016-10-10 15:09:07 +03:00
Todd Kordenbrock	05f86b5df7	osc-rdma: fix datatype lower bound errors in ompi_osc_rdma_master() Instead of ompi_datatype_get_extent(), use ompi_datatype_get_true_extent() to get the local and remote lower bound. For derived types like subarray, true_lb is the correct offset for RDMA operations.	2016-10-10 06:45:28 -05:00
Todd Kordenbrock	cc863ff9fb	osc-portals4: fix datatype errors in put() Instead of ompi_datatype_get_extent(), use ompi_datatype_get_true_extent() to get the origin and target lower bound. For derived types like subarray, true_lb is the correct offset for RDMA operations. Also, instead of the extent use the size of the datatype.	2016-10-10 06:45:14 -05:00
Gilles Gouaillardet	1e0f591811	coll/libnbc: implement support for MPI_IN_PLACE in MPI_Ialltoall* Thanks Chris Ward for the report Many thanks to George for the guidance	2016-10-08 19:44:01 +09:00

... 2 3 4 5 6 ...

6449 Коммитов