openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	40b7643119	osc_sm_passive_target.c: ensure ret is always defined Fixes a compiler warning	2015-04-13 11:31:43 -04:00
Ralph Castain	3e44d3c9e3	Enable singletons to run without any active OOB module until they attempt to comm_spawn	2015-04-10 14:06:42 -07:00
Nathan Hjelm	eb56117405	Merge pull request #513 from hjelmn/mca_bug_fixes opal: fix multiple bugs in MCA and opal	2015-04-08 10:29:44 -06:00
Nathan Hjelm	9cd955badf	opal: fix multiple bugs in MCA and opal This commit fixes the following bugs: - opal_output_finalize did not properly set internal state. This caused problems when calling the sequence opal_output_init (), opal_output_finalize (), opal_output_init (). - opal_info support called mca_base_open () but never called the matching mca_base_close (). mca_base_open () and mca_base_close () have been updated to use a open count instead of an open flag to allow mca_base_open to be called through multiple paths (as may be the case when MPI_T is in use). - orte_info support did not register opal variables. This can cause orte-info to not return opal variables. - opal_info, orte_info, and ompi_info support have been updated to use a register count. - When opening the dl framework the reference count was added to ensure the framework stuck around. The framework being closed prematurely was a bug in the MCA base that has since been corrected. The increment (and associated decrement) have been removed. - dl/dlopen did not set the value of mca_dl_dlopen_component.filename_suffixes_mca_storage on each call to register. Instead the value was set in the component structure. This caused the value to be lost when re-loading the component. Fixed by setting the default value in register. - Reset shmem framework state on close to avoid returning a stale component after reloading opal/shmem. - MCA base parameters were not properly deregistered when the MCA base was closed. This commit may fix #374. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-07 19:13:20 -06:00
Howard Pritchard	5ee18f4f00	Merge pull request #514 from hppritcha/topic/mpi_win_lock_all_man man pages: fix problem with MPI_Win_lock_all	2015-04-07 17:17:30 -06:00
Howard Pritchard	291c775e74	man pages: fix problem with MPI_Win_lock_all thanks to Thomas Jahns for pointing this out - http://www.open-mpi.org/community/lists/users/2015/04/26633.php Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-04-07 16:29:00 -06:00
Nathan Hjelm	2409715fc3	Merge pull request #511 from hjelmn/osc_pt2pt_fix osc/pt2pt: fix synchronization bugs	2015-04-07 09:14:00 -06:00
Howard Pritchard	fc3a0f60c5	Merge pull request #512 from hppritcha/topic/java_better_dlopen_error ompi/java: better error message if dlopen fails	2015-04-06 14:08:10 -06:00
Howard Pritchard	18039b34b4	ompi/java: better error message if dlopen fails The error message emitted by ompi/java when dlopen fails is misleading and not very informative. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-04-06 13:35:09 -06:00
Nathan Hjelm	80ed805a16	osc/pt2pt: fix synchronization bugs The fragment flush code tries to send the active fragment before sending any queued fragments. This could cause osc messages to arrive out-of-order at the target (bad). Ensure ordering by alway sending the active fragment after sending queued fragments. This commit also fixes a bug when a synchronization message (unlock, flush, complete) can not be packed at the end of an existing active fragment. In this case the source process will end up sending 1 more fragment than claimed in the synchronization message. To fix the issue a check has been added that fixes the fragment count if this situation is detected. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-06 08:39:19 -06:00
Jithin Jose	9c937d44ae	Inline MTL-OFI Signed-off-by: Jithin Jose <jithin.jose@intel.com> Conflicts: ompi/mca/mtl/ofi/mtl_ofi_recv.c	2015-04-03 15:19:30 -07:00
Jithin Jose	50304dfe05	Inline mtl-datatype pack/unpack Signed-off-by: Jithin Jose <jithin.jose@intel.com>	2015-04-03 15:19:21 -07:00
Jithin Jose	c09582a3ff	- CM blocking send/recv optimizations This patch tries to do as little as possible in the PML CM blocking send/receive routines. Basically, avoid creating and filling in an entire request object. An OMPI-level request is still needed, but we can create that on the stack instead of going to a free list. Signed-off-by: Andrew Friedley <andrew.friedley@intel.com> Signed-off-by: Jithin Jose <jithin.jose@intel.com>	2015-04-03 15:19:08 -07:00
Howard Pritchard	05324e32ff	fcoll/static: coverity fixes Fix CIDs 72138, 72139, 72143 Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-04-02 14:51:44 -06:00
Devendar Bureddy	6ddc7ac35c	HCOLL: Fix assertion hcoll context may not be destroyed if it is cached.	2015-04-01 20:33:28 +03:00
Nathan Hjelm	e91084e20b	Merge pull request #492 from hjelmn/subarray_fix ompi/datatype: fix subarray datatype	2015-03-27 10:50:46 -06:00
Devendar Bureddy	71c28cea65	HCOLL: hcoll dte fixes - hcoll currently do not support datatype with gaps around it (i.e dtsize != dtextent) - check for user defined Ops.	2015-03-25 16:04:11 +02:00
Nathan Hjelm	88072f9b8b	ompi/datatype: fix subarray datatype The subarray datatype was not packing/unpacking correctly. This was leading to wrong results whenever the lb of the subarray datatype was non-zero. I tracked the issue to the use of ompi_datatype_create_resized. This function simply duplicates the old datatype and sets the lb and extent. This is unfortunately insufficent for the pack/unpack functions which use the loop end first element offset NOT the lb. This offset is 0 in the resized datatype. Once ompi_datatype_create_resized has been fixed this commit should be reverted. Fixes #380. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-03-24 14:33:10 -06:00
Nathan Hjelm	f588ad7af0	Merge pull request #485 from hjelmn/info_enums ompi/info: add support for getting info key value based on variable enumerator	2015-03-23 11:43:22 -06:00
Nathan Hjelm	9299cd5cd7	ompi/info: add support for getting info key value based on variable enumerator This commit adds a function that will return an integer value for an info key based on the value returned by a variable enumerator. This feature should greatly simplify code using the info keys (osc for example). Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-03-23 11:20:37 -06:00
Howard Pritchard	edf9e8ba8f	mtl/psm: coverity fixes Fix CIDS 1270176 - 1270179 Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-03-18 11:02:01 -06:00
Nathan Hjelm	ccba8ce856	Merge pull request #457 from hjelmn/mpit_fixes mca/base: fix bugs in framework deregistration/re-registration	2015-03-18 08:37:49 -06:00
Ralph Castain	0cfb4f29aa	Silence compiler warning	2015-03-16 09:59:21 -07:00
Todd Kordenbrock	515d9e8cc9	mtl-portals4: fix compiler warnings	2015-03-12 20:34:04 -05:00
adrianreber	714d9aa67e	Merge pull request #348 from adrianreber/topic/orte_cr_continue_like_restart Topic/orte cr continue like restart	2015-03-12 14:54:02 +01:00
Alina Sklarevich	28586caecf	MTL_MXM/PML_YALLA: fix coverity issues.	2015-03-12 11:49:22 +02:00
Howard Pritchard	da85d5fc0a	Merge pull request #467 from hppritcha/topic/minor_fcoll_static_coverity_fix fcoll/static: minor fix for coverity	2015-03-11 10:28:05 -06:00
Nathan Hjelm	ce6caab2a7	Merge pull request #463 from hjelmn/cuda_async btl/openib: cuda: fix CUDA-aware support with async copy	2015-03-11 09:52:48 -06:00
Howard Pritchard	66fee3bd18	fcoll/static: minor fix for coverity Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-03-11 09:11:49 -06:00
Adrian Reber	c08e234af7	FT: fix compilation using --with-ft (5/5) Enabling the FT code breaks compilation (again). This series tries to fix the compiler errors. This is again only fixing the compiler errors without any warranty that the result might actually support FT again. With the changes introduced in the previous patches in this series some goto constructs for cleanup are no longer necessary and removed.	2015-03-11 14:23:33 +01:00
Adrian Reber	9b84fe45d3	FT: fix compilation using --with-ft (3/5) Enabling the FT code breaks compilation (again). This series tries to fix the compiler errors. This is again only fixing the compiler errors without any warranty that the result might actually support FT again. Follow-up of `552c9ca5a0`. This patch implements the necessary changes in mentioned commit in the FT code.	2015-03-11 14:23:33 +01:00
Adrian Reber	1c5a8df724	FT: fix compilation using --with-ft (2/5) Enabling the FT code breaks compilation (again). This series tries to fix the compiler errors. This is again only fixing the compiler errors without any warranty that the result might actually support FT again. The FT code used barrier mechanisms which have been removed with `aec5cd08bd`. This patch replaces all those different barriers with opal_pmix.fence(NULL, 0); I am not sure this is completely correct but at least a starting point for a review.	2015-03-11 14:23:33 +01:00
Adrian Reber	f45dd069bd	FT: fix compilation using --with-ft (1/5) Enabling the FT code breaks compilation (again). This series tries to fix the compiler errors. This is again only fixing the compiler errors without any warranty that the result might actually support FT again. This first patch moves orte_cr_continue_like_restart from ORTE to opal_cr_continue_like_restart in OPAL. This only leaves three calls from OPAL to ORTE in the FT code. As it is not yet 100% clear how to handle these calls the code orte_sstore.set_attr() has been #ifdef'd out for now.	2015-03-11 14:23:33 +01:00
Alina Sklarevich	f9a9b936a1	PML_YALLA: fix compilation warnings.	2015-03-11 10:58:54 +02:00
Nathan Hjelm	3d32dbd793	btl/openib: cuda: fix CUDA-aware support with async copy This commit should resolve an issue seen with CUDA-aware support. The problem came in with BTL 3.0. Before 3.0 the size of the copy was stored in the incoming segment's des_remote_count field. This field does not exist in BTL 3.0 so I stored the value in the des_segment_count field. This caused problems with the cuda support code. To fix the issue the endpoint pointer is now stored in the in fragment's endpoint pointer which free's up the segment's des_cbdata pointer for storing the transfer size. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-03-10 14:38:12 -06:00
Nathan Hjelm	d929137768	osc/pt2pt: need to unlock self before waiting for unlock acks This commit fixes a bug in osc/pt2pt which causes MPI_Win_unlock_all to hang. The problem was caused by code refactoring that moved the unlock of the local process to after the loop that waits for unlock acks. This will cause the code to loop forever waiting on the self ack. Fixes #444	2015-03-10 14:10:37 -06:00
Yohann Burette	d48a8ab8f0	mtl/ofi: Use fi_allocinfo().	2015-03-10 12:50:55 -07:00
Jeff Squyres	2e8ee003b0	ofi: endpoint type hint moved to a sub-struct, BUFFERED went away Update to match new libfabric API/structure change.	2015-03-10 09:55:45 -07:00
Howard Pritchard	b73d566d57	Merge pull request #454 from hppritcha/topic/coverity_fixes fcoll/dynamic: coverity fixes	2015-03-10 07:59:56 -06:00
Mike Dubman	6f91a007e1	Merge pull request #458 from yosefe/topic/pml-yalla-fix-segv keep mxm context alive as long as pml_yalla component is open.	2015-03-10 13:38:14 +02:00
yosefe	976144dca7	keep mxm context alive as long as pml_yalla component is open. pml_yalla_del_comm may be called after yalla module is finalized, which leads to invalid memory access if mxm context is already destroyed in this point.	2015-03-10 11:52:44 +02:00
Nathan Hjelm	005c6022e2	mca/base: fix bugs in framework deregistration/re-registration There were a number of bugs in the framework/variable code that affected deregistration: - Frameworks could be erroneously closed if seperately registered and opened then subsequently closed. This was a bug in the original design which only reference counted opens but not registrations. This would cause undefined behavior if MPI_T_finalize actually calls ompi_info_close_components as intended. Now both registrations and opens are reference counted and frameworks/components are not torn down until the matching number of close calls have been made. - group_find_by_name did not pass the invalidok flags down to mca_base_var_group_get_internal correctly. - Group deregistration caused the group to be completely reset. This does not match the behavior required by MPI_T as it could reduce the number of variables/subgroups in a group. This commit also updates MPI_T_finalize to call ompi_info_close_components as originally intended. Closes #374 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-03-09 16:52:53 -06:00
Howard Pritchard	fba88360a8	fcoll/dynamic: more coverity fixes Okay coverity seems to get one stuck in a loop where by fixing one set of resource allocation problems, it starts finding more. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-03-09 15:01:05 -07:00
Howard Pritchard	2d61a652c8	fcoll/dynamic: coverity fixes okay, hopefully really fix CIDS 72325-72328, and 72330-72332. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-03-09 13:53:52 -07:00
Nathan Hjelm	0d80bfb391	Merge pull request #443 from hjelmn/mpit_31 Add new error code introduced in MPI-3.1.	2015-03-09 13:03:15 -06:00
Jeff Squyres	a026456bef	(orte\|ompi\|oshmem)info tools: convert to opal_dl interface Noe that this commit removes option:lt_dladvise from the various "info" tools output. This technically breaks our CLI "ABI" because we're not deprecating it / replacing it with an alias to some other "into" tool output. Although the dl/libltdl component contains an "have_lt_dladvise" MCA var that contains the same information, the "option:lt_dladvise" output from the various "info" tools is not* an MCA var, and therefore we can't alias it. So it just has to die.	2015-03-09 08:18:13 -07:00
Jeff Squyres	c683500a29	debuggers: convert to opal_dl interface	2015-03-09 08:16:55 -07:00
Gilles Gouaillardet	6de973daae	coll/sm: remove unused value as reported by Coverity with CID 1269962	2015-03-09 17:31:32 +09:00
Gilles Gouaillardet	1896d4fba7	bcol/basesmuma: fix misc memory leak as reported by Coverity with CID 715762	2015-03-09 17:22:25 +09:00
Gilles Gouaillardet	341bdd1fc3	ompi/group: refactor ompi_group_incl and fixes CID 70478	2015-03-09 17:07:11 +09:00

1 2 3 4 5 ...

7975 Коммитов