openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	33a3ace874	Minimize the alignments. We only do it when we need to pack data that must be aligned (aka the displacement). All other cases do not require special alignments, and are treated normally. Fix the comment regarding the alignment requirements.	2015-04-30 22:06:50 -04:00
George Bosilca	015d3f56cf	Fix the INDEXED_BLOCK issue identified by IBM.	2015-04-30 14:43:19 -04:00
Gilles Gouaillardet	697a866b6e	ddt: correctly align next datatype description This bug can be evidenced by the test/datatype/ddt_pack test case on sparc architecture.	2015-04-30 15:04:54 +09:00
Ryan Grant	6ab91a6781	Merge pull request #561 from tkordenbrock/topic/mtl.fix.datatype.overflow mtl-portals4: fix datatype overflow in ompi_mtl_portals4_long_isend()	2015-04-28 15:15:59 -06:00
Ryan Grant	cc3da91700	Merge pull request #562 from tkordenbrock/topic/mtl.expand.source.bits.to.24 mtl-portals4: expand the source field of the match bits to 24 bits	2015-04-28 14:31:06 -06:00
Rolf vandeVaart	91a8ec52ca	Fix possible unintialized warnings	2015-04-28 16:25:35 -04:00
Todd Kordenbrock	8a4616f724	mtl-portals4: fix datatype overflow in ompi_mtl_portals4_long_isend() The length parameter of ompi_mtl_portals4_long_isend() was declared as "int", which may not be big enough depending on the platform and compiler options used. This commit changes the type to size_t to prevent overflow.	2015-04-28 14:40:25 -05:00
Todd Kordenbrock	3e437f6184	mtl-portals4: expand the source field of the match bits to 24 bits The source field was 16 bits which is not sufficient for many current and future machines. This commit expands the source field to 24 bits and reduces the tag field from 32 bits to 24 bits.	2015-04-28 14:25:30 -05:00
Gilles Gouaillardet	18b75bd40d	io/base: check the MCA version matches	2015-04-28 17:48:23 +09:00
Ralph Castain	3d46850c4d	Per patch from Marco Atzeri, have the fortran wrapper links go directly to opal_wrapper to avoid breaks in the chain in some environments.	2015-04-25 17:09:06 -07:00
Yohann Burette	1be185ed87	mtl/ofi: Remove use of MR.	2015-04-24 15:55:21 -07:00
Nathan Hjelm	2716b8b1da	osc/pt2pt: correct flush expected counts Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2015-04-24 13:34:21 -06:00
Nathan Hjelm	f1d09e55ec	osc/pt2pt: silence warnings Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-23 15:35:47 -06:00
Nathan Hjelm	29b435a5a4	osc/pt2pt: fix bugs that caused incorrect fragment counting This commit fixes a bug identified by MTT that occurred when mixing passive and active target synchronization. The bugs fixed in this commit are: - Do not update incoming fragment counts for any type of unbuffered control message. These messages are out-of-band and should not be considered towards the signal counts. - Complete a change from using received counts to expected counts for lock, unlock, and flush acks. Part of the change made it into master before the rest was ready. This was preventing wakeups in some cases. - Turn the passive_target_access_epoch module member into a counter. As long as at least one peer is locked we are in a passive-target epoch and not an active target one. This fix will ensure that fragment flags are set appropriately. fixes #538 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-23 13:22:24 -06:00
Jeff Squyres	63c7520273	use-mpi-f08/Makefile.am: also link in libmpi_mpifh.la Per mail from Macro Atzeri, we also need to link in libmpi_mpifh.la, lest we exhaust relative offset addressing (e.g., in 32 bit builds). See http://www.open-mpi.org/community/lists/devel/2015/04/17304.php.	2015-04-22 14:22:36 -07:00
Ryan Grant	1436417488	Merge pull request #547 from tkordenbrock/topic/mtl.add.logical.mode mtl-portals4: add the option to use the Portals4 logical to physical mapping	2015-04-22 15:06:13 -06:00
Nathan Hjelm	7c95ecf859	mca/base: provide functions to determine if a framework is registered/open This commit also fixes a problem with the lazy opening of topo components. The topo framework incorrectly: 1) checked if the topo framework was open by checking the length of the components list, and 2) called the framework open directly instead of using mca_base_framework_open. fixes #544 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-21 13:54:25 -06:00
Todd Kordenbrock	8e56002ec7	mtl-portals4: add missing return to portals4_init_interface()	2015-04-21 11:30:33 -05:00
Todd Kordenbrock	34c50fa963	mtl-portals4: move MD cleanup closer to failure PtlMDRelease() was called if read_msg() returned a failure code. This commit moves the PtlMDRelease() inside read_msg() so that it doesn't get called in cases where the failure happens before or at the PtlMDBind().	2015-04-21 11:30:33 -05:00
Todd Kordenbrock	422be76770	mtl-portals4: add a debug message for thread multiple mode	2015-04-21 11:30:33 -05:00
Todd Kordenbrock	35e5ffd001	mtl-portals4: add the option to use the Portals4 logical to physical table This commit adds an MCA variable to select Portals4 logical addressing, populates the logical-to-physical mapping table and initializes the NI in this mode.	2015-04-21 11:30:33 -05:00
Yohann Burette	19607d2ce7	mtl/ofi: Remove memset() from progress path.	2015-04-20 14:12:39 -07:00
Yohann Burette	d2eda04801	mtl/ofi: Use fi_tinject for small messages.	2015-04-20 14:12:39 -07:00
Nathan Hjelm	033894b493	Merge pull request #541 from hjelmn/c99_components C99 component initialization	2015-04-20 10:45:39 -06:00
Yohann Burette	ba1bc00df1	mtl/ofi: Remove FI_CANCEL.	2015-04-20 09:40:37 -07:00
Devendar Bureddy	19f5a3eff4	HCOLL: skip hcoll if enable_mpi_threads is true reasons: 1) default OCOMS is not configured with --enable-ocoms-multi-threads 2) locking overheads	2015-04-20 19:39:49 +03:00
Devendar Bureddy	dd8e9fa176	HCOLL: enable by defaut	2015-04-20 19:39:30 +03:00
Nathan Hjelm	d251fa1525	pml/ob1: fix heterogenous build Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-20 09:27:00 -06:00
Howard Pritchard	3339274136	Merge pull request #542 from hppritcha/topic/coverity_714118 fcoll/two_phase: coverity fix	2015-04-20 05:42:12 -06:00
Howard Pritchard	de215addc6	fcoll/two_phase: coverity fix fix CID 714118 Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-04-18 14:34:48 -06:00
Nathan Hjelm	df75d0382f	ompi: use C99 subobject naming for component initialization This commit helps future-proof ompi components by initializing each component member by name. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-18 10:29:58 -06:00
Yohann Burette	9392bb5ede	mtl/ofi: Implement Probe/Mprobe/Mrecv using FI_PEEK/FI_CLAIM.	2015-04-17 16:42:13 -07:00
Nadezhda Kogteva	116169c38a	opal timing: added ability to choose the timer type	2015-04-17 11:15:55 +03:00
Mangala Jyothi Bhaskar	c4de46e284	Fix number of aggregators used in two phase fcoll	2015-04-16 10:39:10 -05:00
Nathan Hjelm	3436f2917d	Merge pull request #449 from hjelmn/mca_base_update mca/base update	2015-04-16 08:41:48 -06:00
Nathan Hjelm	d5b52d3141	ompi/communicator: make comm_request internal variables static	2015-04-15 10:05:21 -06:00
Ralph Castain	a4b1225892	Don't register the PSM errhandler until it is certain that the PSM component can be used. This doesn't matter on the master, but it does matter on the 1.8 branch as the MTL select logic is different over there.	2015-04-14 07:54:53 -07:00
Nathan Hjelm	113c890ccf	Merge pull request #520 from hjelmn/valgrind_cleanness fix memory leaks and valgrind errors	2015-04-13 10:09:34 -06:00
Jeff Squyres	49f52a5356	osc_sm_passive_target.c: update the check for lock types Based on some on-list and IM discussion with @hjelmn about open-mpi/ompi@40b7643119, change the testing to a switch/case. If we fall into the default case, assert() error (because it's an OMPI developer programming error).	2015-04-13 12:02:15 -04:00
Jeff Squyres	40b7643119	osc_sm_passive_target.c: ensure ret is always defined Fixes a compiler warning	2015-04-13 11:31:43 -04:00
Nathan Hjelm	a7b0c00ab6	fix memory leaks and valgrind errors This commit fixes several vagrind errors. Included: - installdirs did not correctly reinitialize all pointers to NULL at close. This causes valgrind errors on a subsequent call to opal_init_tool. - several opal strings were leaked by opal_deregister_params which was setting them to NULL instead of letting them be freed by the MCA variable system. - move opal_net_init to AFTER the variable system is initialized and opal's MCA variables have been registered. opal_net_init uses a variable registered by opal_register_params! - do not leak ompi_mpi_main_thread when it is allocated by MPI_T_init_thread. - do not overwrite ompi_mpi_main_thread if it is already set (by MPI_T_init_thread). - mca_base_var: read_files was overwritting mca_base_var_file_list even if it was non-NULL. - mca_base_var: set all file global variables to initial states on finalize. - btl/vader: decrement enumerator reference count to ensure that it is freed. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-11 09:28:35 -06:00
Ralph Castain	3e44d3c9e3	Enable singletons to run without any active OOB module until they attempt to comm_spawn	2015-04-10 14:06:42 -07:00
Nathan Hjelm	eb56117405	Merge pull request #513 from hjelmn/mca_bug_fixes opal: fix multiple bugs in MCA and opal	2015-04-08 10:29:44 -06:00
Nathan Hjelm	9cd955badf	opal: fix multiple bugs in MCA and opal This commit fixes the following bugs: - opal_output_finalize did not properly set internal state. This caused problems when calling the sequence opal_output_init (), opal_output_finalize (), opal_output_init (). - opal_info support called mca_base_open () but never called the matching mca_base_close (). mca_base_open () and mca_base_close () have been updated to use a open count instead of an open flag to allow mca_base_open to be called through multiple paths (as may be the case when MPI_T is in use). - orte_info support did not register opal variables. This can cause orte-info to not return opal variables. - opal_info, orte_info, and ompi_info support have been updated to use a register count. - When opening the dl framework the reference count was added to ensure the framework stuck around. The framework being closed prematurely was a bug in the MCA base that has since been corrected. The increment (and associated decrement) have been removed. - dl/dlopen did not set the value of mca_dl_dlopen_component.filename_suffixes_mca_storage on each call to register. Instead the value was set in the component structure. This caused the value to be lost when re-loading the component. Fixed by setting the default value in register. - Reset shmem framework state on close to avoid returning a stale component after reloading opal/shmem. - MCA base parameters were not properly deregistered when the MCA base was closed. This commit may fix #374. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-07 19:13:20 -06:00
Howard Pritchard	5ee18f4f00	Merge pull request #514 from hppritcha/topic/mpi_win_lock_all_man man pages: fix problem with MPI_Win_lock_all	2015-04-07 17:17:30 -06:00
Howard Pritchard	291c775e74	man pages: fix problem with MPI_Win_lock_all thanks to Thomas Jahns for pointing this out - http://www.open-mpi.org/community/lists/users/2015/04/26633.php Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-04-07 16:29:00 -06:00
Nathan Hjelm	2409715fc3	Merge pull request #511 from hjelmn/osc_pt2pt_fix osc/pt2pt: fix synchronization bugs	2015-04-07 09:14:00 -06:00
Howard Pritchard	fc3a0f60c5	Merge pull request #512 from hppritcha/topic/java_better_dlopen_error ompi/java: better error message if dlopen fails	2015-04-06 14:08:10 -06:00
Howard Pritchard	18039b34b4	ompi/java: better error message if dlopen fails The error message emitted by ompi/java when dlopen fails is misleading and not very informative. Signed-off-by: Howard Pritchard <howardp@lanl.gov>	2015-04-06 13:35:09 -06:00
Nathan Hjelm	80ed805a16	osc/pt2pt: fix synchronization bugs The fragment flush code tries to send the active fragment before sending any queued fragments. This could cause osc messages to arrive out-of-order at the target (bad). Ensure ordering by alway sending the active fragment after sending queued fragments. This commit also fixes a bug when a synchronization message (unlock, flush, complete) can not be packed at the end of an existing active fragment. In this case the source process will end up sending 1 more fragment than claimed in the synchronization message. To fix the issue a check has been added that fixes the fragment count if this situation is detected. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2015-04-06 08:39:19 -06:00

1 2 3 4 5 ...

8016 Коммитов