openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	b57a191ccc	Update the external client to the new PMIx init/finalize signatures	2016-03-03 20:50:20 -08:00
igor-ivanov	5e9fdabdbb	Merge pull request #1420 from ggouaillardet/topic/memory_linux_memalign_enum memory/linux: make memory_linux_memalign an enum	2016-03-03 21:21:55 +04:00
rhc54	d38e2e6655	Merge pull request #1423 from rhc54/topic/suicide Fix registration of error handlers thru the pmix120 component.	2016-03-02 17:43:06 -08:00
Gilles Gouaillardet	5c685e2332	memory/linux: make memory_linux_memalign an enum Thanks Igor Ivanov for the review.	2016-03-03 08:38:46 +09:00
Ralph Castain	4a55fba414	Fix registration of error handlers thru the pmix120 component. A thread-shift operation was hanging on the sync_event_base, which made it dependent on someone calling opal_progress. Unfortunately, a process in "sleep" or spinning outside the MPI library won't do that, and so we never complete errhandler registration.	2016-03-02 15:01:01 -08:00
Nathan Hjelm	2a0b3a5700	btl/vader: various threading fixes This commit fixes several threading bugs: - Add an additional lock to the btl_base_endpoint_t structure to lock the list of pending frags. This allows the progress function to attempt to send pending frags without needing to drop/reaquire the lock. This should provide a small improvement in performance and fixes a potential race between adding an removing items from the pending list. - Ensure fast boxes are only set up once by updating the send count using atomics when needed and do not set the fast box buffer pointer until the fast box is set up. Closes open-mpi/ompi#1408 Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-03-02 10:50:59 -07:00
Ralph Castain	011403c04a	Fix a number of issues, some of which have lingered for a long time: * provide a more reliable way of determining that a process is a singleton by leveraging the schizo framework. Add new components for slurm, alps, and orte to detect when we are in a managed environment, and if we have been launched by mpirun or a native launcher. Set the correct envars to control ess and pmix selection in each case. * change the relative priority of the pmix120 and pmix112 components to make pmix120 the default * fix singleton comm-spawn by correctly setting the num_apps field of the orte_job_t created by the daemon - this fixes a segfault in register_nspace on newly created daemons * ensure orterun doesn't propagate any ess or pmix directives in its environment * Cleanup a few valgrind issues and memory leaks * Fix a race condition that prevented the client from completing notification registrations (missing thread shift) * Ensure the shizo/alps component detects launch by mpirun	2016-03-01 06:53:00 -08:00
Gilles Gouaillardet	e5d6b97db4	opal: fix pragma for GCC 6 and later GCC 6 and later should ignore -Wpedantic instead of -pedantic	2016-02-29 13:56:22 +09:00
Ralph Castain	d28d3ee901	Make the error message on external pmix library a little clearer by separating out the libevent from the libhwloc checks	2016-02-24 11:20:25 -06:00
Gilles Gouaillardet	477991b5aa	btl/openib: fix abstraction violation and use opal_memory->memoryc_set_alignment	2016-02-24 09:50:13 +09:00
Gilles Gouaillardet	d8482ce6f4	opal/mca/memory: add a memoryc_set_alignment subroutine to the OPAL memory MCA this commit also (partially) reverts : - open-mpi/ompi@7de01b347c - open-mpi/ompi@8b05f308f9	2016-02-24 09:50:12 +09:00
Nathan Hjelm	230d04327e	ompi: always enable MPI_THREAD_MULTIPLE support This commit removes the --with-mpi-thread-multiple option and forces MPI_THREAD_MULTIPLE support. This cleans up an abstration violation in opal where OMPI_ENABLE_THREAD_MULTIPLE determines whether the opal_using_threads is meaningful. To reduce the performance hit on MPI_THREAD_SINGLE programs an OPAL_UNLIKELY is used for the check on opal_using_threads in OPAL_THREAD_* macros. This commit does not clean up the arguments to the various functions that take whether muti-threading support is enabled. That should be done at a later time. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-23 10:02:14 -07:00
Ralph Castain	d653cf2847	Convert the orte_job_data pointer array to a hash table so it doesn't grow forever as we run lots and lots of jobs in the persistent DVM.	2016-02-21 11:55:49 -08:00
Ralph Castain	8c92a179c0	Minor memory leak	2016-02-19 15:05:39 -08:00
Nathan Hjelm	2031bb6f01	btl/openib: XRC save SRQ#s on the loopback endpoint This commit fixes a bug that can occur when communicating via XRC to peers on the same node. UDCM was not saving the SRQ numbers on the loopback endpoint (which shares its ib_addr info with all local peers) so any messages to local peers use an invalid SRQ number. Fixes open-mpi/ompi#1383 Signed-off-by: Nathan Hjelm <hjelmn@me.com>	2016-02-18 20:59:11 -07:00
rhc54	bfd4254a7b	Merge pull request #1382 from rhc54/topic/cleanup Cleanup some valgrind complaints about jumps with uninitialized values.	2016-02-18 17:29:37 -08:00
Ralph Castain	6e68d758b9	Cleanup some valgrind complaints about jumps with uninitialized values. Fix a few IOF issues reported by Mark Santcroos when submitting jobs from tools. Add the ability to pass directives to the --output-filename option that tell ORTE to (a) not include the jobid in the path to the output files, and (b) not to copy the output to the tool (i.e., just store it in the files). ck Remove stale debug Fix a segfault if no subscribers are present	2016-02-18 16:30:37 -08:00
Nathan Hjelm	371df45bf8	btl/openib: fix locking bugs with XRC ib_addr lock This bug fixes two issue with the ib_addr lock: - The ib_addr lock must always be obtained regardless of opal_using_threads() as the CPC is run in a seperate thread. - The ib_addr lock is held in mca_btl_openib_endpoint_connected when calling back into the CPC start_connect on any pending connections. This will attempt to obtain the ib_addr lock again. Since this is not a performance-critical part of the code the lock has been changed to be recursive. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-18 15:55:34 -07:00
Nathan Hjelm	4dc73d7765	btl/openib: XRC fix bug that could cause an invalid SRQ# to be used This commit fixes a bug that occurs when attempting a get or put operation on an endpoint that is not already connected. In this case the remote_srqn may be set to an invalid value as the rem_srqs array on the endpoint is not populated. This commit moves the usage of the rem_srqs array to the internal put/get functions where it is guaranteed this array is populated. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-18 15:44:29 -07:00
Ralph Castain	60a7bc2e50	Enable the PMIx notification callback system. This currently is only supported by the pmix120 component, which is not selected by default. All other components will ignore error registration requests, and thus do not support debugger attach when launched via mpirun. Note that direct launched applications will support such attachment, but may not do so in a scalable fashion. Fixes ##1225	2016-02-18 09:29:12 -08:00
rhc54	2745610eb7	Merge pull request #1377 from rhc54/topic/pmix Plug a leak in the PMIx subsystem	2016-02-17 20:05:45 -08:00
Ralph Castain	efb0eff43e	Plug a leak in the PMIx subsystem	2016-02-17 19:00:36 -08:00
rhc54	dc4d3edc06	Merge pull request #1372 from rhc54/topic/sing Further enhance the support for Singularity containers.	2016-02-17 16:39:23 -08:00
Nathan Hjelm	92a15cc316	Merge pull request #1374 from hjelmn/tune_fix Fix parsing of envvars in MCA files	2016-02-17 17:24:33 -07:00
Nathan Hjelm	32236736a4	Fix parsing of envvars in MCA files This commit fixes a memory corruption bug when parsing lines of the form: -x FOO=bar The code was making changes to the size of the buffer allocated for key_buffer without making the appropriate changes to key_buffer_len. This was causing subsequent calls to save_param_name to write to invalid memory. This commit makes the following changes: - Fix the above bug by modifying trim_name to move the string within the buffer instead of re-allocating space for the trimmed string. - Cleaned up both trim_name and save_param_name. Both functions took a prefix and suffix to trim. Problem was the prefix was not treated like a prefix. Instead the "prefix" was located inside the string using strstr then the trimmed value started after the substring (even in the middle of the string). To allow trimming both -x and --x (as well as -mca and --mca) trim_name is now called with each prefix. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-17 14:58:05 -07:00
Nathan Hjelm	4f4ea96940	btl/openib/udcm: fix local XRC connections This commit ensures ib_addr->remote_xrc_rcv_qp_num value is set when creating the loopback queue pair. This is needed when communicating with any other local peer. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-17 14:54:19 -07:00
Ralph Castain	8f9508cace	Further enhance the support for Singularity containers. Extend the "personality" command-line option to allow specifying both model (e.g., "ompi") and container (e.g., "singularity"), and add the necessary logic to support multiple options. Add a new pmix "isolated" component to handle singletons where no HNP is available since containers cannot launch the HNP.	2016-02-17 13:33:06 -08:00
Jeff Squyres	d544e0e6e0	Merge pull request #1347 from ggouaillardet/topic/dss_tests test/dss: update tests to make them usable again, and run them	2016-02-17 09:01:36 -05:00
Nathan Hjelm	bf8360388f	btl/openib/udcm: fix XRC support This commit fixes two bugs in XRC support - When dynamic add_procs support was added to master the remote process name was added to the non-XRC request structure. The same value was not added to the XRC xconnect structure. This error was not caught because the send/recv code was incorrectly using the wrong structure member. This commmit adds the member and ensure the xconnect code uses the correct structure. - XRC loopback QP support has been fixed. It was 1) not setting the correct fields on the endpoint structure, 2) calling udcm_xrc_recv_qp_connect, and 3) was not initializing the endpoint data. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-16 16:49:04 -07:00
Nathan Hjelm	201c280e6c	btl/openib: fix error in param check in mca_btl_openib_put mca_btl_openib_put incorrectly checks the qp inline max before allowing an inline put. This check will always fail for an endpoint that has not been connected. The commit changes the check to use the btl_put_local_registration_threshold instead. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-16 16:46:32 -07:00
Nathan Hjelm	123a39ac3c	btl/openib: fix regression in XRC support Commit open-mpi/ompi@400af6c52d introduced a regression in XRC support. The commit reversed the ordering of shared receive queue (SRQ) and completion queue (CQ) completion. CQ creation must always preceed SRQ creation when using XRC as the CQs are needed to create the SRQs. This commit fixes the ordering so that CQs are always created before SRQs. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-16 16:46:20 -07:00
George Bosilca	9dc79f4bc1	Initialize these 2 common symbols.	2016-02-15 12:27:24 -05:00
igor-ivanov	d9eefefa74	Merge pull request #1351 from igor-ivanov/pr/issue-1336 opal/memory: Move Memory Allocation Hooks usage from openib	2016-02-15 14:07:36 +04:00
Ralph Castain	06c3dfc052	Refactor the ORTE DVM code so that external codes can submit multiple jobs using only a single connection to the HNP. * Clean up the DVM so it continues to run even when applications error out and we would ordinarily abort the daemons. * Create a new errmgr component for the DVM to handle the differences. * Cleanup the DVM state component. * Add ORTE bindings directory and brief README * Pass a local tool index around to match jobs. * Pass the jobid on job completion. * Fix initialization logic. * Add framework for python wrapper. * Fix terminate-with-non-zero-exit behavior so it properly terminates only the indicated procs, notifies orte-submit, and orte-dvm continues executing. * Add some missing options to orte-dvm * Fix a bug in -host processing that caused us to ignore the #slots designator. Add a new attribute to indicate "do not expand the DVM" when submitting job spawn requests. * It actually makes no sense that we treat the termination of all children differently than terminating the children of a specific job - it only creates confusion over the difference in behavior. So terminate children the same way regardless. Extend the cmd_line utility to easily allow layering of command line definitions Catch up with ORTE interface change and make build more generic. Disable "fixed dvm" logic for now. Add another cmd_line function to merge a table of cmd line options with another one, reporting as errors any duplicate entries. Use this to allow orterun to reuse the orted_submit code Fix the "fixed_dvm" logic by ensuring we reset num_new_daemons to zero. Also ensure that the nidmap is sent with the first job so the downstream daemons get the node info. Remove a duplicate cmd line entry in orterun. Revise the DVM startup procedure to pass the nidmap only once, at the startup of the DVM. This reduces the overhead on each job launch and ensures that the nidmap doesn't get overwritten. Add new commands to get_orted_comm_cmd_str(). Move ORTE command line options to orte_globals.[ch]. Catch up with extra orte_submit_init parameter. Add example code. Add documentation. Bump version. The nidmap and routing data must be updated prior to propagating the xcast or else the xcast will fail. Fix the return code so it is something more expected when an error occurs. Ensure we get an error returned to us when we fail to launch for some reason. In this case, we will always get a launch_cb as we did indeed attempt to spawn it. The error code will be returned in the complete_cb. Fix the return code from orte_submit_job - it was returning the tracker index instead of "success". Take advantage of ORTE's pretty-print capabilities to provide a nice error output explaining why we failed to launch. Ensure we always get a launch_cb when we fail to launch, but no complete_cb as the job never launched. Extend the error reporting capability to job completion as well. Add index parameter to orte_submit_job(). Add orte_job_cancel and implement ORTE_DAEMON_TERMINATE_JOB_CMD. Factor out dvm termination. Parse the terminate option at tool level. Add error string for ORTE_ERR_JOB_CANCELLED. Add some safeguards. Cleanup and/of comments. Enable the return. Properly ORTE_DECLSPEC orte_submit_halt. Add orte_submit_halt and orte_submit_cancel to interface. Use the plm interface to terminate the job	2016-02-13 08:10:44 -08:00
Ralph Castain	aa9e5a1a27	Add support for Singularity containers, including a .m4 file for checking if Singularity is available and an orte/schizo component for setting the proper support if a container was given as the executable Cleanup the configury so we properly check for Singularity under the various typical use-cases Bring the Singularity support online. We have to turn "off" the sm BTL as it segfaults from inside the container - root cause remains unclear. Also turned "off" the various OPAL shmem components in case they are involved and someone else tries to use them. Happily, the vader BTL works just fine!	2016-02-13 04:40:22 -08:00
Igor Ivanov	8b05f308f9	opal/memory: Move Memory Allocation Hooks usage from openib These changes fix issue https://github.com/open-mpi/ompi/issues/1336 - improve abstractions: opal/memory/linux component should be single place that opeartes with Memory Allocation Hooks. - avoid collisions in case dynamic component open/close: it is safe because it is linked statically. - does not change original behaivour.	2016-02-11 14:46:35 +02:00
Gilles Gouaillardet	96310f439b	sentinel: fix 32 bits arch since a sentinel is only made from the current job, only store the first 31 bits of the vpid into the sentinel.	2016-02-10 15:44:07 +09:00
Gilles Gouaillardet	7c99115a94	opal/dss: fix comparison of OPAL_VALUE types	2016-02-09 16:21:57 +09:00
Jeff Squyres	8558def858	opal.pc.in: fix typo; use the write AC_SUBST'ed variable As reported by @marksantcroos, this substitution in opal.pc was incorrect -- it left @{libdir} in the string (vs. ${libdir}). The fix is simple: use the proper substitution variable in opal.pc (it was never updated to reflect the new/correct name that was created just for the pkg-config files). Fixes open-mpi/ompi#1343.	2016-02-08 10:55:18 -08:00
Jeff Squyres	8d0a592563	usnic: update a few verbose reachability messages	2016-02-06 03:28:48 -08:00
Jeff Squyres	87dbe6ce01	usnic: add high-verbose reachability messages	2016-02-06 03:28:47 -08:00
Jeff Squyres	dac2fe1589	usnic: ensure to use ntohl() for network-order values	2016-02-06 03:28:47 -08:00
Jeff Squyres	51240394a7	usnic: ensure to init module->av_eq_num	2016-02-06 03:28:47 -08:00
Jeff Squyres	89eea51075	usnic: fix calculation for number of blocks	2016-02-02 16:56:34 -08:00
Jeff Squyres	d812695201	verbs: fix typo	2016-02-02 14:23:45 -08:00
Jeff Squyres	2cf9b26d34	verbs_usnic: previous commit missed a symbol 0715802f52c24c236700ac085090d5441524644c missed that there is a call to a common/verbs_usnic symbol in the common/verbs component. This call needs to be compiled out when the common/verbs_usnic component is not built.	2016-02-02 14:05:59 -08:00
Nathan Hjelm	a016c17714	Merge pull request #1338 from hjelmn/ugni_threading UNGI threading fixes	2016-02-02 13:22:57 -07:00
Jeff Squyres	0715802f52	verbs_usnic: do not build by default This component is a workaround to a bug in libibverbs that prints a dire warning that usNIC devices are not supported (of course not -- usNIC devices provide functionality through libfabric, not libibverbs). This component was written before a better workaround was created: a "no op" libibverbs plugin for usNIC devices (https://github.com/cisco/libusnic_verbs, and is also available in binary form on cisco.com). Hence, this component no longer builds by default. It's still available if a user specifically asks for it (e.g., if they do not want to install the "no op" libibverbs plugin), but it's not the default. This component also has the side-effect of making libopen-pal.so depend on libibverbs.so, which can be annoying for packagers (which is another reason it isn't built by default any more).	2016-02-02 11:22:04 -08:00
Nathan Hjelm	cd11fc3081	btl/ugni: fix race condition that causes completions to be dropped The send code in the ugni btl has an optimization that enables it to return 1 (fragment gone) in some cases. This optimization involved removing the btl ownership and callback flags to ensure the fragment stuck around long enough for its completion flag to be checked. This works fine for the single-threaded case but not in the multi-threaded case. It is possible that a fragment will be completed by another thread while a thread is in mca_btl_ugni_send. This competition can lead to a leaked fragment, missed callback, or both. To fix the issue without removing the optimization a reference count has been added to the fragment. Callbacks and fragment release will not be made until the fragment reference count has reach 0. The count is incremented before sending the frag and decremented after the completion flag has been checked. The fix has been verified to work using a multi-threaded RMA benchmark with the osc/pt2pt component. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-02 12:14:31 -07:00
Nathan Hjelm	14704201e2	btl/ugni: fix race condition when adding endpoint to wait list This commit fixes a race condition that can cause an endpoint to be added to the wait list multiple times. To fix the issue an additional check has been added to ensure the endpoint is not on the wait list after the wait list lock is held. The wait list processing code has also been updated to keep the wait list lock until all wait listed endpoints have been handled. This reduces the chance that an endpoint that is being processed by the wait list code is not re-added to the list by a competing send. Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>	2016-02-02 12:13:49 -07:00

... 3 4 5 6 7 ...

4126 Коммитов