openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	a4d99ddef6	More synchronizations for the Windows version. The problem came from the multiple threads accessing the OOB/registry asynchronously via the callbacks. The quickest solution (but definitively not the cleanest) is to serialize these callbacks in such a way that at any given time only one thread can execute a callbacks. This commit was SVN r15086.	2007-06-14 22:35:38 +00:00
George Bosilca	fb9ff5cc75	Don't remove the tcp events from the list, they will remove themselves in the destructor. This commit was SVN r15085.	2007-06-14 22:33:09 +00:00
George Bosilca	95a607b945	A more Windows friendly version. As the socket event will be generated through the win dll using multiple threads, we have to insure that the oob callbacks happens only in a synchronous way or really bad things happens with the current design (blocking messages from a receive callback). This commit was SVN r15069.	2007-06-14 04:38:06 +00:00
Brian Barrett	84d1512fba	Add the potential for doing some basic error checking on mutexes during single threaded builds. In its default configuration, all this does is ensure that there's at least a good chance of threads building based on non-threaded development (since the variable names will be checked). There is also code to make sure that a "mutex" is never "double locked" when using the conditional macro mutex operations. This is off by default because there are a number of places in both ORTE and OMPI where this alarm spews mega bytes of errors on a simple test. So we have some work to do on our path towards thread support. Also removed the macro versions of the non-conditional thread locks, as the only places they were used, the author of the code intended to use the conditional thread locks. So now you have upper-case macros for conditional thread locks and lowercase functions for non-conditional locks. Simple, right? :). This commit was SVN r15011.	2007-06-12 16:25:26 +00:00
Ralph Castain	983fd3432a	Fix singleton comm_spawn. Ensure that singleton's start the RML receive function so they can receive RML updates during xconnect procedures once any comm_spawn'd children start. Since singleton's only use the RMGR/URM component, update that component to also hold us until xconnect is completed (if it is invoked) before returning to the caller. This commit was SVN r14914.	2007-06-06 17:39:23 +00:00
Brian Barrett	e4b369c93e	Properly handle case where user instructs the oob to not use all non-localhost interfaces This commit was SVN r14815.	2007-05-31 02:29:44 +00:00
George Bosilca	f8f71b9ba0	Correct a threaded problem and make sure we only free what was allocated. This commit was SVN r14803.	2007-05-30 18:50:29 +00:00
Jeff Squyres	379a4ec5e2	While we're editing MCA params in the oob tcp component, ditch the use of the deprecated MCA param API for registering MCA parameters and update to the current API. This commit was SVN r14747.	2007-05-24 13:01:55 +00:00
Jeff Squyres	839c1db95c	Fix something that has been bugging me for a while: Rename the oob_tcp_include and oob_tcp_exclude MCA parameters to be oob_tcp_if_include and oob_tcp_if_exclude (to match the convention with btl_tcp_if_[in\|ex]clude). Keep "hidden" synonyms oob_tcp_include and oob_tcp_exclude in case anyone is actually using them (and some users undoubtedly are), but do not have them show up in ompi_info --param output. Instead, the new "oob_tcp_if_*" names will show up in ompi_info output. This commit was SVN r14746.	2007-05-24 12:52:26 +00:00
Ralph Castain	02f6e6ab3e	Slight touchup to make it pretty This commit was SVN r14734.	2007-05-23 16:39:18 +00:00
Ralph Castain	3fc227286f	Be sure to NULL terminate the list of keys... This commit was SVN r14733.	2007-05-23 16:35:03 +00:00
Ralph Castain	5b0abf520b	Don't update our own contact info This commit was SVN r14718.	2007-05-22 13:28:23 +00:00
Ralph Castain	4fff584a68	Commit the orted-failed-to-start code. This correctly causes the system to detect the failure of an orted to start and allows the system to terminate all procs/orteds that did start. The primary change that underlies all this is in the OOB. Specifically, the problem in the code until now has been that the OOB attempts to resolve an address when we call the "send" to an unknown recipient. The OOB would then wait forever if that recipient never actually started (and hence, never reported back its OOB contact info). In the case of an orted that failed to start, we would correctly detect that the orted hadn't started, but then we would attempt to order all orteds (including the one that failed to start) to die. This would cause the OOB to "hang" the system. Unfortunately, revising how the OOB resolves addresses introduced a number of additional problems. Specifically, and most troublesome, was the fact that comm_spawn involved the immediate transmission of the rendezvous point from parent-to-child after the child was spawned. The current code used the OOB address resolution as a "barrier" - basically, the parent would attempt to send the info to the child, and then "hold" there until the child's contact info had arrived (meaning the child had started) and the send could be completed. Note that this also caused comm_spawn to "hang" the entire system if the child never started... The app-failed-to-start helped improve that behavior - this code provides additional relief. With this change, the OOB will return an ADDRESSEE_UNKNOWN error if you attempt to send to a recipient whose contact info isn't already in the OOB's hash tables. To resolve comm_spawn issues, we also now force the cross-sharing of connection info between parent and child jobs during spawn. Finally, to aid in setting triggers to the right values, we introduce the "arith" API for the GPR. This function allows you to atomically change the value in a registry location (either divide, multiply, add, or subtract) by the provided operand. It is equivalent to first fetching the value using a "get", then modifying it, and then putting the result back into the registry via a "put". This commit was SVN r14711.	2007-05-21 18:31:28 +00:00
Brian Barrett	33a5758521	Some IPv6 improvements: * Move ipv6comat.h code into opal_config_bottom.h and change into some more intelligent testing of structures * Change opal's if interface to use sockaddr instead of sockaddr_storage, as the RFCs suggest we do * Move the networking code in opal that isn't directly related to if detection into net.h * Add quicky function to get the port out of either a sockaddr_in or sockaddr_in6, saving a bunch of code in the oob. * Update TCP oob and btl with new interface This commit was SVN r14679.	2007-05-17 01:17:59 +00:00
Josh Hursey	596062d34b	Seems that the recent changes in the sds and oob exposed some invalid assumptions in the FT restart code for the ORTE layer. This fixes those problems by having the RML completely shutdown and restart the OOB framework (instead of just the module as before). This makes it much easier to manage, and maintainable as the OOB changes in the future. The SDS now does communication as part of its startup procedure, so we need to make sure we restart the RML before the SDS so that it can communicate properly. OOB base [close\|open] used a static bool to determine if they have been called previously or not. I needed to expose this boolean so that I can close() then open() the oob base in the restart procedure. The functionality has not changed, we just now have the ability to open/close the framework as many times as we need to as long as we always call them in that order. (So calling open twice in a row is not allowed as before, it is only allowed if you open(), close(), then open() again). Things seem to be working now. This commit was SVN r14515.	2007-04-25 19:51:52 +00:00
Brian Barrett	4b8bb70afb	A couple cleanups for the IPv6 support: - make opal_sockaddr2str() take a sockaddr_storage instead of a sockaddr_in6 so that it works for IPv4 and IPv6 addresses, and remove a whole bunch of #ifs in the OOOB code. - Fix a compiler warning in the TCP BTL due to run-time determined array size by making it a dynamicly allocated array. - Fix the unpacking code of IPv4 addresses when using IPv6 support, so that the address is in the correct location (instead of in an IPv6 structure, use an IPv4 structure). Refs trac:1005. This commit was SVN r14514. The following Trac tickets were found above: Ticket 1005 --> https://svn.open-mpi.org/trac/ompi/ticket/1005	2007-04-25 19:08:07 +00:00
Adrian Knoth	35fce38f43	Don't know why this line was here. This commit was SVN r14509.	2007-04-25 12:31:13 +00:00
Ralph Castain	8517a5a3a6	cleanup a few compiler warnings This commit was SVN r14507.	2007-04-25 11:51:18 +00:00
Jeff Squyres	c4c68e666a	Merge in the ipv6 work from /tmp/ipv6-merge. This commit was SVN r14503.	2007-04-25 01:55:40 +00:00
George Bosilca	cad93a7693	Add more output. Fix some typos, and some small cleanups. This commit was SVN r14327.	2007-04-12 05:01:29 +00:00
Brian Barrett	13a4bba13f	Yet another dumb thing that shouldn't have been in r14261. This commit was SVN r14263. The following SVN revision numbers were found above: r14261 --> open-mpi/ompi@8a55c84d0b	2007-04-07 23:23:23 +00:00
Brian Barrett	8a55c84d0b	Fix a number of OOB issues: * Remove the connect() timeout code, as it had some nasty race conditions when connections were established as the trigger was firing. A better solution has been found for the cluster where this was needed, so just removing it was easiest. * When a fatal error (too many connection failures) occurs, set an error on messages in the queue even if there isn't an active message. The first message to any peer will be queued without being active (and so will all subsequent messages until the connection is established), and the orteds will hang until that first message completes. So if an orted can never contact it's peer, it will never exit and just sit waiting for that message to complete. * Cover an interesting RST condition in the connect code. A connection can complete the three-way handshake, the connector can even send some data, but the server side will drop the connection because it can't move it from the half-connected to fully-connected state because of space shortage in the listen backlog queue. This causes a RST to be received first time that recv() is called, which will be when waiting for the remote side of the OOB ack. In this case, transition the connection back into a CLOSED state and try to connect again. * Add levels of debugging, rather than all or nothing, each building on the previous level. 0 (default) is hard errors. 1 is connection error debugging info. 2 is all connection info. 3 is more state info. 4 includes all message info. * Add some hopefully useful comments This commit was SVN r14261.	2007-04-07 22:33:30 +00:00
Galen Shipman	48d1fa830d	A race condition exists on the free list of pending connections because OPAL_FREE_LIST_WAIT/RETURN will not use locks in a non-threaded build conditionaly use locks if non-threaded around the OPAL_FREE_LIST_WAIT/RETURN seems to fix the issue Tested at 4K processes and seems to work.. This commit was SVN r14135.	2007-03-23 15:19:03 +00:00
Brian Barrett	d454395b51	Need to fall back on the event listen mode if the MCA parameter said use the listen thread, but we're not the HNP. This is better than not starting up any listen mode, which is what we were doing before :/ This commit was SVN r14133.	2007-03-23 13:29:18 +00:00
Josh Hursey	dadca7da88	Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD). This merge adds Checkpoint/Restart support to Open MPI. The initial frameworks and components support a LAM/MPI-like implementation. This commit follows the risk assessment presented to the Open MPI core development group on Feb. 22, 2007. This commit closes trac:158 More details to follow. This commit was SVN r14051. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r13912 The following Trac tickets were found above: Ticket 158 --> https://svn.open-mpi.org/trac/ompi/ticket/158	2007-03-16 23:11:45 +00:00
Brian Barrett	f6a5d58885	Rather than set the connect event timeout number to something big and hoping its bigger than the timeout for the connect() call, just don't register the handler by default and fall back to connect() timing out. Should give much happier performance on big clusters. This commit was SVN r13639.	2007-02-13 18:36:50 +00:00
Brian Barrett	262cbbc5c9	Back out r13593, which contained a change that shouldn't be committed. This commit was SVN r13594. The following SVN revision numbers were found above: r13593 --> open-mpi/ompi@81472363ea	2007-02-09 20:13:02 +00:00
Brian Barrett	81472363ea	Allow the OOB to connect between all MPI applications during MPI_INIT without also establishing MPI connectivity. This commit was SVN r13593.	2007-02-09 20:11:40 +00:00
George Bosilca	1e38810c2d	Correctly close the sockets on a generic way. This commit was SVN r13254.	2007-01-23 03:17:23 +00:00
Brian Barrett	03112254e7	Increase connection timeout to 600 seconds, which should always be higher than the connect() timeout, so that we'll use that rather than our own timeout by defualt. There timeout was set low for Big Red, but causes problems for very large clusters, as there's no way to wire them up in 10 seconds most of the time. This commit was SVN r13062.	2007-01-10 04:53:21 +00:00
Ralph Castain	6101050ea6	Remove an abstraction barrier I thought was gone long-ago. The OOB subscription really shouldn't be defined as an OMPI subscription. I know it's just a technicality, but it is time to address such things rather than just letting them continue to propagate. :-) This commit was SVN r12954.	2007-01-02 16:16:50 +00:00
Brian Barrett	38c2e43ac2	Print out error string rather than errno for TCP-related errors, making it easier for both the user and us to debug issues with BTL and OOB issues... This commit was SVN r12852.	2006-12-14 18:20:43 +00:00
Ralph Castain	0a5d41857a	Complete next round of message size reduction: "strip" the descriptive info from the returned values. I have now added a flag to the gpr address mode (ORTE_GPR_STRIPPED) that instructs the gpr to not include segment names or tokens in the returned gpr_value_t objects. I found only two places that were looking at the tokens: 1. the odls - we used the tokens to separately process the globals container data from everything else. In this case, I left the subscription that returned the globals data alone, but "stripped" the subscription that returned the launch data for the procs. These subscriptions have nothing to do with the xcast message. 2. the pml_base_modex - the callback function was getting process names from the returned tokens. Actually, this function was doing a very bad thing - it was assuming that the first token returned was always the process name. This is currently true, but is one of those assumptions that someone could have easily changed - and suddenly found the system inexplicably failing. I modified the function to (a) get the name sent back to us, (b) "stripped" the value structures of tokens and segment strings, and (c) correctly obtained process names from the returned values. I also reindented the heck out of the code so it was legible (at least, to my old eyes). This commit was SVN r12813.	2006-12-09 23:10:25 +00:00
Ralph Castain	6d6cebb4a7	Bring over the update to terminate orteds that are generated by a dynamic spawn such as comm_spawn. This introduces the concept of a job "family" - i.e., jobs that have a parent/child relationship. Comm_spawn'ed jobs have a parent (the one that spawned them). We track that relationship throughout the lineage - i.e., if a comm_spawned job in turn calls comm_spawn, then it has a parent (the one that spawned it) and a "root" job (the original job that started things). Accordingly, there are new APIs to the name service to support the ability to get a job's parent, root, immediate children, and all its descendants. In addition, the terminate_job, terminate_orted, and signal_job APIs for the PLS have been modified to accept attributes that define the extent of their actions. For example, doing a "terminate_job" with an attribute of ORTE_NS_INCLUDE_DESCENDANTS will terminate the given jobid AND all jobs that descended from it. I have tested this capability on a MacBook under rsh, Odin under SLURM, and LANL's Flash (bproc). It worked successfully on non-MPI jobs (both simple and including a spawn), and MPI jobs (again, both simple and with a spawn). This commit was SVN r12597.	2006-11-14 19:34:59 +00:00
Galen Shipman	68d9922f44	enable/disable connection sleep in oob_tcp.c via mca param.. on by default.. This commit was SVN r12444.	2006-11-06 18:00:46 +00:00
Brian Barrett	fce5130333	Delay opening the listen socket until module init, so that we can have the seed value have something set to true. Allow selection of the listen type to thread if (and only if) the process is the HNP... This commit was SVN r12105.	2006-10-11 21:29:29 +00:00
Brian Barrett	8f7ab1c584	num_procs can be zero if something went partly wrong before. This will cause a math exception on some platforms, so don't let that happen. This commit was SVN r11929.	2006-10-02 01:27:22 +00:00
Brian Barrett	d00a0de716	* It appears that in their infinite wisdom, Apple removed the __DARWIN_ALIGN_POWER define from the last release of the OS X compiler toolchain. The bug in net/if.h, however, is still there. So look for the hints that we're on a 64 bit Apple PowerPC instead. * If we don't find a buffer size that works by 10MB, we're never going to. So add some code to limit the buffer size we'll try so that we don't fall into an infinite loop * Detect errors in opal_ifcount in the oob init code Refs trac:420 This commit was SVN r11825. The following Trac tickets were found above: Ticket 420 --> https://svn.open-mpi.org/trac/ompi/ticket/420	2006-09-26 16:37:04 +00:00
Andrew Friedley	1b6231a9b5	Fix for running jobs that span multiple 's' partitions on IU BigRed. Each 's' partition has its own TCP network. It's fine to use this network for jobs that fit inside the partition, but the TCP OOB errors when trying to connect across two partitions, because there are two disjoint networks. Each node also has another TCP network connecting ALL nodes together. So the solution is to actually try all the available TCP interfaces on a node, instead of erroring when the first one fails. Also, the default TCP connect() timeout is way too long (5 minutes) - use our own timeout mechanism, with the timeout value expressed as an MCA parameter. This commit was SVN r11718.	2006-09-19 19:33:49 +00:00
Ralph Castain	37dfdb76eb	Here is the major MAD-cure commit. I have written plenty about it, so I refer you here to those messages for a description of everything that was done. This commit was SVN r11661.	2006-09-14 21:29:51 +00:00
George Bosilca	f52c10d18e	And ORTE is ready for prime-time. All Windows tricks are in: - use the OPAL functions for PATH and environment variables - make all headers C++ friendly - no unamed structures - no implicit cast. Plus a full implementation for the orte_wait functions. This commit was SVN r11347.	2006-08-23 03:32:36 +00:00
Ralph Castain	5dfd54c778	With the branch to 1.2 made.... Clean up the remainder of the size_t references in the runtime itself. Convert to orte_std_cntr_t wherever it makes sense (only avoid those places where the actual memory size is referenced). Remove the obsolete oob barrier function (we actually obsoleted it a long time ago - just never bothered to clean it up). I have done my best to go through all the components and catch everything, even if I couldn't test compile them since I wasn't on that type of system. Still, I cannot guarantee that problems won't show up when you test this on specific systems. Usually, these will just show as "warning: comparison between signed and unsigned" notes which are easily fixed (just change a size_t to orte_std_cntr_t). In some places, people didn't use size_t, but instead used some other variant (e.g., I found several places with uint32_t). I tried to catch all of them, but... Once we get all the instances caught and fixed, this should once and for all resolve many of the heterogeneity problems. This commit was SVN r11204.	2006-08-15 19:54:10 +00:00
Ralph Castain	d2912f03e0	Cleanup a historical naming convention problem. Move the socket_errno definitions to the OPAL layer and change the name accordingly. This cleans up some interrelationship issues as well as removing a name confusion. This commit was SVN r11186.	2006-08-14 20:14:44 +00:00
Brian Barrett	c744f650ba	* really didn't mean for this patch (the threaded accept() code) to come in with r10841, so revert it (and it's fixes) out. Will bring back once cleaned up from the code used in the tbird experiment This commit was SVN r10991. The following SVN revision numbers were found above: r10841 --> open-mpi/ompi@dfa1221c3b	2006-07-25 22:32:01 +00:00
Jeff Squyres	bdab8d744c	Send a pointer to the data, not the data itself. Otherwise, we could get a segv in some cases. This commit was SVN r10984.	2006-07-25 21:42:44 +00:00
George Bosilca	33a7634009	Silence the compiler. This commit was SVN r10851.	2006-07-17 17:13:28 +00:00
Brian Barrett	dfa1221c3b	* AC_CONFIG_LINKS has a minor problem in that it always uses ln -s, rather than $(LN_S). This causes problems with with Windows and probably elsewhere (re: #200). So use a slightly different trick to get the right header selected for the MEMCPY and TIMER components. * Using the same trick used to solve the AC_CONFIG_LINKS problem, stop using a separate header file for direct calling in the PML and MTL. This lets me remove some icky code in ompi_mca.m4 that was more fragile than I really liked. This commit was SVN r10841.	2006-07-16 04:23:52 +00:00
Ralph Castain	cef1ce19d6	Restore the "sleep" delay during startup. Since Jeff and I are going to a branch for T-bird, we have restored the trunk to its prior state to avoid any possibility of disturbing it. This commit was SVN r10774.	2006-07-12 22:18:53 +00:00
Ralph Castain	9102b5af3b	Remove the "sleep" delay in the oob connection procedure. This shouldn't cause any problems, especially for launches of less than 1000 processes. Please report any abnormal behavior during launch, though, as we would like to understand what (if any) impact is seen. I couldn't see any on small jobs (the modulo functions render this number down pretty low). This commit was SVN r10763.	2006-07-12 20:31:30 +00:00
Brian Barrett	4b70bb92db	* Per ticket #112 , localhost checks should check against 127.0.0.1/8, rather than just 127.0.0.1. This commit was SVN r10750.	2006-07-11 20:54:49 +00:00

1 2

77 Коммитов