openmpi

Автор	SHA1	Сообщение	Дата
Jeff Squyres	d2d06008a0	Change the default value of mpi_leave_pinned to -1, meaning that we'll figure it out at runtime (really meaning: we'll still default to "0" unless something explicitly overrides to 1, such as the openib BTL). This way, ompi_info doesn't confusingly report mpi_leave_pinned==0 for mpi_leave_pinned, but we end up running with mpi_leave_pinned==1. Fixes trac:1502. This commit was SVN r19571. The following Trac tickets were found above: Ticket 1502 --> https://svn.open-mpi.org/trac/ompi/ticket/1502	2008-09-16 22:06:14 +00:00
Jeff Squyres	270f482fea	Addendum to r19561: also remove a comment that is no longer true and some code that is commented out. This commit was SVN r19564. The following SVN revision numbers were found above: r19561 --> open-mpi/ompi@17e65369be	2008-09-16 13:02:10 +00:00
George Bosilca	acd3406aa7	Never drop messages. No never no more. This is supposed to fix the ticket #1460. This commit was SVN r19562.	2008-09-15 23:04:18 +00:00
George Bosilca	17e65369be	Fix the deadlock when we run out of resources on the BTLs. Move the progress function from the BML into the PML. The BTL progress functions are now directly registered with the event library. This commit was SVN r19561.	2008-09-15 22:56:23 +00:00
George Bosilca	2499112d1c	Fix indentation. This commit was SVN r19313.	2008-08-17 20:10:54 +00:00
Rainer Keller	e84f1f6fdf	- Mark the variable bytes_delivered as being unused (it is just set within MCA_PML_OB1_RECV_REQUEST_UNPACK) Iff Coverity's prevent makes usage of __attribute__(unused), this should get rid of warning. Relates to CID1060 Would then apply to a many int _rc; definitions, that are used in other macros in similar fashion... This commit was SVN r19179.	2008-08-06 13:46:23 +00:00
George Bosilca	3dafa58b32	Fix coverty issue 1044. This commit was SVN r19178.	2008-08-06 13:38:21 +00:00
Rainer Keller	c1f2b8e476	- Fix resource leak in case of error. Coverity CID1067 This commit was SVN r19168.	2008-08-06 08:04:27 +00:00
Jeff Squyres	0af7ac53f2	Fixes trac:1392, #1400 * add "register" function to mca_base_component_t * converted coll:basic and paffinity:linux and paffinity:solaris to use this function * we'll convert the rest over time (I'll file a ticket once all this is committed) * add 32 bytes of "reserved" space to the end of mca_base_component_t and mca_base_component_data_2_0_0_t to make future upgrades [slightly] easier * new mca_base_component_t size: 196 bytes * new mca_base_component_data_2_0_0_t size: 36 bytes * MCA base version bumped to v2.0 * '''We now refuse to load components that are not MCA v2.0.x''' * all MCA frameworks versions bumped to v2.0 * be a little more explicit about version numbers in the MCA base * add big comment in mca.h about versioning philosophy This commit was SVN r19073. The following Trac tickets were found above: Ticket 1392 --> https://svn.open-mpi.org/trac/ompi/ticket/1392	2008-07-28 22:40:57 +00:00
Jeff Squyres	e3e79c0881	Fixes trac:1379: * Use synonym/deprecated MCA param API for some mca base params * In openib BTL, if we have appropriate memory hooks support, and if mpi_leave_pinned and mpi_leave_pinned_pipeline were not set by the user, set mpi_leave_pinned to 1. * Defer checking mpi_leave_pinned_* until as late as possible (i.e., until after the btl's have had a chance to set mpi_leave_pinned to 1): * in ob1 pml * in rdma mpool This commit was SVN r19022. The following Trac tickets were found above: Ticket 1379 --> https://svn.open-mpi.org/trac/ompi/ticket/1379	2008-07-24 22:51:26 +00:00
Aurelien Bouteiller	086cb6190e	Use the generic version number instead of hardcoded ones This commit was SVN r18983.	2008-07-22 21:10:51 +00:00
George Bosilca	3ba0a8c0c1	In the case where the environment is homogeneous we can ALWAYS create the receiver convertor when we create the request (as we know all architectures are identical). This commit was SVN r18934.	2008-07-17 04:57:55 +00:00
George Bosilca	902a2892b6	Fix typo. This commit was SVN r18933.	2008-07-17 04:55:23 +00:00
George Bosilca	939fa3001d	Small cleanups. Remove some switch cases that cannot be reached. Rename a struct field. This commit was SVN r18931.	2008-07-17 04:50:39 +00:00
George Bosilca	319a8b3219	Once matched the proc attached to the request should be the source of the message and not the first on the list. This fix the ticket #1386. This commit was SVN r18929.	2008-07-17 03:04:28 +00:00
Aurelien Bouteiller	66463cb258	Fix the annoying message from showing up when not using PML V. The underlying bug is not fixed though, but at least people not involved in FT dev should not see it anymore. Fix ticket https://svn.open-mpi.org/trac/ompi/ticket/1328 Aurelien This commit was SVN r18917.	2008-07-15 22:05:40 +00:00
George Bosilca	3de0488410	Fix the truncation problem. This close the #211 . This commit was SVN r18850.	2008-07-09 17:38:41 +00:00
Ralph Castain	6af8a73dc0	Modify the checking logic to look for NULL return This commit was SVN r18749.	2008-06-26 14:08:36 +00:00
Ralph Castain	af8c167861	May be picky, but cleanup before returning in error conditions This commit was SVN r18748.	2008-06-26 13:31:36 +00:00
Ralph Castain	3631a60181	Update the PML selection logic to detect when a modex is required, and in those cases to only have rank=0 report its selected module. This is per the email thread on the devel list: http://www.open-mpi.org/community/lists/devel/2008/06/4223.php This commit was SVN r18747.	2008-06-26 13:22:48 +00:00
George Bosilca	bc9b950162	Honor ^ for the PML selection. This commit was SVN r18683.	2008-06-19 16:50:46 +00:00
George Bosilca	dc0ab0d0a8	Enable the sendi path. This commit was SVN r18633.	2008-06-09 23:03:56 +00:00
Aurelien Bouteiller	ebe6df4c06	Moving the pml_v_output global variable inside the pml_v structure. This should avoid one of the missing symbols when visibility is enabled. This commit was SVN r18627.	2008-06-09 20:38:44 +00:00
Galen Shipman	dbd282fcad	doh.. fix GET protocol.. This commit was SVN r18623.	2008-06-09 19:45:44 +00:00
Ralph Castain	9613b3176c	Effectively revert the orte_output system and return to direct use of opal_output at all levels. Retain the orte_show_help subsystem to allow aggregation of show_help messages at the HNP. After much work by Jeff and myself, and quite a lot of discussion, it has become clear that we simply cannot resolve the infinite loops caused by RML-involved subsystems calling orte_output. The original rationale for the change to orte_output has also been reduced by shifting the output of XML-formatted vs human readable messages to an alternative approach. I have globally replaced the orte_output/ORTE_OUTPUT calls in the code base, as well as the corresponding .h file name. I have test compiled and run this on the various environments within my reach, so hopefully this will prove minimally disruptive. This commit was SVN r18619.	2008-06-09 14:53:58 +00:00
George Bosilca	2aec094d56	The PML V is a component so it should use OMPI_MODULE_DECLSPEC. This commit was SVN r18610.	2008-06-06 17:43:57 +00:00
George Bosilca	4d8cbbc167	Add Pasha's patch as it correctly solve the issues. In fact in the current incarnation these functions do not need the inline keyword anymore. This commit was SVN r18558.	2008-06-03 16:03:36 +00:00
Ralph Castain	c992e99035	Remove the tags from orte_output_open and the filtering operation from orte_output - this will be handled differently to improve the XML output interface This commit was SVN r18557.	2008-06-03 14:24:01 +00:00
George Bosilca	e361bcb64c	Send optimizations. 1. The send path get shorter. The BTL is allowed to return > 0 to specify that the descriptor was pushed to the networks, and that the memory attached to it is available again for the upper layer. The MCA_BTL_DES_SEND_ALWAYS_CALLBACK flag can be used by the PML to force the BTL to always trigger the callback. Unmodified BTL will continue to work as expected, as they will return OMPI_SUCCESS which force the PML to have exactly the same behavior as before. Some BTLs have been modified: self, sm, tcp, mx. 2. Add send immediate interface to BTL. The idea is to have a mechanism of allowing the BTL to take advantage of send optimizations such as the ability to deliver data "inline". Some network APIs such as Portals allow data to be sent using a "thin" event without packing data into a memory descriptor. This interface change allows the BTL to use such capabilities and allows for other optimizations in the future. All existing BTLs except for Portals and sm have this interface set to NULL. This commit was SVN r18551.	2008-05-30 03:58:39 +00:00
Galen Shipman	4da4c44210	Receive side changes, basically uses multiple active message callbacks rather than using a single receive callback followed by a switch on the header. Also fast pathed the matching for small fragments. This commit was SVN r18549.	2008-05-30 01:29:09 +00:00
Jeff Squyres	e7ecd56bd2	This commit represents a bunch of work on a Mercurial side branch. As such, the commit message back to the master SVN repository is fairly long. = ORTE Job-Level Output Messages = Add two new interfaces that should be used for all new code throughout the ORTE and OMPI layers (we already make the search-and-replace on the existing ORTE / OMPI layers): * orte_output(): (and corresponding friends ORTE_OUTPUT, orte_output_verbose, etc.) This function sends the output directly to the HNP for processing as part of a job-specific output channel. It supports all the same outputs as opal_output() (syslog, file, stdout, stderr), but for stdout/stderr, the output is sent to the HNP for processing and output. More on this below. * orte_show_help(): This function is a drop-in-replacement for opal_show_help(), with two differences in functionality: 1. the rendered text help message output is sent to the HNP for display (rather than outputting directly into the process' stderr stream) 1. the HNP detects duplicate help messages and does not display them (so that you don't see the same error message N times, once from each of your N MPI processes); instead, it counts "new" instances of the help message and displays a message every ~5 seconds when there are new ones ("I got X new copies of the help message...") opal_show_help and opal_output still exist, but they only output in the current process. The intent for the new orte_* functions is that they can apply job-level intelligence to the output. As such, we recommend that all new ORTE and OMPI code use the new orte_* functions, not thei opal_* functions. === New code === For ORTE and OMPI programmers, here's what you need to do differently in new code: * Do not include opal/util/show_help.h or opal/util/output.h. Instead, include orte/util/output.h (this one header file has declarations for both the orte_output() series of functions and orte_show_help()). * Effectively s/opal_output/orte_output/gi throughout your code. Note that orte_output_open() takes a slightly different argument list (as a way to pass data to the filtering stream -- see below), so you if explicitly call opal_output_open(), you'll need to slightly adapt to the new signature of orte_output_open(). * Literally s/opal_show_help/orte_show_help/. The function signature is identical. === Notes === * orte_output'ing to stream 0 will do similar to what opal_output'ing did, so leaving a hard-coded "0" as the first argument is safe. * For systems that do not use ORTE's RML or the HNP, the effect of orte_output_* and orte_show_help will be identical to their opal counterparts (the additional information passed to orte_output_open() will be lost!). Indeed, the orte_* functions simply become trivial wrappers to their opal_* counterparts. Note that we have not tested this; the code is simple but it is quite possible that we mucked something up. = Filter Framework = Messages sent view the new orte_* functions described above and messages output via the IOF on the HNP will now optionally be passed through a new "filter" framework before being output to stdout/stderr. The "filter" OPAL MCA framework is intended to allow preprocessing to messages before they are sent to their final destinations. The first component that was written in the filter framework was to create an XML stream, segregating all the messages into different XML tags, etc. This will allow 3rd party tools to read the stdout/stderr from the HNP and be able to know exactly what each text message is (e.g., a help message, another OMPI infrastructure message, stdout from the user process, stderr from the user process, etc.). Filtering is not active by default. Filter components must be specifically requested, such as: {{{ $ mpirun --mca filter xml ... }}} There can only be one filter component active. = New MCA Parameters = The new functionality described above introduces two new MCA parameters: * '''orte_base_help_aggregate''': Defaults to 1 (true), meaning that help messages will be aggregated, as described above. If set to 0, all help messages will be displayed, even if they are duplicates (i.e., the original behavior). * '''orte_base_show_output_recursions''': An MCA parameter to help debug one of the known issues, described below. It is likely that this MCA parameter will disappear before v1.3 final. = Known Issues = * The XML filter component is not complete. The current output from this component is preliminary and not real XML. A bit more work needs to be done to configure.m4 search for an appropriate XML library/link it in/use it at run time. * There are possible recursion loops in the orte_output() and orte_show_help() functions -- e.g., if RML send calls orte_output() or orte_show_help(). We have some ideas how to fix these, but figured that it was ok to commit before feature freeze with known issues. The code currently contains sub-optimal workarounds so that this will not be a problem, but it would be good to actually solve the problem rather than have hackish workarounds before v1.3 final. This commit was SVN r18434.	2008-05-13 20:00:55 +00:00
Gleb Natapov	31d2797a2f	If RDMA PUT is received before ACK and registration of memory fails don't start sending fragment by copy in/out before ACK is received as we don't know pointer to receive request yet. Pipeline protocol sometimes doesn't send ACK though, so this case is still broken. This commit was SVN r18423.	2008-05-11 12:40:55 +00:00
Josh Hursey	da2f1c58e2	Some checkpoint/restart cleanup. * Remove the opal_only option. This was suffering from bit rot, and no one uses it. It can be added back fairly easily if wanted. * Cleanup metadata interactions at the local level. * Touch up some of the INC funcitonality (fix typos and a minor ordering issue) This commit was SVN r18416.	2008-05-08 18:47:47 +00:00
Shiqing Fan	8393fb5d47	Use the new memchecker_call function for memory checking of non-blocking communication. This commit was SVN r18399.	2008-05-07 12:28:51 +00:00
Shiqing Fan	f35a06119c	Use memchecker_convertor_call function instead the old one. Move the function to the place that we can use convertor. This commit was SVN r18370.	2008-05-05 13:57:27 +00:00
Josh Hursey	dcd21d7d07	Some checkpoint/restart fixes in response to r18338 (changes in modex). Things should be working now. This commit was SVN r18348. The following SVN revision numbers were found above: r18338 --> open-mpi/ompi@3e55fe6f6d	2008-05-01 17:48:13 +00:00
Ralph Castain	3e55fe6f6d	Fold in the revised modex scheme. Move the ompi_proc_t modex portions to the RTE level since the daemons already have that info. Provide each process with the equivalent of a "nidmap" - both a map of what nodes are in the job, and a map of which node each process is on. This enables the use of static ports, though that hasn't been turned "on" in this commit. Update the rsh tree spawn capability so we spawn the next wave of daemons before launching our own local procs. Add an ability to encode nodenames for large clusters with contiguous node name numbering schemes - this allows communication of all node names in a few bytes instead of tens-of-bytes/node. This commit was SVN r18338.	2008-04-30 19:49:53 +00:00
George Bosilca	6e6c370917	Rollback r18274 as its legal to have a sequence number smaller than the expected one. It doesn't necessarily means the message is duplicated, it can simply signify the message is out of sequence and the counter overflowed. This commit was SVN r18323. The following SVN revision numbers were found above: r18274 --> open-mpi/ompi@73c9de3af9	2008-04-27 18:35:54 +00:00
Aurelien Bouteiller	c20b020ea6	Fix ticket #1275 . The pml v can now be correctly deactivated on the configure command line. Also fix a dist target under some unusual circumpstances. This commit was SVN r18291.	2008-04-24 21:42:54 +00:00
Josh Hursey	2c736873bb	Fix a checkpoint/restart bug that causes a restarted application to occasionally throw a SIGSEGV or SIGPIPE due to invalid socket descriptors. The problem was caused by a bad ordering between the restart of the ORTE level tcp connections (in the OOB - out-of-band communication) and the Open MPI level tcp connections (BTLs). Before this commit ORTE would shutdown and restart the OOB completely before the OMPI level restarted its tcp connections. What would happen is that a socket descriptor used by the OMPI level on checkpoint was assigned to the ORTE level on restart. But the OMPI level had no knowledge that the socket descriptor it was previously using has been recycled so it closed it on restart. This caused the ORTE level to break as the newly created socket descriptor was closed without its knowledge. The fix is to have the OMPI level shutdown tcp connections, allow the ORTE level to restart, and then allow the OMPi level to restart its connections. This seems obvious, and I'm surprised that this bug has not cropped up sooner. I'm confident that this specific problem has been fixed with this commit. Thanks to Eric Roman and Tamer El Sayed for their help in identifying this problem, and patience while I was fixing it. * Add a new state {{{OPAL_CRS_RESTART_PRE}}}. This state identifies when we are on the down slope of the INC (finalize-like) which is useful when you want to close, but not reopen a component set for fear of interfering with a lower level. * Use this new state in OMPI level coordination. Here we want to make sure to play well with both the OMPI/BTL/TCP and ORTE/OOB/TCP components. * Update ft_event functions in PML and BML to handle the new restart state. * Add an additional flag to the error output in OOB/TCP so we can see what the socket descriptor was on failure as this can be helpful in debugging. This commit was SVN r18276.	2008-04-24 17:54:22 +00:00
George Bosilca	3ccac4f803	Oops ... This commit was SVN r18275.	2008-04-24 15:54:52 +00:00
George Bosilca	73c9de3af9	Bark if we got a wrong sequence number. Here wrong means that the seq number if smaller than what we expect. This commit was SVN r18274.	2008-04-24 15:48:43 +00:00
Josh Hursey	cc83d41ad9	Merge in tmp/jjh-scratch {{{ svn merge -r 18218:18240 https://svn.open-mpi.org/svn/ompi/tmp/jjh-scratch . }}} Contains: * Primarily a fix for a user reported problem where a cached file descriptor is causing a SIGPIPE on restart. * Cleanup some small memory leaks from using mca_base_param_env_var() - Thanks Jeff * Cleanup ORTE FT tool compilation in non-FT builds - Thanks Tim P. * Cleanup mpi interface with missplaced {{{OPAL_CR_ENTER_LIBRARY}}} - Thanks Terry * Some other sundry cleanup items all dealing with C/R functionality in the trunk. This commit was SVN r18241.	2008-04-23 00:17:12 +00:00
Ralph Castain	fa082cafa9	Shift the architecture calculation from the ompi/datatype engine to the opal/util area. This allows us to compute the architecture earlier in the launch and communicate it outside of the modex. Note: this is an early preliminary step in the movement of portions of the datatype engine to the opal layer. This commit was SVN r18198.	2008-04-17 20:43:56 +00:00
Tim Prins	3582e11200	cleanup some warnings on 32 bit systems This commit was SVN r18187.	2008-04-17 12:25:05 +00:00
Ralph Castain	3a0d09300b	Fully implement the inbound binomial allgather for daemon-based collectives. Supports both modex and barrier operations. Comm_spawn still uses the rank=0 method - shifting that algo to the daemons is under study. This commit was SVN r18115.	2008-04-09 22:10:53 +00:00
Shiqing Fan	28746bbcdb	Remove the memchecker macro in pml base request, used in req_wait.c, which actually is in the wrong place. Instead, one simple call from send_request_free and recv_request_free(already done) will do all the work, fast and clean. This commit was SVN r18095.	2008-04-07 17:46:50 +00:00
Shiqing Fan	a1e5df1cc9	Use the new memchecker function call which is based on convertor. Remove one unnecessary call. This commit was SVN r18085.	2008-04-07 07:52:04 +00:00
George Bosilca	b4f828f389	We need a newline at the nd of the file, or some compiler bark. This commit was SVN r18023.	2008-03-30 19:05:56 +00:00
Aurelien Bouteiller	77653ac787	Missing .h file in makefile breaked nightly tarball distcheck... This commit was SVN r18006.	2008-03-28 14:36:56 +00:00

1 2 3 4 5 ...

717 Коммитов