openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	25d48e22fa	Implementation of the MPI-3 Matched Probe functionality. Currently only implemented in the OB1 PML, will return NOT_SUPPORTED in other PMLs. This commit was SVN r25865.	2012-02-06 17:35:21 +00:00
Nathan Hjelm	bb1fec0407	added put/get btl descriptor flags This commit was SVN r25553.	2011-11-30 21:37:23 +00:00
Nathan Hjelm	8962ce25b0	fixed some compiler errors caused by seg_key changes. osc/rdma may need to be updated to use btls that use 128 bit segment keys This commit was SVN r25448.	2011-11-06 20:19:14 +00:00
Ralph Castain	2eaadcfab9	Remove unused variable This commit was SVN r25284.	2011-10-14 15:32:18 +00:00
George Bosilca	3241bea696	Apply a patch provided by Sébastien Boisvert fixing an issue with the probe fairness. This commit was SVN r25265.	2011-10-11 20:28:33 +00:00
George Bosilca	4fd78c4683	Keep track of the last probe on each communicator, so we can probe all peers in a round-robin fashion. A little bit more fair ... This commit was SVN r25264.	2011-10-11 20:24:54 +00:00
Rolf vandeVaart	20c5e6e0d6	Fix a few more cases where we are using a function as an argument to a macro which could result in it being called twice. I did not observe any issues, but it should be fixed. Also did some minor refactoring for clarity and following code convention. This commit was SVN r23886.	2010-10-12 20:11:48 +00:00
Rolf vandeVaart	0331889495	Some more spaces, tabs, include file ordering changes. No real code changes here. This commit was SVN r23789.	2010-09-22 13:48:22 +00:00
Abhishek Kulkarni	afbe3e99c6	* Wrap all the direct error-code checks of the form (OMPI_ERR_* == ret) with (OMPI_ERR_* = OPAL_SOS_GET_ERR_CODE(ret)), since the return value could be a SOS-encoded error. The OPAL_SOS_GET_ERR_CODE() takes in a SOS error and returns back the native error code. * Since OPAL_SUCCESS is preserved by SOS, also change all calls of the form (OPAL_ERROR == ret) to (OPAL_SUCCESS != ret). We thus avoid having to decode 'ret' to get the native error code. This commit was SVN r23162.	2010-05-17 23:08:56 +00:00
George Bosilca	321213e779	Fix segmentation fault on heterogeneous architectures. Don't mess with the ompi_ptr_t by translating into void*. Instead keep it as an ompi_ptr_t all the way. Thanks to Timur Magomedov for helping to track down this issue and test the patch. cmr:v1.4 cmr:v1.5 This commit was SVN r23030.	2010-04-23 15:14:55 +00:00
Rolf vandeVaart	c82e468ede	Undo revision r21767 - sorry folks This commit was SVN r21769. The following SVN revision numbers were found above: r21767 --> open-mpi/ompi@41f38110ff	2009-08-05 22:23:26 +00:00
Rolf vandeVaart	41f38110ff	HCA failover support in openib BTL This commit was SVN r21767.	2009-08-05 21:53:02 +00:00
George Bosilca	cf8bd2142a	Various cleanups and typos. This commit was SVN r21765.	2009-08-05 03:12:33 +00:00
Rainer Keller	6c5532072a	- Split the datatype engine into two parts: an MPI specific part in OMPI and a language agnostic part in OPAL. The convertor is completely moved into OPAL. This offers several benefits as described in RFC http://www.open-mpi.org/community/lists/devel/2009/07/6387.php namely: - Fewer basic types (int* and float* types, boolean and wchar - Fixing naming scheme to ompi-nomenclature. - Usability outside of the ompi-layer. - Due to the fixed nature of simple opal types, their information is completely known at compile time and therefore constified - With fewer datatypes (22), the actual sizes of bit-field types may be reduced from 64 to 32 bits, allowing reorganizing the opal_datatype structure, eliminating holes and keeping data required in convertor (upon send/recv) in one cacheline... This has implications to the convertor-datastructure and other parts of the code. - Several performance tests have been run, the netpipe latency does not change with this patch on Linux/x86-64 on the smoky cluster. - Extensive tests have been done to verify correctness (no new regressions) using: 1. mpi_test_suite on linux/x86-64 using clean ompi-trunk and ompi-ddt: a. running both trunk and ompi-ddt resulted in no differences (except for MPI_SHORT_INT and MPI_TYPE_MIX_LB_UB do now run correctly). b. with --enable-memchecker and running under valgrind (one buglet when run with static found in test-suite, commited) 2. ibm testsuite on linux/x86-64 using clean ompi-trunk and ompi-ddt: all passed (except for the dynamic/ tests failed!! as trunk/MTT) 3. compilation and usage of HDF5 tests on Jaguar using PGI and PathScale compilers. 4. compilation and usage on Scicortex. - Please note, that for the heterogeneous case, (-m32 compiled binaries/ompi), neither ompi-trunk, nor ompi-ddt branch would successfully launch. This commit was SVN r21641.	2009-07-13 04:56:31 +00:00
Greg Koenig	60485ff95f	This is a very large change to rename several #define values from OMPI_* to OPAL_*. This allows opal layer to be used more independent from the whole of ompi. NOTE: 9 "svn mv" operations immediately follow this commit. This commit was SVN r21180.	2009-05-06 20:11:28 +00:00
George Bosilca	b5deb228f3	Allow the BTL to release the descriptor. In fact the only thing the PML needs is to be involved in the RMA completion process, which is insured by the MCA_BTL_DES_SEND_ALWAYS_CALLBACK flag. Fixes trac:1875. This commit was SVN r20983. The following Trac tickets were found above: Ticket 1875 --> https://svn.open-mpi.org/trac/ompi/ticket/1875	2009-04-13 23:41:50 +00:00
George Bosilca	527540aeb1	Rename req_bytes_delivered to req_bytes_expected for the receive requests to really reflect what this field means. This commit was SVN r20971.	2009-04-10 16:36:20 +00:00
George Bosilca	dfc7cea329	Fix the deadlock issues on the osu_bw. The problem is that the PML is event driver, and if there are no event generated by the BTLs ... well nothing happens (i.e there is no progress at the PML level and all pending fragments remain pending). By forcing the BTL to trigger the callbacks for all ACK and FIN, we give more opportunities to the PML to do real progress, but we pay this in terms of performance. This commit was SVN r20953.	2009-04-07 16:56:37 +00:00
George Bosilca	daba352af4	As the request is not yet updated (i.e. _MATCHED cannot be called as we don't yet know the expected length of the message) we should use the source and tag from the message header instead of the value from the status structure attached to the request. -This line, and those below, will be ignored-- M pml_ob1_recvreq.c This commit was SVN r20844.	2009-03-23 20:25:53 +00:00
Shiqing Fan	abd21b6d17	- An update for memchecker : 1. fix a bug in pml_ob1_recvreq/sendreq.c, buffer was made defined where the request has already been released. 2. complete memchecker support for collective functions. 3. change the wrongly spelled function name of memchecker, i.e. '_isaddressible' should be '_isaddressable' This commit was SVN r20043.	2008-11-27 16:34:02 +00:00
George Bosilca	00d24bf8ab	Scalability patch, or slim-fast effect #1 . All BML structures just got a whole lot smaller, decreasing the memory footprint of the running application. How much it's a good question. Here is a breakdown: - in mca_bml_base_endpoint_t: 3 size_t + 1 uint32_t - in mca_bml_base_btl_t: 1 * int + 1 * double - 1 * float + 6 * size_t + 9 * (void) The decrease in mca_bml_base_endpoint_t is for each peer and the decrease in mca_bml_base_btl_t is for each BTL for each peer. So, if we consider the most convenient case where there is only one network between all peers, this decrease the memory foot print per peer by 9size_t + 9(void) + 2 * int32_t + 1 * double - 1 * float. On a 64 bits machine this will be 156 bytes per peer. Now we access all these fields directly from the underlying BTL structure, and as this structure is common to multiple BML endpoint, we are a lot more cache friendly. Even if this do not improve the latency, it makes the SM performance graph a lot smoother. This commit was SVN r19659.	2008-09-30 21:02:37 +00:00
Rainer Keller	e84f1f6fdf	- Mark the variable bytes_delivered as being unused (it is just set within MCA_PML_OB1_RECV_REQUEST_UNPACK) Iff Coverity's prevent makes usage of __attribute__(unused), this should get rid of warning. Relates to CID1060 Would then apply to a many int _rc; definitions, that are used in other macros in similar fashion... This commit was SVN r19179.	2008-08-06 13:46:23 +00:00
George Bosilca	3ba0a8c0c1	In the case where the environment is homogeneous we can ALWAYS create the receiver convertor when we create the request (as we know all architectures are identical). This commit was SVN r18934.	2008-07-17 04:57:55 +00:00
George Bosilca	939fa3001d	Small cleanups. Remove some switch cases that cannot be reached. Rename a struct field. This commit was SVN r18931.	2008-07-17 04:50:39 +00:00
George Bosilca	319a8b3219	Once matched the proc attached to the request should be the source of the message and not the first on the list. This fix the ticket #1386. This commit was SVN r18929.	2008-07-17 03:04:28 +00:00
George Bosilca	3de0488410	Fix the truncation problem. This close the #211 . This commit was SVN r18850.	2008-07-09 17:38:41 +00:00
Galen Shipman	dbd282fcad	doh.. fix GET protocol.. This commit was SVN r18623.	2008-06-09 19:45:44 +00:00
Ralph Castain	9613b3176c	Effectively revert the orte_output system and return to direct use of opal_output at all levels. Retain the orte_show_help subsystem to allow aggregation of show_help messages at the HNP. After much work by Jeff and myself, and quite a lot of discussion, it has become clear that we simply cannot resolve the infinite loops caused by RML-involved subsystems calling orte_output. The original rationale for the change to orte_output has also been reduced by shifting the output of XML-formatted vs human readable messages to an alternative approach. I have globally replaced the orte_output/ORTE_OUTPUT calls in the code base, as well as the corresponding .h file name. I have test compiled and run this on the various environments within my reach, so hopefully this will prove minimally disruptive. This commit was SVN r18619.	2008-06-09 14:53:58 +00:00
George Bosilca	e361bcb64c	Send optimizations. 1. The send path get shorter. The BTL is allowed to return > 0 to specify that the descriptor was pushed to the networks, and that the memory attached to it is available again for the upper layer. The MCA_BTL_DES_SEND_ALWAYS_CALLBACK flag can be used by the PML to force the BTL to always trigger the callback. Unmodified BTL will continue to work as expected, as they will return OMPI_SUCCESS which force the PML to have exactly the same behavior as before. Some BTLs have been modified: self, sm, tcp, mx. 2. Add send immediate interface to BTL. The idea is to have a mechanism of allowing the BTL to take advantage of send optimizations such as the ability to deliver data "inline". Some network APIs such as Portals allow data to be sent using a "thin" event without packing data into a memory descriptor. This interface change allows the BTL to use such capabilities and allows for other optimizations in the future. All existing BTLs except for Portals and sm have this interface set to NULL. This commit was SVN r18551.	2008-05-30 03:58:39 +00:00
Galen Shipman	4da4c44210	Receive side changes, basically uses multiple active message callbacks rather than using a single receive callback followed by a switch on the header. Also fast pathed the matching for small fragments. This commit was SVN r18549.	2008-05-30 01:29:09 +00:00
Jeff Squyres	e7ecd56bd2	This commit represents a bunch of work on a Mercurial side branch. As such, the commit message back to the master SVN repository is fairly long. = ORTE Job-Level Output Messages = Add two new interfaces that should be used for all new code throughout the ORTE and OMPI layers (we already make the search-and-replace on the existing ORTE / OMPI layers): * orte_output(): (and corresponding friends ORTE_OUTPUT, orte_output_verbose, etc.) This function sends the output directly to the HNP for processing as part of a job-specific output channel. It supports all the same outputs as opal_output() (syslog, file, stdout, stderr), but for stdout/stderr, the output is sent to the HNP for processing and output. More on this below. * orte_show_help(): This function is a drop-in-replacement for opal_show_help(), with two differences in functionality: 1. the rendered text help message output is sent to the HNP for display (rather than outputting directly into the process' stderr stream) 1. the HNP detects duplicate help messages and does not display them (so that you don't see the same error message N times, once from each of your N MPI processes); instead, it counts "new" instances of the help message and displays a message every ~5 seconds when there are new ones ("I got X new copies of the help message...") opal_show_help and opal_output still exist, but they only output in the current process. The intent for the new orte_* functions is that they can apply job-level intelligence to the output. As such, we recommend that all new ORTE and OMPI code use the new orte_* functions, not thei opal_* functions. === New code === For ORTE and OMPI programmers, here's what you need to do differently in new code: * Do not include opal/util/show_help.h or opal/util/output.h. Instead, include orte/util/output.h (this one header file has declarations for both the orte_output() series of functions and orte_show_help()). * Effectively s/opal_output/orte_output/gi throughout your code. Note that orte_output_open() takes a slightly different argument list (as a way to pass data to the filtering stream -- see below), so you if explicitly call opal_output_open(), you'll need to slightly adapt to the new signature of orte_output_open(). * Literally s/opal_show_help/orte_show_help/. The function signature is identical. === Notes === * orte_output'ing to stream 0 will do similar to what opal_output'ing did, so leaving a hard-coded "0" as the first argument is safe. * For systems that do not use ORTE's RML or the HNP, the effect of orte_output_* and orte_show_help will be identical to their opal counterparts (the additional information passed to orte_output_open() will be lost!). Indeed, the orte_* functions simply become trivial wrappers to their opal_* counterparts. Note that we have not tested this; the code is simple but it is quite possible that we mucked something up. = Filter Framework = Messages sent view the new orte_* functions described above and messages output via the IOF on the HNP will now optionally be passed through a new "filter" framework before being output to stdout/stderr. The "filter" OPAL MCA framework is intended to allow preprocessing to messages before they are sent to their final destinations. The first component that was written in the filter framework was to create an XML stream, segregating all the messages into different XML tags, etc. This will allow 3rd party tools to read the stdout/stderr from the HNP and be able to know exactly what each text message is (e.g., a help message, another OMPI infrastructure message, stdout from the user process, stderr from the user process, etc.). Filtering is not active by default. Filter components must be specifically requested, such as: {{{ $ mpirun --mca filter xml ... }}} There can only be one filter component active. = New MCA Parameters = The new functionality described above introduces two new MCA parameters: * '''orte_base_help_aggregate''': Defaults to 1 (true), meaning that help messages will be aggregated, as described above. If set to 0, all help messages will be displayed, even if they are duplicates (i.e., the original behavior). * '''orte_base_show_output_recursions''': An MCA parameter to help debug one of the known issues, described below. It is likely that this MCA parameter will disappear before v1.3 final. = Known Issues = * The XML filter component is not complete. The current output from this component is preliminary and not real XML. A bit more work needs to be done to configure.m4 search for an appropriate XML library/link it in/use it at run time. * There are possible recursion loops in the orte_output() and orte_show_help() functions -- e.g., if RML send calls orte_output() or orte_show_help(). We have some ideas how to fix these, but figured that it was ok to commit before feature freeze with known issues. The code currently contains sub-optimal workarounds so that this will not be a problem, but it would be good to actually solve the problem rather than have hackish workarounds before v1.3 final. This commit was SVN r18434.	2008-05-13 20:00:55 +00:00
Shiqing Fan	8393fb5d47	Use the new memchecker_call function for memory checking of non-blocking communication. This commit was SVN r18399.	2008-05-07 12:28:51 +00:00
Ralph Castain	fa082cafa9	Shift the architecture calculation from the ompi/datatype engine to the opal/util area. This allows us to compute the architecture earlier in the launch and communicate it outside of the modex. Note: this is an early preliminary step in the movement of portions of the datatype engine to the opal layer. This commit was SVN r18198.	2008-04-17 20:43:56 +00:00
Shiqing Fan	a1e5df1cc9	Use the new memchecker function call which is based on convertor. Remove one unnecessary call. This commit was SVN r18085.	2008-04-07 07:52:04 +00:00
Gleb Natapov	cf40674369	Decide if sends should be throttled at the receiver and pass this to the sender in an ACK message. The decision can't be done reliably at the sender. This commit was SVN r17987.	2008-03-27 08:56:43 +00:00
George Bosilca	8943ae0b4e	Cleanup plus some typos. This commit was SVN r17858.	2008-03-18 03:03:33 +00:00
Ralph Castain	d70e2e8c2b	Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer This commit was SVN r17632.	2008-02-28 01:57:57 +00:00
George Bosilca	fa31ec81d0	Add the ownership flags to the PML/BTL interface. The layer owning the descriptor is responsible for releasing it once the descriptor is not in use anymore. This commit was SVN r17497.	2008-02-18 17:39:30 +00:00
Gleb Natapov	354c5bc5e1	Don't call progress() from OB1 fragment scheduling functions. They don't serve any purpose and case recursion calls to progress engine. This commit was SVN r17478.	2008-02-17 12:42:32 +00:00
Gleb Natapov	0a1fa2cb56	req_match_received is set inside MCA_PML_OB1_RECV_REQUEST_MATCHE(). This commit was SVN r17442.	2008-02-13 08:34:39 +00:00
Shiqing Fan	54c7b71cfd	Use the correct way of including memchecker.h, which will work with '--with-devel-headers'. This commit was SVN r17435.	2008-02-12 18:01:17 +00:00
Shiqing Fan	f5792bbda5	merging the memchecker into trunk. This commit was SVN r17424.	2008-02-12 08:46:27 +00:00
Gleb Natapov	b37ff74a24	Make function that is used only in one file static. Remove static functions declaration. This commit was SVN r17080.	2008-01-09 09:54:35 +00:00
Ethan Mallove	f32dcb1636	The Sun Studio 12 compilers need to have `inline` specified as `static` in cases where a function is not part of a separate compilation unit (such as `append_recv_req_to_queue`). This commit was SVN r17069.	2008-01-08 18:45:51 +00:00
George Bosilca	b58dae00db	Allow PERUSE to compile correctly. This commit was SVN r17008.	2007-12-21 06:18:19 +00:00
Gleb Natapov	35bf8c7c46	Rewrite OB1 matching logic. Get rid of macros, make the code shorter. This commit was SVN r16993.	2007-12-19 09:16:20 +00:00
Gleb Natapov	5cd38b8b06	Better encapsulate heterogeneous arch handling in ob1. This commit was SVN r16970.	2007-12-16 08:45:44 +00:00
Gleb Natapov	e2e211f23b	Add flags parameter to btl_alloc() and btl_prepare_src() functions. If BTL knows at the time of allocation priority of a descriptor it may do some optimizations. This commit was SVN r16901.	2007-12-09 14:08:01 +00:00
Gleb Natapov	2d784752dd	Remove descriptor caching form BML. With descriptor caching some optimizations are impossible. This commit was SVN r16897.	2007-12-09 13:58:17 +00:00
Gleb Natapov	097b17d30e	Prevent a receive request from been freed while other thread holds a reference to it or there is an outstanding completion for the request. This commit was SVN r16153.	2007-09-18 16:18:47 +00:00

1 2 3 4

153 Коммитов