openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	9613b3176c	Effectively revert the orte_output system and return to direct use of opal_output at all levels. Retain the orte_show_help subsystem to allow aggregation of show_help messages at the HNP. After much work by Jeff and myself, and quite a lot of discussion, it has become clear that we simply cannot resolve the infinite loops caused by RML-involved subsystems calling orte_output. The original rationale for the change to orte_output has also been reduced by shifting the output of XML-formatted vs human readable messages to an alternative approach. I have globally replaced the orte_output/ORTE_OUTPUT calls in the code base, as well as the corresponding .h file name. I have test compiled and run this on the various environments within my reach, so hopefully this will prove minimally disruptive. This commit was SVN r18619.	2008-06-09 14:53:58 +00:00
Ralph Castain	c992e99035	Remove the tags from orte_output_open and the filtering operation from orte_output - this will be handled differently to improve the XML output interface This commit was SVN r18557.	2008-06-03 14:24:01 +00:00
George Bosilca	e361bcb64c	Send optimizations. 1. The send path get shorter. The BTL is allowed to return > 0 to specify that the descriptor was pushed to the networks, and that the memory attached to it is available again for the upper layer. The MCA_BTL_DES_SEND_ALWAYS_CALLBACK flag can be used by the PML to force the BTL to always trigger the callback. Unmodified BTL will continue to work as expected, as they will return OMPI_SUCCESS which force the PML to have exactly the same behavior as before. Some BTLs have been modified: self, sm, tcp, mx. 2. Add send immediate interface to BTL. The idea is to have a mechanism of allowing the BTL to take advantage of send optimizations such as the ability to deliver data "inline". Some network APIs such as Portals allow data to be sent using a "thin" event without packing data into a memory descriptor. This interface change allows the BTL to use such capabilities and allows for other optimizations in the future. All existing BTLs except for Portals and sm have this interface set to NULL. This commit was SVN r18551.	2008-05-30 03:58:39 +00:00
Jeff Squyres	e7ecd56bd2	This commit represents a bunch of work on a Mercurial side branch. As such, the commit message back to the master SVN repository is fairly long. = ORTE Job-Level Output Messages = Add two new interfaces that should be used for all new code throughout the ORTE and OMPI layers (we already make the search-and-replace on the existing ORTE / OMPI layers): * orte_output(): (and corresponding friends ORTE_OUTPUT, orte_output_verbose, etc.) This function sends the output directly to the HNP for processing as part of a job-specific output channel. It supports all the same outputs as opal_output() (syslog, file, stdout, stderr), but for stdout/stderr, the output is sent to the HNP for processing and output. More on this below. * orte_show_help(): This function is a drop-in-replacement for opal_show_help(), with two differences in functionality: 1. the rendered text help message output is sent to the HNP for display (rather than outputting directly into the process' stderr stream) 1. the HNP detects duplicate help messages and does not display them (so that you don't see the same error message N times, once from each of your N MPI processes); instead, it counts "new" instances of the help message and displays a message every ~5 seconds when there are new ones ("I got X new copies of the help message...") opal_show_help and opal_output still exist, but they only output in the current process. The intent for the new orte_* functions is that they can apply job-level intelligence to the output. As such, we recommend that all new ORTE and OMPI code use the new orte_* functions, not thei opal_* functions. === New code === For ORTE and OMPI programmers, here's what you need to do differently in new code: * Do not include opal/util/show_help.h or opal/util/output.h. Instead, include orte/util/output.h (this one header file has declarations for both the orte_output() series of functions and orte_show_help()). * Effectively s/opal_output/orte_output/gi throughout your code. Note that orte_output_open() takes a slightly different argument list (as a way to pass data to the filtering stream -- see below), so you if explicitly call opal_output_open(), you'll need to slightly adapt to the new signature of orte_output_open(). * Literally s/opal_show_help/orte_show_help/. The function signature is identical. === Notes === * orte_output'ing to stream 0 will do similar to what opal_output'ing did, so leaving a hard-coded "0" as the first argument is safe. * For systems that do not use ORTE's RML or the HNP, the effect of orte_output_* and orte_show_help will be identical to their opal counterparts (the additional information passed to orte_output_open() will be lost!). Indeed, the orte_* functions simply become trivial wrappers to their opal_* counterparts. Note that we have not tested this; the code is simple but it is quite possible that we mucked something up. = Filter Framework = Messages sent view the new orte_* functions described above and messages output via the IOF on the HNP will now optionally be passed through a new "filter" framework before being output to stdout/stderr. The "filter" OPAL MCA framework is intended to allow preprocessing to messages before they are sent to their final destinations. The first component that was written in the filter framework was to create an XML stream, segregating all the messages into different XML tags, etc. This will allow 3rd party tools to read the stdout/stderr from the HNP and be able to know exactly what each text message is (e.g., a help message, another OMPI infrastructure message, stdout from the user process, stderr from the user process, etc.). Filtering is not active by default. Filter components must be specifically requested, such as: {{{ $ mpirun --mca filter xml ... }}} There can only be one filter component active. = New MCA Parameters = The new functionality described above introduces two new MCA parameters: * '''orte_base_help_aggregate''': Defaults to 1 (true), meaning that help messages will be aggregated, as described above. If set to 0, all help messages will be displayed, even if they are duplicates (i.e., the original behavior). * '''orte_base_show_output_recursions''': An MCA parameter to help debug one of the known issues, described below. It is likely that this MCA parameter will disappear before v1.3 final. = Known Issues = * The XML filter component is not complete. The current output from this component is preliminary and not real XML. A bit more work needs to be done to configure.m4 search for an appropriate XML library/link it in/use it at run time. * There are possible recursion loops in the orte_output() and orte_show_help() functions -- e.g., if RML send calls orte_output() or orte_show_help(). We have some ideas how to fix these, but figured that it was ok to commit before feature freeze with known issues. The code currently contains sub-optimal workarounds so that this will not be a problem, but it would be good to actually solve the problem rather than have hackish workarounds before v1.3 final. This commit was SVN r18434.	2008-05-13 20:00:55 +00:00
Ralph Castain	48e5840c50	Restore a placeholder to make non-SVN SCM's happy. This commit was SVN r17648.	2008-02-28 20:19:22 +00:00
George Bosilca	678e6c7f0d	This is a Mercurial file. This commit was SVN r17635.	2008-02-28 05:18:06 +00:00
Ralph Castain	d70e2e8c2b	Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer This commit was SVN r17632.	2008-02-28 01:57:57 +00:00
Ralph Castain	b4ec81a9fd	Fix the Panasas support in ROMIO so it builds without complaints. Required a patch from Brian, plus a few edits by me to remove warnings. NOTE: the code provided by PANASAS includes a "switch" that they left incomplete - it doesn't cover all possibilities. Since the value being switched is an enum, this causes problems for the compiler. I added the missing values, but - since Panasas felt they could be ignored - had the switch generate an error if those cases ever occurred. This commit was SVN r17543.	2008-02-21 20:35:34 +00:00
Jeff Squyres	5bb1e5151f	Suggestions/patches from Brian to make stuff better: * Include all the stuff that is necessary for running autogen.sh in a distribution tarball. * Remove from config/Makefile.am's EXTRA_DIST that which is automatically included in the tarball in recent versions of Automake (i.e., all the m4 files that are acincluded). * Make ROMIO's configure script look for something that is actually included in the tarball. Fixes trac:1025. This commit was SVN r17505. The following Trac tickets were found above: Ticket 1025 --> https://svn.open-mpi.org/trac/ompi/ticket/1025	2008-02-19 01:49:52 +00:00
Jeff Squyres	17ede97ef8	Two fixes to revert some long-ago decisions that seemed like a good idea at the time, but led to logistical difficulties in importing new versions of ROMIO: * We are effectively eliminating the ROMIO file prefix rule hacks in the ROMIO component, which create symlinks from foo.c to io_romio_foo.c. In reality, the file name conflict potential will be small. * Additionally, we are effectively eliminating the ROMIO function prefix rule in the ROMIO component. This is another place where there are generally problems with the merge up new versions of ROMIO and/or patches from the user community (for their own local builds). In reality, since other major MPI implementations provides the same exact symbols, it won't cause any practical problems for users. In return, we make it ''much'' simpler to apply ROMIO patches to Open MPI. The problem right now is that any patch will have filenames such as ad_panfs.c, but Open MPI will only have io_romio_ad_panfs.c, making things extremely difficult for users. I believe, for example, that this would make it possible for LANL to have applied their patches without too much hassle on either their part or our part. It will also make things easier for OMPI when we/they want to do the next ROMIO upgrade (this was one of the sources of problems on each upgrade). This commit was SVN r17436.	2008-02-12 18:55:17 +00:00
Jeff Squyres	213b5d5c6e	Per long threads on the mailing list and much confusion discussion about linkers, have all OPAL, ORTE, and OMPI components '''not'' link against the OPAL, ORTE, or OMPI libraries. See ttp://www.open-mpi.org/community/lists/users/2007/10/4220.php for details (or https://svn.open-mpi.org/trac/ompi/wiki/Linkers for a better-formatted version of the same info). This commit was SVN r16968.	2007-12-15 13:32:02 +00:00
Adrian Knoth	037a533752	Reformatted r16691 to OMPI style. Re #733 This commit was SVN r16693. The following SVN revision numbers were found above: r16691 --> open-mpi/ompi@8dca19cb3b	2007-11-08 12:54:48 +00:00
Adrian Knoth	8dca19cb3b	upstream patch, provided by Jiri Polach. Re #733 This commit was SVN r16691.	2007-11-08 12:44:10 +00:00
Rich Graham	27a748e7eb	change all instances of ompi_free_list_init to ompi_free_list_init_new. Header and payload data are specified separately at this stage. This commit was SVN r16633.	2007-11-01 23:38:50 +00:00
Brian Barrett	2b8af283de	Add ability to completely turn off MPI one-sided support, so that users can experiment with using ROMIO directly. This commit was SVN r15922.	2007-08-18 21:35:51 +00:00
Brian Barrett	9184db7239	Update ROMIO release to the one included with MPICH2-1.0.5p4, tagged in vendor/romio as mpich2-1.0.5p4. This commit was SVN r15544.	2007-07-21 22:08:27 +00:00
Brian Barrett	872623a527	Fix situation where we were not propogating error codes from ROMIO into non-blocking request status fields, so it was never being relayed to the user This commit was SVN r15296.	2007-07-05 22:30:42 +00:00
Brian Barrett	b71b2b4b0d	Make the aio detection work with cross compiling. The tests no longer even look at the status code and basically guarantee that the aio function was never called, so there's really no point in AC_TRY_RUN over AC_COMPILE_IFELSE... This commit was SVN r15033.	2007-06-13 03:16:32 +00:00
Brian Barrett	84d1512fba	Add the potential for doing some basic error checking on mutexes during single threaded builds. In its default configuration, all this does is ensure that there's at least a good chance of threads building based on non-threaded development (since the variable names will be checked). There is also code to make sure that a "mutex" is never "double locked" when using the conditional macro mutex operations. This is off by default because there are a number of places in both ORTE and OMPI where this alarm spews mega bytes of errors on a simple test. So we have some work to do on our path towards thread support. Also removed the macro versions of the non-conditional thread locks, as the only places they were used, the author of the code intended to use the conditional thread locks. So now you have upper-case macros for conditional thread locks and lowercase functions for non-conditional locks. Simple, right? :). This commit was SVN r15011.	2007-06-12 16:25:26 +00:00
Brian Barrett	21e00f6f0c	Clean up a couple of configure things: * Require Autoconf 2.60 or higher and remove some cruft required for AC 2.59 or the AC 2.59 / AC 2.60 mix * Remove a bunch of now unnecessary AC_SUBST calls * Use the libtool-provided variables for the -I and library to use when compiling against ltdl Fixes trac:1000 This commit was SVN r14652. The following Trac tickets were found above: Ticket 1000 --> https://svn.open-mpi.org/trac/ompi/ticket/1000	2007-05-15 04:23:48 +00:00
Sven Stork	a04c8eb39a	- Bring over the visibility feature, for a finer symbol export control via the visibility feature that is provided by some compilers. Per default this feature is disabled, to enable it you need to configure with --enable-visibility and obviously you need a compiler with visibility support. Please refer to the wiki for more information. https://svn.open-mpi.org/trac/ompi/wiki/Visibility This commit was SVN r14582.	2007-05-04 09:03:37 +00:00
Jeff Squyres	82caceda08	A minor change to ROMIO's configure script: make it use exactly the same "restrict" check as the top-level OMPI configure.ac script so that it will guarantee to always get the same result. Therefore, the #define for restrict will always have the same value in both opal_config.h and romioconf.h, and we get 7 less warnings (6 in the IO ROMIO component, 1 in ROMIO itself) when compiling with icc on Linux (because PAC_C_RESTRICT and AC_C_RESTRICT would get different values for the "restrict" #define in this case). This commit was SVN r14387.	2007-04-17 03:10:06 +00:00
Jeff Squyres	51f286d737	Just like r14289 on the ORTE trunk: Per discussions with Brian and Ralph, make a slight correction in where components are installed. Use $pkglibdir, not $libdir/openmpi, so that when compiled in the orte trunk, components are installed to the right directory (because the component search patch is checking $pkglibdir). This commit was SVN r14345. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r14289	2007-04-12 11:19:42 +00:00
Galen Shipman	ebca0bb34e	fix for aggregated writes This commit was SVN r14314.	2007-04-11 22:07:19 +00:00
Jeff Squyres	85d7678350	Revert r14286; it worked for icc, but not for gcc. #$%@#$% Sorry for configure changes during the day; I totally forgot about that. :-( This commit was SVN r14288. The following SVN revision numbers were found above: r14286 --> open-mpi/ompi@0083eba18e	2007-04-10 15:42:59 +00:00
Jeff Squyres	0083eba18e	Comment out the PAC_C_RESTRICT test from ROMIO's configure.in script. The top-level OMPI configure script already checks for "restrict" and will issue a #define for it. PAC_C_RESTRICT would also check for restrict, but sometimes come up with a different answer than the top-level OMPI configure script, thereby resulting in conflicting #define's for "restrict" (e.g., icc 9.0/9.1 on linux x86-64). So it's easiest just to remove this test from ROMIO's configure.in script. This commit was SVN r14286.	2007-04-10 14:50:47 +00:00
Josh Hursey	dadca7da88	Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD). This merge adds Checkpoint/Restart support to Open MPI. The initial frameworks and components support a LAM/MPI-like implementation. This commit follows the risk assessment presented to the Open MPI core development group on Feb. 22, 2007. This commit closes trac:158 More details to follow. This commit was SVN r14051. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r13912 The following Trac tickets were found above: Ticket 158 --> https://svn.open-mpi.org/trac/ompi/ticket/158	2007-03-16 23:11:45 +00:00
Brian Barrett	9660bb6ccc	These symbols aren't actually created in ROMIO with Open MPI's configure, so no need to have them in here. This commit was SVN r13933.	2007-03-05 22:55:17 +00:00
Rainer Keller	0889ebd59f	- Eliminate warnings, that PGI-6.2.5 issues with -Minform=inform This commit was SVN r13840.	2007-02-28 08:36:34 +00:00
Sven Stork	d8a369936e	- Fix more symbols that should be exported. This commit was SVN r13824.	2007-02-27 15:17:17 +00:00
Brian Barrett	a34e67d743	Remove unneeded PARAM_INIT_FILE variable in configure.params files used by components that use configure.m4 for configuration or are always built. The macro has not been needed since moving to configure types other than configure.stub Fixes trac:590 This commit was SVN r13031. The following Trac tickets were found above: Ticket 590 --> https://svn.open-mpi.org/trac/ompi/ticket/590	2007-01-08 03:44:22 +00:00
Brian Barrett	6f8b366acb	Rename liborte to libopen-rte and libopal to libopen-pal per telecon today and bug #632. Refs trac:632 This commit was SVN r12762. The following Trac tickets were found above: Ticket 632 --> https://svn.open-mpi.org/trac/ompi/ticket/632	2006-12-05 18:27:24 +00:00
Jeff Squyres	e02114dcf3	Fixes trac:529. * Create a new request type: NOOP (described below) * For all MPI__INIT functions, OBJ_NEW an ompi_request_t and set its type to NOOP Ensure that the NOOP requests are OBJ_RELEASE'd when they are done * MPI_START looks at the request type; if NOOP, just return success. If not, call the PML start() function * MPI_STARTALL always pass the entire array of requests back to the PML (see next point) * Make the PMLs only process PML requests (i.e., ignore/skip anything that isn't of type PML -- such as the NOOP requests) * Add a little more param error checking in STARTALL This commit was SVN r12338. The following Trac tickets were found above: Ticket 529 --> https://svn.open-mpi.org/trac/ompi/ticket/529	2006-10-27 12:32:36 +00:00
Jeff Squyres	9ccbab464d	Remove unused variable. This commit was SVN r11918.	2006-10-01 11:51:41 +00:00
Brian Barrett	a546b6834b	Only add the ROMIO source directory into DIST_SUBDIRS and SUBDIRS if the component is configured successfully. Otherwise, we can end up trying to run make in the romio directory without any Makefiles. This really only happens on the targets that recurse into DIST_SUBDIRS - ie dist, maintainer-clean, and distclean refs trac:411 This commit was SVN r11807. The following Trac tickets were found above: Ticket 411 --> https://svn.open-mpi.org/trac/ompi/ticket/411	2006-09-25 23:51:13 +00:00
Brian Barrett	ddffc8748f	* remove unneeded files that were included in the mpich2 tarball This commit was SVN r11726.	2006-09-20 18:27:28 +00:00
Brian Barrett	d1402cf8f5	* Update ROMIO release to the one included with MPICH2-1.0.4p1, tagged in vendor/romio as mpich2-1.0.4p1. This commit was SVN r11715.	2006-09-19 16:13:46 +00:00
Rainer Keller	611f3ba408	- Fix buglet found by Feng Sheng's IO tests; Correctly initialize the status for non-blocking io. This commit was SVN r11672.	2006-09-15 12:37:29 +00:00
Jeff Squyres	6173e952a2	Remove these files from SVN; they should not be committed. This will force everyone to re-autogen. Sorry! This commit was SVN r11616.	2006-09-11 22:43:20 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Brian Barrett	df9273587f	* romio_cb_write should also be forced to enable when optimizations are requested This commit was SVN r10584.	2006-06-30 15:06:10 +00:00
Brian Barrett	0031e39d72	* fix for dumb memory bug introduced in romio performance fixup code This commit was SVN r10528.	2006-06-27 19:58:18 +00:00
Brian Barrett	9a65a7ca97	* re-add -Is necessary for VPATH builds. This commit was SVN r10524.	2006-06-27 14:10:34 +00:00
Brian Barrett	970d858f30	* Add performance code requested by LANL, per ticket #128 . Must be explicitly enabled at run-time with the mca parameter io_romio_enable_parallel_optimizations set to something non-zero. This will enable some magic flags in Panasas if the user didn't set them (either on or off) and do some slightly better things with strided collective writes. This commit was SVN r10516.	2006-06-26 22:26:36 +00:00
Galen Shipman	218a438509	finished the ompi_free_list_t class nightmare.. This commit was SVN r10314.	2006-06-12 22:09:03 +00:00
Galen Shipman	18dda70fd0	make ompi_free_list_item_t a class.. This will go to the 1.1 branch but will probably require a few changes as ompi_free_list_t is different in the branch.. This commit was SVN r10306.	2006-06-12 16:44:00 +00:00
Brian Barrett	cc99a63169	* fix issue with PANFS not building properly - we didn't add PANFS_LIB to the list of libraries This commit was SVN r10279.	2006-06-09 20:41:12 +00:00
Galen Shipman	83ff3201b5	don't use rank or nprocs in error messages when we don't have them.. This should hit 1.1 and 1.0 branches.. Reviewed by Brian This commit was SVN r10164.	2006-06-01 14:24:11 +00:00
Brian Barrett	4904e34a52	set datarootdir, necessary for Autoconf-2.60 which will define some variables based upon this value (e.g., datadir, docdir). Submitted by: Ralf Wildenhues Reviewed by: Brian Barrett This commit was SVN r10133.	2006-05-31 03:43:55 +00:00
Brian Barrett	6026fc98f6	* Fix M4 quoting so that AC 2.60 won't complain Submitted by: Ralf Wildenhues Reviewed by: Brian Barrett This commit was SVN r10129.	2006-05-31 03:39:18 +00:00

1 2 3

113 Коммитов