openmpi

Автор	SHA1	Сообщение	Дата
Rainer Keller	d81443cc5a	- On the way to get the BTLs split out and lessen dependency on orte: Often, orte/util/show_help.h is included, although no functionality is required -- instead, most often opal_output.h, or orte/mca/rml/rml_types.h Please see orte_show_help_replacement.sh commited next. - Local compilation (Linux/x86_64) w/ -Wimplicit-function-declaration actually showed two missing #include "orte/util/show_help.h" in orte/mca/odls/base/odls_base_default_fns.c and in orte/tools/orte-top/orte-top.c Manually added these. Let's have MTT the last word. This commit was SVN r20557.	2009-02-14 02:26:12 +00:00
Ralph Castain	4cdf91a8d4	Per the RFC, extend the current use of the ompi_proc_t flags field (without changing the field itself). The prior ompi_proc_t structure had a uint8_t flag field in it, where only one bit was used to flag that a proc was "local". In that context, "local" was constrained to mean "local to this node". This commit provides a greater degree of granularity on the term "local", to include tests to see if the proc is on the same socket, PC board, node, switch, CU (computing unit), and cluster. Add #define's to designate which bits stand for which local condition. This was added to the OPAL layer to avoid conflicting with the proposed movement of the BTLs. To make it easier to use, a set of macros have been defined - e.g., OPAL_PROC_ON_LOCAL_SOCKET - that test the specific bit. These can be used in the code base to clearly indicate which sense of locality is being considered. All locations in the code base that looked at the current proc_t field have been changed to use the new macros. Also modify the orte_ess modules so that each returns a uint8_t (to match the ompi_proc_t field) that contains a complete description of the locality of this proc. Obviously, not all environments will be capable of providing such detailed info. Thus, getting a "false" from a test for "on_local_socket" may simply indicate a lack of knowledge. This commit was SVN r20496.	2009-02-10 02:20:16 +00:00
Shiqing Fan	a5281f0434	- 1/4 commit for Windows Visual Studio and CCP support: CMakeLists and .windows files. In contribs preconfigured and precompiled parts. This commit was SVN r20108.	2008-12-10 20:59:20 +00:00
Jeff Squyres	eeabae49b9	Per http://www.open-mpi.org/community/lists/devel/2008/09/4648.php , remove the unconditional opal_output's when mmap() fails, and instead, conditionally output the failure message via btl_base_verbose settings. This commit was SVN r19547.	2008-09-11 19:02:33 +00:00
Ralph Castain	9613b3176c	Effectively revert the orte_output system and return to direct use of opal_output at all levels. Retain the orte_show_help subsystem to allow aggregation of show_help messages at the HNP. After much work by Jeff and myself, and quite a lot of discussion, it has become clear that we simply cannot resolve the infinite loops caused by RML-involved subsystems calling orte_output. The original rationale for the change to orte_output has also been reduced by shifting the output of XML-formatted vs human readable messages to an alternative approach. I have globally replaced the orte_output/ORTE_OUTPUT calls in the code base, as well as the corresponding .h file name. I have test compiled and run this on the various environments within my reach, so hopefully this will prove minimally disruptive. This commit was SVN r18619.	2008-06-09 14:53:58 +00:00
Jeff Squyres	e7ecd56bd2	This commit represents a bunch of work on a Mercurial side branch. As such, the commit message back to the master SVN repository is fairly long. = ORTE Job-Level Output Messages = Add two new interfaces that should be used for all new code throughout the ORTE and OMPI layers (we already make the search-and-replace on the existing ORTE / OMPI layers): * orte_output(): (and corresponding friends ORTE_OUTPUT, orte_output_verbose, etc.) This function sends the output directly to the HNP for processing as part of a job-specific output channel. It supports all the same outputs as opal_output() (syslog, file, stdout, stderr), but for stdout/stderr, the output is sent to the HNP for processing and output. More on this below. * orte_show_help(): This function is a drop-in-replacement for opal_show_help(), with two differences in functionality: 1. the rendered text help message output is sent to the HNP for display (rather than outputting directly into the process' stderr stream) 1. the HNP detects duplicate help messages and does not display them (so that you don't see the same error message N times, once from each of your N MPI processes); instead, it counts "new" instances of the help message and displays a message every ~5 seconds when there are new ones ("I got X new copies of the help message...") opal_show_help and opal_output still exist, but they only output in the current process. The intent for the new orte_* functions is that they can apply job-level intelligence to the output. As such, we recommend that all new ORTE and OMPI code use the new orte_* functions, not thei opal_* functions. === New code === For ORTE and OMPI programmers, here's what you need to do differently in new code: * Do not include opal/util/show_help.h or opal/util/output.h. Instead, include orte/util/output.h (this one header file has declarations for both the orte_output() series of functions and orte_show_help()). * Effectively s/opal_output/orte_output/gi throughout your code. Note that orte_output_open() takes a slightly different argument list (as a way to pass data to the filtering stream -- see below), so you if explicitly call opal_output_open(), you'll need to slightly adapt to the new signature of orte_output_open(). * Literally s/opal_show_help/orte_show_help/. The function signature is identical. === Notes === * orte_output'ing to stream 0 will do similar to what opal_output'ing did, so leaving a hard-coded "0" as the first argument is safe. * For systems that do not use ORTE's RML or the HNP, the effect of orte_output_* and orte_show_help will be identical to their opal counterparts (the additional information passed to orte_output_open() will be lost!). Indeed, the orte_* functions simply become trivial wrappers to their opal_* counterparts. Note that we have not tested this; the code is simple but it is quite possible that we mucked something up. = Filter Framework = Messages sent view the new orte_* functions described above and messages output via the IOF on the HNP will now optionally be passed through a new "filter" framework before being output to stdout/stderr. The "filter" OPAL MCA framework is intended to allow preprocessing to messages before they are sent to their final destinations. The first component that was written in the filter framework was to create an XML stream, segregating all the messages into different XML tags, etc. This will allow 3rd party tools to read the stdout/stderr from the HNP and be able to know exactly what each text message is (e.g., a help message, another OMPI infrastructure message, stdout from the user process, stderr from the user process, etc.). Filtering is not active by default. Filter components must be specifically requested, such as: {{{ $ mpirun --mca filter xml ... }}} There can only be one filter component active. = New MCA Parameters = The new functionality described above introduces two new MCA parameters: * '''orte_base_help_aggregate''': Defaults to 1 (true), meaning that help messages will be aggregated, as described above. If set to 0, all help messages will be displayed, even if they are duplicates (i.e., the original behavior). * '''orte_base_show_output_recursions''': An MCA parameter to help debug one of the known issues, described below. It is likely that this MCA parameter will disappear before v1.3 final. = Known Issues = * The XML filter component is not complete. The current output from this component is preliminary and not real XML. A bit more work needs to be done to configure.m4 search for an appropriate XML library/link it in/use it at run time. * There are possible recursion loops in the orte_output() and orte_show_help() functions -- e.g., if RML send calls orte_output() or orte_show_help(). We have some ideas how to fix these, but figured that it was ok to commit before feature freeze with known issues. The code currently contains sub-optimal workarounds so that this will not be a problem, but it would be good to actually solve the problem rather than have hackish workarounds before v1.3 final. This commit was SVN r18434.	2008-05-13 20:00:55 +00:00
Gleb Natapov	b42234461a	Cleanup shared file creation on unix/linux. This commit was SVN r18021.	2008-03-30 13:41:47 +00:00
Ralph Castain	dc7f45dafd	Remove the obsolete and largely unused orte_system_info structure. The only fields that were used in that struct were nodeid and nodename - these have been transferred to the orte_process_info structure. Only one place used the user name field - session_dir, when formulating the name of the top-level directory. Accordingly, the code for getting the user's id has been moved to the session_dir code. This commit was SVN r17926.	2008-03-23 23:10:15 +00:00
Gleb Natapov	f488b94899	More SM BTL initialization cleanups. This commit was SVN r17833.	2008-03-16 10:01:56 +00:00
Gleb Natapov	90c70e37b9	Clean up SM btl startup code. Remove no longer needed code leftovers from two BTL times. Remove old and no longer correct comment. This commit was SVN r17805.	2008-03-11 14:39:10 +00:00
Ralph Castain	d70e2e8c2b	Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer This commit was SVN r17632.	2008-02-28 01:57:57 +00:00
Jeff Squyres	213b5d5c6e	Per long threads on the mailing list and much confusion discussion about linkers, have all OPAL, ORTE, and OMPI components '''not'' link against the OPAL, ORTE, or OMPI libraries. See ttp://www.open-mpi.org/community/lists/users/2007/10/4220.php for details (or https://svn.open-mpi.org/trac/ompi/wiki/Linkers for a better-formatted version of the same info). This commit was SVN r16968.	2007-12-15 13:32:02 +00:00
Sven Stork	9e2263f29f	- fix a small memory leak This commit was SVN r15768.	2007-08-06 13:35:32 +00:00
Rolf vandeVaart	333357f4cc	This fixes the initialization of the usable size of the shared memory. The original code was not compensating for the space used by the header. When memory got tight, the allocator would return a pointer to memory that did not exist resulting in a SEGV for the application. This is a partial fix for ticket #929. Reviewed by Rich Graham. This commit was SVN r13950.	2007-03-07 13:28:06 +00:00
George Bosilca	1c7c39b32b	I miss this warnings on my last commit. This commit was SVN r13431.	2007-02-01 19:34:21 +00:00
George Bosilca	79ea6d471b	Even less warnings. This commit was SVN r13429.	2007-02-01 19:27:11 +00:00
Rainer Keller	061ba05439	- Fixes uncovered with the format attribute to opal_output and opal_output_verbose This commit was SVN r13371.	2007-01-30 20:56:31 +00:00
George Bosilca	d19a4f4740	Cast it to make cl happy. This commit was SVN r13267.	2007-01-24 00:51:01 +00:00
Brian Barrett	a34e67d743	Remove unneeded PARAM_INIT_FILE variable in configure.params files used by components that use configure.m4 for configuration or are always built. The macro has not been needed since moving to configure types other than configure.stub Fixes trac:590 This commit was SVN r13031. The following Trac tickets were found above: Ticket 590 --> https://svn.open-mpi.org/trac/ompi/ticket/590	2007-01-08 03:44:22 +00:00
Rich Graham	8a9da02063	change code to conform with coding standard. Handle error condition where shared memory file is not created. This commit was SVN r12964.	2007-01-03 00:06:02 +00:00
Rich Graham	6cb2377015	Change the allocation of the shared memory backing file. The file is allocated on a per comm_world instance, with the lowest rank in comm_world on the given host creating and initializing the file, and then notifying the remaining files via the OOB. Reviewed: Ralph Castain, Brian Barrett Addressing ticket #674. This commit was SVN r12949.	2007-01-01 02:39:02 +00:00
George Bosilca	392178b1a8	Move it in the right place. This commit was SVN r11276.	2006-08-21 04:05:19 +00:00
George Bosilca	ebd7a49428	Create shared memory "files" on Windows. These "files" will be allocated directly by the OS in the paging file (the HUGE file that cannot be defragmented with any tools). Unlike UNIX, they do not have physical existence as files. This commit was SVN r11273.	2006-08-20 19:45:28 +00:00
Sven Stork	da7ad0e8b8	- update function name inside debug statement This commit was SVN r9933.	2006-05-16 14:33:41 +00:00
Brian Barrett	1da22f9099	* silence a bunch of compiler warnings on Solaris when using the Sun compilers. This should go to the v1.1 branch This commit was SVN r9693.	2006-04-23 21:15:09 +00:00
Brian Barrett	9befdc7d9f	* Ensure that mca_common_sm_mmap_seg_alloc() always returns a word-aligned pointer. Otherwise, we can end up segfaulting when the memory area is used by the caller. Fixes a bug reported by Alex Spiegel. This commit was SVN r9692.	2006-04-23 21:14:03 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
George Bosilca	81de5e7a47	Get a meaningful error message on Windows when the shared memory file cannot be created. This commit was SVN r8747.	2006-01-19 07:07:47 +00:00
Jeff Squyres	268a5f5716	Ensure that seg is always initialized, even in the return_error case. This commit was SVN r8646.	2006-01-04 23:01:50 +00:00
George Bosilca	29dd1a6f6b	Unmap from map_addr not map_seg. This commit was SVN r8626.	2005-12-31 16:11:58 +00:00
George Bosilca	e1fbf7b0df	The last commit remove the initialization of map_addr ... This commit was SVN r8624.	2005-12-31 15:18:58 +00:00
George Bosilca	3baf36c6d9	Add the windows implementation or shared memory files. As the common part is the one that create the shred memory file it seems logical to make it destroy the file as well. Therefore, the code for unmapping the file is in a common place. This commit was SVN r8622.	2005-12-31 15:06:24 +00:00
George Bosilca	94c109081b	Protection plus begining of shared memory on windows. This commit was SVN r8464.	2005-12-12 20:24:07 +00:00
George Bosilca	e65e98ca58	Small typo. This commit was SVN r8424.	2005-12-09 05:10:51 +00:00
Jeff Squyres	42ec26e640	Update the copyright notices for IU and UTK. This commit was SVN r7999.	2005-11-05 19:57:48 +00:00
Brian Barrett	ed56e743b7	* update configure.ac to use the modern version of AC_INIT and AM_INIT_AUTOMAKE, instead of the deprecated version. * Work around dumbness in modern AC_INIT that requires the version number to be set at autoconf time (instead of at configure time, as it was before). Set the version number, minus the subversion r number, at autoconf time. Override the internal variables to include the r number (if needed) at configure time. Basically, the right thing should always happen. The only place it might not is the version reported as part of configure --help will not have an r number. * Since AM_INIT_AUTOMAKE taks a list of options, no need to specify them in all the Makefile.am files. * Addes support for subdir-objects, meaning that object files are put in the directory containing source files, even if the Makefile.am is in another directory. This should start making it feasible to reduce the number of Makefile.am files we have in the tree, which will greatly reduce the time to run autogen and configure. This commit was SVN r7211.	2005-09-07 05:54:53 +00:00
Jeff Squyres	b080d7360c	Remove useless compiler warning This commit was SVN r6886.	2005-08-15 16:48:43 +00:00
Jeff Squyres	cf16a521c8	Ensure to get ompi/include/constants.h This commit was SVN r6845.	2005-08-12 21:42:07 +00:00
Jeff Squyres	ba31fbf132	A better solution than r6672. If the caller passes in a data segment alignment of 0, then assume there will be no data segment and don't do the checks to see if it will be beyond the end of the file. This commit was SVN r6773. The following SVN revision numbers were found above: r6672 --> open-mpi/ompi@8b56769307	2005-08-08 21:38:27 +00:00
Jeff Squyres	1c5382deac	- Fix a minor problem in alignment logic in sm common component - Adjust btl sm to allocate just a few bytes extra to allow the common sm component to assume that there will be a data segment (even though the sm btl doesn't use the data segment in that portion of code) This commit was SVN r6772.	2005-08-08 21:29:05 +00:00
Tim Woodall	c4bc9aa740	return a NULL registration This commit was SVN r6494.	2005-07-14 19:10:46 +00:00
Brian Barrett	a13166b500	* rename ompi_output to opal_output This commit was SVN r6329.	2005-07-03 23:31:27 +00:00
Brian Barrett	9da0b4fe1d	* rename all the atomic functions from ompi to opal This commit was SVN r6325.	2005-07-03 21:38:51 +00:00
Brian Barrett	761402f95f	* rename ompi_list to opal_list This commit was SVN r6322.	2005-07-03 16:22:16 +00:00
Brian Barrett	499e4de1e7	* rename ompi_object and ompi_class to opal_object and opal_class This commit was SVN r6321.	2005-07-03 16:06:07 +00:00
Jeff Squyres	aa056f7bfd	First cut of OMPI Makefile.am's, plus a few more catchup updates in orte This commit was SVN r6286.	2005-07-02 15:06:47 +00:00
Jeff Squyres	4ab17f019b	Rename src -> ompi This commit was SVN r6269.	2005-07-02 13:43:57 +00:00

47 Коммитов