1
1

129 Коммитов

Автор SHA1 Сообщение Дата
Brian Barrett
57ca18ce88 * move ompi_event_fini() from mpirun/MPI_Finalize to ompi_rte_finalize to
match where ompi_event_init() lived
* initialize and shutdown the code to allow child process wait callbacks
* add comment about few() ussage in rte-enabled jobs (short answer:
  don't).

This commit was SVN r2864.
2004-09-27 19:38:23 +00:00
Jeff Squyres
9264a4772c Fix a few more rogue global symbols
This commit was SVN r2820.
2004-09-23 12:56:03 +00:00
Jeff Squyres
bfa3a46453 These symbols should be static
This commit was SVN r2810.
2004-09-23 00:00:01 +00:00
Ralph Castain
64db75b43e Fix create_dirpath to shut valgrind up.
This commit was SVN r2800.
2004-09-22 15:56:02 +00:00
Brian Barrett
41e17e2758 * rename pack.{c,h} to bufpack.{c,h} because there was already a pack.c in
src/mpi/c and you can't have two object files with the same name in
  the same library

This commit was SVN r2782.
2004-09-20 19:55:01 +00:00
Ralph Castain
e7135621fe Okay, with this minor change, mpirun2 and mpi_init can now both rapidly determine that they don't want to join an existing universe and create their very own session directory tree - and even clean up correctly IF the program terminates normally.
Still have to work on that abnormal termination stuff...

This commit was SVN r2731.
2004-09-16 21:42:48 +00:00
George Bosilca
f5502e731d Work around differences between several versions of flex. Newer versions (3.5.21) seems to define YY_CURRENT_BUFFER_LVALUE as the correct value to be used on the left side assignments when older version can use the YY_CURRENT_BUFFER.
This commit was SVN r2704.
2004-09-16 09:14:01 +00:00
George Bosilca
4177ed8ab9 On left side of an assgnment YY_CURRENT_BUFFER should became YY_CURRENT_BUFFER_LVALUE.
This commit was SVN r2701.
2004-09-16 08:48:31 +00:00
Ralph Castain
70dae461e4 MPI_Init will now detect and join a persistent universe - hooray! Fixed the session_dir cleanup process so it is kinder to the universe-setup file (i.e., leaves it alone), thus allowing persistent universes to retain their contact info on the session_dir tree. Adjusted mpirun2, ompid, and ompiconsole accordingly.
Put some error protection in ompi_rte_monitor.

This commit was SVN r2678.
2004-09-15 16:33:36 +00:00
Ralph Castain
c5008b71bf Grrr...get rid of another pesky diagnostic output.
This commit was SVN r2663.
2004-09-14 14:31:39 +00:00
Ralph Castain
069682e046 A bunch of minor changes, mostly adding diagnostics. Just wanted to checkpoint so I can start fresh since there now seem to be problems in the tree with mpirun2.
Fixed ompid so it reissues the non-blocking receive - should now be close to ready for primetime. Fixed some logic in the svc framework that wasn't checking properly for action flags. 

This commit was SVN r2660.
2004-09-14 14:21:04 +00:00
Ralph Castain
f3256a1140 Discovered a very, very interesting bug, courtesy of Jeff point out some unexpected behavior under Linux. It turns out that the Linux "dirname" function has a bug in it - namely, it modifies the argument passed to it, in violation of its documentation. This was apparently fixed in the Mac library - it definitely does NOT modify its argument on the Mac.
After spending hours tracking this down, I finally did manage to isolate it to the dirname function. I have installed a "fix" for the problem (using an intermediate temporary copy of the string being passed to the function) that solves the problem without messing us up on the Mac.

Hopefully, we won't find too many more such "cute" errors!

This commit was SVN r2643.
2004-09-14 03:59:17 +00:00
Jeff Squyres
d9ae96c259 Update output streams and associated docs.
IMPORTANT NOTE: when sendout ompi_output() strings to files, note that
the file goes into the session directory.  If the session directory
does not exist yet when you call ompi_output() and you are sending the
string to a file, THE STRING WILL BE DROPPED (in terms of the file --
it will still be sent to all other locations that were specified).
This should really only be a factor for a fairly small portion of
things during initialization of a process -- basically, until the
session directory exists, nothing will be written to a file.

ompi_output() will automatically take care of creating the file and
writing to it once the session directory has been created.  So it is
valid to:

id = ompi_output(...something with file set...)
ompi_output(id, "blah!");
/* ... process session directory is created ... */
ompi_output(id, "wazzlebort!");

In this case, the output file will be:

-----
[WARNING: 1 lines lost because the Open MPI process session directory did
 not exist when ompi_output() was invoked]
wazzlebort!
-----

This commit was SVN r2637.
2004-09-13 20:19:01 +00:00
Ralph Castain
6c8654d2b8 Update the utility to a kinder, gentler sort that leaves Jeff's ompi_output files alone. :-)
This commit was SVN r2636.
2004-09-13 19:02:43 +00:00
Ralph Castain
c6cbe33d50 Some of these didn't really change - I was just in/out of them for diagnostics while chasing a bug. Got caught by my good buddy Tim again :) on his parse_contact_info function, which requires that the space for the answer be allocated in advance. Sigh. Anyway, mpirun2 now works again. My apologies if you tried it in the last few hours and found it didn't.
Also removed the mpirun3 directory since we are basically dragging mpirun2 along with us - no need to create a new version after all.

Made a few changes to the universe info structure, eliminating the "webserver" and "socket" fields since we will do those contacts through the oob channel. Also changed the "silent_mode" field to "console" since silent mode is the default - the flag needs to tell you to turn the console on, not off.

Parse environ function now gets the ns and gpr replica contact info and loads it in the proper places to hand it off to the respective components, thus allowing me to check connection to them as part of determining if the named universe already exists. Changed the local_universe_exists function accordingly and gave it a new name (since the replicas may not be local). This name will shortly be changed to "ompi_rte_join_universe" as I complete the logic for doing that function.

Please let me know if you see any problems. I successfully ran some trivial multi-process functions in both mpirun2 and singleton modes, and ran the seed daemon as well, so I think it should all be okay.

This commit was SVN r2611.
2004-09-11 12:56:52 +00:00
Ralph Castain
106e07f759 Some reorganization of the startup process functions that is transparent to anyone using mpirun2 and/or running as a singleton. Please note that the old mpirun script may well not work any more - I have not been trying to keep that one running.
For those of you looking into the guts of these functions, the most visible changes are:

- raising the assignment of the process name to a higher level, taking it out of the "hole" it had fallen into. We've been having problems with multiple functions assigning the process name. This is understandable - lots of workarounds were implemented in the early development stages. However, it was becoming hard to determine WHEN the name was being defined - it was being hidden under too many layers of function calls. Hence, it is now assigned in the three primary programs in a very visible fashion. Hopefully, we can now chase down all the other places and get rid of them.

- similarly, I raised the visibility of when the session directory gets constructed to ensure it doesn't get done at the wrong time and/or multiple times.

- created a new function that parses all the non-mca level environmental variables and assigns the info into the corresponding structures. I have also included notes in this function and in the various ompi_rte_init_stage functions about proper ordering.

- modified the rte cmd line parsers to store the options they find into the environment so they can be passed along later

That about does it.

This commit was SVN r2589.
2004-09-10 03:21:03 +00:00
Jeff Squyres
86e3cfa85d Remove all remaining traces of the common command line utility. It
doesn't do what it was designed for, and therefore wasn't useful, so
per discussion with Ralph last night, we decided to scrap it.

This commit was SVN r2539.
2004-09-08 15:02:35 +00:00
Jeff Squyres
4b30eddd23 Ensure to increment i if we didn't find the option. Fixed a problem
Ralph reported.

This commit was SVN r2535.
2004-09-08 07:46:28 +00:00
Ralph Castain
e8c36d02c9 Not as bad as this all may look. Tim and I made a significant change to the way we handle the startup of the oob, the seed, etc. We have made it backwards-compatible so that mpirun2 and singleton operations remain working. We had to adjust the name server and gpr as well, plus the process_info structure.
This also includes a checkpoint update to openmpi.c and ompid.c. I have re-enabled the ompid compile.

This latter raises an important point. The trunk compiles the programs like ompid just fine under Linux. It also does just fine for OSX under the dynamic libraries. However, we are seeing errors when compiling under OSX for the static case - the linker seems to have trouble resolving some variable names, even though linker diagnostics show the variables as being defined. Thus, a warning to Mac users that you may have to locally turn things off if you are trying to do static compiles. We ask, however, that you don't commit those changes that turn things off for everyone else - instead, let's try to figure out why the static compile is having a problem, and let everyone else continue to work.

Thanks
Ralph

This commit was SVN r2534.
2004-09-08 03:59:06 +00:00
Jeff Squyres
3498f7a283 First cut of the Show Help Subsystem (SHS)
- see src/util/show_help.h for details (doxygen); main function call
  is ompi_show_help()

- text message files are expected to be located in $pkgdatadir
  (usually $prefix/share/openmpi).  Anyone can install a text file in
  $pkgdatadir with their message(s) in it and then have them displayed
  via ompi_show_help().  "pkgdata_DATA" is the keyword to use in
  Makefile.am's, for example (from src/mca/base/Makefile.am):

    pkgdata_DATA = help-mca-base.txt

- added a few examples in the code base using ompi_show_help(), but
  not too many -- can convert more "show_help" comments in the code
  over time; no huge rush.  :-)
- no i18n-like support yet; waiting for advice and consensus from
  other developers

This commit was SVN r2519.
2004-09-05 16:05:37 +00:00
Ralph Castain
36c6cf574e Fix bad initialization in proc_info, add a pid field into the universe info and save it in the universe info file.
This commit was SVN r2491.
2004-09-03 19:26:49 +00:00
Ralph Castain
3c94af6021 Bring tree back to ability to cleanly compile. These changes should be transparent to virtually everyone, but the silly tree insists on compiling everything. Mostly modified things in preparation for releasing the openmpi program, and to begin knitting the next (hopefully final) version of mpirun.
Let me know if it has any impact on you - it shouldn't.

This commit was SVN r2487.
2004-09-03 16:26:15 +00:00
Tim Woodall
a5cf56c425 add support for packing/unpacking ompi_process_name_t as OMPI_NAME
This commit was SVN r2437.
2004-09-01 23:09:41 +00:00
Ralph Castain
57d820e34d Correct the name on daemon_init to ompi_daemon_init. Fix the setup file i/o interface to correctly write/read universe setup files.
This commit was SVN r2398.
2004-08-30 19:25:14 +00:00
Ralph Castain
d4c059a942 Get rid of a couple of warning messages in the nightly builds...
This commit was SVN r2380.
2004-08-29 19:03:23 +00:00
Ralph Castain
5b7e17908d Okay, now for a *slightly* less aggressive cleanup of the session dir. Whereas the previous version was a tad unfriendly to files from other procs sharing the tree, this one is somewhat more respectful....
BTW, in case anyone is trying to use threads, be aware that much of the RTE is NOT thread-safe at this time. We'll work on that soon.

:-)

This commit was SVN r2379.
2004-08-29 16:37:02 +00:00
Jeff Squyres
d2ea3eea6d Only unlock the mutex if we actually locked it.
This commit was SVN r2373.
2004-08-29 07:50:07 +00:00
Ralph Castain
ee75ce35cd Fix a logic error and typo that were keeping files in the session dir tree alive - now can truly "woo-hoo".
This commit was SVN r2371.
2004-08-29 06:50:00 +00:00
Ralph Castain
671a3de019 Add session directory finalize routines - session directories are now cleaned up! Woo-hoo!
This commit was SVN r2370.
2004-08-29 02:46:31 +00:00
Jeff Squyres
e2aaf64239 Add a few checks to ensure that MPI_INIT(NULL, NULL) and
MPI_INIT_THREAD(NULL, NULL, ...) will work properly.

This commit was SVN r2362.
2004-08-28 16:30:29 +00:00
Jeff Squyres
41592650f2 - Silence useless compiler warning
- Make the code conform better to the coding guidelines (I think some
  of this was written long before the guidelines were established...?)

This commit was SVN r2357.
2004-08-28 11:56:33 +00:00
Tim Woodall
16d250b376 - integration of gpr/ns/oob w/ mpirun2
This commit was SVN r2344.
2004-08-28 01:15:19 +00:00
Jeff Squyres
dace5cd6d9 Better/clearer memory management; fix some memory leaks
This commit was SVN r2332.
2004-08-27 16:44:12 +00:00
Jeff Squyres
d767e7822f Forgot to destroy the mutex
This commit was SVN r2331.
2004-08-27 16:43:35 +00:00
Jeff Squyres
b053a53823 - In an attempt for better compile times (i.e., less dependencies),
strip down cmd_line.h; remove necessity of including
  include/constants.h and util/argv.h.
- This means that a bunch of other files in the source tree that
  depended on those files having already been included needed to be
  updated to include the relevant file(s)
- In fixing these (^^), added both /* $HEADER$ */ and #include
  "ompi_config.h" in a few files that were missing them.
- change from ompi_cmd_line_create() / ompi_cmd_line_free() to
  OBJ_NEW() / OBJ_RELEASE()
- update mpirun2.cc to allow "-np"
- fix minor problem in mpirun2.cc where two command line params were
  accepting "-h"
- slightly revamped cmd_line.c to use the OBJ_NEW/OBJ_RELEASE
  interface.  Added new function ompi_cmd_line_make_opt3() that allows
  "single dash" parameters.
- Greatly expanded the ompi_cmd_line_t doxygen docs.
- The diff for ompi_cmd_line_t is artificially large because I also
  took the opprotunity to reformat to 4 space tabs.  Sorry.  :-\

This commit was SVN r2308.
2004-08-26 07:56:45 +00:00
Jeff Squyres
b5b873bb59 Eliminate some compiler warnings
This commit was SVN r2295.
2004-08-25 11:46:56 +00:00
Ralph Castain
83db216e40 Reserves a spot for the console program - no code in it yet. Attempts to fix earlier problem when compiling/linking the universe_setup_file_io routines. Will test further on new checkout - appears to work fine under OSX right now.
This commit was SVN r2291.
2004-08-25 02:01:35 +00:00
Ralph Castain
3007a3270b Update that defines a default value for the "my_universe" field.
This commit was SVN r2278.
2004-08-24 14:20:49 +00:00
Jeff Squyres
7476707d72 Temporarily #if 0 the body of these functions because they cause
linker errors.  Will follow up in an email to the relevant developers
(from svn blame).

This commit was SVN r2277.
2004-08-24 11:12:05 +00:00
Jeff Squyres
de28be5768 - remove useless doxtext @file comment
- add missing header file

This commit was SVN r2275.
2004-08-24 08:44:37 +00:00
Ralph Castain
b7b0ee1905 Checkpointing the openmpi program. Added ability to write/read universe setup files in session directory. Added logic for connecting to existing universe.
This commit was SVN r2274.
2004-08-24 03:04:41 +00:00
Ralph Castain
b2868d0158 Begin knitting the openmpi program using the name server, registry, etc. Changes include:
- modified rte_init to utilize name server, interface to session directory system
- added fields to system_info and proc_info
- created a utility for converting process into daemon
- modified openmpi to interface to various functions, spawn universe, record contact info, etc. Program almost complete.
- began building ompid daemon

This commit was SVN r2270.
2004-08-23 21:39:22 +00:00
David Daniel
b1ff3e50c2 Closing in on compilation with Solaris (and other non-GNU) compilers.
This commit was SVN r2244.
2004-08-19 23:34:46 +00:00
David Daniel
5c4c277266 Numerous niggles related to building on Solaris
This commit was SVN r2234.
2004-08-19 19:30:53 +00:00
Graham Fagg
720156b50d Added new pack and unpack string functions. These are mainly used by RTE (NS/GPR etc).
- unpack string allocates memory for user removing need to know/set max string lengths
- fixed missing 'break' in case statement thanks to unit test yet again!
- updated unit test ompi_pack (test8)
- will remove old OMPI_STRING type and support shortly after checking for usage

This commit was SVN r2219.
2004-08-18 23:18:25 +00:00
David Daniel
1d3e06bb8a Changes to output.c etc. to use asprintf
This commit was SVN r2218.
2004-08-18 23:15:48 +00:00
Tim Woodall
08de49cc4a allow null string
This commit was SVN r2185.
2004-08-17 16:10:42 +00:00
Jeff Squyres
11c9295d6c Move the ompi_few() docs to the .h file.
This commit was SVN r2176.
2004-08-17 03:06:39 +00:00
Rich Graham
f75b88faa5 fix a bug, so that the fifo size is rounded up to a power of 2.
Add some support code to determine nearest power of 2.

This commit was SVN r2170.
2004-08-16 23:06:33 +00:00
David Daniel
a1688d5b9f more atomic.h changes
This commit was SVN r2157.
2004-08-16 01:13:25 +00:00