1
1

67 Коммитов

Автор SHA1 Сообщение Дата
Ralph Castain
5a2c6a0b87 Fix the trunk so it runs again:
1. Tell the oob subscription not to report on changes until AFTER the trigger fires (signifying that all procs have started)

2. Make sure tokens returned to the subscriber are from the container that housed the data being returned (previous mistake - tokens were from subscription, not from container)

Tested on the Mac running up to 10 processes

This commit was SVN r6025.
2005-06-10 03:35:36 +00:00
Tim Woodall
95f4def89a more corrections to ping
This commit was SVN r5765.
2005-05-19 16:16:19 +00:00
Tim Woodall
3528471f22 correct debug output
addition error checking

This commit was SVN r5742.
2005-05-18 15:31:23 +00:00
Jeff Squyres
a28b5ae43b Fix for a bunch of size_t issues; reviewed by George and Ralph.
- Change all uses of *printf'ing a size_t to use an explicit cast to
  (unsigned long) and the %lu escape
- change ORTE_GPR_REPLICA_MAX_SIZE to INT_MAX until bug 1345 is fixed
  (i.e., until we allow size_t in MCA params)
- ns_base_local_fns.c:orte_ns_base_get_proc_name_string(): changed
  from %0X -> %lu
- ORTE_NAME_ARGS added explicit (unsigned long) casts, and changed all
  usages of ORTE_NAME_ARGS to use %lu's

This commit was SVN r5644.
2005-05-08 13:22:55 +00:00
Tim Woodall
754dc71177 revised oob ping so that it doesnt attempt to setup/use a persistent connection
This commit was SVN r5601.
2005-05-05 16:31:40 +00:00
Jeff Squyres
462adee81a Commit 1 of 4 to bring in the hetero branch to the trunk. Merged in
from:

svn merge -r5440:5448 https://svn.open-mpi.org/svn/ompi/tmp/hetero .

This commit was SVN r5549.

The following SVN revisions from the original message are invalid or
inconsistent and therefore were not cross-referenced:
  r5440
  r5448
2005-05-01 00:47:35 +00:00
Josh Hursey
d1b12501fb usleep is not defined in Windows land.
sleep is the equlivant command where:
sleep(10) sleeps for 10 milliseconds.

This commit was SVN r5433.
2005-04-19 04:38:48 +00:00
Tim Woodall
f2acb6d66f restore subscribe for connect/accept
This commit was SVN r5280.
2005-04-12 21:25:51 +00:00
Jeff Squyres
3ec8c614f7 Remove some redundant code
This commit was SVN r5278.
2005-04-12 21:11:40 +00:00
Jeff Squyres
3f5541349a Add UC copyright
This commit was SVN r5009.
2005-03-24 12:43:37 +00:00
Brian Barrett
30af9a7b90 * More changes from the tim branch. Still has problems with ABORTed procs,
but now tells you when it can't find orted.  Also includes memory leak
  plugs, bproc fixes, and gm repairs.

This commit was SVN r4937.
2005-03-18 23:58:36 +00:00
Brian Barrett
a1d11d9b54 * sync with close to the trunk, but right before the gm changes that we
actually need.  More, interesting, stuff coming soon.

This commit was SVN r4933.
2005-03-18 23:40:08 +00:00
Brian Barrett
6822a519bb * results from initial merge of the tim branch into the trunk. Compiles and
ompi_info works, but that's all that has been tested.

This commit was SVN r4827.
2005-03-14 20:57:21 +00:00
Brian Barrett
0d82642b40 * Split thread support build conditionals into MPI threads and progress
threads (defaults to use MPI threads, disable progress threads).  This
  allows us to have MPI threaded support, but without progress threads
  and all that fun stuff.

This commit was SVN r4443.
2005-02-16 17:42:07 +00:00
Prabhanjan Kambadur
15acf6b07a Moving the comments to the right places
This commit was SVN r4073.
2005-01-20 04:44:07 +00:00
Prabhanjan Kambadur
9ac9f15537 These are the changes after the review with Jeff. Mostly are fixes for OOB and TCP
This commit was SVN r4070.
2005-01-20 00:03:23 +00:00
Tim Woodall
6e37862637 added parameter to control i/fs used for oob messaging
This commit was SVN r3989.
2005-01-13 21:44:58 +00:00
Ralph Castain
95510d3684 Remove the diagnostic messages from the xcast. Move one step closer on comm_spawn....
This commit was SVN r3759.
2004-12-09 06:32:44 +00:00
Ralph Castain
ed197f0186 More minor changes that continue to make progress on comm_spawn. Nothing significant - no impact on other operations.
PLEASE NOTE: there are some diagnostic messages in oob_xcast that will print out. Please don't have a cow about them - they won't hurt nor injure anyone, and it's just there for a little while to help Tim and I debug a problem. Just didn't want to create yet another MCA parameter to debug 10 lines of code. :-) 

This commit was SVN r3756.
2004-12-09 04:54:37 +00:00
Ralph Castain
be3facae87 This still doesn't fix the comm_spawn problem, but does fix an error in the proxy-to-replica communication when requesting startup/shutdown messages.
This commit was SVN r3725.
2004-12-07 03:57:15 +00:00
Jeff Squyres
616269a9be Add HLRS copyright
This commit was SVN r3665.
2004-11-28 20:09:25 +00:00
Jeff Squyres
e9ed717748 First cut at copyrights: IU, UTK, and some OSU. LANL and HLRS still
pending.

This commit was SVN r3655.
2004-11-22 01:38:40 +00:00
Ralph Castain
bf9087d9d1 The merged main trunk and gpr integration branch. Tested on Mac only so far - will check out and test on Linux. If that has a problem, will back all changes out (again), but I think we have this one correct. Will send out a more complete change notice once testing is complete.
This commit was SVN r3644.
2004-11-20 19:12:43 +00:00
Brian Barrett
23a6d5bb60 * roll back r3584 (gpr changes to reduce floods) as it appears to cause
some instability on Linux

This commit was SVN r3587.

The following SVN revision numbers were found above:
  r3584 --> open-mpi/ompi@52add381d0
2004-11-17 02:30:07 +00:00
Brian Barrett
52add381d0 * Merge over the gpr changes Ralph has made on the gpr-integration branch.
This may trigger a complete rebuild :(.  Short overview of changes:

  - reduce number of network slams at startup
  - prevent gpr from hanging when doing process death code
  - general gpr cleanups

This commit was SVN r3584.
2004-11-16 22:53:33 +00:00
Prabhanjan Kambadur
4c8c62204d some additional changes that were made to the code yesterday.
This commit was SVN r3467.
2004-11-02 13:14:34 +00:00
Prabhanjan Kambadur
650b04c4b4 changes:
--------
1. malloc casts to the right pointers
2. function parameter casts in the components (eg., recv requires a (char *) typecast 
   else cL compiler barfs)
3. added my own errno indirection. this is only in oob/tcp module. ompi_errno is #defined
   ro errno in unix land and to a function ompi_get_error which returns the equivalent
   error code.
4. implemented our own fcntl to prevent spaghetti coding. this currently only takes
   F_GETFL and F_SETFL arguments, does nothing on F_GETFL and sets the nonblocking 
   option on F_SETFL
5. Moved some extern declarations to global scope since the CL compiler does not do 
   the right things if they are declared and used in static inline functions.
6. Protection around some header files. changed sys/errno to errno.
7. defined in_proto_t (unsigned uint16_t) to DWORD ... comments are welcome

This commit was SVN r3394.
2004-10-28 18:13:43 +00:00
Prabhanjan Kambadur
4257467fec this is the big windows commit. there are more things which have gone into this than i can remember. but basically, we are looking for
1. header file and source file protections using #ifdef WIN32
2. new files and directories to support windows functionality
3. appropritate linkage symbols added (OMPI_DECLSPEC) for windows
4. some functions are unimplemented on the windows side. this is mostly
because there might not be need to implement it in windows land. eg., forking
a daemon off
5. Introduced locking mechanisms for windows

This commit was SVN r3286.
2004-10-22 16:06:05 +00:00
Jeff Squyres
d324a7725c - Add #if protection around non-portable system .h files
- Add #include "ompi_config.h" to all .c files, and ensure that it's
  the first #included file
- remove a few useless #if HAVE_CONFIG_H checks

This commit was SVN r3229.
2004-10-20 01:03:09 +00:00
Tim Woodall
0b3c452cde use ns routines to generate jobid string
This commit was SVN r3141.
2004-10-14 21:04:12 +00:00
Tim Woodall
da4b57d6cf - use loopback for oob/ptl if no other i/fs are present
- force init to fail if no i/fs are available

This commit was SVN r2898.
2004-09-30 21:23:10 +00:00
Tim Woodall
9dab6ab95b fix for publish/subscribe
This commit was SVN r2892.
2004-09-30 16:34:48 +00:00
Tim Woodall
a222c702ec cleanup of finalize code
- unregister all event handlers from event library
- cancel pending non-blocking receives with oob

This commit was SVN r2887.
2004-09-30 15:09:29 +00:00
Tim Woodall
ab05652f9e changed output to debug
This commit was SVN r2575.
2004-09-09 21:57:45 +00:00
Tim Woodall
92ae84828f changed debug macro name
removed debug output

This commit was SVN r2566.
2004-09-09 19:21:34 +00:00
Tim Woodall
32fa7b9b03 added support for an oob ping command - to check availability of a given process
This commit was SVN r2541.
2004-09-08 17:02:24 +00:00
Ralph Castain
e8c36d02c9 Not as bad as this all may look. Tim and I made a significant change to the way we handle the startup of the oob, the seed, etc. We have made it backwards-compatible so that mpirun2 and singleton operations remain working. We had to adjust the name server and gpr as well, plus the process_info structure.
This also includes a checkpoint update to openmpi.c and ompid.c. I have re-enabled the ompid compile.

This latter raises an important point. The trunk compiles the programs like ompid just fine under Linux. It also does just fine for OSX under the dynamic libraries. However, we are seeing errors when compiling under OSX for the static case - the linker seems to have trouble resolving some variable names, even though linker diagnostics show the variables as being defined. Thus, a warning to Mac users that you may have to locally turn things off if you are trying to do static compiles. We ask, however, that you don't commit those changes that turn things off for everyone else - instead, let's try to figure out why the static compile is having a problem, and let everyone else continue to work.

Thanks
Ralph

This commit was SVN r2534.
2004-09-08 03:59:06 +00:00
Tim Woodall
db2e993a3d initialize/cleanup event library in ompi_rte_init/ompi_rte_fini
This commit was SVN r2486.
2004-09-03 16:12:45 +00:00
Tim Woodall
f43f072673 - further integration w/ gpr - use synchro/subscribe to asynchronously populate
a cache of peer addresses as they come online
- resolved issues with the event library

This commit was SVN r2434.
2004-09-01 23:07:40 +00:00
Tim Woodall
7bd0f1f4b3 resolved several race conditions
This commit was SVN r2402.
2004-08-31 02:57:39 +00:00
Tim Woodall
16d250b376 - integration of gpr/ns/oob w/ mpirun2
This commit was SVN r2344.
2004-08-28 01:15:19 +00:00
Tim Woodall
3d754c4941 - modifications to oob to support bringing up oob before anything else (ns)
- changed ns/gpr to register callback in component init (and ignore errors if not supported)

This commit was SVN r2303.
2004-08-25 17:39:08 +00:00
Tim Woodall
e05e7ca942 gpr proxy i/f (put) seems to be working w/ tcp oob!
This commit was SVN r2235.
2004-08-19 19:34:37 +00:00
Tim Woodall
9c9037ef2b added support for querying oob contact info and setting
contact info for seed daemon 

This commit was SVN r2167.
2004-08-16 19:39:54 +00:00
Tim Woodall
8468273a8a - more on cleanup
This commit was SVN r2087.
2004-08-12 13:29:37 +00:00
Tim Prins
3db2f05dc3 Properly set address reuse on the oob tcp listen socket.
Remove some temporary code.

This commit was SVN r2040.
2004-08-11 16:02:59 +00:00
Tim Woodall
d1dd1ddaf2 dont enable this by default
This commit was SVN r2014.
2004-08-10 21:04:09 +00:00
Tim Woodall
debf4842eb cleanup close/finalize
This commit was SVN r2013.
2004-08-10 21:02:36 +00:00
Tim Prins
593b0542ae Comments and doxygen updates to the oob.
This commit was SVN r1993.
2004-08-09 23:07:53 +00:00
Tim Prins
259b2d9c35 A temporary workaround to avoid segfaults when the tcp oob is cleaning up.
This simply removes the event handler deletions from the clean up code.

This commit was SVN r1929.
2004-08-06 19:12:26 +00:00