1
1
Граф коммитов

3850 Коммитов

Автор SHA1 Сообщение Дата
Tim Woodall
f64bcf3a52 unlock on return
This commit was SVN r5764.
2005-05-19 15:47:22 +00:00
Ralph Castain
34e437a9af Cleanup a compiler warning on orteprobe.
Split the finalize process into two parts: one that finalizes the orte subsystems, and another that finalizes (what will become) the opal subsystems. Needed to properly restart the workstation process once remote launch accomplished.

This commit was SVN r5758.
2005-05-19 13:33:55 +00:00
Jeff Squyres
2ec635b07c Reset certain signal handlers after the fork but before the exec when
launching new processes so that they get all the default unix
behaviors (i.e., become killable, and don't accidentally have some
signals blocked -- left over from the event library).

This commit was SVN r5757.
2005-05-19 13:33:28 +00:00
George Bosilca
7cf25d2758 Indentation ...
This commit was SVN r5755.
2005-05-19 06:55:13 +00:00
George Bosilca
c3e5b01328 Rework the section algorithm
This commit was SVN r5754.
2005-05-19 06:54:45 +00:00
George Bosilca
a5fc2762dd 1. Do not try to get the remote proc if we will not use it
2. Always release the request lock before calling FREE ...

This commit was SVN r5753.
2005-05-19 06:53:57 +00:00
Thara Angskun
e9cc404c33 - just checkpoint
This commit was SVN r5752.
2005-05-18 22:46:17 +00:00
George Bosilca
b41444ddcb Less talkative ....
This commit was SVN r5751.
2005-05-18 21:23:48 +00:00
Ralph Castain
d3029fbdb9 Fix a few bugs.
Remote launch of daemon now working. Bunch of forced diagnostic messages in it, though, which I'll leave until I release this for actual use.

This commit was SVN r5750.
2005-05-18 18:39:50 +00:00
Ralph Castain
bb9c90b6ae Fix a couple of minor bugs that prevented the clean handshake of contact info.
This commit was SVN r5749.
2005-05-18 18:24:14 +00:00
Ralph Castain
91c75cb7d8 Checkpoint the remote launch work.
Orted has been modified to take a new parameter - a file descriptor used as a pipe to pass the daemon's contact info back to the probe when the daemon is remotely launched.

This commit was SVN r5748.
2005-05-18 17:56:51 +00:00
Ralph Castain
d1fe8b6b90 Move an output message so it only comes out under specific conditions.
You will now receive a message indicating that an existing universe was detected, but connection to it was refused. The system will tell you the name it created for the new universe it will now be using.

This commit was SVN r5747.
2005-05-18 16:36:11 +00:00
Ralph Castain
1b42e973d5 Only orted was saving the universe contact info - this should actually be saved by whomever believes they are the "seed". Modified files to ensure this happens. Also includes a checkpoint of the probe and remote launch functions.
This commit was SVN r5746.
2005-05-18 16:31:03 +00:00
Thara Angskun
c71f3f7152 - just checkpoint
This commit was SVN r5745.
2005-05-18 15:55:34 +00:00
George Bosilca
81d1d0322a Remove all useless includes. They get included anyway from others include ... The way we
manage the dependencies still need some rework ...

This commit was SVN r5744.
2005-05-18 15:54:24 +00:00
Tim Woodall
ea8c91627d correc typo
This commit was SVN r5743.
2005-05-18 15:32:05 +00:00
Tim Woodall
3528471f22 correct debug output
addition error checking

This commit was SVN r5742.
2005-05-18 15:31:23 +00:00
Tim Woodall
75a43e84ba fixed ping
This commit was SVN r5741.
2005-05-18 15:12:04 +00:00
Ralph Castain
ef07d67946 Checkpoint
This commit was SVN r5740.
2005-05-17 20:21:59 +00:00
Ralph Castain
3232200e51 Probe now functional! At least, it now at least runs. Next need to complete its operations
This commit was SVN r5739.
2005-05-17 20:19:38 +00:00
Ralph Castain
56475f64a3 Checkpoint for transfer
This commit was SVN r5738.
2005-05-17 19:56:54 +00:00
Ralph Castain
20c325b6bd Checkpoint for transfer to remote machie
This commit was SVN r5737.
2005-05-17 19:35:49 +00:00
Ralph Castain
3e982534cd Make sure all string-to-name field conversions are done in decimal now.
This commit was SVN r5736.
2005-05-17 16:04:50 +00:00
Ralph Castain
3092e0f6cf Fix a minor compiler squawk....
This commit was SVN r5735.
2005-05-17 13:11:48 +00:00
George Bosilca
36fc07e93e A TAB manage to get inside ,,,
This commit was SVN r5734.
2005-05-17 07:19:20 +00:00
George Bosilca
8ea51d6491 Typo ...
This commit was SVN r5733.
2005-05-17 07:16:28 +00:00
George Bosilca
ff2c9d5f9e The stack now hold the type of the last converted element.
This commit was SVN r5732.
2005-05-17 07:15:44 +00:00
George Bosilca
b4314b0426 Improved case for basic datatype with gaps around.
This commit was SVN r5731.
2005-05-17 07:14:30 +00:00
George Bosilca
7381c1f9ee This function come from the DDT version 2. Remove it by now !!!
This commit was SVN r5730.
2005-05-17 07:10:31 +00:00
George Bosilca
f68e255933 The DDT engine reach a quite stable state. All of the IBM test pass, few of the Intel tests fails
(mostly UB/LB related and doing questionable things). Otherwise a lot of changes:
- cleaner code + more comments
- stronger bound checker (DDT error messages if we exceed the buffer capacity)
- differentiate the 3 internal types: loop, basic element and end_loop (using union)
- more MACROS to solve the repetitive problems
- more output in debug mode (and if requested by the user).
- correct some mismaching between usage of true_extent and extent
- improve the special cases (contiguous data, contiguous with gaps, no conversion ...)
- in order to allow gdb to break in the pack/unpack function they became real function (defined
  in dt_pack.c and dt_unpack.c) if OMPI_ENABLE_DEBUG is defined. Otherwise they are just macros.
- a new approach for the computation of the lower bound and upper bound. In same time the true_lb and
  true_ub has been modified to match the new algorithm.
- handle specific cases in the datatype optimization. In some cases the datatype definition increase,
  but the complexity decrease.

This commit was SVN r5729.
2005-05-17 06:54:41 +00:00
George Bosilca
f712270f8b Do not correct the lb. The inner layer should compute the correct one.
This commit was SVN r5728.
2005-05-17 06:42:11 +00:00
George Bosilca
10a07e0eff More options allowing more specific datatype tests.
This commit was SVN r5727.
2005-05-17 06:40:45 +00:00
George Bosilca
b70ae8a2bb This one is pretty strange. The protoytpe is clear the 3th argument should be a bool. But it was a char*
and icc didn't complain ... only gcc 4.1 ...

This commit was SVN r5726.
2005-05-17 01:10:44 +00:00
George Bosilca
dde75fe032 One of the include was missing. That's a big problem on 64 bits architecture where the default return value
is int (mostly 4 bytes) and strdup normally return a char* (usually 8 bytes). The result: stack corrupted
and other weird things ...

This commit was SVN r5725.
2005-05-17 01:02:40 +00:00
Ralph Castain
89b6a97f0f Bring the resource discovery system's resource file component online so I can find the node I need to launch upon. I removed all reference to the xml library that was causing trouble, and wrote my own limited xml parser instead, so this will now compile just fine anywhere.
Need to do some refining of the component, but it meets basic requirements right now. Nobody else should notice any change - system basically ignores it unless you tell it to do something.

This commit was SVN r5723.
2005-05-16 21:01:09 +00:00
George Bosilca
2c4209f4cb Add the output utilities include.
This commit was SVN r5722.
2005-05-16 19:36:12 +00:00
Ralph Castain
a0393e9cb9 Fix a malloc/free problem when user doesn't specify a name for the universe. No impact on what George is seeing - still looking into that with Tim.
This commit was SVN r5721.
2005-05-16 18:58:22 +00:00
Brian Barrett
09057fe311 * massive cleanup of debugging output to make it much easier to match
messages
* use different event queues for send / recv, part of moving towards dealing
  with dropped fragments

This commit was SVN r5719.
2005-05-15 21:05:00 +00:00
Brian Barrett
ac7b97a0d9 * convert to an array of event handles - still of size 1 - as prep work for
adding event queues for dropped fragments and retransmit requests

This commit was SVN r5718.
2005-05-13 18:36:15 +00:00
Brian Barrett
5d57956a02 * rename event queue flag to prepare for more event queues
This commit was SVN r5717.
2005-05-13 18:27:27 +00:00
Brian Barrett
66a0c49e2b Run the event loop once after adding a new signal event to the event library. Events are only processed at the start of an event loop (not at event add), so there was a window of time between event_add() and event_loop() for the signal event in which the event existed, but was not active. During this window, signals that should have triggered a callback could be lost.
Reviewed by Jeff and Tim.

This commit was SVN r5715.
2005-05-13 17:59:36 +00:00
Jeff Squyres
72f86297c8 Somehow this functionality got lost over time: when a process aborts,
orterun should abort the rest.  Reviewed by Brian.

This commit was SVN r5713.
2005-05-13 17:52:50 +00:00
Josh Hursey
46810fd155 Some fixes to get the subset of mca directories compile under Windows.
Added a special case under the win_makefile for the gpr/replica directory
since it contains multiple dependant layers of directories.

Added a couple of OMPI_DECLSPECs. Change a conflicting variable name in
gpr_replica_dict_tl.c from 'new' to 'new_dict'.

This commit was SVN r5712.
2005-05-13 16:55:14 +00:00
Josh Hursey
f176f85e55 Fix a couple of library checks, and some Windows related code
This commit was SVN r5711.
2005-05-13 15:05:07 +00:00
Jeff Squyres
c12d5c8c88 While waiting for fortran compiles...
Fixes for orterun in handling different MCA params for different
processes (reviewed by Brian):
- By design, if you run the following:
    mpirun --mca foo aaa --mca foo bbb a.out
  a.out will get a single MCA param for foo with value "aaa,bbb".
- However, if you specify multiple apps with different values for the
  same MCA param, you should expect to get the different values for
  each app.  For example:
    mpirun --mca foo aaa a.out : --mca foo bbb b.out
  Should yield a.out with a "foo" param with value "aaa" and b.out
  with a "foo" param with a value "bbb".  
- This did not work -- both a.out and b.out would get a "foo" with
  "aaa,bbb".
- This commit fixes this behavior -- now a.out will get aaa and b.out
  will get bbb.
- Additionally, if you mix --mca and and app file, you can have
  "global" params and per-line-in-the-appfile params.  For example:
    mpirun --mca foo zzzz --app appfile
  where "appfile" contains:
    -np 1 --mca bar aaa a.out
    -np 1 --mca bar bbb b.out
  In this case, a.out will get foo=zzzz and bar=aaa, and b.out will
  get foo=zzzz and bar=bbb.
Spiffy.

Ok, fortran build is done... back to Fortran... sigh...

This commit was SVN r5710.
2005-05-13 14:36:36 +00:00
Jeff Squyres
0fe7168823 Add missing header file.
This commit was SVN r5709.
2005-05-13 12:36:05 +00:00
Brian Barrett
f64e52a28d * more refactoring to reduce duplicate code
This commit was SVN r5708.
2005-05-13 04:04:08 +00:00
Brian Barrett
a242d5ad4f * Add OMPI_OUTPUT_VERBOSE macro that (like OMPI_OUTPUT) is a no-op when
debugging is disabled.
* convert one more output to only happen when debugging is enabled

This commit was SVN r5707.
2005-05-13 03:01:02 +00:00
Brian Barrett
a7fd494448 * start cleaning up output statements
* start refactoring duplicate code into inline functions (probably will
  have to become macros, but not until debugging is done)
* general code cleanup

This commit was SVN r5706.
2005-05-13 02:54:06 +00:00
Ralph Castain
fdfe457578 Bring in the remote launch changes. This still isn't fully functional, but impacted a few other places that were worth fixing.
1. Added a new function to launch head node processes on remote nodes.

2. Added new tool "orteprobe" that checks to see if a daemon is running on a node. If so, it reports the contact info back to the requestor. If not, it will (eventually - but not now) fork/exec a daemon on the node, report the contact info back to requestor, and then die.

3. Modified orted to handle universe name parameters, and added separate command line flags for debugging the daemon and saving daemon debugging output in a file. The "debug" flag now turns on the runtime debug info instead of the daemon debug - thus, you can now just get daemon debug info if you like.

4. Fix the dps to handle zero length strings correctly.

5. Modify the fork and rsh launchers to pass required environmental variables to the daemons and processes

6. Pulled the redirection of stdin/stdout/stderr for the daemon out of orted and put it into the daemon_init function to simplify orted logic.

7. Modified sys_info to correctly deal with passed mca param

8. Modified univ_info to parse incoming universe location information.

This commit was SVN r5705.
2005-05-12 21:44:23 +00:00
Brian Barrett
0c6eaaebe3 * start cleaning up debugging output (still much to do)
* make buffers really big so that we pass allocmem until we figure out
  why we're not flow controlling as I expected
* set event queue to invalid intially and use that as the enabled test
  rather than a seperate bool - shrinks the module a bit
* add dropped count checks, with a panic if one occurs.  Still need to
  implement some type of retransmit logic.

This commit was SVN r5704.
2005-05-12 21:28:48 +00:00
Brian Barrett
e2c2c72b84 Changes to pass allocmem IBM test
- don't free the send buffer unless the converter tells us we need to
  - properly do the math to determine when the receive buffer has been
    fully used and unlinked itself

This commit was SVN r5703.
2005-05-12 19:52:51 +00:00
Jeff Squyres
f5657fb8ee For the rsh pls, if the launch is on the local node, just exec it --
don't bother using the launching agent (typically rsh or ssh).

This commit was SVN r5702.
2005-05-12 19:12:53 +00:00
Jeff Squyres
544f9dd780 Fix silly string error (missing +2 in the len calculation, so just
replace it with asprintf).  Reviewed by Brian.

This commit was SVN r5700.
2005-05-12 18:56:05 +00:00
Brian Barrett
189a536685 * Fix incorrect logic in orted so that --no-daemonize works as intended
* Minor formatting fixes in XGrid RAS component
* Code cleanup in XGrid PLS component:
  - If we can't get daemon contact information, kill the job at the XGrid
    level
  - Add MCA parameter pls_xgrid_delete_job that will delete the job from
    XGrid when complete (this seems like standard behavior, so it's the
    default)
  - Remove compiler warning about getting the name of a XGGrid object
  - Properly populate the daemon information for the killing code

This commit was SVN r5697.
2005-05-12 16:48:41 +00:00
Josh Hursey
4b60235383 remove unnecessary exclusion for Windows which was killing the Windows nightly build
This commit was SVN r5695.
2005-05-12 14:37:40 +00:00
Brian Barrett
decc74d15c * Enable the XGrid components. Only do anything if the XGrid contact info
variables are set.
* show the RAS priority in ompi_info

This commit was SVN r5694.
2005-05-12 03:33:59 +00:00
George Bosilca
4ef1d70034 snprintf does not really do what we expect. In some situations it will write
more than we have asked for (on my G5). Anyway now I hope I have enought memory to printout
the full description of the datatype.

This commit was SVN r5690.
2005-05-11 21:30:56 +00:00
Brian Barrett
c477907166 * ignore UNLINK messages earlier in the chian (if Portals supports them)
* process long message fragments properly

This commit was SVN r5689.
2005-05-11 20:22:18 +00:00
Josh Hursey
cc6cb5cac5 Checkpoint on Windows build.
Many changes to headers for OMPI_DECLSPEC, and 
proper placement of c_plusplus defines in those files.

mca/gpr/replica and tools are the two sets of directories
that still need work for the Windows build for this pass.

This commit was SVN r5688.
2005-05-11 20:21:10 +00:00
George Bosilca
6714dfac4e Remove all useless checks that size_t is greater or equal to zero.
This commit was SVN r5687.
2005-05-11 14:19:48 +00:00
George Bosilca
e940ab43b8 Optionally disable the tests.
This commit was SVN r5686.
2005-05-11 14:17:01 +00:00
Thara Angskun
55538e100d - another check point
- able to launch job (sort of)...but not correctly clean up, mpirun hang, blowup terminal etc.

This commit was SVN r5685.
2005-05-11 09:09:55 +00:00
George Bosilca
1a9cef70fb checkpoint
This commit was SVN r5684.
2005-05-11 06:30:10 +00:00
George Bosilca
9bd4110bb5 If we check for errors then let's check all of them.
This commit was SVN r5683.
2005-05-11 04:26:14 +00:00
George Bosilca
f9ae5a282e We are here in a macro. The arguments of the macro should be protected. Otherwise the compiler
will get confused with the precedence of the operators.

This commit was SVN r5682.
2005-05-11 04:23:47 +00:00
George Bosilca
f0adb8b4fd Adapt to the new PML interface.
This commit was SVN r5681.
2005-05-11 03:59:57 +00:00
Jeff Squyres
bc6f5a83c4 Fix a few more header installation directories
This commit was SVN r5679.
2005-05-10 23:56:23 +00:00
Jeff Squyres
6bda1ed699 Fix installation directory for the header files
This commit was SVN r5678.
2005-05-10 23:52:35 +00:00
Brian Barrett
caf8551001 * checkpoint - long messages are causing segfaults in the PML, but need to
stop for a bit

This commit was SVN r5675.
2005-05-10 20:42:57 +00:00
Brian Barrett
2ec27c0927 * add ability to respond to RNDV packets with ACKs. short MPI_Ssends now
work properly.  Still need to implement second fragment support

This commit was SVN r5674.
2005-05-10 19:39:21 +00:00
George Bosilca
956782670a One more printf with size_t solved.
This commit was SVN r5673.
2005-05-10 17:25:56 +00:00
Jeff Squyres
f8b1e19076 - Add a few help messages
- app->num_procs changed to a size_t, which hosed the initialization
  of its value to -1 (not sure why the compiler didn't complain
  #$%@#$%), which was there to catch the case when the user forgot to
  specify -np (or some other equivalent).  Fixed.

This commit was SVN r5672.
2005-05-10 17:14:53 +00:00
Josh Hursey
5d1e2c53b0 fix some library path issues
This commit was SVN r5671.
2005-05-10 16:16:36 +00:00
Brian Barrett
0cd4d15824 * Update to match Tim's changes to the PML
* Couple of improvements towards handling ACKs properly

This commit was SVN r5670.
2005-05-10 15:53:41 +00:00
Brian Barrett
eeba1b9a72 * re-enable making DSOs of the TEG PML
This commit was SVN r5669.
2005-05-10 14:56:45 +00:00
Jeff Squyres
6d8ad2b1bd When the argv add functions were changed to nosize, we forgot to
update num_env.  Doh!

This commit was SVN r5668.
2005-05-10 14:50:55 +00:00
Jeff Squyres
5d1685edbc Add unignore for jsquyres@mauve.lanl.gov
This commit was SVN r5665.
2005-05-10 12:02:36 +00:00
Jeff Squyres
98f00ce592 Add missing .h file
This commit was SVN r5663.
2005-05-10 11:45:42 +00:00
Jeff Squyres
f374559d14 Add a proper test for socklen_t
This commit was SVN r5662.
2005-05-10 11:25:22 +00:00
Tim Woodall
6a565b3fb7 rollback
This commit was SVN r5660.
2005-05-09 21:07:53 +00:00
Tim Woodall
d54c6fafe0 cleanup for data structure reorg
This commit was SVN r5659.
2005-05-09 21:05:54 +00:00
George Bosilca
876dfc4cfc Dont add the fragment to the endpoint before we have erceived some data.
This commit was SVN r5658.
2005-05-09 20:20:29 +00:00
Tim Woodall
6ab861945f corrections for data structure reorg
This commit was SVN r5657.
2005-05-09 20:03:58 +00:00
Tim Woodall
10c8338a04 moved up
This commit was SVN r5656.
2005-05-09 19:42:32 +00:00
Tim Woodall
3d3151abb3 reorg to support new pml - move ptl/teg specific data
from pml_base_* data structures
*** NOTE ***
this will break gm/ib - I will fix these shortly

This commit was SVN r5655.
2005-05-09 19:37:10 +00:00
Josh Hursey
276cf4dbd5 donno how that one creeped in. rollback this file
This commit was SVN r5653.
2005-05-09 18:03:29 +00:00
Josh Hursey
f3c537598b Export the rest of the datatypes.
This commit was SVN r5652.
2005-05-09 17:47:31 +00:00
Brian Barrett
1af69c389d * merge xgrid pls/ras components from bwbw-xgrid tmp branch into the trunk
This commit was SVN r5651.
2005-05-09 16:28:05 +00:00
Josh Hursey
82b407061f windows checkpoint
This commit was SVN r5650.
2005-05-09 16:23:00 +00:00
Brian Barrett
55310951f6 * some fixes to make SPARC asm compile. Still fails to execute, unfortunately
This commit was SVN r5649.
2005-05-09 16:10:13 +00:00
Josh Hursey
2164397a99 minor addition for dps code in Windows
This commit was SVN r5646.
2005-05-09 14:12:46 +00:00
Thara Angskun
74180750b7 - just check point
This commit was SVN r5645.
2005-05-09 07:14:58 +00:00
Jeff Squyres
a28b5ae43b Fix for a bunch of size_t issues; reviewed by George and Ralph.
- Change all uses of *printf'ing a size_t to use an explicit cast to
  (unsigned long) and the %lu escape
- change ORTE_GPR_REPLICA_MAX_SIZE to INT_MAX until bug 1345 is fixed
  (i.e., until we allow size_t in MCA params)
- ns_base_local_fns.c:orte_ns_base_get_proc_name_string(): changed
  from %0X -> %lu
- ORTE_NAME_ARGS added explicit (unsigned long) casts, and changed all
  usages of ORTE_NAME_ARGS to use %lu's

This commit was SVN r5644.
2005-05-08 13:22:55 +00:00
George Bosilca
1d87c3a87b Take in account the current index.
This commit was SVN r5643.
2005-05-08 05:52:00 +00:00
Thara Angskun
3f63c4db97 - nothing but checkpoint.
This commit was SVN r5642.
2005-05-08 03:06:36 +00:00
Thara Angskun
e951cbde90 Fixed poe priority bug.
This commit was SVN r5641.
2005-05-07 22:48:29 +00:00
Ralph Castain
84889eb9fd Missed one more memory cleanup...
This commit was SVN r5640.
2005-05-07 15:01:21 +00:00
Ralph Castain
fdbc12f412 Fix the missing segment name correctly.
As an FYI: the pack/unpack routines should be happy with a NULL string (and appear to be so). Issue here was that the constructor was not called, which means that the string pointer was not initialized to NULL as it ordinarily would have been.

This commit was SVN r5639.
2005-05-07 14:59:35 +00:00
Jeff Squyres
d9a4a30be7 Make ROMIO play nice and recognize OMPI
This commit was SVN r5638.
2005-05-07 10:30:30 +00:00
Jeff Squyres
f8a22760c6 @%#$%# Another one slipped through (static builds)
This commit was SVN r5637.
2005-05-07 10:19:28 +00:00
Jeff Squyres
02d64ed82b Oops -- missed this one (caught in the nightly builds).
This commit was SVN r5636.
2005-05-07 09:31:10 +00:00
George Bosilca
20b61ce558 To be or not to be ... I still dont know the answer but at least I know that this field have
to be set ("" seems a good value) before calling pack.

This commit was SVN r5635.
2005-05-07 07:50:14 +00:00
George Bosilca
6d7025ce4e Always respect the size of the buffer. Stop printing when we reach the space reserved on the buffer.
This commit was SVN r5633.
2005-05-07 04:28:15 +00:00
Jeff Squyres
539653cf18 Arrgh -- one mistake slipped by in the last commit. Fixed.
This commit was SVN r5630.
2005-05-06 22:22:53 +00:00
Jeff Squyres
fcb07baa32 Next step in reducing some configure/build time:
1. Instead of removing various src/ component directories, simply
   "flatten" the Makefile.am structure by having only a single
   top-level Makefile.am for the component, and having it include
   src/Makefile.extra (which is where the source files are listed).
   This effectively makes the build faster because "make" does not
   traverse down into src/, and we don't build a Makefile for that
   directory.
2. Did end up moving topo/unity/src/* into topo/unity, which is where
   I figured out that option #1 would be a bit easier (and safer,
   considering that other developers are actively working in various
   src/ directories -- moving things around while they're working
   would be Bad!)
3. Did not consolidate most of the io/romio component because of the
   nightmare of sym links (especially w.r.t. VPATH builds) in the
   included ROMIO distribution.  I wasted too much time trying to get
   that stuff right and finally gave up -- this is a "low hanging
   fruit" optimization, after all.

This commit was SVN r5629.
2005-05-06 21:50:30 +00:00
Jeff Squyres
e9d7cd054a Function for George: allow incrementing the reference count on a
component to delay its dlclose().

This commit was SVN r5627.
2005-05-06 21:28:06 +00:00
Jeff Squyres
40112a3a9e Not sure how that snuck in there...
This commit was SVN r5626.
2005-05-06 21:06:14 +00:00
Brian Barrett
5e06fe42d1 * checkpoint to record small victory - implement start of send and receive
- sends/recvs short messages (less than first frag size)
  - does not properly ACK messages, so Ssend() is borked
  - leaks memory like there's no tomorrow
  - don't use it just yet

This commit was SVN r5625.
2005-05-06 20:15:28 +00:00
Ralph Castain
929fc76cfa Add the application name to the returned values to support TotalView interface.
This commit was SVN r5624.
2005-05-06 18:42:25 +00:00
Ralph Castain
659d57f300 Several things in this commit - shouldn't impact any existing work:
1. Added pid_t to the dps

2. Processes now "register" their local pid and update their location (i.e., nodename) on the registry during mpi_init

3. Added a new error code for values that exceed maximum for their data type (useful when transitioning a value from one variable to another of different size)

4. Fixed a few places where size_t was being incorrectly handled

5. Updated dps_test to cover pid_t types

This should now provide support for TotalView connection - which David is pursuing.

This commit was SVN r5623.
2005-05-06 17:00:06 +00:00
Brian Barrett
22685e5055 * I think Edgar meant to assign a pointer to comm_self, not comm_self itself.
This lets things compile again

This commit was SVN r5622.
2005-05-06 15:41:00 +00:00
Brian Barrett
fc16d60236 * Don't use poll() on OS X. It's implemented on top of select in 10.3 (no
biggie), so we gain nothing there.  On 10.4, it's implemented directly,
  but doesn't support devices (which messes up pty support and IO
  forwarding).

This commit was SVN r5621.
2005-05-06 15:10:35 +00:00
Edgar Gabriel
d6bd2fdeab fix for the self_atexit test in the ibm-testsuite
This commit was SVN r5620.
2005-05-06 14:22:59 +00:00
Jeff Squyres
1bbd4df914 Add missing header file
This commit was SVN r5619.
2005-05-06 12:26:32 +00:00
George Bosilca
d60565e043 Handle the alignement problems. The same error as in the email from Ferris McCormick showed up
on all 64 bits architectures. The problem was the for unpack the source pointer was cast to a
specific type (uint32_t for 32 bits data) and then hton* was applied. The result was ... unexpected.

This patch always memcpy the data in a temporary variable with the correct size before calling
ntoh* functions, so we can insure that the data is always correctly aligned.

Moreover I add a debuging layer. OMPI_OUTPUT is used to print out the data being packed and
unpacked. It generate a lot of output but hopefully allow us to spot few bugs. This layer is not
completed the output stream descriptor is set to -1 (no output).

This commit was SVN r5617.
2005-05-05 23:59:59 +00:00
George Bosilca
4c04dde93d argc it's just an int and we try to pack it as a size_t. Well it didn't work :)
Anyway now I'm able to run on several 64 bits architectures (Athlon and G5) so
I suppose that we are back online on 64 bits.

This commit was SVN r5616.
2005-05-05 23:05:05 +00:00
George Bosilca
0b70b3d7c1 rc it's not initialized. But if we found the name on the list then the operation it's succesfull.
This commit was SVN r5615.
2005-05-05 23:00:09 +00:00
George Bosilca
9387013ca2 If it's a unsigned long then the format string should be %lu.
This commit was SVN r5614.
2005-05-05 22:58:47 +00:00
Ralph Castain
8167c24630 Fix a minor bug that reported an error when one didn't really exist.
This commit was SVN r5612.
2005-05-05 20:23:19 +00:00
Brian Barrett
faf3d438bb * Put all the source files in src/, as it was getting really annoying having
everything in one directory.  Still have only one Makefile, so it shouldn't
  change build time at all
* Now that I finally understand the header system for data, refactor a little
  bit of the code to match what really should be happening
* start of a hacked up send() - puts the data for a 0 byte message on the
  other side, and all the pointers are where i think they should be.  So
  my plan of attack will work.  But I think I'm going to have to use
  iovecs instead of memcpy() real soon now.

This commit was SVN r5610.
2005-05-05 19:53:50 +00:00
Ralph Castain
863eac2df8 Checkpoint start of setting up a head node process, fix a minor bug.
This commit was SVN r5609.
2005-05-05 19:20:47 +00:00
Ralph Castain
20ef03e6e5 Bring revised universe existence check to main branch. Passed singleton, orterun, and orterun attaching to persistent daemon tests.
This commit was SVN r5608.
2005-05-05 19:14:51 +00:00
Ralph Castain
f36260331d Take a shot at fixing the sparc alignment problem...
This commit was SVN r5607.
2005-05-05 18:45:16 +00:00
Tim Woodall
3e3b90a6a1 resolved 64 bit issue
This commit was SVN r5606.
2005-05-05 18:25:56 +00:00
Tim Woodall
9af1f10149 num_procs should be size_t
This commit was SVN r5605.
2005-05-05 18:25:07 +00:00
Brian Barrett
0fb11d2ab1 * checkpoint, unuseable, but want to rename a file and it's easier without
changes pending

This commit was SVN r5604.
2005-05-05 17:53:12 +00:00
Tim Woodall
754dc71177 revised oob ping so that it doesnt attempt to setup/use a persistent connection
This commit was SVN r5601.
2005-05-05 16:31:40 +00:00
George Bosilca
b429c73240 Now we have a way to specify if we want the PML to take care of one or two PTLs by peer. If only
one is selected it will be used for all purposes: small messages and long messages (even if the
long message is still split in several fragments). For the case where 2 PTLs per peer exists,
the first one is for latency (small messages and rendez-vous requests) when the second one
will be used for bandwitdh.

This commit was SVN r5600.
2005-05-05 04:35:18 +00:00
George Bosilca
f6aeee4502 Remove all references to the previously removed ptl_array.
This commit was SVN r5599.
2005-05-05 04:19:48 +00:00
George Bosilca
cb509df2ee More fixes. Remove useless files. Decrease the memory footprint for this PML.
This commit was SVN r5598.
2005-05-05 04:17:52 +00:00
Jeff Squyres
6ef5b7283f Fix problem found by Greg Watson -- in the case of an option that is
simply "-", fix a corner case where it looked like that was a valid
short option.

This commit was SVN r5597.
2005-05-05 02:02:01 +00:00
George Bosilca
457083e148 Always activate it for the user bosilca.
Jeff send me the way to do that automatically, and I'm pretty sure I'm not the only one who miss some
of the functionalities of our build system. The idea is really cool, let only the developper of a
component have it active until it reach a stable state. For all others peoples the .ompi_ignore
file prevent them for compiling the component.

cd src/mca/pml/uniq
echo $USER > .ompi_unignore
svn add .ompi_unignore
svn ci .ompi_unignore

This commit was SVN r5595.
2005-05-05 01:25:08 +00:00
George Bosilca
536a5c5d09 The UNIQ PML is here. It does compile with few warnings, and it still have few bugs. It's here just as a checkpoint.
The idea behind this PML is to minimiza the overhead of managing multiple PTL. For each node, UNIQ keep two PTL's
one for latency and one for bandwidth. One the next version I want to add a configure parameter to allow the user
to select how many PTL's he want: one or two.

This commit was SVN r5593.
2005-05-04 17:08:08 +00:00
Brian Barrett
d3f3d08b59 * dumb, minor changes - need to switch machines
This commit was SVN r5592.
2005-05-04 16:59:37 +00:00
Ralph Castain
069f648c93 Cleanup some warning messages in the nightly build per Jeff
This commit was SVN r5590.
2005-05-04 14:01:25 +00:00
Brian Barrett
99389ee9f2 * add --no-daemonize flag to orted to prevent it from doing the fork()/exec()/
setsid() routine at startup.  Needed for some of the batch scheduling
  starters.

This commit was SVN r5587.
2005-05-04 11:57:47 +00:00
George Bosilca
ae40471a30 Modify to reflect the change in the subdirs (removal of src directory).
This commit was SVN r5586.
2005-05-04 01:55:19 +00:00
George Bosilca
4cc56fea8c Per Jeff suggestion nuke the src directory.
This commit was SVN r5585.
2005-05-04 01:51:32 +00:00
Jeff Squyres
9e8dfd0882 Per George's suggestion, move the DPS init to its proper place.
This commit was SVN r5582.
2005-05-04 00:11:36 +00:00
Brian Barrett
cffb3d031e * checkpoint. Nothing works, as I just realized my entire design was
based around PTL_MD_MAX_SIZE, which apparently isn't implemented in
  Cray's Portals implementation.  Time to rethink that design :/

This commit was SVN r5576.
2005-05-02 20:31:42 +00:00
Tim Woodall
07e4cf840f resolve packing issue w/ int64
This commit was SVN r5575.
2005-05-02 18:30:07 +00:00
Tim Woodall
5c9f4c75d1 fix typo
This commit was SVN r5574.
2005-05-02 16:51:01 +00:00
Tim Woodall
8c8c3eb86e checkpoint/reorg
This commit was SVN r5573.
2005-05-02 16:40:25 +00:00
Josh Hursey
350d7786b4 some fixes and additions to get Windows libmpi to build.
This commit was SVN r5572.
2005-05-02 15:27:54 +00:00
Ralph Castain
44b83e73ef Fix the print warnings for the name services conversions on names from their binary value to a string.
HEADS UP: string versions of names are now presented in DECIMAL format - not HEX as they previously were. If you used the name services functions (as you were supposed to do) to access these names, you will not have any problems. If you did it yourself, then you need to fix it - my suggestion would be that you fix your code by using the name service functions to avoid future problems.

This commit was SVN r5571.
2005-05-02 15:06:13 +00:00
Ralph Castain
931924397c Fix several minor things:
1. *correctly* fix the printing of size_t variables. Need to do this through a #define, not just typecast things. Thanks to Jeff/Brian for suggesting a cleaner way to do it (as opposed to just doing the #define at the print location). Note that not ALL of the prints have been "fixed" yet - will continue to identify them.

2. Add int64 and size_t to the pack/unpack unit tests.

3. Fix a bug in the int64 pack/unpack system.

This commit was SVN r5570.
2005-05-02 14:48:57 +00:00
Brian Barrett
68697d9dab * add all the run-time tweakable parameters
* start trying to figure out the sendfrag code
* start of code to initialize recv buffer

This commit was SVN r5569.
2005-05-01 19:55:16 +00:00
George Bosilca
54c1739515 size_t is definitivelly a strange type. There is no way to print it correctly on all platforms. So here is
the trick: I decide to print it always as an unsigned long and explicitly cast everything to this type.
Thus, I change all printf formats from %d to %lu and cast all arguemnts to the correct type (unsigned long).

This commit was SVN r5568.
2005-05-01 19:31:25 +00:00
George Bosilca
948a01230c Initialize the DPS framework before start playing with the arguments. It's just a quick temporary fix so
everybody can still play with ompi_info.

This commit was SVN r5566.
2005-05-01 16:33:17 +00:00