1
1
Граф коммитов

3850 Коммитов

Автор SHA1 Сообщение Дата
Brian Barrett
0c6eaaebe3 * start cleaning up debugging output (still much to do)
* make buffers really big so that we pass allocmem until we figure out
  why we're not flow controlling as I expected
* set event queue to invalid intially and use that as the enabled test
  rather than a seperate bool - shrinks the module a bit
* add dropped count checks, with a panic if one occurs.  Still need to
  implement some type of retransmit logic.

This commit was SVN r5704.
2005-05-12 21:28:48 +00:00
Brian Barrett
e2c2c72b84 Changes to pass allocmem IBM test
- don't free the send buffer unless the converter tells us we need to
  - properly do the math to determine when the receive buffer has been
    fully used and unlinked itself

This commit was SVN r5703.
2005-05-12 19:52:51 +00:00
Jeff Squyres
f5657fb8ee For the rsh pls, if the launch is on the local node, just exec it --
don't bother using the launching agent (typically rsh or ssh).

This commit was SVN r5702.
2005-05-12 19:12:53 +00:00
Jeff Squyres
544f9dd780 Fix silly string error (missing +2 in the len calculation, so just
replace it with asprintf).  Reviewed by Brian.

This commit was SVN r5700.
2005-05-12 18:56:05 +00:00
Brian Barrett
189a536685 * Fix incorrect logic in orted so that --no-daemonize works as intended
* Minor formatting fixes in XGrid RAS component
* Code cleanup in XGrid PLS component:
  - If we can't get daemon contact information, kill the job at the XGrid
    level
  - Add MCA parameter pls_xgrid_delete_job that will delete the job from
    XGrid when complete (this seems like standard behavior, so it's the
    default)
  - Remove compiler warning about getting the name of a XGGrid object
  - Properly populate the daemon information for the killing code

This commit was SVN r5697.
2005-05-12 16:48:41 +00:00
Josh Hursey
4b60235383 remove unnecessary exclusion for Windows which was killing the Windows nightly build
This commit was SVN r5695.
2005-05-12 14:37:40 +00:00
Brian Barrett
decc74d15c * Enable the XGrid components. Only do anything if the XGrid contact info
variables are set.
* show the RAS priority in ompi_info

This commit was SVN r5694.
2005-05-12 03:33:59 +00:00
George Bosilca
4ef1d70034 snprintf does not really do what we expect. In some situations it will write
more than we have asked for (on my G5). Anyway now I hope I have enought memory to printout
the full description of the datatype.

This commit was SVN r5690.
2005-05-11 21:30:56 +00:00
Brian Barrett
c477907166 * ignore UNLINK messages earlier in the chian (if Portals supports them)
* process long message fragments properly

This commit was SVN r5689.
2005-05-11 20:22:18 +00:00
Josh Hursey
cc6cb5cac5 Checkpoint on Windows build.
Many changes to headers for OMPI_DECLSPEC, and 
proper placement of c_plusplus defines in those files.

mca/gpr/replica and tools are the two sets of directories
that still need work for the Windows build for this pass.

This commit was SVN r5688.
2005-05-11 20:21:10 +00:00
George Bosilca
6714dfac4e Remove all useless checks that size_t is greater or equal to zero.
This commit was SVN r5687.
2005-05-11 14:19:48 +00:00
George Bosilca
e940ab43b8 Optionally disable the tests.
This commit was SVN r5686.
2005-05-11 14:17:01 +00:00
Thara Angskun
55538e100d - another check point
- able to launch job (sort of)...but not correctly clean up, mpirun hang, blowup terminal etc.

This commit was SVN r5685.
2005-05-11 09:09:55 +00:00
George Bosilca
1a9cef70fb checkpoint
This commit was SVN r5684.
2005-05-11 06:30:10 +00:00
George Bosilca
9bd4110bb5 If we check for errors then let's check all of them.
This commit was SVN r5683.
2005-05-11 04:26:14 +00:00
George Bosilca
f9ae5a282e We are here in a macro. The arguments of the macro should be protected. Otherwise the compiler
will get confused with the precedence of the operators.

This commit was SVN r5682.
2005-05-11 04:23:47 +00:00
George Bosilca
f0adb8b4fd Adapt to the new PML interface.
This commit was SVN r5681.
2005-05-11 03:59:57 +00:00
Jeff Squyres
bc6f5a83c4 Fix a few more header installation directories
This commit was SVN r5679.
2005-05-10 23:56:23 +00:00
Jeff Squyres
6bda1ed699 Fix installation directory for the header files
This commit was SVN r5678.
2005-05-10 23:52:35 +00:00
Brian Barrett
caf8551001 * checkpoint - long messages are causing segfaults in the PML, but need to
stop for a bit

This commit was SVN r5675.
2005-05-10 20:42:57 +00:00
Brian Barrett
2ec27c0927 * add ability to respond to RNDV packets with ACKs. short MPI_Ssends now
work properly.  Still need to implement second fragment support

This commit was SVN r5674.
2005-05-10 19:39:21 +00:00
George Bosilca
956782670a One more printf with size_t solved.
This commit was SVN r5673.
2005-05-10 17:25:56 +00:00
Jeff Squyres
f8b1e19076 - Add a few help messages
- app->num_procs changed to a size_t, which hosed the initialization
  of its value to -1 (not sure why the compiler didn't complain
  #$%@#$%), which was there to catch the case when the user forgot to
  specify -np (or some other equivalent).  Fixed.

This commit was SVN r5672.
2005-05-10 17:14:53 +00:00
Josh Hursey
5d1e2c53b0 fix some library path issues
This commit was SVN r5671.
2005-05-10 16:16:36 +00:00
Brian Barrett
0cd4d15824 * Update to match Tim's changes to the PML
* Couple of improvements towards handling ACKs properly

This commit was SVN r5670.
2005-05-10 15:53:41 +00:00
Brian Barrett
eeba1b9a72 * re-enable making DSOs of the TEG PML
This commit was SVN r5669.
2005-05-10 14:56:45 +00:00
Jeff Squyres
6d8ad2b1bd When the argv add functions were changed to nosize, we forgot to
update num_env.  Doh!

This commit was SVN r5668.
2005-05-10 14:50:55 +00:00
Jeff Squyres
5d1685edbc Add unignore for jsquyres@mauve.lanl.gov
This commit was SVN r5665.
2005-05-10 12:02:36 +00:00
Jeff Squyres
98f00ce592 Add missing .h file
This commit was SVN r5663.
2005-05-10 11:45:42 +00:00
Jeff Squyres
f374559d14 Add a proper test for socklen_t
This commit was SVN r5662.
2005-05-10 11:25:22 +00:00
Tim Woodall
6a565b3fb7 rollback
This commit was SVN r5660.
2005-05-09 21:07:53 +00:00
Tim Woodall
d54c6fafe0 cleanup for data structure reorg
This commit was SVN r5659.
2005-05-09 21:05:54 +00:00
George Bosilca
876dfc4cfc Dont add the fragment to the endpoint before we have erceived some data.
This commit was SVN r5658.
2005-05-09 20:20:29 +00:00
Tim Woodall
6ab861945f corrections for data structure reorg
This commit was SVN r5657.
2005-05-09 20:03:58 +00:00
Tim Woodall
10c8338a04 moved up
This commit was SVN r5656.
2005-05-09 19:42:32 +00:00
Tim Woodall
3d3151abb3 reorg to support new pml - move ptl/teg specific data
from pml_base_* data structures
*** NOTE ***
this will break gm/ib - I will fix these shortly

This commit was SVN r5655.
2005-05-09 19:37:10 +00:00
Josh Hursey
276cf4dbd5 donno how that one creeped in. rollback this file
This commit was SVN r5653.
2005-05-09 18:03:29 +00:00
Josh Hursey
f3c537598b Export the rest of the datatypes.
This commit was SVN r5652.
2005-05-09 17:47:31 +00:00
Brian Barrett
1af69c389d * merge xgrid pls/ras components from bwbw-xgrid tmp branch into the trunk
This commit was SVN r5651.
2005-05-09 16:28:05 +00:00
Josh Hursey
82b407061f windows checkpoint
This commit was SVN r5650.
2005-05-09 16:23:00 +00:00
Brian Barrett
55310951f6 * some fixes to make SPARC asm compile. Still fails to execute, unfortunately
This commit was SVN r5649.
2005-05-09 16:10:13 +00:00
Josh Hursey
2164397a99 minor addition for dps code in Windows
This commit was SVN r5646.
2005-05-09 14:12:46 +00:00
Thara Angskun
74180750b7 - just check point
This commit was SVN r5645.
2005-05-09 07:14:58 +00:00
Jeff Squyres
a28b5ae43b Fix for a bunch of size_t issues; reviewed by George and Ralph.
- Change all uses of *printf'ing a size_t to use an explicit cast to
  (unsigned long) and the %lu escape
- change ORTE_GPR_REPLICA_MAX_SIZE to INT_MAX until bug 1345 is fixed
  (i.e., until we allow size_t in MCA params)
- ns_base_local_fns.c:orte_ns_base_get_proc_name_string(): changed
  from %0X -> %lu
- ORTE_NAME_ARGS added explicit (unsigned long) casts, and changed all
  usages of ORTE_NAME_ARGS to use %lu's

This commit was SVN r5644.
2005-05-08 13:22:55 +00:00
George Bosilca
1d87c3a87b Take in account the current index.
This commit was SVN r5643.
2005-05-08 05:52:00 +00:00
Thara Angskun
3f63c4db97 - nothing but checkpoint.
This commit was SVN r5642.
2005-05-08 03:06:36 +00:00
Thara Angskun
e951cbde90 Fixed poe priority bug.
This commit was SVN r5641.
2005-05-07 22:48:29 +00:00
Ralph Castain
84889eb9fd Missed one more memory cleanup...
This commit was SVN r5640.
2005-05-07 15:01:21 +00:00
Ralph Castain
fdbc12f412 Fix the missing segment name correctly.
As an FYI: the pack/unpack routines should be happy with a NULL string (and appear to be so). Issue here was that the constructor was not called, which means that the string pointer was not initialized to NULL as it ordinarily would have been.

This commit was SVN r5639.
2005-05-07 14:59:35 +00:00
Jeff Squyres
d9a4a30be7 Make ROMIO play nice and recognize OMPI
This commit was SVN r5638.
2005-05-07 10:30:30 +00:00
Jeff Squyres
f8a22760c6 @%#$%# Another one slipped through (static builds)
This commit was SVN r5637.
2005-05-07 10:19:28 +00:00
Jeff Squyres
02d64ed82b Oops -- missed this one (caught in the nightly builds).
This commit was SVN r5636.
2005-05-07 09:31:10 +00:00
George Bosilca
20b61ce558 To be or not to be ... I still dont know the answer but at least I know that this field have
to be set ("" seems a good value) before calling pack.

This commit was SVN r5635.
2005-05-07 07:50:14 +00:00
George Bosilca
6d7025ce4e Always respect the size of the buffer. Stop printing when we reach the space reserved on the buffer.
This commit was SVN r5633.
2005-05-07 04:28:15 +00:00
Jeff Squyres
539653cf18 Arrgh -- one mistake slipped by in the last commit. Fixed.
This commit was SVN r5630.
2005-05-06 22:22:53 +00:00
Jeff Squyres
fcb07baa32 Next step in reducing some configure/build time:
1. Instead of removing various src/ component directories, simply
   "flatten" the Makefile.am structure by having only a single
   top-level Makefile.am for the component, and having it include
   src/Makefile.extra (which is where the source files are listed).
   This effectively makes the build faster because "make" does not
   traverse down into src/, and we don't build a Makefile for that
   directory.
2. Did end up moving topo/unity/src/* into topo/unity, which is where
   I figured out that option #1 would be a bit easier (and safer,
   considering that other developers are actively working in various
   src/ directories -- moving things around while they're working
   would be Bad!)
3. Did not consolidate most of the io/romio component because of the
   nightmare of sym links (especially w.r.t. VPATH builds) in the
   included ROMIO distribution.  I wasted too much time trying to get
   that stuff right and finally gave up -- this is a "low hanging
   fruit" optimization, after all.

This commit was SVN r5629.
2005-05-06 21:50:30 +00:00
Jeff Squyres
e9d7cd054a Function for George: allow incrementing the reference count on a
component to delay its dlclose().

This commit was SVN r5627.
2005-05-06 21:28:06 +00:00
Jeff Squyres
40112a3a9e Not sure how that snuck in there...
This commit was SVN r5626.
2005-05-06 21:06:14 +00:00
Brian Barrett
5e06fe42d1 * checkpoint to record small victory - implement start of send and receive
- sends/recvs short messages (less than first frag size)
  - does not properly ACK messages, so Ssend() is borked
  - leaks memory like there's no tomorrow
  - don't use it just yet

This commit was SVN r5625.
2005-05-06 20:15:28 +00:00
Ralph Castain
929fc76cfa Add the application name to the returned values to support TotalView interface.
This commit was SVN r5624.
2005-05-06 18:42:25 +00:00
Ralph Castain
659d57f300 Several things in this commit - shouldn't impact any existing work:
1. Added pid_t to the dps

2. Processes now "register" their local pid and update their location (i.e., nodename) on the registry during mpi_init

3. Added a new error code for values that exceed maximum for their data type (useful when transitioning a value from one variable to another of different size)

4. Fixed a few places where size_t was being incorrectly handled

5. Updated dps_test to cover pid_t types

This should now provide support for TotalView connection - which David is pursuing.

This commit was SVN r5623.
2005-05-06 17:00:06 +00:00
Brian Barrett
22685e5055 * I think Edgar meant to assign a pointer to comm_self, not comm_self itself.
This lets things compile again

This commit was SVN r5622.
2005-05-06 15:41:00 +00:00
Brian Barrett
fc16d60236 * Don't use poll() on OS X. It's implemented on top of select in 10.3 (no
biggie), so we gain nothing there.  On 10.4, it's implemented directly,
  but doesn't support devices (which messes up pty support and IO
  forwarding).

This commit was SVN r5621.
2005-05-06 15:10:35 +00:00
Edgar Gabriel
d6bd2fdeab fix for the self_atexit test in the ibm-testsuite
This commit was SVN r5620.
2005-05-06 14:22:59 +00:00
Jeff Squyres
1bbd4df914 Add missing header file
This commit was SVN r5619.
2005-05-06 12:26:32 +00:00
George Bosilca
d60565e043 Handle the alignement problems. The same error as in the email from Ferris McCormick showed up
on all 64 bits architectures. The problem was the for unpack the source pointer was cast to a
specific type (uint32_t for 32 bits data) and then hton* was applied. The result was ... unexpected.

This patch always memcpy the data in a temporary variable with the correct size before calling
ntoh* functions, so we can insure that the data is always correctly aligned.

Moreover I add a debuging layer. OMPI_OUTPUT is used to print out the data being packed and
unpacked. It generate a lot of output but hopefully allow us to spot few bugs. This layer is not
completed the output stream descriptor is set to -1 (no output).

This commit was SVN r5617.
2005-05-05 23:59:59 +00:00
George Bosilca
4c04dde93d argc it's just an int and we try to pack it as a size_t. Well it didn't work :)
Anyway now I'm able to run on several 64 bits architectures (Athlon and G5) so
I suppose that we are back online on 64 bits.

This commit was SVN r5616.
2005-05-05 23:05:05 +00:00
George Bosilca
0b70b3d7c1 rc it's not initialized. But if we found the name on the list then the operation it's succesfull.
This commit was SVN r5615.
2005-05-05 23:00:09 +00:00
George Bosilca
9387013ca2 If it's a unsigned long then the format string should be %lu.
This commit was SVN r5614.
2005-05-05 22:58:47 +00:00
Ralph Castain
8167c24630 Fix a minor bug that reported an error when one didn't really exist.
This commit was SVN r5612.
2005-05-05 20:23:19 +00:00
Brian Barrett
faf3d438bb * Put all the source files in src/, as it was getting really annoying having
everything in one directory.  Still have only one Makefile, so it shouldn't
  change build time at all
* Now that I finally understand the header system for data, refactor a little
  bit of the code to match what really should be happening
* start of a hacked up send() - puts the data for a 0 byte message on the
  other side, and all the pointers are where i think they should be.  So
  my plan of attack will work.  But I think I'm going to have to use
  iovecs instead of memcpy() real soon now.

This commit was SVN r5610.
2005-05-05 19:53:50 +00:00
Ralph Castain
863eac2df8 Checkpoint start of setting up a head node process, fix a minor bug.
This commit was SVN r5609.
2005-05-05 19:20:47 +00:00
Ralph Castain
20ef03e6e5 Bring revised universe existence check to main branch. Passed singleton, orterun, and orterun attaching to persistent daemon tests.
This commit was SVN r5608.
2005-05-05 19:14:51 +00:00
Ralph Castain
f36260331d Take a shot at fixing the sparc alignment problem...
This commit was SVN r5607.
2005-05-05 18:45:16 +00:00
Tim Woodall
3e3b90a6a1 resolved 64 bit issue
This commit was SVN r5606.
2005-05-05 18:25:56 +00:00
Tim Woodall
9af1f10149 num_procs should be size_t
This commit was SVN r5605.
2005-05-05 18:25:07 +00:00
Brian Barrett
0fb11d2ab1 * checkpoint, unuseable, but want to rename a file and it's easier without
changes pending

This commit was SVN r5604.
2005-05-05 17:53:12 +00:00
Tim Woodall
754dc71177 revised oob ping so that it doesnt attempt to setup/use a persistent connection
This commit was SVN r5601.
2005-05-05 16:31:40 +00:00
George Bosilca
b429c73240 Now we have a way to specify if we want the PML to take care of one or two PTLs by peer. If only
one is selected it will be used for all purposes: small messages and long messages (even if the
long message is still split in several fragments). For the case where 2 PTLs per peer exists,
the first one is for latency (small messages and rendez-vous requests) when the second one
will be used for bandwitdh.

This commit was SVN r5600.
2005-05-05 04:35:18 +00:00
George Bosilca
f6aeee4502 Remove all references to the previously removed ptl_array.
This commit was SVN r5599.
2005-05-05 04:19:48 +00:00
George Bosilca
cb509df2ee More fixes. Remove useless files. Decrease the memory footprint for this PML.
This commit was SVN r5598.
2005-05-05 04:17:52 +00:00
Jeff Squyres
6ef5b7283f Fix problem found by Greg Watson -- in the case of an option that is
simply "-", fix a corner case where it looked like that was a valid
short option.

This commit was SVN r5597.
2005-05-05 02:02:01 +00:00
George Bosilca
457083e148 Always activate it for the user bosilca.
Jeff send me the way to do that automatically, and I'm pretty sure I'm not the only one who miss some
of the functionalities of our build system. The idea is really cool, let only the developper of a
component have it active until it reach a stable state. For all others peoples the .ompi_ignore
file prevent them for compiling the component.

cd src/mca/pml/uniq
echo $USER > .ompi_unignore
svn add .ompi_unignore
svn ci .ompi_unignore

This commit was SVN r5595.
2005-05-05 01:25:08 +00:00
George Bosilca
536a5c5d09 The UNIQ PML is here. It does compile with few warnings, and it still have few bugs. It's here just as a checkpoint.
The idea behind this PML is to minimiza the overhead of managing multiple PTL. For each node, UNIQ keep two PTL's
one for latency and one for bandwidth. One the next version I want to add a configure parameter to allow the user
to select how many PTL's he want: one or two.

This commit was SVN r5593.
2005-05-04 17:08:08 +00:00
Brian Barrett
d3f3d08b59 * dumb, minor changes - need to switch machines
This commit was SVN r5592.
2005-05-04 16:59:37 +00:00
Ralph Castain
069f648c93 Cleanup some warning messages in the nightly build per Jeff
This commit was SVN r5590.
2005-05-04 14:01:25 +00:00
Brian Barrett
99389ee9f2 * add --no-daemonize flag to orted to prevent it from doing the fork()/exec()/
setsid() routine at startup.  Needed for some of the batch scheduling
  starters.

This commit was SVN r5587.
2005-05-04 11:57:47 +00:00
George Bosilca
ae40471a30 Modify to reflect the change in the subdirs (removal of src directory).
This commit was SVN r5586.
2005-05-04 01:55:19 +00:00
George Bosilca
4cc56fea8c Per Jeff suggestion nuke the src directory.
This commit was SVN r5585.
2005-05-04 01:51:32 +00:00
Jeff Squyres
9e8dfd0882 Per George's suggestion, move the DPS init to its proper place.
This commit was SVN r5582.
2005-05-04 00:11:36 +00:00
Brian Barrett
cffb3d031e * checkpoint. Nothing works, as I just realized my entire design was
based around PTL_MD_MAX_SIZE, which apparently isn't implemented in
  Cray's Portals implementation.  Time to rethink that design :/

This commit was SVN r5576.
2005-05-02 20:31:42 +00:00
Tim Woodall
07e4cf840f resolve packing issue w/ int64
This commit was SVN r5575.
2005-05-02 18:30:07 +00:00
Tim Woodall
5c9f4c75d1 fix typo
This commit was SVN r5574.
2005-05-02 16:51:01 +00:00
Tim Woodall
8c8c3eb86e checkpoint/reorg
This commit was SVN r5573.
2005-05-02 16:40:25 +00:00
Josh Hursey
350d7786b4 some fixes and additions to get Windows libmpi to build.
This commit was SVN r5572.
2005-05-02 15:27:54 +00:00
Ralph Castain
44b83e73ef Fix the print warnings for the name services conversions on names from their binary value to a string.
HEADS UP: string versions of names are now presented in DECIMAL format - not HEX as they previously were. If you used the name services functions (as you were supposed to do) to access these names, you will not have any problems. If you did it yourself, then you need to fix it - my suggestion would be that you fix your code by using the name service functions to avoid future problems.

This commit was SVN r5571.
2005-05-02 15:06:13 +00:00
Ralph Castain
931924397c Fix several minor things:
1. *correctly* fix the printing of size_t variables. Need to do this through a #define, not just typecast things. Thanks to Jeff/Brian for suggesting a cleaner way to do it (as opposed to just doing the #define at the print location). Note that not ALL of the prints have been "fixed" yet - will continue to identify them.

2. Add int64 and size_t to the pack/unpack unit tests.

3. Fix a bug in the int64 pack/unpack system.

This commit was SVN r5570.
2005-05-02 14:48:57 +00:00
Brian Barrett
68697d9dab * add all the run-time tweakable parameters
* start trying to figure out the sendfrag code
* start of code to initialize recv buffer

This commit was SVN r5569.
2005-05-01 19:55:16 +00:00
George Bosilca
54c1739515 size_t is definitivelly a strange type. There is no way to print it correctly on all platforms. So here is
the trick: I decide to print it always as an unsigned long and explicitly cast everything to this type.
Thus, I change all printf formats from %d to %lu and cast all arguemnts to the correct type (unsigned long).

This commit was SVN r5568.
2005-05-01 19:31:25 +00:00
George Bosilca
948a01230c Initialize the DPS framework before start playing with the arguments. It's just a quick temporary fix so
everybody can still play with ompi_info.

This commit was SVN r5566.
2005-05-01 16:33:17 +00:00
Jeff Squyres
5f8580bf04 Re-add orte-types.h
This commit was SVN r5565.
2005-05-01 09:59:42 +00:00
Jeff Squyres
c75b244737 Don't try to get a lock that we already have.
This commit was SVN r5560.
2005-05-01 09:53:30 +00:00
George Bosilca
7f0e9483ce More printf warnings removed.
This commit was SVN r5558.
2005-05-01 08:27:14 +00:00
George Bosilca
a03ea3e82c Correclt compute the dargs even for the first data.
This commit was SVN r5557.
2005-05-01 08:26:20 +00:00
George Bosilca
b25a5a24be As far as I know there is no standard way to print out a size_t data. Most of the time a %ld should suffice
otherwise we have to cast everything back to int before printing it out. 

This commit was SVN r5556.
2005-05-01 08:25:38 +00:00
George Bosilca
bd75b69fa1 Remove an useless .h file. With this include basically all the files in ompi will get recompiled
when dps_internal.h get touched. Anyway the name say it should be internal to the dps system, so
there is no reason to have it included everywhere.

This commit was SVN r5555.
2005-05-01 08:24:24 +00:00
George Bosilca
6fd89c67b3 Most of the files here require the string.h include on x86 platforms. So I add
it on a common place dps_internal.h protected by all the standard stuff.

This commit was SVN r5554.
2005-05-01 03:50:23 +00:00
Jeff Squyres
bcd4797389 Commit 4 of 4 for bringing the changes over from the hetero branch.
Merged in from:

svn merge -r5506:5553 https://svn.open-mpi.org/svn/ompi/tmp/hetero .

This commit was SVN r5552.

The following SVN revisions from the original message are invalid or
inconsistent and therefore were not cross-referenced:
  r5506
  r5553
2005-05-01 00:58:06 +00:00
Jeff Squyres
6a0282b4a1 Commit 3 of 4 for bringing over the changes from the hetero branch.
Merged from:

svn merge -r5496:5506 https://svn.open-mpi.org/svn/ompi/tmp/hetero .

This commit was SVN r5551.

The following SVN revisions from the original message are invalid or
inconsistent and therefore were not cross-referenced:
  r5496
  r5506
2005-05-01 00:54:12 +00:00
Jeff Squyres
aa70022dc2 Commit 2 of 4 for bringing the changes over from the hetero branch.
Merged in from:

svn merge -r5448:5496 https://svn.open-mpi.org/svn/ompi/tmp/hetero .

This commit was SVN r5550.

The following SVN revisions from the original message are invalid or
inconsistent and therefore were not cross-referenced:
  r5448
  r5496
2005-05-01 00:53:00 +00:00
Jeff Squyres
462adee81a Commit 1 of 4 to bring in the hetero branch to the trunk. Merged in
from:

svn merge -r5440:5448 https://svn.open-mpi.org/svn/ompi/tmp/hetero .

This commit was SVN r5549.

The following SVN revisions from the original message are invalid or
inconsistent and therefore were not cross-referenced:
  r5440
  r5448
2005-05-01 00:47:35 +00:00
George Bosilca
261df8bbdf One more step toward the darray function ... This one it's just a checkpoint ...
This commit was SVN r5544.
2005-04-30 19:37:31 +00:00
George Bosilca
26492277cb Add the code for deadlock detection in the send case. However it's disable by now. Read the bug
http://bugs.osl.iu.edu/Ticket/Display.html?id=1334 for more informations.

This commit was SVN r5543.
2005-04-30 19:36:05 +00:00
Jeff Squyres
b0bd773ec5 Arrgh -- this was a debugging line that should never have been committed.
This commit was SVN r5535.
2005-04-29 21:41:14 +00:00
Jeff Squyres
2e83004126 Use an MCA param for teg's priority
This commit was SVN r5534.
2005-04-29 21:14:31 +00:00
Jeff Squyres
c13c802944 Temporary fix -- will be obviated by the size_t (et al.) fixes coming
soon.

Make ORTE_EXIT_CODE be the same as INT32, not INT8.  This allows the
full propogation of the value returned by waitpid() rather than just
the lowest 8 bites.  Also change the naming of it in orterun to be
exit_status, not exit_code (per POSIX standard naming convention).

orterun now returns the first nonzero exit status that it receives.

This commit was SVN r5530.
2005-04-29 00:36:07 +00:00
Jeff Squyres
ca35c1276c - Minor style fixes
- Return ORTE_SUCCESS, not ORTE_ERR_NOT_IMPLEMENTED

This commit was SVN r5529.
2005-04-29 00:33:40 +00:00
George Bosilca
c6ff7fdbe4 Add more fonctionality to the generic function for pack/unpack. Still some work to do here.
This commit was SVN r5528.
2005-04-28 22:03:16 +00:00
George Bosilca
a554597b72 On MAC the iovec iov_base is of type char*. So we have to explicitly cast in order to
dont get warnings.

This commit was SVN r5527.
2005-04-28 22:02:17 +00:00
George Bosilca
0874dacb9c Less warnings (cvhange some variables from unsigned to signed).
This commit was SVN r5526.
2005-04-28 22:01:05 +00:00
George Bosilca
e910a968ed On AIX the r2 is a protected register (stack/frame pointer). If we use it inside our
functions we destroy the frame pointer for the next call so very very weird things
happened. Like seg fault on i = 50 !!!
Both 32 and 64 bits versions have been modified but only the 32 version have been tested (by lack of ressources).

This commit was SVN r5525.

The following SVN revision numbers were found above:
  r2 --> open-mpi/ompi@58fdc18855
2005-04-28 21:57:31 +00:00
Jeff Squyres
6944839774 Per suggestions from Graham and Abaqus, return the first nonzero exit
status that we get (which naturally returns 0 if all return 0).  This
should pick up nonzero returns from main() after MPI_FINALIZE, but the
gpr is still reporting 0 while testing.  So orterun looks cororect for
this behavior -- investigating what's happening on the ORTE side is a
different commit...

This commit was SVN r5521.
2005-04-28 13:18:52 +00:00
Edgar Gabriel
64f95132c6 minor fix for setting the error codes
This commit was SVN r5519.
2005-04-28 08:36:10 +00:00
Tim Woodall
08ac0a95bc - shared memory current doesn't work across jobs
This commit was SVN r5518.
2005-04-27 20:16:01 +00:00
Tim Woodall
21809c342b correct check for buffered send
This commit was SVN r5516.
2005-04-27 19:46:03 +00:00
Brian Barrett
206b1bace1 * checkpoint - initialization now factored in reasonable way for both
utcp reference and Red Storm (I think)

This commit was SVN r5515.
2005-04-27 16:42:56 +00:00
Brian Barrett
ec4264a925 * fix a bunch of memory badness so that we can get through init :)
This commit was SVN r5513.
2005-04-27 15:03:55 +00:00
Jeff Squyres
fe2522f315 A bunch of changes to support MPI_INTEGER*x, MPI_REAL*x,
MPI_COMPLEX*x, and some optional C datatypes in MPI reduction
operations.  These types are not technically supported by the letter
of the MPI standard, but are implied by the spirit of it (and there
are definitely users that use them in real applications)

- Add checks in configure for back-end C types for MPI_INTEGER*x and
  MPI_REAL*x
- Create C data structs for MPI_COMPLEX*x
- Fixed typo for MPI_INTEGER8 in mpi.h
- Updated configure macros to create MPI_FORTRAN_INTEGER* defines, as
  opposed to MPI_FORTRAN_INT, which was causing [me] lots of confusion
  (between C "*_INT" names and Fortran "*_INT" names).  This caused
  some trivial updates in ddt, ompi_info, and the MPI layer to match.
- Update ompi_info to show whether we have each MPI_INTEGER*x,
  MPI_REAL*x, and MPI_COMPLEX*x
- Extended reduction operations for optional datatypes:
  - "C integer" now includes long long int, long long, and unsigned
    long long
  - "Fortran integer" now includes MPI_INTEGER*x
  - "Floating point" now includes MPI_REAL*x
  - "Complex" now includes MPI_COMPLEX*x

This commit was SVN r5511.
2005-04-27 10:23:06 +00:00
George Bosilca
789a484a88 Allow the selection of th best module.
This commit was SVN r5507.
2005-04-26 18:26:17 +00:00
George Bosilca
6673a62225 It does not really make any difference but at least it will keep some compilers quiet.
This commit was SVN r5504.
2005-04-26 05:28:50 +00:00
George Bosilca
b57ab5fb26 The return of the asprintf is always an int. Not a big deal except when we have
to catch some errors.

This commit was SVN r5503.
2005-04-26 05:27:38 +00:00
George Bosilca
668bc9a2f9 This one is quite nice ...
This commit was SVN r5502.
2005-04-26 05:26:05 +00:00
Brian Barrett
8f73d829d4 * start portals development again. nothing interesting, just some code
cleanup

This commit was SVN r5501.
2005-04-25 20:53:15 +00:00
George Bosilca
24eef59f7b Checkpoint ... I have to move to another cluster.
This commit was SVN r5499.
2005-04-24 23:42:47 +00:00
George Bosilca
dbac158804 Look like a lot of changes but the logic is exactly the same. I mostly clean-up
the whole datatype code, make it a little bit more readable and add some
additional checks for correctness. In same time I move some internal structures
from the external .h include to the internal one.

The ddt_test.c get one more datatype to test. This one look like those used
in the BLACS test code.

This commit was SVN r5498.
2005-04-24 20:08:45 +00:00
Tim Woodall
3c6fb03e9c - dont create shared memory pool if it won't be used
This commit was SVN r5494.
2005-04-22 16:18:41 +00:00
George Bosilca
88ca070e88 2 less compilation warnings.
This commit was SVN r5492.
2005-04-22 05:04:52 +00:00
George Bosilca
362b3ab3fa One less uninitialized variable.
This commit was SVN r5491.
2005-04-22 04:54:00 +00:00
George Bosilca
54d58df8d5 Adapt to the new prototypes in the datatype.h
This commit was SVN r5490.
2005-04-22 04:53:29 +00:00
George Bosilca
3f89dc79e2 Protect the mutex declarations. They should be undeclared except if the threads are enabled.
Additionally it remove some compilation warnings.

This commit was SVN r5489.
2005-04-22 04:52:40 +00:00
George Bosilca
e3c2da5db6 Cleanup the code and add some more output.
Correct a bug where the displacement was incorrectly computed in some cases.

This commit was SVN r5488.
2005-04-22 04:51:15 +00:00
George Bosilca
d6b1d9805f Correct a unpack bug related to the size of the unpacked object.
This commit was SVN r5487.
2005-04-22 04:49:57 +00:00
George Bosilca
9b090e7e71 One more test (struct containing a char followed by a double).
Modification to match the new prototypes and remove all warnings.

This commit was SVN r5486.
2005-04-22 04:48:53 +00:00
George Bosilca
72502a5d17 Match the new prototypes.
Some cleanups.

This commit was SVN r5485.
2005-04-22 04:47:55 +00:00
George Bosilca
6f92675c56 Get rid of dt_desc_t and use exclusively ompi_datatype_t.
Correctly place the const on the functions prototypes and adapt all file to match the new prototypes.

This commit was SVN r5484.
2005-04-22 04:46:01 +00:00
George Bosilca
67c905e76d Add a dump function for the convertor.
This commit was SVN r5483.
2005-04-22 04:43:28 +00:00
Brian Barrett
1ac858dcc9 * add information on whether progress or mpi threads are in the build for
ompi_info

This commit was SVN r5480.
2005-04-21 15:28:03 +00:00
Brian Barrett
591b8eebda * Default to only progressing the event library every 100 calls to
ompi_progress, unless someone is actually using it (MPI-2 dynamic,
  TCP PTL).  This is only for end of MPI_Init to start of MPI_Finalize. 
  All other times, the event library will be progressed every call
  into ompi_progress().

This commit was SVN r5479.
2005-04-21 14:58:25 +00:00
Tim Woodall
ee52046876 make sure setup is complete before waiting on child
This commit was SVN r5476.
2005-04-20 15:59:22 +00:00
Tim Woodall
d885155bae comments
This commit was SVN r5475.
2005-04-20 15:57:06 +00:00
Tim Woodall
3a5b6b654e - dont register for sigchild callbacks until setup is complete
- flush io output on sigchild

This commit was SVN r5474.
2005-04-20 15:44:57 +00:00
Tim Woodall
99ca522d39 debug option to not execute ompid - just print out command line - to enable
debug of the deamon

This commit was SVN r5473.
2005-04-20 15:43:55 +00:00
Tim Woodall
cf7a1c7631 resolve a couple of locking/progress issues
This commit was SVN r5472.
2005-04-20 15:43:00 +00:00
Tim Woodall
7713012d92 change file name so that it's not removed on cleanup
This commit was SVN r5471.
2005-04-20 15:42:20 +00:00
Tim Woodall
cc6eae3eb9 resolve locking issue
This commit was SVN r5470.
2005-04-20 15:41:59 +00:00
Tim Woodall
445d20ccab remove unlock - this is done externally
This commit was SVN r5469.
2005-04-20 14:19:14 +00:00
Jeff Squyres
e541350b9a Oops -- meant to *delete* the CCASFLAGS, not comment them out...
This commit was SVN r5468.
2005-04-20 14:18:09 +00:00
Jeff Squyres
266344210f Rename src/util/libutil.la -> src/util/libompiutil.la because Libtool
was getting confused with the recent addition of "-lutil" to get the
system-level libutil (i.e., Libtool was confusing the two and doing
Bad Things).

This commit was SVN r5467.
2005-04-20 14:14:07 +00:00
Jeff Squyres
6bbdf75df0 Plug a few memory leaks in order to test new libtool improvements from
Ralf W (core libtool developer).  There are still a few more to plug
in ompi_info (mainly concerned with shutting down OMPI/ORTE
subsystems), but they can wait...

This commit was SVN r5466.
2005-04-20 13:16:32 +00:00
Jeff Squyres
191f8b2781 Move the declaration up into the C_DECLS protected area (needed for
ompi_info). 

This commit was SVN r5465.
2005-04-20 13:15:41 +00:00
Brian Barrett
776d3c02c9 * mention to the friendly compiler that we are going to stomp on ebx and it
should probably deal with that fact already.

This commit was SVN r5463.
2005-04-20 03:30:12 +00:00
Tim Woodall
469672a361 build array of progress functions for only those ptls used
This commit was SVN r5459.
2005-04-19 22:46:11 +00:00
Tim Woodall
6a3099a1d0 correction
This commit was SVN r5458.
2005-04-19 22:01:18 +00:00
Tim Woodall
5d26f93149 redefine the macros to be empty if threading is not enabled
This commit was SVN r5457.
2005-04-19 20:50:44 +00:00
George Bosilca
29ab011aba Check with the standard and correct the behaviour of this function.
This commit was SVN r5450.
2005-04-19 15:32:30 +00:00
Josh Hursey
c5338de1ac Add a couple of file for the Windows build.
This commit was SVN r5441.
2005-04-19 14:20:00 +00:00
Thara Angskun
6569d5d569 check for POE support
This commit was SVN r5438.
2005-04-19 06:39:27 +00:00
George Bosilca
ad861063c8 I was thinking that void* is the most generic type of object pointer. And I was wrong as the type void* is
signed.

This commit was SVN r5437.
2005-04-19 06:03:53 +00:00
George Bosilca
3e54fd5fc5 Explicit cast to avoid compiler warnings.
This commit was SVN r5436.
2005-04-19 05:54:49 +00:00
George Bosilca
fa24b8332f By default data_map should be set to false.
This commit was SVN r5435.
2005-04-19 05:45:25 +00:00
George Bosilca
6a5130e2c8 Give a value to the jobid variable.
This commit was SVN r5434.
2005-04-19 05:44:53 +00:00
Josh Hursey
d1b12501fb usleep is not defined in Windows land.
sleep is the equlivant command where:
sleep(10) sleeps for 10 milliseconds.

This commit was SVN r5433.
2005-04-19 04:38:48 +00:00
Josh Hursey
fbf1ec6f2e Inching towards a compiling Windows build.
Some cleanup changes...

This commit was SVN r5432.
2005-04-19 04:24:13 +00:00
Brian Barrett
0964152893 clean up the OMPI_BUILDING #define. Rather than being defined to 1 if
we are part of the source tree and not defined otherwise, we are going
with an always defined if ompi_config.h is included policy.  If
ompi_config.h is included before mpi.h or before OMPI_BUILDING is set,
it will set OMPI_BUILDING to 1 and enable all the internal code that
is in ompi_config_bottom.h.  Otherwise, it will only include the
system configuration data (enough for defining the C and C++ interfaces
to MPI, but not perturbing the user environment).

This should fix the problems with bool and the like that the Eclipse
folks were seeing.  It also cleans up some build system hacks that
we had along the way.

Also, don't use int64_t as the default size of MPI_Offset, because it
requires us including stdint.h in mpi.h, which is something we really
shouldn't be doing.

And finally, fix a ROMIO Makefile that didn't set -DOMPI_BUILDING=1,
as ROMIO includes mpi.h, but not ompi_config.h

This commit was SVN r5430.
2005-04-19 03:51:20 +00:00
Josh Hursey
80af0b41c7 Fix include...
This commit was SVN r5429.
2005-04-19 03:46:27 +00:00
Jeff Squyres
f2806bdb03 Change a bunch of error handlers to invoke on a relevant communicator,
not MPI_COMM_WORLD

This commit was SVN r5427.
2005-04-19 00:45:29 +00:00
Josh Hursey
02fcede066 Fixed some includes.
Comment out pipes stuff for windows. need to come back and fix this properly in the future.
-his line, and those below, will be ignored--

M    iof_base_setup.c

This commit was SVN r5424.
2005-04-18 21:17:56 +00:00
Tim Woodall
f4c25492f3 - correct thread locking
- allocate enough memory to hold fifo array

This commit was SVN r5423.
2005-04-18 21:13:30 +00:00
Jeff Squyres
d4deac7e10 Fix a mistaken test / assignment
This commit was SVN r5421.
2005-04-18 19:33:50 +00:00
Brian Barrett
63bd314a0b * Update ASM tests to do more thread testing (which should help find bugs)
* Update cmpset test to call memory barrier when needed before checking the
  results
* remove unneeded sync from cmpset_32 on Power PC

This commit was SVN r5420.
2005-04-18 19:33:23 +00:00
Edgar Gabriel
0a77b54535 fix two bugs for dynamic process management, which makes the ibm spawn test pass:
- argv[0] should be the name of the executable for the spawned processes.
 - if we free a dynamic communicator (instead of disconnecting),
   the counter for dynamic communicators has to be decreased as well,
   else we core in finalize. 

This commit was SVN r5419.
2005-04-18 18:57:24 +00:00
Jeff Squyres
6fbf9c29f2 Fix minor typos
This commit was SVN r5414.
2005-04-18 14:21:29 +00:00
Brian Barrett
5b12a357c7 * make the PTY setup code conform to the prefix rules. Duh.
This commit was SVN r5410.
2005-04-17 17:50:39 +00:00
Jeff Squyres
3ea253bbb2 Updates to orterun:
- Print error messages with the basename(argv[0]) rather than
  hard-coded argv[0] so that you can see an error message beginning
  with "mpirun" when you run mpirun, etc.
- For all processes that died due to a signal:
  - If the signal was not SIGKILL, display the first N of them (where
    N defaults to 1)
  - If more than N processes died due to a non-SIGKILL signal, print
    "And X more processes aborted..." kind of message
  - Add --aborted command line parameter to change the default value
    of N
  - Also print out the total number of processes that died due to
    SIGKILL, with a disclaimer that it's impossible to know if we
    killed them or someone else killed them

This commit was SVN r5406.
2005-04-15 21:52:58 +00:00
George Bosilca
ac987a5533 Remove some compilation warnings.
This commit was SVN r5405.
2005-04-15 21:36:32 +00:00
Jeff Squyres
7ad0d67a9d After consultation with Tim: when we set the ABORTED state for a proc,
also increase the count on TERMINATED so that orterun can know when
*all* processes have died.

This commit was SVN r5404.
2005-04-15 21:35:17 +00:00
Jeff Squyres
f9ef7d4657 Make the pls's clean up the session directory of each process that dies.
This commit was SVN r5403.
2005-04-15 21:34:07 +00:00
Brian Barrett
3a55cffdcd * add missing files from pty commit
This commit was SVN r5402.
2005-04-15 21:23:25 +00:00
George Bosilca
c6e97182cf Dont go back from the progress if there are still events.
This commit was SVN r5401.
2005-04-15 21:22:46 +00:00
Brian Barrett
e3587652b7 * Add support for using ptys for stdout when doing I/O forwarding. This is
enough to make use applications be line buffered instead of block buffered,
  which makes output come much faster :)

This commit was SVN r5400.
2005-04-15 21:18:20 +00:00
George Bosilca
fe70c2ceed Correctly compute the displacement in the stack.
This commit was SVN r5399.
2005-04-15 20:48:51 +00:00
George Bosilca
d7a66701a5 As this function can be used with datatype without an optimized representation compute the count by
taking in account the size of he predefined datatype.

This commit was SVN r5398.
2005-04-15 20:47:56 +00:00
Graham Fagg
8be3558757 missed const include
This commit was SVN r5394.
2005-04-15 17:40:41 +00:00
Tim Woodall
a8e13d9636 correct return value
This commit was SVN r5393.
2005-04-15 17:19:01 +00:00
Tim Woodall
a831729d6f split close into finalize/close so that rmgr can finalize all
sub-components prior to entering close. moved pls logic to
wait on children from close to finalize.

This commit was SVN r5392.
2005-04-15 17:04:57 +00:00
Jeff Squyres
1581a8feb4 If MPI_INIT hasn't been called or MPI_FINALIZE has been called, then
just abort().

This commit was SVN r5391.
2005-04-15 16:38:44 +00:00
Edgar Gabriel
78c00921f6 saving the current status before leaving to the airport
This commit was SVN r5390.
2005-04-15 16:20:30 +00:00
George Bosilca
7fd378f5d2 If everything went just fine return OMPI_SUCCESS.
This commit was SVN r5389.
2005-04-15 16:16:44 +00:00
George Bosilca
1ae368311a Do not allow the PML to cache the GM requests by now. I'll work on that later.
This commit was SVN r5388.
2005-04-15 16:13:43 +00:00
George Bosilca
478f7a226d Just indentation :)
This commit was SVN r5387.
2005-04-15 16:12:30 +00:00