1
1

8663 Коммитов

Автор SHA1 Сообщение Дата
Ralph Castain
f95e20e2e1 Add another test program - an MPI app that just spins. This supports testing of system response to signal-terminated processes.
Add some debugger output to the ODLS default component.

Modify the orted command communication system so that it is done via non-blocking sends. This removes the linearity of the transmission and improves the response time.

This commit was SVN r12585.
2006-11-13 21:51:34 +00:00
Andrew Friedley
a4bdcb4faa Fix a segfault that turned up in more MPI_THREAD_MULTIPLE testing.
Same sort of problem and fix as described in r12323 - mca_pml_ob1_recv_frag_progress() was segfaulting due to a NULL req_proc pointer.  The path leading to this was through the mca_pml_ob1_check_cantmatch_for_match() function, where we can match a frag using the same macros as mca_pml_ob1_frag_match() and never initialize the req_proc pointer.

This commit was SVN r12582.

The following SVN revision numbers were found above:
  r12323 --> open-mpi/ompi@c752502dee
2006-11-13 20:12:51 +00:00
Tim Prins
39bc652899 Refs trac:612
Make it so if -np was not passed and -pernode was, we map bynode

This commit was SVN r12580.

The following Trac tickets were found above:
  Ticket 612 --> https://svn.open-mpi.org/trac/ompi/ticket/612
2006-11-13 19:13:21 +00:00
Ralph Castain
4636125e2d Modify the RMGR components to allow job setup with a given jobid, and add another attribute so that we can setup triggers without launching.
Add some debugging output to the ODLS default module, and the orted.

Remove the nodename data from the ODLS info report - that info is already stored in the registry by the RMAPS framework upon completing the mapping procedure.

Add another test program that does an ORTE-only dynamic spawn (gasp!). Looks just like comm_spawn - just no MPI involved.

Modify the ODLS to release the processor when we "kill" local procs in a more scalable fashion. It previously had a sleep in it that Jeff's prior commit removed. However, he introduced some Windows code into the non-Windows component (protected by "if"s, but unnecessary). This is a more general solution he proposed - included here so I could get things to compile properly.

This commit was SVN r12579.
2006-11-13 18:51:18 +00:00
Jeff Squyres
bfdf801487 Replace a "sleep(1)" with a yield so that the orted can reap processes
much faster.

This commit was SVN r12575.
2006-11-13 12:45:03 +00:00
Jeff Squyres
a82dfac73d Fix typo.
This commit was SVN r12574.
2006-11-13 12:42:06 +00:00
Jeff Squyres
90b6dc8744 Make the attribution for the ompi_ddt_duplicate() fix a little more
specific, because it must have been a beast to track down.  :-)

This commit was SVN r12572.
2006-11-13 12:33:54 +00:00
Jeff Squyres
6ce032b8e9 * Sync with 1.1 NEWS
* Add note about TV message queue debugging

This commit was SVN r12570.
2006-11-13 12:30:23 +00:00
Tim Mattox
669c8ff5b9 Made the leak fix thank you more specific.
This commit was SVN r12568.
2006-11-13 12:26:08 +00:00
Tim Mattox
d00844ea5e Added a thankyou to the NEWS file for a memory leak fix.
This commit was SVN r12567.
2006-11-13 12:17:46 +00:00
George Bosilca
b621069638 Don't copy the reference count when we duplicate a datatype. Thanks
to Andreas Schafer for the fix.

This commit was SVN r12566.
2006-11-13 05:45:29 +00:00
Ralph Castain
8b6921f297 Initialize the rank variable before it is used.
This commit was SVN r12565.
2006-11-13 02:37:12 +00:00
Tim Mattox
30b2780b03 Update README file for SC06 1.2 beta
This commit was SVN r12563.
2006-11-12 16:31:06 +00:00
George Bosilca
2b9f6613c9 Rewrite the ompi_ddt_sndrcv function.
This commit was SVN r12561.
2006-11-12 05:56:02 +00:00
George Bosilca
2ee30fd468 This wasn't the correct way to write the if statement. What I want is
that if one of them is zero then the if will get executed.

This commit was SVN r12560.
2006-11-12 05:27:27 +00:00
Gleb Natapov
9933a6f469 Previous fix doesn't fix the case when opcode is changed in put/get functions.
The fix is to set opcode to SEND at the entrance to the send function before
checking credits and putting fragment to the pending list. We do the same thing
in put/get functions i.e setting opcode at the entrance to the function.

This commit was SVN r12559.
2006-11-11 07:51:06 +00:00
Ralph Castain
4e50cdae52 This commit accomplishes two things:
1. Fix the "hang" condition when an application isn't found. It turned out that the ODLS had some difficulty with the process actually not having been started - hence, it never called the waitpid callback. As a result, the "terminated" trigger didn't fire, and so mpirun didn't wake up. With this change, the HNP's errmgr forces the issue by causing the trigger to fire itself when an abort condition occurs.

2. Shift the recording of the pid and the nodename from mpi_init to the orted launcher. This allows programs such as Eclipse PTP to get the pids even for non-MPI applications. In the case of bproc, the pls handles this chore since we don't use orteds in that system.

This commit was SVN r12558.
2006-11-11 04:03:45 +00:00
Tim Mattox
37b72d03cd Updated the autotools check to handle the libtool 2.1a version string.
This commit was SVN r12557.
2006-11-11 02:42:12 +00:00
Tim Mattox
40834fd0c8 Updated the SC 2006 Beta notes in the README
This commit was SVN r12556.
2006-11-11 02:37:29 +00:00
George Bosilca
ec410644ce Implement the send receive as 2 non blocking operations. That will help us
avoiding too many calls to opal_progress.

This commit was SVN r12553.
2006-11-10 23:06:19 +00:00
George Bosilca
c2c6a1b37e Correctly compute the number of elements in a segment.
For broadcast send the correct size for all intermediary nodes.

This commit was SVN r12552.
2006-11-10 23:04:50 +00:00
George Bosilca
7102147b9f Correctly detect when the specified algorithm is out of range. In
this case we reset it to zero.

This commit was SVN r12551.
2006-11-10 21:47:07 +00:00
George Bosilca
bfbd0e61f6 Minimize the number of lines of code :)
This commit was SVN r12550.
2006-11-10 20:56:08 +00:00
George Bosilca
a38cd366d7 Construct the convertor. It's not really required, but it's not in the
critical path anyway. At least in debug mode we get nice informations about
where the convertor was created.

This commit was SVN r12549.
2006-11-10 20:55:06 +00:00
George Bosilca
858ab24e8e The req_mtl field has to be the last in the struct or bad things happen.
This commit was SVN r12548.
2006-11-10 20:53:41 +00:00
George Bosilca
af68171253 Use the macro to compute the number of elements in a segment in both
bcast and reduce and update the default values for the variables
as required by the comment in the coll_tuned.h file.

This commit was SVN r12546.
2006-11-10 20:04:08 +00:00
George Bosilca
476b922074 Updates & upgrades:
- consistent arguments checking (not allowing to select an algorithm which
     is not available)
 - consistent way of computing the segcount (number of datatypes by segment).
 - small cleanups.
 - more informative debugging messages.

This commit was SVN r12545.
2006-11-10 19:54:09 +00:00
Dan Lacher
938e7cd8d9 Switch solaris packages over to being built dynamically instead of from
static prototype files.

Submitted by: Dan Lacher
Reviewed by: Rolf Vandevaart

This commit was SVN r12544.
2006-11-10 18:42:50 +00:00
Jeff Squyres
f04ec1bda6 * Add blurb at the top about copyrights
* Remove extra character from LANL copyright, word-wrap and punctuate.

This commit was SVN r12542.
2006-11-10 17:54:28 +00:00
Jeff Squyres
e124857353 Various updates.
This commit was SVN r12540.
2006-11-10 17:41:42 +00:00
Jeff Squyres
5419219041 * Move one more item down into 1.2
* Add future bullet about threaded openib btl progress

This commit was SVN r12539.
2006-11-10 17:28:29 +00:00
Jeff Squyres
113fc131ef - Remove some pending items
- Sync with 1.1.3 NEWS
- Add udapl bullet
- Add MPI_STATUS_SET_ELEMENTS bullet

This commit was SVN r12538.
2006-11-10 17:07:51 +00:00
Gleb Natapov
7e03b83d23 Reset opcode field to SEND. It is checked later in pending progress function.
This commit was SVN r12531.
2006-11-10 06:17:00 +00:00
George Bosilca
77ef979457 New architecture for broadcast. A generic broadcast working on a tree
description. Most of the bcast algorithms can be completed using this
generic function once we create the tree structure. Add all kind of
trees.

There are 2 versions of the generic bcast function. One using overlapping
between receives (for intermediary nodes) and then blocking sends to all
childs and another where all sends are non blocking. I still have to
figure out which one give the smallest overhead.

This commit was SVN r12530.
2006-11-10 05:53:50 +00:00
George Bosilca
56748d5f57 Correctly initialize the unpack buffer.
This commit was SVN r12529.
2006-11-10 05:11:02 +00:00
George Bosilca
17405cd9c6 A temporary fix, until we figure out a better approach. The problem
is that if one add "pml=" to the configuration file, really bad things
happen. All PMLs will get initialize, and each of them will initialize
all BTLs. This patch force the mca_pml_base_pml to get initialized in
all cases before we go out of the mca_pml_base_open function.

This commit was SVN r12527.
2006-11-10 04:53:00 +00:00
George Bosilca
ab1655079d Turn off the CONVERTOR_NO_OP once we reach this point.
This commit was SVN r12526.
2006-11-09 23:56:31 +00:00
Jeff Squyres
8a08b092f6 Check to see if we need to do anything. If we don't (i.e., if all the
rcounts are 0), then just return MPI_SUCCESS.

This commit was SVN r12525.
2006-11-09 23:21:34 +00:00
Rich Graham
1cfe963071 update license.
This commit was SVN r12523.
2006-11-09 22:02:51 +00:00
Edgar Gabriel
b3fc84f669 yes, I did make commits to the trunk, so I should probably update the LICENSE
file as well. (The according files e.g. in the group directory have been
updated, but I forgot to update this one).

This commit was SVN r12522.
2006-11-09 21:59:48 +00:00
George Bosilca
4cb7910a8b And now the optimization step.
This commit was SVN r12521.
2006-11-09 20:26:51 +00:00
George Bosilca
1d80f685b5 Remove one compiler warning.
This commit was SVN r12520.
2006-11-09 20:08:43 +00:00
George Bosilca
e33d1dedab Rewrite the conditions to keep them as small as possible. Correct some
of these conditions. Optimize the flags generation for the convertor.

This commit was SVN r12518.
2006-11-09 19:33:19 +00:00
George Bosilca
da1720d1ef Indentation only.
This commit was SVN r12517.
2006-11-09 19:31:52 +00:00
George Bosilca
73eec4bfef Show the MCA parameter coll_base_verbose only if Open MPI is compiled in
debug mode. Otherwise there is no debug anyway ...

This commit was SVN r12516.
2006-11-09 19:02:32 +00:00
Jeff Squyres
0a28212392 This is a workaround to bug in the Intel C++ compiler, version 9.1
(all versions up to and including 20060925).  The issue has been
reported to Intel, along with a small [non-MPI] test program that
reproduces the problem (the test program and the OMPI C++ bindings
work fine with Intel C++ 9.0 and many other C++ compilers).

In short, a static initializer for a global variable (i.e., its
constructor is fired before main()) that takes as an argument a
reference to a typedef'd type will simply get the wrong value in the
argument.  Specifically:

{{{
namespace MPI {
    Intracomm COMM_WORLD(MPI_COMM_WORLD);
}
}}}

The constructor for MPI::Intracomm should get the value of
&ompi_mpi_comm_world.  It does not; it seems to get a random value.

As mandated by MPI-2, annex B.13.4, for C/C++ interoperability, the
prototype for this constructor is:

{{{
class Intracomm {
public:
    Intracomm(const MPI_Comm& data);
};
}}}

Experiments with icpc 9.1/20060925 have shown that removing the
reference from the prototype makes it work (!).  After lots of
discussions about this issue with a C++ expert (Doug Gregor from IU),
we decided the following (cut-n-paste from an e-mail):

-----
> So here's my question: given that OMPI's MPI_<CLASS> types are all
> pointers, is there any legal MPI program that adheres to the above
> bindings that would fail to compile or work properly if we simply
> removed the "&" from the second binding, above?

I don't know of any way that a program could detect this change. FWIW,
the C++ committee has agreed that implementation of the C++ standard
library are allowed to decide arbitrarily between const& and by-value.
If they don't care, MPI users won't care.

When you remove the '&', I suggest also removing the "const". It is
redundant, but can trigger some strange name mangling in Sun's C++
compiler.
-----

So with this change:

 * we now work again with the Intel 9.1 compiler
 * our C++ bindings do not exactly conform to the MPI-2 spec, but
   valid/legal MPI C++ apps cannot tell the difference (i.e., the
   functionality is the same)

This commit was SVN r12514.
2006-11-09 17:34:12 +00:00
George Bosilca
a14ff905f8 Update the convertor.c file. This commit was supposed to go together with
r12486 but somehow I miss it.
Update the pack and unpack functions for contigusous datatypes to minimize their
impact on the performance. Keep them as condensed as possible.

This commit was SVN r12513.

The following SVN revision numbers were found above:
  r12486 --> open-mpi/ompi@8746369338
2006-11-09 17:26:55 +00:00
Sven Stork
9ba4c4a7ee - Add support for "sh" handling. Instead of detecting of bash we now
check for bourne shell, because bourne shell is the smallest
  common divisor for bash/ksh/sh.
- Make some shell expressions sh compatible 

This commit was SVN r12509.
2006-11-09 10:16:45 +00:00
George Bosilca
a82ce427e4 Update the number of reduce algorithms available.
This commit was SVN r12503.
2006-11-08 22:20:34 +00:00
George Bosilca
0dcf0097db They are supposed to be ints not size_t.
This commit was SVN r12497.
2006-11-08 17:05:15 +00:00