1
1

5445 Коммитов

Автор SHA1 Сообщение Дата
Jeff Squyres
f8fa8f4935 Fix a problem found by Tim Prins (patch also supplied by Tim P). From
his e-mail:

I ran into a small bug in rmaps_rr.c: map_app_by_slot which was
triggered by using multiple app contexts. Basically, if not all the
slots we allocated on a node were used by an app, we would
automatically move onto the next node. This caused a problem with
multiple app contexts when the first app takes a partial allocation of
a node, the second app would not be able to access these slots because
we had already move past the node, and the byslot routine does not
wrap back around the list.

This commit was SVN r6766.
2005-08-08 18:56:17 +00:00
Jeff Squyres
54e3865c62 Minor fix -- ensure to use the installed $libdir, not anything that
may already be in the LD_LIBRARY_PATH.

This commit was SVN r6765.
2005-08-08 18:01:44 +00:00
Jeff Squyres
32e71e5c6c Fix a problem where orterun itself would not receive MCA parameters
that were set on the command line.  This was techinically exactly the
way the code was designed, but it certainly violated the Law of Least
Astonishment (even to its designer ;-) ).  So now if you execute
something like this:

   mpirun -mca pls_rsh_debug 1 -np 4 hello

You'll see debugging output from the rsh pls component, as you would
expect (this was not previously the case -- the MCA pls_rsh_debug
parame would be set to 1 in the 4 spawned hello processes, but *not*
in the orterun process).

More specifically, MCA parameters will be set in the orterun process
in the following cases:

- The new command line switch "--gmca" (or "-gmca") is used,
  indicating that the MCA parameter is "global".  --gmca also means
  that that MCA parameter will be applied to all context app's.  For
  example:

      mpirun -gmca foo bar -np 1 hello : -np 2 goodbye

  The foo MCA param will be set in both the hello and goodbye
  processes.

- If there is only one context app.  For example:

      mpirun -mca pls_rsh_debug 1 -np 4 hello

  will set pls_rsh_debug to 1 in both the orterun process and the 4
  spawned hello processes.

Also added a few more comments inside orterun to document a somewhat
confusing use of a state variable in a recursive case.

This commit was SVN r6764.
2005-08-08 16:42:28 +00:00
Jeff Squyres
473720c22f Update the README a bit
This commit was SVN r6763.
2005-08-08 15:25:59 +00:00
Ralph Castain
3c13d699f8 Remove an old file.
This commit was SVN r6762.
2005-08-08 13:41:53 +00:00
Brian Barrett
6f0b51e4db * only include sys/ioctl.h if it exists
This commit was SVN r6761.
2005-08-08 13:15:06 +00:00
Ralph Castain
e583f6a97f Add a couple of new functions to the schema framework to check if a trigger is a "standard" trigger or not, and to extract a jobid from a standard trigger. Both functions will be used in a later commit.
Ensure that the seed set_my_name function sets all the right initial info in the name services' structures.

This commit was SVN r6760.
2005-08-07 13:26:49 +00:00
Ralph Castain
c530521a8e Add several new interface functions to the name services:
1. dump_xxx - analogous to the registry's dump commands, allows you to examine the contents of the name services' structures

2. get_job_peers - get an array of process names for all processes in the specified job

This commit was SVN r6759.
2005-08-07 13:21:52 +00:00
Ralph Castain
5208f9001d Update the gpr unit tests
This commit was SVN r6758.
2005-08-07 13:09:34 +00:00
Ralph Castain
1438009dbd Properly set the MCA parameter to indicate these functions are infrastructure so that the singleton flag does not get set.
Somehow, in changing over to the new MCA interfaces, the "set" part of that logic got lost, so the singleton flag was always being set. This should repair some of the anomalous behavior seen recently where the local host was always being used for an application process.

This commit was SVN r6757.
2005-08-07 04:17:10 +00:00
Josh Hursey
3b187c4db3 Fix the 'delete container' logic in gpr to prevent recursive delete of all
containers when one is requested.

Fix a bug in gpr_replica_del_index_api which doesn't preset num_tokens and
num_keys, but assumes they are 0.

Fix orte_ras_base_node_delete() function to operate properly to delete the
appropriate container in the 'orte-node' segment when requested.

This commit was SVN r6756.
2005-08-05 23:37:39 +00:00
George Bosilca
14ffc85379 I want to have it compiled too.
This commit was SVN r6754.
2005-08-05 18:47:12 +00:00
George Bosilca
439101dd00 Remove some compilation warnings.
This commit was SVN r6753.
2005-08-05 18:37:42 +00:00
George Bosilca
342fef77d0 Change the default name for the PML PTL from mca_pml_base_ptl_t to mca_pml_teg_ptl_t.
Just for consistency. (same thing was done by Rainer on the UNIQ PML in the revision 6749).

This commit was SVN r6752.
2005-08-05 18:26:53 +00:00
George Bosilca
f062a679d9 I miss this one in the last patch (just a function name change).
This commit was SVN r6751.
2005-08-05 18:04:52 +00:00
George Bosilca
8b93cb7661 Rename all the functions starting with mca_base_modex to mca_pml_base_modex.
Change all the places where they are used to fit the new name.

Remove the code to check the remote arch from the PML. We will have a GPR mechanism
in ompi_mpi_initialize to do that.

This commit was SVN r6750.
2005-08-05 18:03:30 +00:00
Rainer Keller
c696603fce More intrusive version of patch.
Rename components from mca_pml_base... to more correct mca_pml_uniq...

This commit was SVN r6749.
2005-08-05 17:28:19 +00:00
Jeff Squyres
7678050a2f Grumble. Add *more* missing files...
This commit was SVN r6748.
2005-08-05 14:17:14 +00:00
Rainer Keller
9dc4aa583e Minimal possible change to allow static-linking of ompi:
otherwise the mca_pml_base_ptl_t-Class is doubly defined, due
to the close similarity to teg.

This commit was SVN r6747.
2005-08-05 13:57:06 +00:00
Jeff Squyres
6faa46348f Minor fix
This commit was SVN r6746.
2005-08-05 12:33:53 +00:00
Jeff Squyres
eb7365581a Copy over from the beta branch.
This commit was SVN r6745.
2005-08-05 12:21:32 +00:00
Jeff Squyres
8ea1fec353 Add missing .h file
This commit was SVN r6744.
2005-08-05 10:30:47 +00:00
Brian Barrett
16e531e373 * fix some bad error messages to actually be useful
This commit was SVN r6741.
2005-08-04 19:28:59 +00:00
Brian Barrett
18d09c14a7 * always include sys/ioctl.h for now. Needs to be protected by header check,
but this will be good enough for Linux and OS X and I don't want to change
  configure.ac mid-day for the other platforms

This commit was SVN r6740.
2005-08-04 18:48:36 +00:00
Jeff Squyres
d0a0434172 Investigating an MCA param problem -- converted over orterun to new
MCA param API in the process.

This commit was SVN r6739.
2005-08-04 18:15:47 +00:00
Jeff Squyres
22133635e8 Move common to the top
This commit was SVN r6738.
2005-08-04 18:06:22 +00:00
Josh Hursey
12031db535 Added missing help file.
This commit was SVN r6737.
2005-08-04 17:40:22 +00:00
Brian Barrett
20d61b4599 * If rdma frag doesn't complete successfully on the receiving end, don't
call the cbfunc, since it's NULL.  The sending side will do the
  "right thing"

This commit was SVN r6735.
2005-08-04 15:45:31 +00:00
Brian Barrett
ab73cc0487 * minor diagnostic printf that should have been in last commit (doh!)
This commit was SVN r6734.
2005-08-04 15:43:50 +00:00
Brian Barrett
a80b00ab5e * Don't change size of user frag - it's not needed, and causes the frag
to never be returned to the free list

This commit was SVN r6733.
2005-08-04 15:43:13 +00:00
Brian Barrett
9cfa6bba6a * If a message isn't successfully sent, reduce the pending sends counter, as
the message is no longer pending
* Try to push out new messages whenever we finish a send, whether it
   worked or not.  Means that in the case where the other side has too
  many sends pending, we'll constantly retry one (and only one, once the
  pending number is reached) message until goodness returns
* Make some warnings only happen in verbose case, as they are mainly
  diagnostics

This commit was SVN r6732.
2005-08-04 15:41:11 +00:00
Jeff Squyres
aa9bdcfec5 Make some fixes and add some features to the rsh pls:
- convert MCA params to the new API
- some style and indenting fixes
- look at local shell, and if [new] MCA param
  pls_rsh_assume_same_shell is 1, then assume that the remote shell is
  the same as the local shell.  If pls_rsh_assume_same_shell is 0, do
  a probe to figure out what the remote shell is (NOT CURRENTLY
  IMPLEMENTED! you'll get a run-time warning if you set this MCA param
  to 0).
- if the remote shell is not csh and not bash, then prefix the remote
  command with "( ! [ -e ./.profile ] || . ./.profile;" (and suffix it
  with ")") so that we run the .profile on the remote side in order to
  set PATHs and the like.  See the LAM FAQ for details (will someday
  be on the Open MPI FAQ:
  http://www.lam-mpi.org/faq/category4.php3#question8)
- add a bunch of debugging output if the MCA param pls_rsh_debug is
  enabled (or the top-level debug MCA param is enabled)
- add more help messages (and corresponding calls to opal_show_help())
  in help-pls-rsh.txt

This commit was SVN r6731.
2005-08-04 15:09:02 +00:00
Brian Barrett
c39ba3e2da * re-enable poll_recalc so that it triggers the recalc of the signals
properly.  This fixes the random hangs that we were seeing this morning
  on Linux that were a result of fixing the thread deadlock yesterday.
  (worked great on my OS X box, which uses select() instead of poll()).

This commit was SVN r6730.
2005-08-04 14:05:46 +00:00
Jeff Squyres
b2cfedf805 Add copyright headers, ompi_config.h, and stdio.h.
This commit was SVN r6729.
2005-08-04 12:06:08 +00:00
Jeff Squyres
49dfd6a527 Add doxy docs for a function that was missing them.
This commit was SVN r6728.
2005-08-03 21:14:40 +00:00
Jeff Squyres
ae7d85b6ca Fix a corner case -- if the MCA param had already been registered, be
sure to look it up if NULL != current_value.

This commit was SVN r6727.
2005-08-03 20:59:44 +00:00
Tim Prins
2d707f34a0 - make persistent daemons work with bproc
- added documentation
 - code cleanups

This commit was SVN r6726.
2005-08-03 20:24:52 +00:00
Brian Barrett
26adbfe713 checkpoint to move back to RS
* remove dead code
* add some debugging code

This commit was SVN r6725.
2005-08-03 20:21:23 +00:00
Brian Barrett
6c37ad4471 * more components to ignore on RS
* fix comment

This commit was SVN r6724.
2005-08-03 16:08:27 +00:00
Brian Barrett
67f96f7b46 * convert to new param registration code
* Fix RDMA book keeping

This commit was SVN r6723.
2005-08-03 16:02:02 +00:00
Brian Barrett
3867d59cf1 * Look at stdout instead of stdin for determining whether output is going
to a tty or not.  Now you can do something like:

     ompi_info -all | grep btl_portals

  and get the full line for each btl_portals parameter.
* For the case where stdout is a tty, we have my current nomination for
  Today's Useless OMPI Feature.  Autodetect the width of the terminal, so
  people with really wide terminals will get less wrapping

This commit was SVN r6722.
2005-08-03 14:48:48 +00:00
Jeff Squyres
11140e9cb8 We must eliminate and stamp out all forms of redundancy, however they
may appear.

(remove *error.h file from Makefile.am -- a cut-n-paste error that has
propagated to a surprising number of directories ;-) )

This commit was SVN r6721.
2005-08-03 14:47:04 +00:00
Brian Barrett
124850d1c5 * fix deadlock in signal add code. Should not call the event_loop from
a function that has the lock.  Instead, recalculate the pending
  events directly.  Same effect, but less deadlock.

This commit was SVN r6720.
2005-08-03 14:10:55 +00:00
Tim Prins
aa0525da98 Improvements in bproc support:
- we now properly support multiple application contexts
 - much improved error messages, using opal_show_help
 - fix some small bugs in the way the processes were discovering their names
 - better searching for orted
 - use the new mca parameter interface

These changes still need some testing, but they seem stable.

This commit was SVN r6719.
2005-08-02 22:22:55 +00:00
Brian Barrett
44ae604451 * Don't let -lsysfs propogate outside of the OpenIB components. Even if
OpenIB wasn't found, it was getting put in LIBS and causing issues on
  some platforms...

This commit was SVN r6718.
2005-08-02 20:25:42 +00:00
Rainer Keller
3aedbcb0b6 Aah, someone committed to a m4-file.
Take the chance to fix this typo...

This commit was SVN r6717.
2005-08-02 20:21:55 +00:00
Brian Barrett
eb2748130b * don't build TCP component if we don't have IP sockets :)
This commit was SVN r6716.
2005-08-02 20:06:34 +00:00
Rainer Keller
f5f96f424d - Use for-loop
- Minor heck for stride != 1
- indenting

This commit was SVN r6715.
2005-08-02 19:56:18 +00:00
Brian Barrett
e84a02d79c Allow the C and C++ compilers to support different methods of inline
assembly (including case where one supports inline assembly and the
other doesn't).

This commit was SVN r6714.
2005-08-02 19:51:57 +00:00
Ralph Castain
4e1837687b Finish simplified interfaces for put and subscribe - more details to come.
This commit was SVN r6713.
2005-08-02 19:43:29 +00:00