1
1
Граф коммитов

5566 Коммитов

Автор SHA1 Сообщение Дата
Galen Shipman
4556f5bb7c Fix for multiple calls to add_procs
This commit was SVN r7111.
2005-08-31 19:44:28 +00:00
Galen Shipman
09873f299f Fixed a race in connection establishment..
This commit was SVN r7110.
2005-08-31 19:43:22 +00:00
David Daniel
c6054662d5 Forgot to add new header to sources
This commit was SVN r7109.
2005-08-31 16:21:58 +00:00
David Daniel
a5eff8fc78 A little more clean-up. TotalView now works with --enable-debug build.
Tested with:
pls = rsh
totalview.6.6.0-2
Linux cadillac82.ccstar.lanl.gov 2.4.24 #1 SMP Thu Jul 1 15:28:04 MDT
2004 i686 i686 i386 GNU/Linux

This commit was SVN r7108.
2005-08-31 16:15:59 +00:00
Jeff Squyres
284328afe3 Add missing .h file so that it is included in the tarball
This commit was SVN r7107.
2005-08-31 11:01:28 +00:00
Jeff Squyres
b4e2c96e22 - Adjust include paths for ltdl.h
- Properly set CLEANFILES to remove F90 kruft

This commit was SVN r7106.
2005-08-31 10:56:01 +00:00
Rainer Keller
27f1174d0e - Only return the nodes actually allocated to the job.
(necessary when orted handles several jobs simultaneously).

This commit was SVN r7105.
2005-08-31 07:09:47 +00:00
Jeff Squyres
0fddc53603 If you're going to make a fix by unescaping something, then unescape
*everything*.  Oops.  :-(

This commit was SVN r7104.
2005-08-31 03:01:06 +00:00
Graham Fagg
1caec16018 snapshot as I move machines
do not use

This commit was SVN r7103.
2005-08-31 01:43:48 +00:00
George Bosilca
d64a702a5b There is a missing header. --enable-picky help to track down such kind of errors.
This commit was SVN r7102.
2005-08-31 00:47:52 +00:00
David Daniel
995641c1e6 Don't initialize proctable more than once (since the stage gate 1 trigger
seems to get fired at least twice).

This commit was SVN r7101.
2005-08-31 00:21:55 +00:00
Tim Woodall
dc0f3eef63 more corrections
This commit was SVN r7100.
2005-08-30 21:26:24 +00:00
Tim Woodall
f742e4901f we always acquire/release this lock - support threaded users
of the library

This commit was SVN r7099.
2005-08-30 21:17:33 +00:00
Tim Woodall
47b41c7081 correct regression
This commit was SVN r7098.
2005-08-30 19:38:56 +00:00
David Daniel
a5d9199e7f Adding a simple hook for TotalView that is activated if a particular MCA
parameter is set.

orterun/MPI integration still not quite working.

This commit was SVN r7097.
2005-08-30 17:34:23 +00:00
David Daniel
ced11250e4 Basic totalview support for orterun. Close to working, but need to
check hostnames are obtained correctly.

This commit was SVN r7096.
2005-08-30 17:29:43 +00:00
Galen Shipman
00e0ff729d intialize free list to rr_buf_max, report async errors to user.
This commit was SVN r7095.
2005-08-30 16:44:38 +00:00
Jeff Squyres
dd96f06470 We now also support POE
This commit was SVN r7094.
2005-08-30 16:15:01 +00:00
George Bosilca
53ccf0e58c POE is working. It can spawn jobs, redirect the output and is able to kill the job (with or without CTRL_C).
This commit was SVN r7093.
2005-08-30 16:13:55 +00:00
David Daniel
6cb97e6ade Reverting totalview support to *not* use the as yet unimplemented
orte_jobgrp_t.  Now just need to work out where to call it...

This commit was SVN r7092.
2005-08-30 12:59:04 +00:00
Jeff Squyres
81c9e7a582 Fix nested quoting
This commit was SVN r7091.
2005-08-30 10:44:55 +00:00
Jeff Squyres
774549d640 Add missing header file
This commit was SVN r7090.
2005-08-30 10:41:25 +00:00
Rainer Keller
d7901c97a5 - Del whitespaces, to make coming patch smaller.
This commit was SVN r7089.
2005-08-30 06:58:37 +00:00
Brian Barrett
77ebdf1c6f * Add some debugging output Ralph asked for when an unknown error code is
passed to opal_error

This commit was SVN r7087.
2005-08-29 23:36:53 +00:00
Brian Barrett
d8e5d80892 * add a reasonable first wack at a suppressions file for Valgrind to ignore
some stuff that we can't do anything about
* fix some more memory leaks in session_dir code

This commit was SVN r7086.
2005-08-29 23:05:52 +00:00
Brian Barrett
bf8a3632bb * bunch more memory leak / block in use fixes
This commit was SVN r7085.
2005-08-29 21:35:01 +00:00
Jeff Squyres
10488b717a More 0 count fixes.
This commit was SVN r7083.
2005-08-29 19:55:48 +00:00
Jeff Squyres
0ef7ced8f0 For items with 0 receive counts, don't try to receive.
This commit was SVN r7082.
2005-08-29 19:37:07 +00:00
Brian Barrett
07b589100e * add test for init_finalize of orte (useful for memory leak checks)
* update ORTE tests to cope with change in prototype for orte_init()

This commit was SVN r7081.
2005-08-29 19:32:46 +00:00
Brian Barrett
660ce0a486 * update tests to reflect moving path_sep out of orte_sys_info and moving
orte_sys_info out of OPAL and into ORTE

This commit was SVN r7080.
2005-08-29 19:23:25 +00:00
Tim Woodall
d34e299829 correctly decrement progress_event if tcp is not being
used so that tcp doesn't impact progress loop

This commit was SVN r7078.
2005-08-29 17:29:58 +00:00
Jeff Squyres
7a4013dcdf Update svn:ignore
This commit was SVN r7075.
2005-08-29 16:44:49 +00:00
Jeff Squyres
6089b02afd First cut of the libnuma implementation
This commit was SVN r7074.
2005-08-29 15:13:54 +00:00
Jeff Squyres
b98974d09c - Clarify the output when looking for compilers that impersonate gcc
- Setup OMPI_CC_ORIGINAL in preparation for AM subdir-objects (so that
  we can snarf libtool objects properly if $CC has been switched to
  the "compile" script)

This commit was SVN r7073.
2005-08-28 13:11:38 +00:00
Jeff Squyres
6057a17709 Fix typo in help message
This commit was SVN r7072.
2005-08-28 13:10:16 +00:00
Jeff Squyres
7d895a4f08 Add missing header file
This commit was SVN r7071.
2005-08-28 11:50:43 +00:00
George Bosilca
12c09fef92 PTL GM now use the new configure system (m4 macros).
This commit was SVN r7070.
2005-08-28 01:38:58 +00:00
Brian Barrett
173e062fbb * Spell LIBS as LIBS not LIBX ;)
This commit was SVN r7069.
2005-08-27 17:38:50 +00:00
Brian Barrett
fc71fd5744 * fix place where Jeff changed an exit to a return and we really wanted
it to be an exit.
* Put the srun process (or what is about to become the srun process) in
  it's own process group so that group-wide signals (such as the 
  SIGINT sent by hitting cntl-c in a shell) are not sent to the srun
  process. 

This commit was SVN r7068.
2005-08-27 17:08:48 +00:00
George Bosilca
5b59ffbe4f Handle multiple IP addresses for the OOB TCP module. We check the addresses in order, and we give up if
and only if all of them failed.

This commit was SVN r7067.
2005-08-27 17:03:19 +00:00
Jeff Squyres
9d6c22b0d5 Commit 2 of 2 to shorten romio component filenames: change the sym
links from mca_io_romio_dist_* to io_romio_*

This commit was SVN r7066.
2005-08-27 15:33:42 +00:00
Jeff Squyres
8006b1a5a8 Part 1 of 2 of a commit to reduce filenames in the romio component
(because we kept bumping up against the max filename limit in "tar"
when making tarballs, especially if the version number got long).

This commit was SVN r7065.
2005-08-27 15:17:25 +00:00
Jeff Squyres
774f879a41 Oops -- add second string in there because we added a second %s to the
help message.

This commit was SVN r7064.
2005-08-27 13:32:25 +00:00
Jeff Squyres
b462345956 Allow command line to override ACLOCAL, AUTOHEADEr, AUTOCONF,
LIBTOOLIZE, AUTOMAKE via environment variables.

This commit was SVN r7063.
2005-08-27 11:23:19 +00:00
Jeff Squyres
27554c19d7 Add missing .h file
This commit was SVN r7062.
2005-08-27 11:01:44 +00:00
Brian Barrett
2143ed4c81 * move error -> string converter registration from orte_init to
orte_init_stage1(), since not all ORTE processes call orte_init().
* Expad opal_error test case to make sure ORTE error codes print
  properly
* Make project error codes start at easy values (OPAL is -1 to -100,
  ORTE is -101 to -200, OMPI is less than -201) to make it easier
  to figure out what an error code as an integer means.  Also has
  the nice property of not changing the values of error codes ever
  time a new error code is added.

This commit was SVN r7061.
2005-08-26 23:36:57 +00:00
George Bosilca
0eabdd9498 The correct way to detect if a contiguous datatpe has gaps around (in the begining or at the
end) is to check the size and the extent (not the true extent).
If there are gaps we should adapt the source and the destination to match these gaps
(especially if the gaps are in the begining).

This commit was SVN r7060.
2005-08-26 22:41:04 +00:00
Jeff Squyres
c9cdb36b0b Finally get this right: move orte_sys_info.[ch] back into the orte
tree.
- fix up #include's throughout the tree (yay contrib/search_replace.pl!)
- remove a few extraneous #include's
- remove orte_sys_info*() from opal_init()/opal_finalize() (it's
  already in orte_init_stage1() and orte_system_finalize())
- remove dependencies in opal on orte_system_info -- util/os_path.c
  and util/os_create_dirpath.c (they only used path_sep, anyway --
  easily changed to #defines)

This commit was SVN r7059.
2005-08-26 21:03:41 +00:00
Jeff Squyres
b3bd549331 - Change a few calls from exit() to orte_abort() so that we get
session directory cleanup (among other things)
- When we get an abnormal exit in orterun (i.e., timeout expires and
  we haven't gotten termination notices from all processes), print a
  better message an exit in a better way (which includes session
  directory cleanup)
- Fix tm and poe pls's to not exit() but rather propagate the error up
  the stack (where relevant)

This commit was SVN r7058.
2005-08-26 20:36:11 +00:00
Tim Woodall
2a9ab3eb10 move obj construct back into open
This commit was SVN r7057.
2005-08-26 20:28:42 +00:00