1
1
Граф коммитов

8323 Коммитов

Автор SHA1 Сообщение Дата
Brian Barrett
d0ec0558e4 more platforms support egrep than grep -E, so use that instead
This commit was SVN r11797.
2006-09-25 19:44:12 +00:00
Tim Prins
e4f8ad303e Fix for #397
on 64 bit platforms sizeof(size_t) != sizeof(orte_std_cntr_t), and we were incorrectly 
assuming this when dealing with num procs. It worked on little endian platforms, but
not big endian. So change num_procs to type int, and cast where needed. 

This commit was SVN r11796.
2006-09-25 19:41:54 +00:00
Brian Barrett
c3306f7073 * don't abort if we get a status error - just pass it on to the next layer up
This commit was SVN r11791.
2006-09-25 17:28:24 +00:00
Brian Barrett
b18055451d * need to include errno.h to get errno
This commit was SVN r11790.
2006-09-25 17:17:43 +00:00
Brian Barrett
fa0ae7aa52 len is assigned during the big for loop() in this function, so assuming
the initialization in the start of the function is going to remain for
the life of the function is erroneous.  Need to initialize before
using in the snprintf() call.

Refs trac:340

This commit was SVN r11781.

The following Trac tickets were found above:
  Ticket 340 --> https://svn.open-mpi.org/trac/ompi/ticket/340
2006-09-25 15:34:50 +00:00
Jeff Squyres
38b34b6a7c * Use time(NULL) to seed the random number generator because it has a
wider space than getpid()
 * Include <time.h> to get time()'s prototype
 * Fix typo that prevented using /dev/urandom on systems that had it

This commit was SVN r11780.
2006-09-25 15:32:54 +00:00
Jeff Squyres
cd6a20dc05 Update to NEWS.
This commit was SVN r11775.
2006-09-25 14:53:29 +00:00
Jeff Squyres
a822c34ebf Ralf W. categorically told me that this kind of statement in
Makefile.am's is from a very old Automake bug which has long-since
been fixed.  Since we require very recent versions of AM, we don't
need these anymore.

This commit was SVN r11774.
2006-09-25 14:28:04 +00:00
Jeff Squyres
c5cc1f0c1a Add man page for wrapper compilers.
Fixes trac:358.

This commit was SVN r11773.

The following Trac tickets were found above:
  Ticket 358 --> https://svn.open-mpi.org/trac/ompi/ticket/358
2006-09-25 14:11:21 +00:00
Gleb Natapov
9cd25158d6 Fix btl_openib_max_btls parameter handling.
This commit was SVN r11772.
2006-09-25 11:18:20 +00:00
Brian Barrett
2bfb2b8a8a * Add sys/stat.h so that this compiiles on OS X
* Properly protect headers with #ifdefs

This commit was SVN r11771.
2006-09-24 18:40:55 +00:00
Brian Barrett
18994324ce OpenBSD doesn't define some constants that other OSes do.
refs trac:393

This commit was SVN r11770.

The following Trac tickets were found above:
  Ticket 393 --> https://svn.open-mpi.org/trac/ompi/ticket/393
2006-09-24 18:20:55 +00:00
Galen Shipman
01de9de9a0 Use /dev/urandom.. doesn't block like /dev/random may..
This commit was SVN r11769.
2006-09-24 07:52:42 +00:00
Brian Barrett
52004417be * fix dumb typo. Clearly shouldn't work late at night...
This commit was SVN r11767.
2006-09-23 14:51:12 +00:00
Brian Barrett
f4b602baa9 turn off debugging that shouldn't have been committed
This commit was SVN r11765.
2006-09-23 03:25:18 +00:00
Brian Barrett
ad48aa82e9 A number of changes for improved Alpha support:
* Use $31 instead of mnemonic zero for the gcc inline
    assembly test, as the GNU assembler doesn't like
    zero, but both Tru64 and GNU assembler should be fine
    with $31
  * Disable Linux timer component on Alpha.  The CPU timer
    rolls over every 10 seconds or less, so it's kinda
    worthless for our needs.
  * Fix some escaping issues when local functions are
    denoted with a $
  * Remove C++ comments from the Alpha assembly.
  * Add base assembly code for the non-inlined functions
    on Alpha

This commit was SVN r11764.
2006-09-23 03:23:57 +00:00
Tim Prins
a4f1018afd cleanup compiler warning
This commit was SVN r11763.
2006-09-23 01:17:39 +00:00
Brian Barrett
e3497a8bff Fix a number of places where we can cause errors in configure if CFLAGS /
CXXFLAGS are set to -Wall -Werror.  Thanks to Ralf for the patch.

refs trac:290

This commit was SVN r11762.

The following Trac tickets were found above:
  Ticket 290 --> https://svn.open-mpi.org/trac/ompi/ticket/290
2006-09-22 23:58:23 +00:00
Terry Dontje
d636db5832 Fixed bug trac #213 by moving the udapl btl header to being a footer.
Also fixed bug trac #346.

This commit was SVN r11760.
2006-09-22 19:28:09 +00:00
Brian Barrett
3c814fdd23 fixes trac:391
Fix for double mutex free that would cause an abort condition in the orted
whenever threads were enabled.

This commit was SVN r11759.

The following Trac tickets were found above:
  Ticket 391 --> https://svn.open-mpi.org/trac/ompi/ticket/391
2006-09-22 19:24:42 +00:00
Brian Barrett
b4480cf182 * Need to unlock on error conditions in opal_event_base_loop()
* Fix if structure if we don't use signals

Rainer Keller submitted original patch, slightly cleaned up by me.

This commit was SVN r11758.
2006-09-22 16:08:58 +00:00
Andrew Friedley
798c19d395 Blah.. we should always return after try_connect() here, not just when we have an error.
Another fix for ticket #362.

This commit was SVN r11756.
2006-09-22 15:51:11 +00:00
Brian Barrett
ab6cbb2359 * update ompi_mpi_abort to call abort_procs_request on the processes that
should die, according to the MPI standard.  It's possible that the
    ORTE layer may kill additional processes, but that's beyond our
    control and seems to be allowed by the standard (ie, it might also
    end up killing all the procs in all the jobs covered by the
    communicator).

  * update the stack trace printing code to use the framework rather
    than calling execinfo directly, so that we should be able to get
    stack traces on all the platforms we support stack tracing on
    (if the user wants stack traces on abort, of course)

This commit was SVN r11753.
2006-09-22 15:04:04 +00:00
Tim Prins
567676f3c1 - Formatting and minor cleanup
- made it so we now set the architecture of each node we discover
- remove debugging output

This commit was SVN r11751.
2006-09-22 13:24:32 +00:00
Gleb Natapov
601a6ca17a Use real subnet prefix instead of sm_lid.
This commit was SVN r11749.
2006-09-22 10:27:12 +00:00
Brian Barrett
8fc278c3a3 Rest of the fix for #325. It uses a bit more space, but now we can reasonably
tell if the remote proc should be in an exposure epoch or not.

Refs trac:325

This commit was SVN r11746.

The following Trac tickets were found above:
  Ticket 325 --> https://svn.open-mpi.org/trac/ompi/ticket/325
2006-09-21 20:49:15 +00:00
Brian Barrett
2ec0c4f593 * Fix race condition in post/wait/start/complete synchronization where one
epoch's control data could overwrite the previous epoch's data because
    we were reusing data structures between PW and SC.  Instead, we now
    have explicit post_msg and complete_msg counters for completion.

    refs trac:354

  * Only register the rdma osc callback once, as it turns out that some
    btls (MX) do somethng more than update a table during the register
    call, and each register call sucks up valuable fragments...

This commit was SVN r11745.

The following Trac tickets were found above:
  Ticket 354 --> https://svn.open-mpi.org/trac/ompi/ticket/354
2006-09-21 19:57:57 +00:00
Sven Stork
443af2a259 - remove path of the argument
This commit was SVN r11742.
2006-09-21 16:08:52 +00:00
Sven Stork
390a5ac6c6 - add portable way to start bash
- fix for ticket 382

This commit was SVN r11741.
2006-09-21 16:04:41 +00:00
George Bosilca
383c4e8c18 Don't print the optimized description if there is nothing.
This commit was SVN r11733.
2006-09-21 05:40:51 +00:00
Tim Prins
83a7f6e4de Fix for bug #369.
LoadLeveler only sets LOADL_PROCESSOR_LIST when there are 128 or less tasks allocated to a job. The POE RAS relied on this variable so I created a new RAS which uses the LoadLeveler API instead of relying on the environment variable. This still needs some testing, so for now we use the POE RAS whenever LOADL_PROCESSOR_LIST, otherwise we fall back on this component.

Unfortunately, this will require an autogen...

This commit was SVN r11732.
2006-09-21 00:08:49 +00:00
George Bosilca
645790dd9c Pedantic...
This commit was SVN r11731.
2006-09-20 22:20:10 +00:00
George Bosilca
688a16ea78 A long time waiting patch. Get rid of the comm->c_pml_procs. It was (and that was
long ago) supposed to be used as a cache for accessing the PML procs. But in
all of the PMLs the PML proc contain only one field i.e. a pointer to the ompi_proc.
This pointer can be accessed using the c_remote_group easily. Therefore, there is no
meaning of keeping the PML procs around. Slim fast commit ...

This commit was SVN r11730.
2006-09-20 22:14:46 +00:00
George Bosilca
20459bd982 Remove the HIDDEN flag. It is not used anywhere.
This commit was SVN r11729.
2006-09-20 20:57:10 +00:00
George Bosilca
1c464d340c Do not increase the reference count for the datatype if it is not required. Plus
some typos.

This commit was SVN r11728.
2006-09-20 20:14:15 +00:00
Brian Barrett
ddffc8748f * remove unneeded files that were included in the mpich2 tarball
This commit was SVN r11726.
2006-09-20 18:27:28 +00:00
Galen Shipman
04e9483aab fall back to rand() if /dev/random doesn't exist or the read to /dev/random
would block.. 

This commit was SVN r11725.
2006-09-20 16:59:44 +00:00
Andrew Friedley
8895bf7369 Fix the fix (r11718) for bug #362.
We were still waiting the entire duration of the timeout before we figured out that a connect() was successful.  Re-introduce adding the peer_send_event so that we detect immediately when a connect() completes.

Also make sure to delete the timeout event in complete_connect().

Fixed a struct timeval initialization warning reported by Jeff.

Remove an erroneous opal_output().

This commit was SVN r11724.

The following SVN revision numbers were found above:
  r11718 --> open-mpi/ompi@1b6231a9b5
2006-09-20 14:29:37 +00:00
George Bosilca
ff7e670c6a Sends and receives are now fully implemented. Correctly handle the reference counts
on groups. Correct the rearrangement of the communicators. Other minors improvements.

This commit was SVN r11722.
2006-09-20 07:51:30 +00:00
Brian Barrett
10a230373b Add a number of missing constants to the C++ bindings
refs trac:322

This commit was SVN r11720.

The following Trac tickets were found above:
  Ticket 322 --> https://svn.open-mpi.org/trac/ompi/ticket/322
2006-09-19 22:28:21 +00:00
Andrew Friedley
e776b01811 This assert fails if -mca pml_dr_enable_csum 0 is set, which isn't what we want..
This commit was SVN r11719.
2006-09-19 19:57:33 +00:00
Andrew Friedley
1b6231a9b5 Fix for running jobs that span multiple 's' partitions on IU BigRed.
Each 's' partition has its own TCP network.  It's fine to use this network for jobs that fit inside the partition, but the TCP OOB errors when trying to connect across two partitions, because there are two disjoint networks.  Each node also has another TCP network connecting ALL nodes together.

So the solution is to actually try all the available TCP interfaces on a node, instead of erroring when the first one fails.

Also, the default TCP connect() timeout is way too long (5 minutes) - use our own timeout mechanism, with the timeout value expressed as an MCA parameter.

This commit was SVN r11718.
2006-09-19 19:33:49 +00:00
George Bosilca
6f3782bbd7 When we succesfully cancel a request we have to set it's pml_complete flag to true
if we want to be able to reuse the request. If not, the request will never be freed
even if the user call MPI_Request_free.

This commit was SVN r11717.
2006-09-19 18:04:09 +00:00
Tim Prins
c4db5654fa Fix for bug #370
The POE ras did not correctly enter the number of slots per node. This fixes that.

This commit was SVN r11716.
2006-09-19 16:27:15 +00:00
Brian Barrett
d1402cf8f5 * Update ROMIO release to the one included with MPICH2-1.0.4p1, tagged in
vendor/romio as mpich2-1.0.4p1.

This commit was SVN r11715.
2006-09-19 16:13:46 +00:00
Gleb Natapov
18c54f574f add rmb()
This commit was SVN r11710.
2006-09-19 13:27:05 +00:00
Ralph Castain
977e3c5ca1 Let's see if Cyrador understands this version a little better...
This commit was SVN r11709.
2006-09-19 13:05:40 +00:00
Gleb Natapov
ac42284c16 Print more helpful message in case we can't find active port.
This commit was SVN r11706.
2006-09-19 08:56:32 +00:00
Rainer Keller
80166a9516 - fix typos
This commit was SVN r11703.
2006-09-19 07:55:41 +00:00
George Bosilca
b49ff3a3b7 We are now able to provide full information about the requests. And we can
parse all requests (sends or receives), and show them based on the communicator.

This commit was SVN r11702.
2006-09-19 06:31:42 +00:00