1
1
Граф коммитов

5869 Коммитов

Автор SHA1 Сообщение Дата
Tim Woodall
a74ca0062a reductions to initial memory footprint
This commit was SVN r7455.
2005-09-21 19:10:56 +00:00
Galen Shipman
4296e723c9 default free_lists to smaller size..
This commit was SVN r7454.
2005-09-21 18:55:07 +00:00
Galen Shipman
96ab5a6bd3 we can be in WAITING_ACK state without a race if the OOB ack is "slower" than
the scheduling of queued IB send operations. 

This commit was SVN r7452.
2005-09-21 16:47:08 +00:00
Tim Woodall
782e5b21cc cleanup
This commit was SVN r7451.
2005-09-21 15:34:45 +00:00
Tim Woodall
a49a442fe4 cleanup refcount logic
This commit was SVN r7450.
2005-09-21 15:32:27 +00:00
Tim Woodall
0ee34051f8 debug asserts
This commit was SVN r7449.
2005-09-21 15:30:17 +00:00
Tim Woodall
1b73d3856e possible race condition - set endpoint state before sending connect ack
This commit was SVN r7448.
2005-09-20 21:03:55 +00:00
Tim Woodall
84e0d89497 correction
This commit was SVN r7447.
2005-09-20 19:20:39 +00:00
Ralph Castain
2656ec93b5 Fix a typo so that stage_gate_2 gets correctly passed back to orterun...
This commit was SVN r7446.
2005-09-20 19:12:59 +00:00
Ralph Castain
5686e8119e Move the error name macro to the errmgr framework. Add a second level of tracing. Remove an obsolete file.
This commit was SVN r7445.
2005-09-20 17:09:11 +00:00
David Daniel
e4985c2a07 Moving totalview spin to the very end of mpi_init
This commit was SVN r7444.
2005-09-20 15:22:15 +00:00
Tim Woodall
c25ffb343a restore host option
This commit was SVN r7443.
2005-09-20 13:36:16 +00:00
Tim Woodall
29d14281c8 use the specified host names (if provided)
This commit was SVN r7442.
2005-09-20 13:33:11 +00:00
Tim Woodall
6c885acb91 corrections to handle host specifications
This commit was SVN r7441.
2005-09-20 13:32:08 +00:00
Tim Woodall
f0cec8ac0c Both -H and -host options are allowed to specify hostlist (now supported for bproc -
will look at rsh)

This commit was SVN r7440.
2005-09-20 13:31:13 +00:00
Jeff Squyres
41ba191e9a Temporarily comment out the -arch and -host options since we do not
yet have an rmapper that can handle that information.

This commit was SVN r7438.
2005-09-20 08:56:02 +00:00
Brian Barrett
fd9901f683 * shell of a portals PML, properly ompi_ignored for most of the world...
This commit was SVN r7437.
2005-09-20 08:07:08 +00:00
Tim Woodall
75d9119cf3 correction
This commit was SVN r7436.
2005-09-19 21:35:39 +00:00
Tim Woodall
e1ec160858 lookup available nodes based on mapping data (if available)
This commit was SVN r7435.
2005-09-19 21:31:00 +00:00
Tim Woodall
9c334800ad merge in environ from front-end node - giving precedence
to any user supplied values. otherwise, some c library
routines behave badly (getpwuid...)

This commit was SVN r7434.
2005-09-19 21:06:05 +00:00
Ralph Castain
bfef5928a1 Add a second trace option to pass an argument
This commit was SVN r7433.
2005-09-19 20:22:22 +00:00
Ralph Castain
86a43b1d29 Add trace to the daemons and orterun so we can tell when their callbacks are being exercised.
This commit was SVN r7432.
2005-09-19 17:20:01 +00:00
Brian Barrett
d81726833e * Add memory barriers for shared memory. Rich and I think we got them
all and the Intel tests pass slightly oversubscribed.

This commit was SVN r7431.
2005-09-19 16:28:25 +00:00
George Bosilca
60f9edf17c Create the mutex and the condition only once.
This commit was SVN r7430.
2005-09-19 16:01:29 +00:00
George Bosilca
193120d434 In the case where we we have to subscribe to get information about the peer. As we call this function
with the mutex locked and as this function will call oob_send which will call the lookup again
... we will deadlock as the mutex is already lock. The solution is to release the mutex before
going into the subscription. Then of course the logic to remote the item when something went
wrong with the subscrition is a little bit more complex.

This commit was SVN r7429.
2005-09-19 15:59:46 +00:00
George Bosilca
703e874468 Remove a race condition. If this functions is called by the progress thread then it does not have to
add an event, it can call the spawn function directly. This will avoid it standing on the condition who 
will never get released.

This commit was SVN r7428.
2005-09-19 15:54:53 +00:00
Ralph Castain
b589a93e29 Continue to lace the trace functionality into orte...
This commit was SVN r7427.
2005-09-19 15:29:14 +00:00
Tim Woodall
09869daf8e from the list of addresses exported by the peer, attempt to
pick an address on the same subnet. if non are found, give
up and try them in order

This commit was SVN r7426.
2005-09-19 14:47:11 +00:00
Tim Woodall
aeb5bc3f57 still need to cleanup/revise the template for mpool changes
This commit was SVN r7425.
2005-09-19 14:34:24 +00:00
George Bosilca
b70230858b Correct the misnaming problem in the GM PTL.
This commit was SVN r7424.
2005-09-19 10:34:06 +00:00
Galen Shipman
6499eb3976 init the return code..
This commit was SVN r7423.
2005-09-18 14:25:30 +00:00
George Bosilca
97673b45d1 Remove the last bad symbol from the GM PTL.
This commit was SVN r7422.
2005-09-18 12:52:37 +00:00
George Bosilca
b5cb27c006 The self should use self named files.
This commit was SVN r7421.
2005-09-18 12:37:15 +00:00
George Bosilca
a7db1763e2 Cleanups ...
This commit was SVN r7420.
2005-09-18 12:34:29 +00:00
George Bosilca
9da5fc2d65 It's already defined in the .h file.
This commit was SVN r7419.
2005-09-18 12:32:33 +00:00
Jeff Squyres
d67c31f238 Remove useless compiler warnings.
This commit was SVN r7418.
2005-09-17 10:54:48 +00:00
Jeff Squyres
f9a1e14f65 Per suggestion from our friendly Libtool developer friends, add proper
dependencies for liborte and libompi (i.e., make liborte depend on
libopal, and make libmpi depend on liborte)

This commit was SVN r7417.
2005-09-17 10:45:46 +00:00
Jeff Squyres
c1ed60be77 Fix problem of not sending appropriate flags to the wrapper
compilers. 

This commit was SVN r7416.
2005-09-17 10:44:20 +00:00
Galen Shipman
b8cb6e1c64 modified mpool module to contain flags - used to determine if the mpool will
be used in MPI_Alloc_mem operations. Note that we found an interesting bug in
which if memory was allocated by the sm mpool (via mmap) and then registered
via the mvapi mpool, the registration would fail on certain systems. 

Added mca param mpool_base_use_mem_hooks, set to 1 to enable the memory hooks
so that memory is deregistered if the user frees it behind our back. This is
only useful if the mca param mpi_leave_pinned is also set to 1. Otherwise all
registrations are deregistered within the MPI library or via
MPI_Free_buf. After testing we should probably set both mpi_leave_pinned and
mpool_base_use_mem_hooks to default to 1. 

This commit was SVN r7415.
2005-09-16 22:22:03 +00:00
Tim Woodall
f71abbf856 cleanup
This commit was SVN r7414.
2005-09-16 20:59:53 +00:00
Tim Woodall
41b6fc166e setup callback before actually launching - otherwise this is
a definate race condition

This commit was SVN r7413.
2005-09-16 20:45:25 +00:00
Brian Barrett
753b9a888e * fix compiler warning
This commit was SVN r7412.
2005-09-16 19:15:36 +00:00
Brian Barrett
c87babb565 * if start_rank == end_rank, there doesn't seem to be a requirement that
stride == 1, and the Intel tests explicitly test the case with
  strides != 1, expecting them to work

This commit was SVN r7411.
2005-09-16 18:44:21 +00:00
Brian Barrett
b3cc58b681 * use rd instead of mov because for some odd reason GAS doesn't like
moving the tick register to another register, but is fine with rd.
* properly mask out the upper 32 bits of the register containing the
  lower 32 bits so that the or with the upper 32bits actually works
  properly.  Hopefully, gcc will optimize this into code that doesn't
  suck so much (it's much easier to deal with when you get to control
  the argument registers...)

This commit was SVN r7410.
2005-09-16 17:44:18 +00:00
Galen Shipman
808b2c1c53 threaded build fix for btl_gm..
This commit was SVN r7409.
2005-09-16 17:18:15 +00:00
Josh Hursey
575afef072 Use non blocking sends in orte_gpr_replica_remote_notify.
This fixes one of the race conditions in orterun is sent a kill signal.
Before it would sometimes spin in the OOB waiting for a message to complete
to a peer that was no longer around. Stalling at this level prevented orterun
from noticing that it had received a kill signal.

This commit was SVN r7408.
2005-09-16 15:34:44 +00:00
Jeff Squyres
4637ca6027 Use AC_FD_CC uniformly (one & was missing).
This commit was SVN r7407.
2005-09-16 12:28:53 +00:00
Jeff Squyres
5b5da322e8 Fix typo and update auto tools to recent versions
This commit was SVN r7406.
2005-09-16 12:28:31 +00:00
Jeff Squyres
2b82224a4f Remove superflous ; (actually causes an error in some cases)
This commit was SVN r7405.
2005-09-16 12:27:25 +00:00
Brian Barrett
2787d993a9 * Add checks for fork/execve/setpgid for slurm components so that they
automagically don't build on platforms without such things
* Fix for mistaken use of cache variable in assembly setup
* one more cached test hits the books

This commit was SVN r7404.
2005-09-16 04:51:09 +00:00