Tim Woodall
a74ca0062a
reductions to initial memory footprint
...
This commit was SVN r7455.
2005-09-21 19:10:56 +00:00
Galen Shipman
4296e723c9
default free_lists to smaller size..
...
This commit was SVN r7454.
2005-09-21 18:55:07 +00:00
Galen Shipman
96ab5a6bd3
we can be in WAITING_ACK state without a race if the OOB ack is "slower" than
...
the scheduling of queued IB send operations.
This commit was SVN r7452.
2005-09-21 16:47:08 +00:00
Tim Woodall
782e5b21cc
cleanup
...
This commit was SVN r7451.
2005-09-21 15:34:45 +00:00
Tim Woodall
a49a442fe4
cleanup refcount logic
...
This commit was SVN r7450.
2005-09-21 15:32:27 +00:00
Tim Woodall
0ee34051f8
debug asserts
...
This commit was SVN r7449.
2005-09-21 15:30:17 +00:00
Tim Woodall
1b73d3856e
possible race condition - set endpoint state before sending connect ack
...
This commit was SVN r7448.
2005-09-20 21:03:55 +00:00
Tim Woodall
84e0d89497
correction
...
This commit was SVN r7447.
2005-09-20 19:20:39 +00:00
Ralph Castain
2656ec93b5
Fix a typo so that stage_gate_2 gets correctly passed back to orterun...
...
This commit was SVN r7446.
2005-09-20 19:12:59 +00:00
Ralph Castain
5686e8119e
Move the error name macro to the errmgr framework. Add a second level of tracing. Remove an obsolete file.
...
This commit was SVN r7445.
2005-09-20 17:09:11 +00:00
David Daniel
e4985c2a07
Moving totalview spin to the very end of mpi_init
...
This commit was SVN r7444.
2005-09-20 15:22:15 +00:00
Tim Woodall
c25ffb343a
restore host option
...
This commit was SVN r7443.
2005-09-20 13:36:16 +00:00
Tim Woodall
29d14281c8
use the specified host names (if provided)
...
This commit was SVN r7442.
2005-09-20 13:33:11 +00:00
Tim Woodall
6c885acb91
corrections to handle host specifications
...
This commit was SVN r7441.
2005-09-20 13:32:08 +00:00
Tim Woodall
f0cec8ac0c
Both -H and -host options are allowed to specify hostlist (now supported for bproc -
...
will look at rsh)
This commit was SVN r7440.
2005-09-20 13:31:13 +00:00
Jeff Squyres
41ba191e9a
Temporarily comment out the -arch and -host options since we do not
...
yet have an rmapper that can handle that information.
This commit was SVN r7438.
2005-09-20 08:56:02 +00:00
Brian Barrett
fd9901f683
* shell of a portals PML, properly ompi_ignored for most of the world...
...
This commit was SVN r7437.
2005-09-20 08:07:08 +00:00
Tim Woodall
75d9119cf3
correction
...
This commit was SVN r7436.
2005-09-19 21:35:39 +00:00
Tim Woodall
e1ec160858
lookup available nodes based on mapping data (if available)
...
This commit was SVN r7435.
2005-09-19 21:31:00 +00:00
Tim Woodall
9c334800ad
merge in environ from front-end node - giving precedence
...
to any user supplied values. otherwise, some c library
routines behave badly (getpwuid...)
This commit was SVN r7434.
2005-09-19 21:06:05 +00:00
Ralph Castain
bfef5928a1
Add a second trace option to pass an argument
...
This commit was SVN r7433.
2005-09-19 20:22:22 +00:00
Ralph Castain
86a43b1d29
Add trace to the daemons and orterun so we can tell when their callbacks are being exercised.
...
This commit was SVN r7432.
2005-09-19 17:20:01 +00:00
Brian Barrett
d81726833e
* Add memory barriers for shared memory. Rich and I think we got them
...
all and the Intel tests pass slightly oversubscribed.
This commit was SVN r7431.
2005-09-19 16:28:25 +00:00
George Bosilca
60f9edf17c
Create the mutex and the condition only once.
...
This commit was SVN r7430.
2005-09-19 16:01:29 +00:00
George Bosilca
193120d434
In the case where we we have to subscribe to get information about the peer. As we call this function
...
with the mutex locked and as this function will call oob_send which will call the lookup again
... we will deadlock as the mutex is already lock. The solution is to release the mutex before
going into the subscription. Then of course the logic to remote the item when something went
wrong with the subscrition is a little bit more complex.
This commit was SVN r7429.
2005-09-19 15:59:46 +00:00
George Bosilca
703e874468
Remove a race condition. If this functions is called by the progress thread then it does not have to
...
add an event, it can call the spawn function directly. This will avoid it standing on the condition who
will never get released.
This commit was SVN r7428.
2005-09-19 15:54:53 +00:00
Ralph Castain
b589a93e29
Continue to lace the trace functionality into orte...
...
This commit was SVN r7427.
2005-09-19 15:29:14 +00:00
Tim Woodall
09869daf8e
from the list of addresses exported by the peer, attempt to
...
pick an address on the same subnet. if non are found, give
up and try them in order
This commit was SVN r7426.
2005-09-19 14:47:11 +00:00
Tim Woodall
aeb5bc3f57
still need to cleanup/revise the template for mpool changes
...
This commit was SVN r7425.
2005-09-19 14:34:24 +00:00
George Bosilca
b70230858b
Correct the misnaming problem in the GM PTL.
...
This commit was SVN r7424.
2005-09-19 10:34:06 +00:00
Galen Shipman
6499eb3976
init the return code..
...
This commit was SVN r7423.
2005-09-18 14:25:30 +00:00
George Bosilca
97673b45d1
Remove the last bad symbol from the GM PTL.
...
This commit was SVN r7422.
2005-09-18 12:52:37 +00:00
George Bosilca
b5cb27c006
The self should use self named files.
...
This commit was SVN r7421.
2005-09-18 12:37:15 +00:00
George Bosilca
a7db1763e2
Cleanups ...
...
This commit was SVN r7420.
2005-09-18 12:34:29 +00:00
George Bosilca
9da5fc2d65
It's already defined in the .h file.
...
This commit was SVN r7419.
2005-09-18 12:32:33 +00:00
Jeff Squyres
d67c31f238
Remove useless compiler warnings.
...
This commit was SVN r7418.
2005-09-17 10:54:48 +00:00
Jeff Squyres
f9a1e14f65
Per suggestion from our friendly Libtool developer friends, add proper
...
dependencies for liborte and libompi (i.e., make liborte depend on
libopal, and make libmpi depend on liborte)
This commit was SVN r7417.
2005-09-17 10:45:46 +00:00
Jeff Squyres
c1ed60be77
Fix problem of not sending appropriate flags to the wrapper
...
compilers.
This commit was SVN r7416.
2005-09-17 10:44:20 +00:00
Galen Shipman
b8cb6e1c64
modified mpool module to contain flags - used to determine if the mpool will
...
be used in MPI_Alloc_mem operations. Note that we found an interesting bug in
which if memory was allocated by the sm mpool (via mmap) and then registered
via the mvapi mpool, the registration would fail on certain systems.
Added mca param mpool_base_use_mem_hooks, set to 1 to enable the memory hooks
so that memory is deregistered if the user frees it behind our back. This is
only useful if the mca param mpi_leave_pinned is also set to 1. Otherwise all
registrations are deregistered within the MPI library or via
MPI_Free_buf. After testing we should probably set both mpi_leave_pinned and
mpool_base_use_mem_hooks to default to 1.
This commit was SVN r7415.
2005-09-16 22:22:03 +00:00
Tim Woodall
f71abbf856
cleanup
...
This commit was SVN r7414.
2005-09-16 20:59:53 +00:00
Tim Woodall
41b6fc166e
setup callback before actually launching - otherwise this is
...
a definate race condition
This commit was SVN r7413.
2005-09-16 20:45:25 +00:00
Brian Barrett
753b9a888e
* fix compiler warning
...
This commit was SVN r7412.
2005-09-16 19:15:36 +00:00
Brian Barrett
c87babb565
* if start_rank == end_rank, there doesn't seem to be a requirement that
...
stride == 1, and the Intel tests explicitly test the case with
strides != 1, expecting them to work
This commit was SVN r7411.
2005-09-16 18:44:21 +00:00
Brian Barrett
b3cc58b681
* use rd instead of mov because for some odd reason GAS doesn't like
...
moving the tick register to another register, but is fine with rd.
* properly mask out the upper 32 bits of the register containing the
lower 32 bits so that the or with the upper 32bits actually works
properly. Hopefully, gcc will optimize this into code that doesn't
suck so much (it's much easier to deal with when you get to control
the argument registers...)
This commit was SVN r7410.
2005-09-16 17:44:18 +00:00
Galen Shipman
808b2c1c53
threaded build fix for btl_gm..
...
This commit was SVN r7409.
2005-09-16 17:18:15 +00:00
Josh Hursey
575afef072
Use non blocking sends in orte_gpr_replica_remote_notify.
...
This fixes one of the race conditions in orterun is sent a kill signal.
Before it would sometimes spin in the OOB waiting for a message to complete
to a peer that was no longer around. Stalling at this level prevented orterun
from noticing that it had received a kill signal.
This commit was SVN r7408.
2005-09-16 15:34:44 +00:00
Jeff Squyres
4637ca6027
Use AC_FD_CC uniformly (one & was missing).
...
This commit was SVN r7407.
2005-09-16 12:28:53 +00:00
Jeff Squyres
5b5da322e8
Fix typo and update auto tools to recent versions
...
This commit was SVN r7406.
2005-09-16 12:28:31 +00:00
Jeff Squyres
2b82224a4f
Remove superflous ; (actually causes an error in some cases)
...
This commit was SVN r7405.
2005-09-16 12:27:25 +00:00
Brian Barrett
2787d993a9
* Add checks for fork/execve/setpgid for slurm components so that they
...
automagically don't build on platforms without such things
* Fix for mistaken use of cache variable in assembly setup
* one more cached test hits the books
This commit was SVN r7404.
2005-09-16 04:51:09 +00:00