Galen Shipman
3c97b3f722
Modified the registration to include a base_align and bound_align for
...
searching the tree. Modified the memory callback to search the tree at each
page boundary for registrations. This is necessary as an application may
malloc memory and send out of any portion of that memory, even discontiguous
regions.
This commit was SVN r7510.
2005-09-27 02:01:21 +00:00
Brian Barrett
d9e80d8f2a
* increase size of event queue for receives - it was too small to be useful
...
on a reasonably sized machine
* if no mpool exists, don't try to malloc out an array of 0 bytes
This commit was SVN r7507.
2005-09-25 17:04:03 +00:00
Galen Shipman
9fe5844071
decrement ref count on removal of registration from mru and tree.
...
add misc asserts to check for proper reference counting.
ugly hack 1 -- use mallopt to never release memory ala sbrk - this is
commented out in mca_btl_mvapi_component_init
ugly hack 2 -- test registrations comming out of the tree via rcache_find, for
an unknown reason the tree is returning registrations where the address is not
within the base or bound of the registration. If this happens, we return
NULL.
comment out code to enable mem hooks if leave_pinned is set, note we can do
this via an mca param and will default it to leave_pinned with mem_hooks when
we iron out these issues.
I am adding a unit test for the rcache. Note that we have a unit test for the
rb tree but the compare function is significantly different than that used for
registrations. After we have tracked down the issues with rcache_rb we will
remove the above hacks.
This commit was SVN r7499.
2005-09-24 00:24:49 +00:00
Brian Barrett
50dc5499b4
* fix some remaining --with-btl-portals configure issues
...
This commit was SVN r7498.
2005-09-24 00:11:40 +00:00
Brian Barrett
0d68728b94
* add some more debugging output for send fragment issue to figure out why
...
Red Storm is complaining about invalid memory pointer (need to go back
to Linux and look at this with valgrind)
* Turn off send in place for now, so I can run the tests on RS and see if
everything else is ok
This commit was SVN r7497.
2005-09-23 19:30:54 +00:00
Brian Barrett
07b0b8c943
* add some useful debugging output
...
* fix dumb bug in btl_portals_get where I using the dest descriptor key instead
of the source descriptor key for the match bits, resulting in a PtlGet() with
the wrong match bits
This commit was SVN r7496.
2005-09-23 15:30:18 +00:00
Tim Woodall
147716c249
added hostname to error output
...
This commit was SVN r7486.
2005-09-22 16:41:34 +00:00
Andrew Friedley
555ae37255
Add lib{opal,orte,mpi}.la to appropriate LIBADD's, some whitespace cleanup as well.
...
This commit was SVN r7477.
2005-09-22 12:28:54 +00:00
Tim Woodall
a74ca0062a
reductions to initial memory footprint
...
This commit was SVN r7455.
2005-09-21 19:10:56 +00:00
Galen Shipman
4296e723c9
default free_lists to smaller size..
...
This commit was SVN r7454.
2005-09-21 18:55:07 +00:00
Galen Shipman
96ab5a6bd3
we can be in WAITING_ACK state without a race if the OOB ack is "slower" than
...
the scheduling of queued IB send operations.
This commit was SVN r7452.
2005-09-21 16:47:08 +00:00
Tim Woodall
0ee34051f8
debug asserts
...
This commit was SVN r7449.
2005-09-21 15:30:17 +00:00
Tim Woodall
1b73d3856e
possible race condition - set endpoint state before sending connect ack
...
This commit was SVN r7448.
2005-09-20 21:03:55 +00:00
Brian Barrett
d81726833e
* Add memory barriers for shared memory. Rich and I think we got them
...
all and the Intel tests pass slightly oversubscribed.
This commit was SVN r7431.
2005-09-19 16:28:25 +00:00
Tim Woodall
aeb5bc3f57
still need to cleanup/revise the template for mpool changes
...
This commit was SVN r7425.
2005-09-19 14:34:24 +00:00
George Bosilca
b5cb27c006
The self should use self named files.
...
This commit was SVN r7421.
2005-09-18 12:37:15 +00:00
Galen Shipman
808b2c1c53
threaded build fix for btl_gm..
...
This commit was SVN r7409.
2005-09-16 17:18:15 +00:00
Tim Woodall
31d392af95
correct name
...
This commit was SVN r7376.
2005-09-14 22:35:58 +00:00
Tim Woodall
d190e6a315
handle losing a connection
...
This commit was SVN r7373.
2005-09-14 21:27:30 +00:00
Tim Woodall
c25fb5dab0
- fixed issue w/ btl send-in-place option that was affecting tcp
...
- reduced size of match header by an additional 4 bytes to 16 bytes
- corrections for buffered send (work in progress)
This commit was SVN r7371.
2005-09-14 17:08:08 +00:00
Brian Barrett
e98415eb7b
* make tree compile on OS X
...
This commit was SVN r7370.
2005-09-14 15:52:42 +00:00
Galen Shipman
f0b1ea52bc
if all else fails in prepare_src,, pack
...
init the rdma_pending list in ob1
This commit was SVN r7366.
2005-09-14 04:41:33 +00:00
Brian Barrett
1290b8eed2
* some debugging to figure out why get isn't working on RS
...
This commit was SVN r7354.
2005-09-13 20:52:56 +00:00
George Bosilca
ad0c0cdc03
Make the GM btl compile again. There were just some typos.
...
This commit was SVN r7352.
2005-09-13 20:19:21 +00:00
Jeff Squyres
bbae6c3b1a
Add missing header file
...
This commit was SVN r7338.
2005-09-13 12:19:34 +00:00
Galen Shipman
39f25428da
missing includes, perhaps related to george's work?
...
This commit was SVN r7332.
2005-09-13 02:00:28 +00:00
Galen Shipman
d932cfd342
merge of rcache work into the trunk.. lotsa fun ;-)..
...
I regression tested before the merge, I will regression test tonight and
correct issues that might have crept in.
This commit was SVN r7329.
2005-09-12 22:28:23 +00:00
Brian Barrett
4c62c356c7
* more missing header file recovery
...
This commit was SVN r7328.
2005-09-12 22:13:09 +00:00
George Bosilca
8308ab42e9
GM depend on the proc.h now.
...
This commit was SVN r7327.
2005-09-12 21:52:44 +00:00
Brian Barrett
88cd561198
* bunch of fixes for Red Storm - missing header files and the like
...
This commit was SVN r7325.
2005-09-12 21:45:58 +00:00
Tim Woodall
304f6254e6
additional btl flags
...
This commit was SVN r7324.
2005-09-12 21:38:31 +00:00
Brian Barrett
79f7ea6856
* implement btl_put for Portals
...
This commit was SVN r7320.
2005-09-12 20:24:43 +00:00
George Bosilca
c9fb1f32f2
And more dependencies fixes. The big commit will follow shortly.
...
This commit was SVN r7319.
2005-09-12 20:22:59 +00:00
George Bosilca
1b031c153b
Last commit to make the threaded case compiling without warnings. Next step try to make it working ...
...
Correct the spring of the vpid problem (similar to the one in the SM PTL).
Add one more argument to the MCA_BTL_SM_FIFO_WRITE macro who will get passed down to the
MCA_BTL_SM_SIGNAL_PEER macro to allow it to have the fifo_fd file descriptor.
This commit was SVN r7305.
2005-09-11 20:55:22 +00:00
George Bosilca
f8d9f6121c
Typo correction ...
...
This commit was SVN r7303.
2005-09-11 20:49:27 +00:00
George Bosilca
c24eb702bb
Correctly compute the default sizes for the fragments.
...
This commit was SVN r7299.
2005-09-11 20:02:55 +00:00
Jeff Squyres
4aa75fa739
- Make opal_output_stream_t be a real opal_object_t so that it can use
...
a constructor, like the rest of the code base
- Convert usage in the tree to use the constructor to zero out an
instance of opal_output_stream_t
- Still need to re-enable output files
This commit was SVN r7253.
2005-09-09 10:46:54 +00:00
Tim Woodall
59f2462ef0
corrections/clarifications
...
This commit was SVN r7215.
2005-09-07 13:40:22 +00:00
Tim Woodall
3e002203a0
dont need to adjust size
...
This commit was SVN r7213.
2005-09-07 13:25:05 +00:00
Brian Barrett
ed56e743b7
* update configure.ac to use the modern version of AC_INIT and
...
AM_INIT_AUTOMAKE, instead of the deprecated version.
* Work around dumbness in modern AC_INIT that requires the version
number to be set at autoconf time (instead of at configure time, as
it was before). Set the version number, minus the subversion r number,
at autoconf time. Override the internal variables to include the r
number (if needed) at configure time. Basically, the right thing
should always happen. The only place it might not is the version
reported as part of configure --help will not have an r number.
* Since AM_INIT_AUTOMAKE taks a list of options, no need to specify
them in all the Makefile.am files.
* Addes support for subdir-objects, meaning that object files are put
in the directory containing source files, even if the Makefile.am is
in another directory. This should start making it feasible to
reduce the number of Makefile.am files we have in the tree, which
will greatly reduce the time to run autogen and configure.
This commit was SVN r7211.
2005-09-07 05:54:53 +00:00
Galen Shipman
e5ea1b55ef
fix for threaded build
...
This commit was SVN r7194.
2005-09-06 15:21:31 +00:00
Brian Barrett
6f19022db9
* Update Portals configuration to use --with-portals instead of
...
--with-btl-portals
* Update Red Storm build config file tomatch change
This commit was SVN r7185.
2005-09-05 21:02:50 +00:00
George Bosilca
3078be40aa
First stable version of the MX BTL (at least we pass NetPipe). The perfs are not amazing
...
but are not that bad either.
On a 2 procs Intel(R) Xeon(TM) CPU 3.20GHz with MYRICOM Inc. Myrinet 2000 Scalable Cluster Interconnect (rev 04) I get:
0: 1 bytes 13096 times --> 1.10 Mbps in 6.94 usec
1: 2 bytes 14408 times --> 2.17 Mbps in 7.02 usec
2: 3 bytes 14243 times --> 3.24 Mbps in 7.07 usec
3: 4 bytes 9428 times --> 4.27 Mbps in 7.15 usec
4: 6 bytes 10493 times --> 6.26 Mbps in 7.32 usec
5: 8 bytes 6834 times --> 8.18 Mbps in 7.47 usec
6: 12 bytes 8371 times --> 11.89 Mbps in 7.70 usec
7: 13 bytes 5411 times --> 12.72 Mbps in 7.80 usec
8: 16 bytes 5919 times --> 15.35 Mbps in 7.95 usec
9: 19 bytes 7074 times --> 17.66 Mbps in 8.21 usec
10: 21 bytes 7696 times --> 19.00 Mbps in 8.43 usec
11: 24 bytes 7906 times --> 20.87 Mbps in 8.77 usec
12: 27 bytes 8073 times --> 23.05 Mbps in 8.94 usec
13: 29 bytes 4972 times --> 24.32 Mbps in 9.10 usec
14: 32 bytes 5307 times --> 26.29 Mbps in 9.29 usec
15: 35 bytes 5720 times --> 33.61 Mbps in 7.95 usec
16: 45 bytes 7191 times --> 39.50 Mbps in 8.69 usec
17: 48 bytes 7670 times --> 41.33 Mbps in 8.86 usec
18: 51 bytes 7759 times --> 42.80 Mbps in 9.09 usec
19: 61 bytes 4313 times --> 47.44 Mbps in 9.81 usec
20: 64 bytes 5012 times --> 57.61 Mbps in 8.48 usec
21: 67 bytes 6083 times --> 59.31 Mbps in 8.62 usec
22: 93 bytes 6234 times --> 68.08 Mbps in 10.42 usec
23: 96 bytes 6396 times --> 80.65 Mbps in 9.08 usec
24: 99 bytes 7455 times --> 81.56 Mbps in 9.26 usec
25: 125 bytes 3926 times --> 112.46 Mbps in 8.48 usec
26: 128 bytes 5848 times --> 116.87 Mbps in 8.36 usec
27: 131 bytes 6077 times --> 119.22 Mbps in 8.38 usec
28: 189 bytes 6192 times --> 163.79 Mbps in 8.80 usec
29: 192 bytes 7572 times --> 168.01 Mbps in 8.72 usec
30: 195 bytes 7705 times --> 171.13 Mbps in 8.69 usec
31: 253 bytes 4011 times --> 210.21 Mbps in 9.18 usec
32: 256 bytes 5423 times --> 214.55 Mbps in 9.10 usec
33: 259 bytes 5535 times --> 217.64 Mbps in 9.08 usec
34: 381 bytes 5613 times --> 290.55 Mbps in 10.00 usec
35: 384 bytes 6663 times --> 296.11 Mbps in 9.89 usec
36: 387 bytes 6764 times --> 298.74 Mbps in 9.88 usec
37: 509 bytes 3451 times --> 353.78 Mbps in 10.98 usec
38: 512 bytes 4546 times --> 359.36 Mbps in 10.87 usec
39: 515 bytes 4617 times --> 361.53 Mbps in 10.87 usec
40: 765 bytes 4645 times --> 461.41 Mbps in 12.65 usec
41: 768 bytes 5270 times --> 468.59 Mbps in 12.50 usec
42: 771 bytes 5341 times --> 470.16 Mbps in 12.51 usec
43: 1021 bytes 2695 times --> 508.42 Mbps in 15.32 usec
44: 1024 bytes 3260 times --> 514.44 Mbps in 15.19 usec
45: 1027 bytes 3298 times --> 515.72 Mbps in 15.19 usec
46: 1533 bytes 3307 times --> 707.12 Mbps in 16.54 usec
47: 1536 bytes 4030 times --> 714.93 Mbps in 16.39 usec
48: 1539 bytes 4071 times --> 714.41 Mbps in 16.44 usec
49: 2045 bytes 2040 times --> 761.38 Mbps in 20.49 usec
50: 2048 bytes 2438 times --> 769.78 Mbps in 20.30 usec
51: 2051 bytes 2465 times --> 769.78 Mbps in 20.33 usec
52: 3069 bytes 2465 times --> 923.43 Mbps in 25.36 usec
53: 3072 bytes 2629 times --> 928.48 Mbps in 25.24 usec
54: 3075 bytes 2642 times --> 929.07 Mbps in 25.25 usec
55: 4093 bytes 1323 times --> 1012.38 Mbps in 30.85 usec
56: 4096 bytes 1620 times --> 1016.69 Mbps in 30.74 usec
57: 4099 bytes 1627 times --> 1015.16 Mbps in 30.81 usec
58: 6141 bytes 1625 times --> 1171.82 Mbps in 39.98 usec
59: 6144 bytes 1667 times --> 1173.85 Mbps in 39.93 usec
60: 6147 bytes 1669 times --> 1174.44 Mbps in 39.93 usec
61: 8189 bytes 835 times --> 1232.43 Mbps in 50.69 usec
62: 8192 bytes 986 times --> 1234.87 Mbps in 50.61 usec
63: 8195 bytes 988 times --> 1234.85 Mbps in 50.63 usec
64: 12285 bytes 988 times --> 1360.73 Mbps in 68.88 usec
65: 12288 bytes 967 times --> 1364.20 Mbps in 68.72 usec
66: 12291 bytes 970 times --> 1364.56 Mbps in 68.72 usec
67: 16381 bytes 485 times --> 1385.48 Mbps in 90.21 usec
68: 16384 bytes 554 times --> 1388.76 Mbps in 90.01 usec
69: 16387 bytes 555 times --> 1388.41 Mbps in 90.05 usec
70: 24573 bytes 555 times --> 1499.72 Mbps in 125.01 usec
71: 24576 bytes 533 times --> 1499.36 Mbps in 125.05 usec
72: 24579 bytes 533 times --> 1500.44 Mbps in 124.98 usec
73: 32765 bytes 266 times --> 1499.31 Mbps in 166.73 usec
74: 32768 bytes 299 times --> 1497.10 Mbps in 166.99 usec
75: 32771 bytes 299 times --> 1495.29 Mbps in 167.21 usec
76: 49149 bytes 299 times --> 1528.78 Mbps in 245.28 usec
77: 49152 bytes 271 times --> 1527.97 Mbps in 245.42 usec
78: 49155 bytes 271 times --> 1529.35 Mbps in 245.22 usec
79: 65533 bytes 135 times --> 1586.19 Mbps in 315.21 usec
80: 65536 bytes 158 times --> 1591.11 Mbps in 314.25 usec
81: 65539 bytes 159 times --> 1586.50 Mbps in 315.17 usec
82: 98301 bytes 158 times --> 1668.05 Mbps in 449.61 usec
83: 98304 bytes 148 times --> 1667.40 Mbps in 449.80 usec
84: 98307 bytes 148 times --> 1667.29 Mbps in 449.84 usec
85: 131069 bytes 74 times --> 1709.11 Mbps in 585.09 usec
86: 131072 bytes 85 times --> 1711.09 Mbps in 584.42 usec
87: 131075 bytes 85 times --> 1710.92 Mbps in 584.49 usec
88: 196605 bytes 85 times --> 1727.93 Mbps in 868.08 usec
89: 196608 bytes 76 times --> 1726.28 Mbps in 868.92 usec
90: 196611 bytes 76 times --> 1727.06 Mbps in 868.54 usec
91: 262141 bytes 38 times --> 1757.65 Mbps in 1137.87 usec
92: 262144 bytes 43 times --> 1758.69 Mbps in 1137.21 usec
93: 262147 bytes 43 times --> 1759.38 Mbps in 1136.78 usec
94: 393213 bytes 43 times --> 1801.51 Mbps in 1665.25 usec
95: 393216 bytes 40 times --> 1803.26 Mbps in 1663.65 usec
96: 393219 bytes 40 times --> 1800.73 Mbps in 1666.00 usec
97: 524285 bytes 20 times --> 1805.33 Mbps in 2215.65 usec
98: 524288 bytes 22 times --> 1806.80 Mbps in 2213.86 usec
99: 524291 bytes 22 times --> 1805.77 Mbps in 2215.14 usec
100: 786429 bytes 22 times --> 1827.24 Mbps in 3283.64 usec
101: 786432 bytes 20 times --> 1827.03 Mbps in 3284.03 usec
102: 786435 bytes 20 times --> 1827.20 Mbps in 3283.73 usec
103: 1048573 bytes 10 times --> 1840.05 Mbps in 4347.71 usec
104: 1048576 bytes 11 times --> 1839.68 Mbps in 4348.58 usec
105: 1048579 bytes 11 times --> 1840.13 Mbps in 4347.54 usec
106: 1572861 bytes 11 times --> 1853.99 Mbps in 6472.50 usec
107: 1572864 bytes 10 times --> 1854.11 Mbps in 6472.10 usec
108: 1572867 bytes 10 times --> 1854.12 Mbps in 6472.10 usec
109: 2097149 bytes 5 times --> 1861.41 Mbps in 8595.61 usec
110: 2097152 bytes 5 times --> 1861.25 Mbps in 8596.40 usec
111: 2097155 bytes 5 times --> 1860.99 Mbps in 8597.59 usec
112: 3145725 bytes 5 times --> 1868.34 Mbps in 12845.59 usec
113: 3145728 bytes 5 times --> 1868.30 Mbps in 12845.90 usec
114: 3145731 bytes 5 times --> 1868.59 Mbps in 12843.89 usec
115: 4194301 bytes 3 times --> 1872.16 Mbps in 17092.51 usec
116: 4194304 bytes 3 times --> 1872.31 Mbps in 17091.19 usec
117: 4194307 bytes 3 times --> 1872.13 Mbps in 17092.82 usec
118: 6291453 bytes 3 times --> 1875.88 Mbps in 25588.00 usec
119: 6291456 bytes 3 times --> 1875.98 Mbps in 25586.68 usec
120: 6291459 bytes 3 times --> 1875.93 Mbps in 25587.36 usec
121: 8388605 bytes 3 times --> 1877.79 Mbps in 34082.69 usec
122: 8388608 bytes 3 times --> 1877.72 Mbps in 34083.84 usec
123: 8388611 bytes 3 times --> 1877.66 Mbps in 34085.00 usec
This commit was SVN r7180.
2005-09-04 22:08:13 +00:00
Tim Woodall
b65dc08ab1
counters need to be signed as we check for <0
...
This commit was SVN r7155.
2005-09-02 18:26:07 +00:00
Tim Woodall
dfe52fceef
minor changes to thread locking
...
This commit was SVN r7154.
2005-09-02 16:27:01 +00:00
Galen Shipman
589b1b8b5a
Additional changes to add_proc and tokens
...
This commit was SVN r7152.
2005-09-02 15:18:36 +00:00
Galen Shipman
a7a4da4502
Scale the SRQ based on the log base 2 of the number of peers,
...
this assumes that the peers have all been added via add_procs up front.
Bad things will happen if add_procs is called again later on a new set of
procs to fix this we need to modify the srq which may wreck things.. looking
into this deeper..
This commit was SVN r7142.
2005-09-02 04:06:51 +00:00
Galen Shipman
c8a23106c0
More fixes for sq tokens,
...
Additional work on multi-rail support.
This commit was SVN r7139.
2005-09-02 03:04:28 +00:00
Tim Woodall
636ab23fdb
atomic increment/test
...
This commit was SVN r7130.
2005-09-01 15:09:50 +00:00
Jeff Squyres
3962c53e2e
- Add to AM_CPPFLAGS $(OPAL_LTDL_CPPFLAGS) where necessary in order to
...
add a -I to find the included ltdl.h (vs. a system-installed ltdl.h)
- Clean up kruft in a bunch of Makefile.am's to remove now-unnecessary
AM_CPPFLAGS settings to get static-components.h for each framework
- Move the component_repository API functions out of opal/mca/base/base.h
and into opal/mca/base/mca_base_component_repository.h in order to
decrease unnecessary dependencies (e.g., before this, almost
everything in the tree depended on ltdl.h, which is unnecessary --
only a small number of files really need ltdl.h)
This commit was SVN r7127.
2005-09-01 12:16:36 +00:00