George Bosilca
5df64827c8
The MX BTL and PTL does not require the same version of the MX library. The BTL accept the
...
default version while the PTL require a patched version (iwith the mx_register_match_callback function).
This commit was SVN r7188.
2005-09-06 03:11:42 +00:00
Brian Barrett
444f92e957
* add a fairly hard to fix blocks in use report from the event library to
...
the OMPI suppressions file. Yeah, we could fix it, but that would mean
threading a finalize routine through the event library...
This commit was SVN r7187.
2005-09-05 21:04:07 +00:00
Brian Barrett
4e20c83204
* fix memory leak in ORTE startup
...
This commit was SVN r7186.
2005-09-05 21:03:02 +00:00
Brian Barrett
6f19022db9
* Update Portals configuration to use --with-portals instead of
...
--with-btl-portals
* Update Red Storm build config file tomatch change
This commit was SVN r7185.
2005-09-05 21:02:50 +00:00
Jeff Squyres
3645816aef
- Add copyrights
...
- Minor style fixes
This commit was SVN r7184.
2005-09-05 18:51:59 +00:00
Jeff Squyres
5702c8f29b
Fix help messages and docs about --with arguments
...
This commit was SVN r7183.
2005-09-05 18:51:34 +00:00
Galen Shipman
f541088bc3
typo broke the threaded build..
...
This commit was SVN r7182.
2005-09-05 16:43:01 +00:00
George Bosilca
3078be40aa
First stable version of the MX BTL (at least we pass NetPipe). The perfs are not amazing
...
but are not that bad either.
On a 2 procs Intel(R) Xeon(TM) CPU 3.20GHz with MYRICOM Inc. Myrinet 2000 Scalable Cluster Interconnect (rev 04) I get:
0: 1 bytes 13096 times --> 1.10 Mbps in 6.94 usec
1: 2 bytes 14408 times --> 2.17 Mbps in 7.02 usec
2: 3 bytes 14243 times --> 3.24 Mbps in 7.07 usec
3: 4 bytes 9428 times --> 4.27 Mbps in 7.15 usec
4: 6 bytes 10493 times --> 6.26 Mbps in 7.32 usec
5: 8 bytes 6834 times --> 8.18 Mbps in 7.47 usec
6: 12 bytes 8371 times --> 11.89 Mbps in 7.70 usec
7: 13 bytes 5411 times --> 12.72 Mbps in 7.80 usec
8: 16 bytes 5919 times --> 15.35 Mbps in 7.95 usec
9: 19 bytes 7074 times --> 17.66 Mbps in 8.21 usec
10: 21 bytes 7696 times --> 19.00 Mbps in 8.43 usec
11: 24 bytes 7906 times --> 20.87 Mbps in 8.77 usec
12: 27 bytes 8073 times --> 23.05 Mbps in 8.94 usec
13: 29 bytes 4972 times --> 24.32 Mbps in 9.10 usec
14: 32 bytes 5307 times --> 26.29 Mbps in 9.29 usec
15: 35 bytes 5720 times --> 33.61 Mbps in 7.95 usec
16: 45 bytes 7191 times --> 39.50 Mbps in 8.69 usec
17: 48 bytes 7670 times --> 41.33 Mbps in 8.86 usec
18: 51 bytes 7759 times --> 42.80 Mbps in 9.09 usec
19: 61 bytes 4313 times --> 47.44 Mbps in 9.81 usec
20: 64 bytes 5012 times --> 57.61 Mbps in 8.48 usec
21: 67 bytes 6083 times --> 59.31 Mbps in 8.62 usec
22: 93 bytes 6234 times --> 68.08 Mbps in 10.42 usec
23: 96 bytes 6396 times --> 80.65 Mbps in 9.08 usec
24: 99 bytes 7455 times --> 81.56 Mbps in 9.26 usec
25: 125 bytes 3926 times --> 112.46 Mbps in 8.48 usec
26: 128 bytes 5848 times --> 116.87 Mbps in 8.36 usec
27: 131 bytes 6077 times --> 119.22 Mbps in 8.38 usec
28: 189 bytes 6192 times --> 163.79 Mbps in 8.80 usec
29: 192 bytes 7572 times --> 168.01 Mbps in 8.72 usec
30: 195 bytes 7705 times --> 171.13 Mbps in 8.69 usec
31: 253 bytes 4011 times --> 210.21 Mbps in 9.18 usec
32: 256 bytes 5423 times --> 214.55 Mbps in 9.10 usec
33: 259 bytes 5535 times --> 217.64 Mbps in 9.08 usec
34: 381 bytes 5613 times --> 290.55 Mbps in 10.00 usec
35: 384 bytes 6663 times --> 296.11 Mbps in 9.89 usec
36: 387 bytes 6764 times --> 298.74 Mbps in 9.88 usec
37: 509 bytes 3451 times --> 353.78 Mbps in 10.98 usec
38: 512 bytes 4546 times --> 359.36 Mbps in 10.87 usec
39: 515 bytes 4617 times --> 361.53 Mbps in 10.87 usec
40: 765 bytes 4645 times --> 461.41 Mbps in 12.65 usec
41: 768 bytes 5270 times --> 468.59 Mbps in 12.50 usec
42: 771 bytes 5341 times --> 470.16 Mbps in 12.51 usec
43: 1021 bytes 2695 times --> 508.42 Mbps in 15.32 usec
44: 1024 bytes 3260 times --> 514.44 Mbps in 15.19 usec
45: 1027 bytes 3298 times --> 515.72 Mbps in 15.19 usec
46: 1533 bytes 3307 times --> 707.12 Mbps in 16.54 usec
47: 1536 bytes 4030 times --> 714.93 Mbps in 16.39 usec
48: 1539 bytes 4071 times --> 714.41 Mbps in 16.44 usec
49: 2045 bytes 2040 times --> 761.38 Mbps in 20.49 usec
50: 2048 bytes 2438 times --> 769.78 Mbps in 20.30 usec
51: 2051 bytes 2465 times --> 769.78 Mbps in 20.33 usec
52: 3069 bytes 2465 times --> 923.43 Mbps in 25.36 usec
53: 3072 bytes 2629 times --> 928.48 Mbps in 25.24 usec
54: 3075 bytes 2642 times --> 929.07 Mbps in 25.25 usec
55: 4093 bytes 1323 times --> 1012.38 Mbps in 30.85 usec
56: 4096 bytes 1620 times --> 1016.69 Mbps in 30.74 usec
57: 4099 bytes 1627 times --> 1015.16 Mbps in 30.81 usec
58: 6141 bytes 1625 times --> 1171.82 Mbps in 39.98 usec
59: 6144 bytes 1667 times --> 1173.85 Mbps in 39.93 usec
60: 6147 bytes 1669 times --> 1174.44 Mbps in 39.93 usec
61: 8189 bytes 835 times --> 1232.43 Mbps in 50.69 usec
62: 8192 bytes 986 times --> 1234.87 Mbps in 50.61 usec
63: 8195 bytes 988 times --> 1234.85 Mbps in 50.63 usec
64: 12285 bytes 988 times --> 1360.73 Mbps in 68.88 usec
65: 12288 bytes 967 times --> 1364.20 Mbps in 68.72 usec
66: 12291 bytes 970 times --> 1364.56 Mbps in 68.72 usec
67: 16381 bytes 485 times --> 1385.48 Mbps in 90.21 usec
68: 16384 bytes 554 times --> 1388.76 Mbps in 90.01 usec
69: 16387 bytes 555 times --> 1388.41 Mbps in 90.05 usec
70: 24573 bytes 555 times --> 1499.72 Mbps in 125.01 usec
71: 24576 bytes 533 times --> 1499.36 Mbps in 125.05 usec
72: 24579 bytes 533 times --> 1500.44 Mbps in 124.98 usec
73: 32765 bytes 266 times --> 1499.31 Mbps in 166.73 usec
74: 32768 bytes 299 times --> 1497.10 Mbps in 166.99 usec
75: 32771 bytes 299 times --> 1495.29 Mbps in 167.21 usec
76: 49149 bytes 299 times --> 1528.78 Mbps in 245.28 usec
77: 49152 bytes 271 times --> 1527.97 Mbps in 245.42 usec
78: 49155 bytes 271 times --> 1529.35 Mbps in 245.22 usec
79: 65533 bytes 135 times --> 1586.19 Mbps in 315.21 usec
80: 65536 bytes 158 times --> 1591.11 Mbps in 314.25 usec
81: 65539 bytes 159 times --> 1586.50 Mbps in 315.17 usec
82: 98301 bytes 158 times --> 1668.05 Mbps in 449.61 usec
83: 98304 bytes 148 times --> 1667.40 Mbps in 449.80 usec
84: 98307 bytes 148 times --> 1667.29 Mbps in 449.84 usec
85: 131069 bytes 74 times --> 1709.11 Mbps in 585.09 usec
86: 131072 bytes 85 times --> 1711.09 Mbps in 584.42 usec
87: 131075 bytes 85 times --> 1710.92 Mbps in 584.49 usec
88: 196605 bytes 85 times --> 1727.93 Mbps in 868.08 usec
89: 196608 bytes 76 times --> 1726.28 Mbps in 868.92 usec
90: 196611 bytes 76 times --> 1727.06 Mbps in 868.54 usec
91: 262141 bytes 38 times --> 1757.65 Mbps in 1137.87 usec
92: 262144 bytes 43 times --> 1758.69 Mbps in 1137.21 usec
93: 262147 bytes 43 times --> 1759.38 Mbps in 1136.78 usec
94: 393213 bytes 43 times --> 1801.51 Mbps in 1665.25 usec
95: 393216 bytes 40 times --> 1803.26 Mbps in 1663.65 usec
96: 393219 bytes 40 times --> 1800.73 Mbps in 1666.00 usec
97: 524285 bytes 20 times --> 1805.33 Mbps in 2215.65 usec
98: 524288 bytes 22 times --> 1806.80 Mbps in 2213.86 usec
99: 524291 bytes 22 times --> 1805.77 Mbps in 2215.14 usec
100: 786429 bytes 22 times --> 1827.24 Mbps in 3283.64 usec
101: 786432 bytes 20 times --> 1827.03 Mbps in 3284.03 usec
102: 786435 bytes 20 times --> 1827.20 Mbps in 3283.73 usec
103: 1048573 bytes 10 times --> 1840.05 Mbps in 4347.71 usec
104: 1048576 bytes 11 times --> 1839.68 Mbps in 4348.58 usec
105: 1048579 bytes 11 times --> 1840.13 Mbps in 4347.54 usec
106: 1572861 bytes 11 times --> 1853.99 Mbps in 6472.50 usec
107: 1572864 bytes 10 times --> 1854.11 Mbps in 6472.10 usec
108: 1572867 bytes 10 times --> 1854.12 Mbps in 6472.10 usec
109: 2097149 bytes 5 times --> 1861.41 Mbps in 8595.61 usec
110: 2097152 bytes 5 times --> 1861.25 Mbps in 8596.40 usec
111: 2097155 bytes 5 times --> 1860.99 Mbps in 8597.59 usec
112: 3145725 bytes 5 times --> 1868.34 Mbps in 12845.59 usec
113: 3145728 bytes 5 times --> 1868.30 Mbps in 12845.90 usec
114: 3145731 bytes 5 times --> 1868.59 Mbps in 12843.89 usec
115: 4194301 bytes 3 times --> 1872.16 Mbps in 17092.51 usec
116: 4194304 bytes 3 times --> 1872.31 Mbps in 17091.19 usec
117: 4194307 bytes 3 times --> 1872.13 Mbps in 17092.82 usec
118: 6291453 bytes 3 times --> 1875.88 Mbps in 25588.00 usec
119: 6291456 bytes 3 times --> 1875.98 Mbps in 25586.68 usec
120: 6291459 bytes 3 times --> 1875.93 Mbps in 25587.36 usec
121: 8388605 bytes 3 times --> 1877.79 Mbps in 34082.69 usec
122: 8388608 bytes 3 times --> 1877.72 Mbps in 34083.84 usec
123: 8388611 bytes 3 times --> 1877.66 Mbps in 34085.00 usec
This commit was SVN r7180.
2005-09-04 22:08:13 +00:00
Rainer Keller
588a62cb90
- Missed file in last commit
...
This commit was SVN r7179.
2005-09-04 20:55:27 +00:00
Rainer Keller
192625d2a1
- Once again: uninteresting cleanup to get diff smaller.
...
This commit was SVN r7178.
2005-09-04 20:54:19 +00:00
Galen Shipman
1af92a1de2
tried to acquire wrong lock..
...
This commit was SVN r7177.
2005-09-04 13:26:46 +00:00
Galen Shipman
ba9342085e
Changes to support rcache in mpool
...
This commit was SVN r7176.
2005-09-04 04:20:55 +00:00
Galen Shipman
dc769e3560
Initial checking of rcache.. svn issues required mult checkins..
...
This commit was SVN r7175.
2005-09-04 04:19:31 +00:00
Galen Shipman
2314919b73
Initial checkin of rcache
...
This commit was SVN r7174.
2005-09-04 04:17:00 +00:00
Brian Barrett
492aeecd11
* track the actual allocations within the ompi and opal free_lists, so that
...
we can free all allocated memory when the free_list is destructed. Fixes
a whole bunch of memory leaks...
This commit was SVN r7173.
2005-09-03 19:46:44 +00:00
Jeff Squyres
bc72a7722b
Updates:
...
- bcast now works properly for root!=0 and multi-fragment messages
- destroy mpool when communicator is destroyed
Still need to implement:
- "in use" flags for groups of fragments so that "wrapping around" in
the data segment doesn't overwrite not-yet-read data
- ensure that shared memory isn't removed before all processes have
finished with it (e.g., during COMM_FREE)
This commit was SVN r7172.
2005-09-03 11:49:46 +00:00
Jeff Squyres
e53fa399dc
Bump the required version of Libtool up to 1.5.16
...
This commit was SVN r7171.
2005-09-03 10:11:30 +00:00
Graham Fagg
059d7f26f8
re-enable proper freeing of allocated structures now that I know these
...
are not causing an error.
This commit was SVN r7170.
2005-09-03 06:19:17 +00:00
Graham Fagg
35447d4c32
underfilling of a only segment and the doing a PML send and esp a recv on it can
...
cause flipping hard to find memory errors...
This commit was SVN r7169.
2005-09-03 06:14:03 +00:00
Graham Fagg
36eddb6609
checkpoint
...
This commit was SVN r7168.
2005-09-03 01:41:13 +00:00
Ralph Castain
12daecb826
More cleanup
...
This commit was SVN r7167.
2005-09-03 01:22:11 +00:00
Ralph Castain
4b5b3b4164
Properly handle the argv array and clean it up when done.
...
This commit was SVN r7166.
2005-09-03 00:15:21 +00:00
Jeff Squyres
99c3edab67
Added mca_mpool_base_module_destroy() function so that you can destroy
...
an mpool module before MPI_FINALIZE (i.e., ompi_mpi_finalize())
This commit was SVN r7165.
2005-09-03 00:07:34 +00:00
Jeff Squyres
1deea5ddb9
Minor mods:
...
- style and error code updates
- use strdup instead of malloc/strcpy
This commit was SVN r7164.
2005-09-03 00:06:20 +00:00
Jeff Squyres
42550d57d1
David made the good point that this is a TCL file, so we should name
...
it ".tcl".
This commit was SVN r7163.
2005-09-02 22:39:15 +00:00
Jeff Squyres
0add040f1b
Ensure to ship openmpi-totalview.rc.
...
This commit was SVN r7162.
2005-09-02 22:10:46 +00:00
Jeff Squyres
fb4de5aec5
Totalview startup script that makes Totalview not display the source
...
code for orterun. Users can either copy this file to $HOME/.tvdrc or
they can source it from their $HOME/.tvdrc:
source /path/to/openmpi-totalview.rc
This commit was SVN r7161.
2005-09-02 22:07:24 +00:00
Josh Hursey
78da530fd2
Fix a bug that Tim highlighted in which orted coredumps when an orterun is
...
CTRL-C'd.
We were calling orte_finalize recursively which caused a segv when it tried to
use a freed framework (orte_rmgr in this case).
I added a status flag to orte_universe_info to indicate where we are in the code.
This was needed to determine if we should call orte_abort or not when shutting
down in the tcp oob.
This commit was SVN r7160.
2005-09-02 21:07:21 +00:00
Ralph Castain
0f797fd40b
Few more cleanups
...
This commit was SVN r7159.
2005-09-02 21:01:59 +00:00
Ralph Castain
4ed7752681
Continue cleaning up memory leaks during launch
...
This commit was SVN r7158.
2005-09-02 20:41:24 +00:00
Ralph Castain
f352890732
Cleaning up memory leaks for proxy operations.
...
This commit was SVN r7157.
2005-09-02 19:26:21 +00:00
Ralph Castain
4bd25e0292
Few minor memory leak cleanups
...
This commit was SVN r7156.
2005-09-02 18:50:01 +00:00
Tim Woodall
b65dc08ab1
counters need to be signed as we check for <0
...
This commit was SVN r7155.
2005-09-02 18:26:07 +00:00
Tim Woodall
dfe52fceef
minor changes to thread locking
...
This commit was SVN r7154.
2005-09-02 16:27:01 +00:00
Brian Barrett
a9863e5ff5
* properly construct / destruct the condition variable in the free list
...
structures
This commit was SVN r7153.
2005-09-02 16:00:42 +00:00
Galen Shipman
589b1b8b5a
Additional changes to add_proc and tokens
...
This commit was SVN r7152.
2005-09-02 15:18:36 +00:00
Tim Woodall
4fbf7965e3
check for rdma btls when leave pinned is set
...
This commit was SVN r7151.
2005-09-02 15:04:47 +00:00
Jeff Squyres
647f6a7648
Update data structures
...
This commit was SVN r7150.
2005-09-02 14:53:14 +00:00
Jeff Squyres
6ef4805729
Tree-based barrier and broadcast seem to be working. Now on to reduce
...
/ allreduce...
This commit was SVN r7149.
2005-09-02 12:57:47 +00:00
Jeff Squyres
a8f9df9951
Change the API slightly so that it is assumed that each process will
...
be called with a description of its memory segments to make local. It
is a small enough API that changing to support a
one-process-does-all-assignment model is simple enough if we ever need
it.
This commit was SVN r7148.
2005-09-02 12:57:02 +00:00
Jeff Squyres
560f323048
Update to ensure that the macros AC_REQUIRE'd by AC_CHECK_HEADER are
...
always invoked properly.
This commit was SVN r7147.
2005-09-02 11:01:07 +00:00
Jeff Squyres
a7fbb0f95e
Put in comments about why these assignments exist
...
This commit was SVN r7146.
2005-09-02 10:27:23 +00:00
Jeff Squyres
7e4f696501
Fix silly compiler warnings
...
This commit was SVN r7145.
2005-09-02 10:26:41 +00:00
Jeff Squyres
99db862e9b
Fix silly compiler warning
...
This commit was SVN r7144.
2005-09-02 10:18:06 +00:00
Jeff Squyres
cf19709d89
Oops -- get the TV API prefix right.
...
This commit was SVN r7143.
2005-09-02 10:16:59 +00:00
Galen Shipman
a7a4da4502
Scale the SRQ based on the log base 2 of the number of peers,
...
this assumes that the peers have all been added via add_procs up front.
Bad things will happen if add_procs is called again later on a new set of
procs to fix this we need to modify the srq which may wreck things.. looking
into this deeper..
This commit was SVN r7142.
2005-09-02 04:06:51 +00:00
Brian Barrett
827122fc62
* order libraries in the correct order so that opal_error test works
...
properly
This commit was SVN r7141.
2005-09-02 03:28:38 +00:00
Brian Barrett
7e9b777ead
* there's no var_in variable for the component-specific variables, so don't
...
have on there. Fixes static builds with GM.
This commit was SVN r7140.
2005-09-02 03:05:38 +00:00
Galen Shipman
c8a23106c0
More fixes for sq tokens,
...
Additional work on multi-rail support.
This commit was SVN r7139.
2005-09-02 03:04:28 +00:00
Graham Fagg
25787c4ca7
added segmented binary
...
renamed split so we know its a split binary tree
(tested, chain/bin/splitbin all pass simple and ibm tests)
This commit was SVN r7138.
2005-09-02 02:19:59 +00:00