Jeff Squyres
8ec6721f24
Prior fix for the ROMIO re-#define problem didn't really fix the
...
problem because Autoconf replaced the "#undef ..." with "#define
...". Fix this by not putting the "#undef ..." statement directly in
romioconf.h[.in] -- but rather having romioconf.h[.in] #include
romioconf-undefs.h, which has the #undef statements.
This commit was SVN r7252.
2005-09-09 10:33:04 +00:00
Graham Fagg
9596d6e2f3
use correct names or cannot call functions...
...
use correct ddt calls (and params ext vs lb etc)
This commit was SVN r7243.
2005-09-09 02:39:57 +00:00
Galen Shipman
e2186a1cc2
Updates to the rb rcache..
...
This commit was SVN r7238.
2005-09-08 21:34:26 +00:00
Galen Shipman
ec2b99986f
remove rcache_base_init.c no longer needed..
...
This commit was SVN r7237.
2005-09-08 16:31:17 +00:00
Jeff Squyres
e8b120fbf2
- Add some #undef's so that we don't conflict with the
...
AC-automatically-generated macros
- Remove a useless #include
This commit was SVN r7228.
2005-09-08 09:42:19 +00:00
Galen Shipman
6c8e9ab661
more rcache changes
...
This commit was SVN r7227.
2005-09-07 20:37:17 +00:00
Galen Shipman
a0b82acece
Add fleshed out open,create and close
...
This commit was SVN r7226.
2005-09-07 20:21:15 +00:00
Jeff Squyres
b31c2f8e77
Remove unnecessary AM_CPPFLAGS
...
This commit was SVN r7221.
2005-09-07 18:40:56 +00:00
George Bosilca
105595bd08
Make rcache look like a component. The problem is that on MAC OS X ar is more sensitive
...
than other Unix OS's and does not accept to be called with any arguments. Therefore,
we need these files in order to succesfully compile even if they are empty.
This commit was SVN r7220.
2005-09-07 18:09:04 +00:00
George Bosilca
23cb29aefe
The src directory is not used anymore but it's still present in the configure.params file. Anyway I don't think this file is used as the .h file does not exist ...
...
This commit was SVN r7219.
2005-09-07 17:16:41 +00:00
Jeff Squyres
881851604b
Update the docs on the actual algorithms used
...
This commit was SVN r7216.
2005-09-07 15:46:33 +00:00
Tim Woodall
59f2462ef0
corrections/clarifications
...
This commit was SVN r7215.
2005-09-07 13:40:22 +00:00
Jeff Squyres
9302f924ea
simplify the bcast code by taking abstract actions and making them
...
macros -- will help with the other algorithms
This commit was SVN r7214.
2005-09-07 13:33:43 +00:00
Tim Woodall
3e002203a0
dont need to adjust size
...
This commit was SVN r7213.
2005-09-07 13:25:05 +00:00
Brian Barrett
ed56e743b7
* update configure.ac to use the modern version of AC_INIT and
...
AM_INIT_AUTOMAKE, instead of the deprecated version.
* Work around dumbness in modern AC_INIT that requires the version
number to be set at autoconf time (instead of at configure time, as
it was before). Set the version number, minus the subversion r number,
at autoconf time. Override the internal variables to include the r
number (if needed) at configure time. Basically, the right thing
should always happen. The only place it might not is the version
reported as part of configure --help will not have an r number.
* Since AM_INIT_AUTOMAKE taks a list of options, no need to specify
them in all the Makefile.am files.
* Addes support for subdir-objects, meaning that object files are put
in the directory containing source files, even if the Makefile.am is
in another directory. This should start making it feasible to
reduce the number of Makefile.am files we have in the tree, which
will greatly reduce the time to run autogen and configure.
This commit was SVN r7211.
2005-09-07 05:54:53 +00:00
Jeff Squyres
7bab4ed269
bunches of updates
...
- finally added "in use" flags -- one flag protects a set of segments
- these flags now used in bcast to protect (for example) when a
message is so long that the root loops around the segments and has
to re-use old segments -- now it knows that it has to wait until the
non-root processes have finished with that set of segments before it
can start using them
- implement allreduce as a reduce followed by a bcast (per discussion
with rich)
- removed some redundant data on various data structures
- implemented query MCA param ("coll_sm_shared_mem_used_data") that
tells you how much shared memory will be used for a given set of MCA
params (e.g., number of segments, etc.). For example:
ompi_info --mca coll_sm_info_num_procs 4 --param coll sm | \
grep shared_mem_used_data
tells you that for the default MCA param values (as of r7172), for 4
processes, sm will use 548864 bytes of shared memory for its data
transfer section
- remove a bunch of .c files from the Makefile.am that aren't
implemented yet (i.e., all they do is return ERR_NOT_IMPLEMENTED)
Now on to the big Altix to test that this stuff really works...
This commit was SVN r7205.
The following SVN revision numbers were found above:
r7172 --> open-mpi/ompi@bc72a7722b
2005-09-06 21:41:55 +00:00
Tim Woodall
7ccc43a052
correct for btls that don't support get
...
This commit was SVN r7201.
2005-09-06 19:01:35 +00:00
Tim Woodall
a3786c4579
dont attempt to deregister if not supported
...
This commit was SVN r7200.
2005-09-06 18:27:53 +00:00
Tim Woodall
d7d717af46
correction for case where registration not supported
...
This commit was SVN r7199.
2005-09-06 18:27:07 +00:00
Galen Shipman
e5ea1b55ef
fix for threaded build
...
This commit was SVN r7194.
2005-09-06 15:21:31 +00:00
Galen Shipman
a47cb00f84
fix bug in threaded build..
...
This commit was SVN r7193.
2005-09-06 14:37:46 +00:00
Graham Fagg
b97305bbcc
a few more functions (testing)
...
This commit was SVN r7190.
2005-09-06 05:21:57 +00:00
George Bosilca
5df64827c8
The MX BTL and PTL does not require the same version of the MX library. The BTL accept the
...
default version while the PTL require a patched version (iwith the mx_register_match_callback function).
This commit was SVN r7188.
2005-09-06 03:11:42 +00:00
Brian Barrett
6f19022db9
* Update Portals configuration to use --with-portals instead of
...
--with-btl-portals
* Update Red Storm build config file tomatch change
This commit was SVN r7185.
2005-09-05 21:02:50 +00:00
Galen Shipman
f541088bc3
typo broke the threaded build..
...
This commit was SVN r7182.
2005-09-05 16:43:01 +00:00
George Bosilca
3078be40aa
First stable version of the MX BTL (at least we pass NetPipe). The perfs are not amazing
...
but are not that bad either.
On a 2 procs Intel(R) Xeon(TM) CPU 3.20GHz with MYRICOM Inc. Myrinet 2000 Scalable Cluster Interconnect (rev 04) I get:
0: 1 bytes 13096 times --> 1.10 Mbps in 6.94 usec
1: 2 bytes 14408 times --> 2.17 Mbps in 7.02 usec
2: 3 bytes 14243 times --> 3.24 Mbps in 7.07 usec
3: 4 bytes 9428 times --> 4.27 Mbps in 7.15 usec
4: 6 bytes 10493 times --> 6.26 Mbps in 7.32 usec
5: 8 bytes 6834 times --> 8.18 Mbps in 7.47 usec
6: 12 bytes 8371 times --> 11.89 Mbps in 7.70 usec
7: 13 bytes 5411 times --> 12.72 Mbps in 7.80 usec
8: 16 bytes 5919 times --> 15.35 Mbps in 7.95 usec
9: 19 bytes 7074 times --> 17.66 Mbps in 8.21 usec
10: 21 bytes 7696 times --> 19.00 Mbps in 8.43 usec
11: 24 bytes 7906 times --> 20.87 Mbps in 8.77 usec
12: 27 bytes 8073 times --> 23.05 Mbps in 8.94 usec
13: 29 bytes 4972 times --> 24.32 Mbps in 9.10 usec
14: 32 bytes 5307 times --> 26.29 Mbps in 9.29 usec
15: 35 bytes 5720 times --> 33.61 Mbps in 7.95 usec
16: 45 bytes 7191 times --> 39.50 Mbps in 8.69 usec
17: 48 bytes 7670 times --> 41.33 Mbps in 8.86 usec
18: 51 bytes 7759 times --> 42.80 Mbps in 9.09 usec
19: 61 bytes 4313 times --> 47.44 Mbps in 9.81 usec
20: 64 bytes 5012 times --> 57.61 Mbps in 8.48 usec
21: 67 bytes 6083 times --> 59.31 Mbps in 8.62 usec
22: 93 bytes 6234 times --> 68.08 Mbps in 10.42 usec
23: 96 bytes 6396 times --> 80.65 Mbps in 9.08 usec
24: 99 bytes 7455 times --> 81.56 Mbps in 9.26 usec
25: 125 bytes 3926 times --> 112.46 Mbps in 8.48 usec
26: 128 bytes 5848 times --> 116.87 Mbps in 8.36 usec
27: 131 bytes 6077 times --> 119.22 Mbps in 8.38 usec
28: 189 bytes 6192 times --> 163.79 Mbps in 8.80 usec
29: 192 bytes 7572 times --> 168.01 Mbps in 8.72 usec
30: 195 bytes 7705 times --> 171.13 Mbps in 8.69 usec
31: 253 bytes 4011 times --> 210.21 Mbps in 9.18 usec
32: 256 bytes 5423 times --> 214.55 Mbps in 9.10 usec
33: 259 bytes 5535 times --> 217.64 Mbps in 9.08 usec
34: 381 bytes 5613 times --> 290.55 Mbps in 10.00 usec
35: 384 bytes 6663 times --> 296.11 Mbps in 9.89 usec
36: 387 bytes 6764 times --> 298.74 Mbps in 9.88 usec
37: 509 bytes 3451 times --> 353.78 Mbps in 10.98 usec
38: 512 bytes 4546 times --> 359.36 Mbps in 10.87 usec
39: 515 bytes 4617 times --> 361.53 Mbps in 10.87 usec
40: 765 bytes 4645 times --> 461.41 Mbps in 12.65 usec
41: 768 bytes 5270 times --> 468.59 Mbps in 12.50 usec
42: 771 bytes 5341 times --> 470.16 Mbps in 12.51 usec
43: 1021 bytes 2695 times --> 508.42 Mbps in 15.32 usec
44: 1024 bytes 3260 times --> 514.44 Mbps in 15.19 usec
45: 1027 bytes 3298 times --> 515.72 Mbps in 15.19 usec
46: 1533 bytes 3307 times --> 707.12 Mbps in 16.54 usec
47: 1536 bytes 4030 times --> 714.93 Mbps in 16.39 usec
48: 1539 bytes 4071 times --> 714.41 Mbps in 16.44 usec
49: 2045 bytes 2040 times --> 761.38 Mbps in 20.49 usec
50: 2048 bytes 2438 times --> 769.78 Mbps in 20.30 usec
51: 2051 bytes 2465 times --> 769.78 Mbps in 20.33 usec
52: 3069 bytes 2465 times --> 923.43 Mbps in 25.36 usec
53: 3072 bytes 2629 times --> 928.48 Mbps in 25.24 usec
54: 3075 bytes 2642 times --> 929.07 Mbps in 25.25 usec
55: 4093 bytes 1323 times --> 1012.38 Mbps in 30.85 usec
56: 4096 bytes 1620 times --> 1016.69 Mbps in 30.74 usec
57: 4099 bytes 1627 times --> 1015.16 Mbps in 30.81 usec
58: 6141 bytes 1625 times --> 1171.82 Mbps in 39.98 usec
59: 6144 bytes 1667 times --> 1173.85 Mbps in 39.93 usec
60: 6147 bytes 1669 times --> 1174.44 Mbps in 39.93 usec
61: 8189 bytes 835 times --> 1232.43 Mbps in 50.69 usec
62: 8192 bytes 986 times --> 1234.87 Mbps in 50.61 usec
63: 8195 bytes 988 times --> 1234.85 Mbps in 50.63 usec
64: 12285 bytes 988 times --> 1360.73 Mbps in 68.88 usec
65: 12288 bytes 967 times --> 1364.20 Mbps in 68.72 usec
66: 12291 bytes 970 times --> 1364.56 Mbps in 68.72 usec
67: 16381 bytes 485 times --> 1385.48 Mbps in 90.21 usec
68: 16384 bytes 554 times --> 1388.76 Mbps in 90.01 usec
69: 16387 bytes 555 times --> 1388.41 Mbps in 90.05 usec
70: 24573 bytes 555 times --> 1499.72 Mbps in 125.01 usec
71: 24576 bytes 533 times --> 1499.36 Mbps in 125.05 usec
72: 24579 bytes 533 times --> 1500.44 Mbps in 124.98 usec
73: 32765 bytes 266 times --> 1499.31 Mbps in 166.73 usec
74: 32768 bytes 299 times --> 1497.10 Mbps in 166.99 usec
75: 32771 bytes 299 times --> 1495.29 Mbps in 167.21 usec
76: 49149 bytes 299 times --> 1528.78 Mbps in 245.28 usec
77: 49152 bytes 271 times --> 1527.97 Mbps in 245.42 usec
78: 49155 bytes 271 times --> 1529.35 Mbps in 245.22 usec
79: 65533 bytes 135 times --> 1586.19 Mbps in 315.21 usec
80: 65536 bytes 158 times --> 1591.11 Mbps in 314.25 usec
81: 65539 bytes 159 times --> 1586.50 Mbps in 315.17 usec
82: 98301 bytes 158 times --> 1668.05 Mbps in 449.61 usec
83: 98304 bytes 148 times --> 1667.40 Mbps in 449.80 usec
84: 98307 bytes 148 times --> 1667.29 Mbps in 449.84 usec
85: 131069 bytes 74 times --> 1709.11 Mbps in 585.09 usec
86: 131072 bytes 85 times --> 1711.09 Mbps in 584.42 usec
87: 131075 bytes 85 times --> 1710.92 Mbps in 584.49 usec
88: 196605 bytes 85 times --> 1727.93 Mbps in 868.08 usec
89: 196608 bytes 76 times --> 1726.28 Mbps in 868.92 usec
90: 196611 bytes 76 times --> 1727.06 Mbps in 868.54 usec
91: 262141 bytes 38 times --> 1757.65 Mbps in 1137.87 usec
92: 262144 bytes 43 times --> 1758.69 Mbps in 1137.21 usec
93: 262147 bytes 43 times --> 1759.38 Mbps in 1136.78 usec
94: 393213 bytes 43 times --> 1801.51 Mbps in 1665.25 usec
95: 393216 bytes 40 times --> 1803.26 Mbps in 1663.65 usec
96: 393219 bytes 40 times --> 1800.73 Mbps in 1666.00 usec
97: 524285 bytes 20 times --> 1805.33 Mbps in 2215.65 usec
98: 524288 bytes 22 times --> 1806.80 Mbps in 2213.86 usec
99: 524291 bytes 22 times --> 1805.77 Mbps in 2215.14 usec
100: 786429 bytes 22 times --> 1827.24 Mbps in 3283.64 usec
101: 786432 bytes 20 times --> 1827.03 Mbps in 3284.03 usec
102: 786435 bytes 20 times --> 1827.20 Mbps in 3283.73 usec
103: 1048573 bytes 10 times --> 1840.05 Mbps in 4347.71 usec
104: 1048576 bytes 11 times --> 1839.68 Mbps in 4348.58 usec
105: 1048579 bytes 11 times --> 1840.13 Mbps in 4347.54 usec
106: 1572861 bytes 11 times --> 1853.99 Mbps in 6472.50 usec
107: 1572864 bytes 10 times --> 1854.11 Mbps in 6472.10 usec
108: 1572867 bytes 10 times --> 1854.12 Mbps in 6472.10 usec
109: 2097149 bytes 5 times --> 1861.41 Mbps in 8595.61 usec
110: 2097152 bytes 5 times --> 1861.25 Mbps in 8596.40 usec
111: 2097155 bytes 5 times --> 1860.99 Mbps in 8597.59 usec
112: 3145725 bytes 5 times --> 1868.34 Mbps in 12845.59 usec
113: 3145728 bytes 5 times --> 1868.30 Mbps in 12845.90 usec
114: 3145731 bytes 5 times --> 1868.59 Mbps in 12843.89 usec
115: 4194301 bytes 3 times --> 1872.16 Mbps in 17092.51 usec
116: 4194304 bytes 3 times --> 1872.31 Mbps in 17091.19 usec
117: 4194307 bytes 3 times --> 1872.13 Mbps in 17092.82 usec
118: 6291453 bytes 3 times --> 1875.88 Mbps in 25588.00 usec
119: 6291456 bytes 3 times --> 1875.98 Mbps in 25586.68 usec
120: 6291459 bytes 3 times --> 1875.93 Mbps in 25587.36 usec
121: 8388605 bytes 3 times --> 1877.79 Mbps in 34082.69 usec
122: 8388608 bytes 3 times --> 1877.72 Mbps in 34083.84 usec
123: 8388611 bytes 3 times --> 1877.66 Mbps in 34085.00 usec
This commit was SVN r7180.
2005-09-04 22:08:13 +00:00
Galen Shipman
1af92a1de2
tried to acquire wrong lock..
...
This commit was SVN r7177.
2005-09-04 13:26:46 +00:00
Galen Shipman
ba9342085e
Changes to support rcache in mpool
...
This commit was SVN r7176.
2005-09-04 04:20:55 +00:00
Galen Shipman
2314919b73
Initial checkin of rcache
...
This commit was SVN r7174.
2005-09-04 04:17:00 +00:00
Jeff Squyres
bc72a7722b
Updates:
...
- bcast now works properly for root!=0 and multi-fragment messages
- destroy mpool when communicator is destroyed
Still need to implement:
- "in use" flags for groups of fragments so that "wrapping around" in
the data segment doesn't overwrite not-yet-read data
- ensure that shared memory isn't removed before all processes have
finished with it (e.g., during COMM_FREE)
This commit was SVN r7172.
2005-09-03 11:49:46 +00:00
Graham Fagg
059d7f26f8
re-enable proper freeing of allocated structures now that I know these
...
are not causing an error.
This commit was SVN r7170.
2005-09-03 06:19:17 +00:00
Graham Fagg
35447d4c32
underfilling of a only segment and the doing a PML send and esp a recv on it can
...
cause flipping hard to find memory errors...
This commit was SVN r7169.
2005-09-03 06:14:03 +00:00
Graham Fagg
36eddb6609
checkpoint
...
This commit was SVN r7168.
2005-09-03 01:41:13 +00:00
Jeff Squyres
99c3edab67
Added mca_mpool_base_module_destroy() function so that you can destroy
...
an mpool module before MPI_FINALIZE (i.e., ompi_mpi_finalize())
This commit was SVN r7165.
2005-09-03 00:07:34 +00:00
Tim Woodall
b65dc08ab1
counters need to be signed as we check for <0
...
This commit was SVN r7155.
2005-09-02 18:26:07 +00:00
Tim Woodall
dfe52fceef
minor changes to thread locking
...
This commit was SVN r7154.
2005-09-02 16:27:01 +00:00
Galen Shipman
589b1b8b5a
Additional changes to add_proc and tokens
...
This commit was SVN r7152.
2005-09-02 15:18:36 +00:00
Tim Woodall
4fbf7965e3
check for rdma btls when leave pinned is set
...
This commit was SVN r7151.
2005-09-02 15:04:47 +00:00
Jeff Squyres
647f6a7648
Update data structures
...
This commit was SVN r7150.
2005-09-02 14:53:14 +00:00
Jeff Squyres
6ef4805729
Tree-based barrier and broadcast seem to be working. Now on to reduce
...
/ allreduce...
This commit was SVN r7149.
2005-09-02 12:57:47 +00:00
Jeff Squyres
560f323048
Update to ensure that the macros AC_REQUIRE'd by AC_CHECK_HEADER are
...
always invoked properly.
This commit was SVN r7147.
2005-09-02 11:01:07 +00:00
Galen Shipman
a7a4da4502
Scale the SRQ based on the log base 2 of the number of peers,
...
this assumes that the peers have all been added via add_procs up front.
Bad things will happen if add_procs is called again later on a new set of
procs to fix this we need to modify the srq which may wreck things.. looking
into this deeper..
This commit was SVN r7142.
2005-09-02 04:06:51 +00:00
Galen Shipman
c8a23106c0
More fixes for sq tokens,
...
Additional work on multi-rail support.
This commit was SVN r7139.
2005-09-02 03:04:28 +00:00
Graham Fagg
25787c4ca7
added segmented binary
...
renamed split so we know its a split binary tree
(tested, chain/bin/splitbin all pass simple and ibm tests)
This commit was SVN r7138.
2005-09-02 02:19:59 +00:00
Graham Fagg
a7a20e1a57
corrected first seg recv for intermediate nodes (or else we lose a request)
...
This commit was SVN r7137.
2005-09-02 00:55:47 +00:00
Graham Fagg
d78e9d0e9e
get/handle extent correctly
...
last rank in comm is not always last in tree structure for nonzero roots
This commit was SVN r7135.
2005-09-01 22:56:47 +00:00
Graham Fagg
f4de8776a6
ops
...
This commit was SVN r7133.
2005-09-01 17:12:28 +00:00
Tim Woodall
636ab23fdb
atomic increment/test
...
This commit was SVN r7130.
2005-09-01 15:09:50 +00:00
Ralph Castain
03e45e6723
Two quick additions:
...
1. Added OMPI_PROC_ARCH as a defined registry key and added the code so that the architecture info gets properly transmitted across all processes using the startup message.
2. Added an OMPI_MODEX_KEY definition and removed the hard-coded "modex" key from pml_modex_exchange
This commit was SVN r7129.
2005-09-01 15:05:03 +00:00
Jeff Squyres
3962c53e2e
- Add to AM_CPPFLAGS $(OPAL_LTDL_CPPFLAGS) where necessary in order to
...
add a -I to find the included ltdl.h (vs. a system-installed ltdl.h)
- Clean up kruft in a bunch of Makefile.am's to remove now-unnecessary
AM_CPPFLAGS settings to get static-components.h for each framework
- Move the component_repository API functions out of opal/mca/base/base.h
and into opal/mca/base/mca_base_component_repository.h in order to
decrease unnecessary dependencies (e.g., before this, almost
everything in the tree depended on ltdl.h, which is unnecessary --
only a small number of files really need ltdl.h)
This commit was SVN r7127.
2005-09-01 12:16:36 +00:00