Aurelien Bouteiller
670956e172
Another cast mistake.
...
This commit was SVN r16247.
2007-09-26 21:14:35 +00:00
Aurelien Bouteiller
f7d7d58fb6
Various cast type errors on 64bit architectures
...
This commit was SVN r16246.
2007-09-26 20:54:18 +00:00
Brian Barrett
56e26ed390
Need to install the mpool_rdma.h so that we can build external BTLs that
...
use the RDMA protocol
This commit was SVN r16237.
2007-09-26 16:58:54 +00:00
Gleb Natapov
c7105eadc7
Update Voltaire copyright.
...
This commit was SVN r16189.
2007-09-24 10:11:52 +00:00
Aurelien Bouteiller
0df0087f17
Investigating improvement of cache line management on shared memory
...
This commit was SVN r16183.
2007-09-21 20:02:56 +00:00
Josh Hursey
1fe1276fd5
Make sure to match on the communicator ID as well.
...
This commit was SVN r16179.
2007-09-21 18:16:02 +00:00
Josh Hursey
3e51d7bb25
Implement the MPI_Iprobe and MPI_Probe wrappers.
...
Remove some old, unused code.
This commit was SVN r16178.
2007-09-21 16:28:46 +00:00
Aurelien Bouteiller
d3b376a340
This patch adds actual non-blocking sender-based message logging. This improves bandwidth. Still need to work on malloc/mmap storage to reach optimal bandwidth.
...
This commit was SVN r16172.
2007-09-21 03:24:08 +00:00
Aurelien Bouteiller
bc318b35e2
There is room in convertor to copy the packed data. It works just need to add the correct memcopy. It does not manage the short messages but I alreqdy think of a workaround for this (and it might even be better regarding latency).
...
This commit was SVN r16169.
2007-09-20 21:57:21 +00:00
Aurelien Bouteiller
bbac6e650a
New improved version of sender-based. Under dev but a new framework for expressing various methods have been added.
...
This commit was SVN r16159.
2007-09-19 03:42:56 +00:00
Gleb Natapov
097b17d30e
Prevent a receive request from been freed while other thread holds a reference
...
to it or there is an outstanding completion for the request.
This commit was SVN r16153.
2007-09-18 16:18:47 +00:00
Jeff Squyres
33955a0ed0
Oops -- when converted from uint to int, -1 (the default value,
...
meaning "infinite") is no longer larger than the minimum required
size. So put in an appropriate test to ensure that "infinite" was not
requested.
This commit was SVN r16142.
2007-09-17 19:28:21 +00:00
Jeff Squyres
130a272cec
Fix some compiler warnings about signed/unsigned comparisons.
...
This commit was SVN r16139.
2007-09-17 13:08:45 +00:00
Josh Hursey
d2ef0d445a
Add some basic timing hooks so I can extract a few more detailed performance
...
numbers for tuning.
Switch the bookmark_recv to be non-blocking. If this is blocking then for
process counts >= 32 slight process delays were causing cascading performance
delays in the protocol. This lead to checkpoints either taking about 3 sec or
45 sec (or more) for 64 procs due to the cascading delays. With the nonblocking
receive version this is no longer the case we get the speedup we expect for this
part of the protocol.
More tuning to come.
This commit was SVN r16137.
2007-09-16 15:13:23 +00:00
Jeff Squyres
6004e177e0
Fixes trac:1133: if you specify a max freelist size that is too small,
...
you'll get a helpful error message and the openib BTL will deactivate
itself.
This commit was SVN r16133.
The following Trac tickets were found above:
Ticket 1133 --> https://svn.open-mpi.org/trac/ompi/ticket/1133
2007-09-14 21:42:56 +00:00
George Bosilca
617ff3a413
Add a MCA parameter for the ELAN MAP ID file.
...
Fix small memory bugs, and track the final segfault. Still some ork to do.
This commit was SVN r16117.
2007-09-12 21:25:35 +00:00
Aurelien Bouteiller
a1f5312afb
Fixed two little warnings
...
This commit was SVN r16116.
2007-09-12 21:07:11 +00:00
Aurelien Bouteiller
ccb3f75e8f
Make sure that the pml v parasite never get loaded when user did not requested FT. This does not break the ability to switch protocol on the fly.
...
This commit was SVN r16114.
2007-09-12 20:47:17 +00:00
George Bosilca
1e7a791349
Remove some of the problems identified by Coverty.
...
This commit was SVN r16112.
2007-09-12 20:13:26 +00:00
Aurelien Bouteiller
828af95be8
Major modification of the vprotocol framework build system. With a better integration in autogen.sh, it allows for generating static-components.h the usual way.
...
NOTE: This build system does not work with the current autogen.sh. Modified one is under heavy testing to make sure it does not have side effects
This commit was SVN r16110.
2007-09-12 18:46:37 +00:00
George Bosilca
05ae27c68b
Don't segfault if we receive a fragment for a non existing communicator.
...
Instead, drop it by now.
This commit was SVN r16105.
2007-09-12 17:52:02 +00:00
George Bosilca
c755938eb0
Coverty: release the temporary buffer on error.
...
This commit was SVN r16104.
2007-09-12 17:45:12 +00:00
Shiqing Fan
a0660f4deb
- Just some type casts.
...
This commit was SVN r16100.
2007-09-12 15:29:58 +00:00
Gleb Natapov
07c8fddeef
Fix scheduling of pending send request. It should be scheduled req_lock times.
...
This commit was SVN r16096.
2007-09-12 07:08:38 +00:00
George Bosilca
d8fed2cfa1
Set a default value so that some compilers stop complaining about
...
uninitialized values.
This commit was SVN r16094.
2007-09-11 18:00:53 +00:00
Gleb Natapov
b0614931f4
Remove mpool_tree_item from the mpool_tree before unregistering/freeing memory.
...
Otherwise a race exists if another thread allocates already freed memory
which is not removed from the mpool_tree yet.
This commit was SVN r16038.
2007-09-03 10:56:55 +00:00
Rainer Keller
a3b30749b0
- Only lock/unlock when using threads.
...
Basically revert this part of r16015.
This commit was SVN r16029.
The following SVN revision numbers were found above:
r16015 --> open-mpi/ompi@435e7d80e9
2007-08-31 12:34:48 +00:00
Rainer Keller
9c1c345c07
- head_lock is an opal_atomic_lock_t...
...
This commit was SVN r16028.
2007-08-31 12:20:21 +00:00
Shiqing Fan
efdcfa3807
- "extern 'C'" has been set twice. Remove one.
...
This commit was SVN r16022.
2007-08-30 15:03:59 +00:00
Shiqing Fan
80fdd5e2a4
- Need to be exported.
...
This commit was SVN r16021.
2007-08-30 14:16:03 +00:00
Gleb Natapov
79011279e5
Remove debug output.
...
This commit was SVN r16016.
2007-08-30 13:29:41 +00:00
Gleb Natapov
435e7d80e9
Remove rc parameter from MCA_BTL_SM_FIFO_WRITE() macro. It cannot fail in
...
current implementation.
This commit was SVN r16015.
2007-08-30 13:21:52 +00:00
Gleb Natapov
690fb95bda
Cleanup send scheduling code.
...
This commit was SVN r16014.
2007-08-30 12:10:04 +00:00
Gleb Natapov
0b0f9d14aa
Mark send request complete on PML level only when absolutely sure there is
...
no more work associated with this request. No more outstanding completions or
packets and send scheduling isn't running in another thread.
This commit was SVN r16013.
2007-08-30 12:08:33 +00:00
Gleb Natapov
fe414047bd
registration may be freed inside mca_mpool_rdma_deregister().
...
This commit was SVN r16012.
2007-08-30 10:52:38 +00:00
Gleb Natapov
091862a25a
Protect access to mca_mpool_base_tree by a lock.
...
This commit was SVN r16011.
2007-08-30 10:51:02 +00:00
Gleb Natapov
eac2674f66
The inner voice tells me this is a typo.
...
This commit was SVN r16004.
2007-08-29 13:28:47 +00:00
Jeff Squyres
466394a878
We only care about the value of ret in the
...
!OMPI_ENABLE_PROGRESS_THREADS case. Reviewed by Brian.
This commit was SVN r16000.
2007-08-29 01:36:17 +00:00
Jeff Squyres
c4a38f47f6
Resolve Coverity CID 467: remove unused variable / dead code.
...
This commit was SVN r15997.
2007-08-29 01:23:18 +00:00
Brian Barrett
59b22533f2
Enable RDMA for heterogeneous situations. Currently done by overloading
...
the ompi_convertor_need_buffers function to only return 0 if the convertor
is homogeneous (which it never does on the trunk, but does to on v1.2, but
that's a different issue). Only enable the heterogeneous rdma code for
a btl if it supports it (via a flag), as some btls need some work for this
to work properly. Currently only TCP and OpenIB extensively tested
This commit was SVN r15990.
2007-08-28 21:23:44 +00:00
Gleb Natapov
fa69c5cc10
If a memory on a sender's size is not registered don't register it on a receive
...
side too. Otherwise a content of the recvreq->req_rdma array is replaced later
without freeing previous content and refcount on registration in mpool become
wrong.
This commit was SVN r15978.
2007-08-28 07:43:06 +00:00
Rich Graham
bc97d22182
remove tabs. Remove old code that was commented out.
...
This commit was SVN r15975.
2007-08-28 03:08:36 +00:00
Rich Graham
4d58f9aed7
Add comments. Move temporary receive object from a free list object to
...
a stack object.
This commit was SVN r15971.
2007-08-27 21:41:04 +00:00
Gleb Natapov
e1a1d9d90e
Receive request converter can be accessed in parallel by a thread that receives
...
data and a thread that run RDMA schedule function. Protect access to the
converter by a lock.
This commit was SVN r15967.
2007-08-27 11:41:42 +00:00
Gleb Natapov
33196d972b
post_send() function is called without endpoint lock held from explicit credits
...
update function so eager_rdma_remote.head have to be updated in a thread safe
manner.
This commit was SVN r15966.
2007-08-27 11:37:01 +00:00
Gleb Natapov
32a61c3bf2
Credit fragment is not protected properly from concurrent access. There is a
...
race that can prevent further explicit credits update from been sent. Fix the
race.
This commit was SVN r15965.
2007-08-27 11:34:59 +00:00
Gleb Natapov
065d04dfde
Do not free recvreq while schedule function is running in another thread.
...
This commit was SVN r15964.
2007-08-27 11:31:40 +00:00
Brad Benton
ccda5c9c74
Modified the MCA_BTL_TCP_CONNECTED case in mca_btl_tcp_endpoint_send_handler()
...
to always first check for a NULL frag pointer before trying to send the
fragment. This avoids an issue in multi-threaded execution in which
multiple threads working on the same endpoint can result in a thread
finding itself here with nothing to send.
This commit was SVN r15963.
2007-08-26 23:40:02 +00:00
Edgar Gabriel
a2f5cada1a
convert the hiearch component to the new structure. More testing required before we remove the .ompi_ignore flag again.
...
This commit was SVN r15954.
2007-08-23 20:41:29 +00:00
Rainer Keller
1b5fa48a29
- Add missing PERUSE_COMM_REQ_REMOVE_FROM_POSTED_Q when matching
...
from the posted generic_recv-queue.
- Move the PERUSE_COMM_MSG_MATCH_POSTED_REQ from
MCA_PML_OB1_RECV_REQUEST_MATCHED to
mca_pml_ob1_recv_frag_match() as suggested by Terry Dontje
Only post, if this is not a probe/iprobe request.
- Do not post PERUSE_COMM_REQ_MATCH_UNEX for probes / iprobes and
do in correct order before PERUSE_COMM_MSG_REMOVE_FROM_UNEX_Q
This commit was SVN r15947.
2007-08-23 07:09:43 +00:00