Galen Shipman
218a438509
finished the ompi_free_list_t class nightmare..
...
This commit was SVN r10314.
2006-06-12 22:09:03 +00:00
Galen Shipman
18dda70fd0
make ompi_free_list_item_t a class..
...
This will go to the 1.1 branch but will probably require a few changes as
ompi_free_list_t is different in the branch..
This commit was SVN r10306.
2006-06-12 16:44:00 +00:00
Brian Barrett
d3257f22d8
* back out Galen's r10300 because it breaks the build. Real fix coming RSN.
...
This commit was SVN r10303.
The following SVN revision numbers were found above:
r10300 --> open-mpi/ompi@b0f3745791
2006-06-12 14:38:14 +00:00
Gleb Natapov
48d348b577
Don't complete send request before we've got completion on the first rndv packet.
...
Sender can receive and complete PUT request before it gets completion on the first rndv packet. senreq struct may be reused for the next MPI_Send and unexpected completion mess up the things. I sometimes got SEGV and sometimes data corruption.
This commit was SVN r10301.
2006-06-12 14:00:43 +00:00
Galen Shipman
b0f3745791
declare these as ompi_free_list_item_t's
...
This needs to go to 1.1
This commit was SVN r10300.
2006-06-12 13:26:15 +00:00
George Bosilca
7d1feffbf7
The real solution. If the sendreq->req_send.req_bytes_packed is zero then there
...
is no data to be trasfered. And this is the condition which lead to a non
initialized convertor.
This commit was SVN r10299.
2006-06-12 06:18:18 +00:00
George Bosilca
c959c2f214
Don't reset the convertor's position if it wasn't initialized before. This can
...
only happens for zero byte persistent requests.
This commit was SVN r10298.
2006-06-12 06:14:35 +00:00
Galen Shipman
9d73217637
These list items are free list items, and should inherit properly..
...
This commit was SVN r10295.
2006-06-11 20:19:12 +00:00
Brian Barrett
d5acb4e3cc
* silence dumb (and mostly useless) warning during cleanup
...
This commit was SVN r10280.
2006-06-09 21:09:53 +00:00
Brian Barrett
cc99a63169
* fix issue with PANFS not building properly - we didn't add PANFS_LIB to the
...
list of libraries
This commit was SVN r10279.
2006-06-09 20:41:12 +00:00
Jeff Squyres
a4030ad2d9
Improve the tremendously unhelpful MCA help message for the
...
btl_openib_ib_mtu and btl_mvapi_ib_mtu MCA params by showing the valid
values what what they represent (got a question about this from Cisco
testing engineers).
This commit was SVN r10277.
2006-06-09 18:02:45 +00:00
Andrew Friedley
9a92394bfd
Mostly cleanups - preprocessor fixes and removal of OPAL_OUTPUTs.
...
Also updated to match recent mpool_free changes.
This commit was SVN r10273.
2006-06-09 00:18:29 +00:00
Andrew Friedley
75176370ae
blah. somehow missed adding .ompi_ignore/.ompi_unignore.
...
This commit was SVN r10272.
2006-06-09 00:15:36 +00:00
Andrew Friedley
cca1616368
Finally committing the UD BTL.
...
UD is the Unreliable Datagram transport for Infiniband, specifically OpenIB. This BTL is derived from the existing openib BTL, which is RC (Reliable Connection) based.
Still a work in progress, as there is a lot of work left to do. Specifically, performance, scalability, and flow control need to be addressed.
Currently I'm playing around with different methods for handling receive buffers, as well as profiling to figure out where the time is going.
This commit was SVN r10271.
2006-06-09 00:13:45 +00:00
Galen Shipman
08823e56fa
check address before looking for the item in the tree corresponding to the
...
address..
All have been reviewed by brian.. putting in a changeset request..
This commit was SVN r10256.
2006-06-08 16:27:59 +00:00
Galen Shipman
636ef0cf6c
don't put back null items on the list..
...
This commit was SVN r10253.
2006-06-08 14:46:41 +00:00
Galen Shipman
429056078a
fix numerous late night errors..
...
1) don't need tree if memory is just malloc'd
2) fix memory and free list leak..
3) deregister first and then free... doh..
This commit was SVN r10251.
2006-06-08 14:23:20 +00:00
Galen Shipman
5a2ceda93f
a couple of stupid late night mistakes...
...
This commit was SVN r10250.
2006-06-08 13:39:41 +00:00
Galen Shipman
0bb8a6fca8
roll back to not use memalign
...
This commit was SVN r10249.
2006-06-08 04:34:04 +00:00
Galen Shipman
b42b0bd1af
potential fix for ticket #81
...
Added a tree to track memory allocation from MPI_Alloc_mem, this allows us to
free the registrations in a sane fashion.. also should be faster..
This commit was SVN r10248.
2006-06-08 04:29:27 +00:00
Sven Stork
c31e6f9767
use memalign instead of malloc + manually alignment in the mvapi mpool
...
revert commit 10243
This commit was SVN r10247.
2006-06-07 23:21:23 +00:00
Andrew Friedley
5ace292cc1
Should fix ticket #81 - which is specific to MVAPI, I've included the same fix for gm/openib as well.
...
uDAPL has the same problem, will fix in separate commit so it doesn't go to branch.
This commit was SVN r10243.
2006-06-07 15:52:48 +00:00
Galen Shipman
84479d0b5a
potential fix for iprobe test,, tested with openib.. will have andy try ud..
...
This commit was SVN r10232.
2006-06-06 22:10:41 +00:00
Galen Shipman
90799f82cd
copy paste error..
...
This commit was SVN r10220.
2006-06-06 02:38:29 +00:00
Galen Shipman
cc54b07aa0
add better error messages for vapi retry exceeded errors.
...
This commit was SVN r10219.
2006-06-06 02:04:56 +00:00
Galen Shipman
9e6e7575b9
doh... add the file..
...
This commit was SVN r10210.
2006-06-05 21:24:42 +00:00
Galen Shipman
f05dee0435
add help file to explain why things went south..
...
This commit was SVN r10209.
2006-06-05 21:23:45 +00:00
Galen Shipman
74c97fb784
cleanup error reporting.. use ompi_proc_t->proc_name if available this gives
...
us source/dest hostnames for communication errors..
This goes to 1.1 branch (reviewed by Brian)..
This commit was SVN r10200.
2006-06-05 20:02:41 +00:00
Brian Barrett
c70fff6ed0
* Fix for bug #44 for the trunk -- remove a bunch of warnings from the DR
...
PML when compiling on Solaris. Patch won't apply cleanly to the v1.1
branch, so a diff for that is coming up soon.
This commit was SVN r10173.
2006-06-01 18:58:38 +00:00
Galen Shipman
83ff3201b5
don't use rank or nprocs in error messages when we don't have them..
...
This should hit 1.1 and 1.0 branches..
Reviewed by Brian
This commit was SVN r10164.
2006-06-01 14:24:11 +00:00
Galen Shipman
0344ae4ac5
Fix to allow eager limit and max send size to be any size (within resource limitations). Instead of storing the ompi_free_list_t * in the fragment, we use the frag type enum, this tells us where the frag came from and where it should return.. This could also be done in mvapi but is not a high priority moving forward..
...
Review by Brian, needs to hit the trunk + 1.1 release..
This commit was SVN r10157.
2006-06-01 02:32:18 +00:00
Brian Barrett
5163f2b296
Fix for bug #36 . The MX, MVAPI, and OpenIB components don't have
...
support for progress threads, so we shouldn't build them or try to use
them when support for progress threads has been requested. The TCP, GM,
SELF, and SM BTLs should have progress thread support, so they aren't
disabled. The Portals BTL isn't compiled on platforms with threads,
so it doens't need to be updated.
This commit was SVN r10156.
2006-06-01 01:30:16 +00:00
Galen Shipman
c79efc9efb
track which list a fragment came from, allows returning based on list, not
...
on size.
This commit was SVN r10142.
2006-05-31 14:24:32 +00:00
Brian Barrett
4904e34a52
set datarootdir, necessary for Autoconf-2.60 which will define some variables
...
based upon this value (e.g., datadir, docdir).
Submitted by: Ralf Wildenhues
Reviewed by: Brian Barrett
This commit was SVN r10133.
2006-05-31 03:43:55 +00:00
Brian Barrett
6026fc98f6
* Fix M4 quoting so that AC 2.60 won't complain
...
Submitted by: Ralf Wildenhues
Reviewed by: Brian Barrett
This commit was SVN r10129.
2006-05-31 03:39:18 +00:00
Brian Barrett
c723d196c5
Rather than using fragment size to determine fragment type, use an enum.
...
Do this rather than the my_list pointer because we need to do some
things that are somewhat special because we pre-pin eager fragments but
not send fragments. Also makes a couple ideas I have slightly easier to
play around with.
This commit was SVN r10127.
2006-05-31 03:34:32 +00:00
Galen Shipman
2667c52a5d
Track fragments by list, not by size..
...
-- reviewed by Brian, needs to hit all the branches..
This commit was SVN r10078.
2006-05-25 18:07:26 +00:00
Galen Shipman
38a0561d9b
Allow maximum send size to be less than the eager limit.
...
Instead of figuring out which free list the fragment belongs to based on size
we simply store a pointer to the list which it belongs in the fragment.
This was reviewed by Brian and should hit all the branches.
This commit was SVN r10072.
2006-05-25 16:57:14 +00:00
Andrew Friedley
fa9ec2afdf
Add my sandia username for convenience
...
This commit was SVN r10071.
2006-05-25 15:49:11 +00:00
Andrew Friedley
8a3d0862ca
I can commit! *happy dance*
...
Trying to remember what I did here.. eager/max messages should work now, no RDMA yet. A number of other fixes and cleanups.
I do know of two problems:
Bad stuff happens when flooded with send frags too quickly - the BTL doesn't handle flow control.
Certain IBM tests turn up a length assertion in the datatype engine - needs more investigation.
This commit was SVN r10070.
2006-05-25 15:47:59 +00:00
Gleb Natapov
f590d8a190
fix eager RDMA on PPC64.
...
This commit was SVN r10059.
2006-05-25 11:05:12 +00:00
Jeff Squyres
dd44d36be0
Fix for ticket #25 . Ensure that in the threaded case where we have
...
This commit was SVN r10043.
2006-05-24 16:15:07 +00:00
George Bosilca
95d0395578
I'm skeptical about the ability of the compiler to correctly optimize the
...
loop local variables.
This commit was SVN r10019.
2006-05-23 03:21:15 +00:00
George Bosilca
085cac552f
Don't let TCP to create local connections, we have the self BTL for this purpose.
...
This commit was SVN r10018.
2006-05-23 03:06:32 +00:00
George Bosilca
837221831a
Temporary solution for in-bound computation of the next BTL.
...
This commit was SVN r10016.
2006-05-22 23:28:40 +00:00
George Bosilca
b8ef0cc749
Minor cleanups.
...
This commit was SVN r10001.
2006-05-21 05:55:21 +00:00
George Bosilca
e43fbd0082
Remove all useless variables. Minor cleanups.
...
This commit was SVN r10000.
2006-05-21 05:53:22 +00:00
Galen Shipman
9165882c07
fixes for failover...
...
This commit was SVN r9998.
2006-05-20 02:39:05 +00:00
Gleb Natapov
1c1b87a9f1
init mutex before use.
...
This commit was SVN r9963.
2006-05-18 09:35:11 +00:00
Jeff Squyres
15758d5f29
Fix AC_DEFINE to match what it's supposed to be defining
...
This commit was SVN r9952.
2006-05-17 03:26:43 +00:00