Rich Graham
a6bdbfab97
implement allreduce as reduce-scatter, followed by an allgather.
...
This commit was SVN r18132.
2008-04-11 04:06:29 +00:00
Jon Mason
08ead87604
Potential double free of locks
...
mca_btl_openib_endpoint_post_rr_nolock is freeing the endpoint lock on
the error case, but most/all of the functions calling this free the lock
regardless of its error case. Thus resulting is a double free of the
lock.
This commit was SVN r18131.
2008-04-10 21:15:01 +00:00
Rich Graham
70f3aab5f2
remove some code that is not needed.
...
This commit was SVN r18128.
2008-04-10 17:32:04 +00:00
Rich Graham
5c7db1e315
remove 2 race conditions in the buffer recycling logic.
...
This commit was SVN r18127.
2008-04-10 17:20:52 +00:00
Edgar Gabriel
4964434205
reverting commit 18122, since the commit was executed accidentally in the
...
wring directory. The UH copyrights do belong into this file (i.e. because of
the fix which is in the 1.2 branch, the UH copyright notes are in the header
there alreary), but I want to have the proper log for that.
This commit was SVN r18124.
2008-04-10 15:09:31 +00:00
Edgar Gabriel
f87830767a
the verification of recvcount==0 and rank = root was braking
...
inter-communicator scatter, since the root (root==MPI_ROOT) might very well
have recvcount=0. The same fix has been applied to gather.c just the other way
round.
Fixes the bug reported on the mainling list by Martin Audet. If there is a
1.2.7 this fix might be worthwhile porting it over.
Please note, that while the test works now for basic and for inter, we get a
0byte malloc warning from the inter module, which we still have to fix in a
separate patch.
This commit was SVN r18122.
2008-04-10 14:58:51 +00:00
Ralph Castain
3a0d09300b
Fully implement the inbound binomial allgather for daemon-based collectives. Supports both modex and barrier operations.
...
Comm_spawn still uses the rank=0 method - shifting that algo to the daemons is under study.
This commit was SVN r18115.
2008-04-09 22:10:53 +00:00
Rich Graham
c6783549ef
getting old
...
This commit was SVN r18110.
2008-04-09 16:55:16 +00:00
Rich Graham
1a20c3ce51
more debug.
...
This commit was SVN r18109.
2008-04-09 16:19:52 +00:00
Rich Graham
e7e18303f6
more debug.
...
This commit was SVN r18108.
2008-04-09 15:10:58 +00:00
Rich Graham
b14c6b17d5
adding debug output.
...
This commit was SVN r18107.
2008-04-09 13:32:01 +00:00
Rich Graham
10434fb2f1
add barrier synchorinzation at the end of the module init, to
...
avoid initializing shared memory variables in use.
This commit was SVN r18105.
2008-04-09 03:44:40 +00:00
Rich Graham
19bb1a2e86
fix initialization bug.
...
This commit was SVN r18104.
2008-04-08 23:34:06 +00:00
Donald Kerr
38e298cc9a
report error message in all libs, not just debug
...
This commit was SVN r18103.
2008-04-08 22:58:28 +00:00
Rich Graham
a69a8d9626
initialize the flags.
...
This commit was SVN r18102.
2008-04-08 22:16:39 +00:00
Rich Graham
8765a2bbdd
more debug code.
...
This commit was SVN r18101.
2008-04-08 20:38:20 +00:00
Rich Graham
08becf33b5
add more debugging.
...
This commit was SVN r18100.
2008-04-08 18:44:50 +00:00
Rich Graham
aa1b7dd406
more debug
...
This commit was SVN r18099.
2008-04-08 03:56:47 +00:00
Rich Graham
0c18bdeff7
more debug code.
...
This commit was SVN r18098.
2008-04-08 03:04:20 +00:00
Rich Graham
9d5a7238df
Add some debugging code.
...
This commit was SVN r18097.
2008-04-07 23:20:15 +00:00
Rich Graham
fa696734d5
add some debug code.
...
This commit was SVN r18096.
2008-04-07 21:03:23 +00:00
Shiqing Fan
28746bbcdb
Remove the memchecker macro in pml base request, used in req_wait.c, which actually is in the wrong place. Instead, one simple call from send_request_free and recv_request_free(already done) will do all the work, fast and clean.
...
This commit was SVN r18095.
2008-04-07 17:46:50 +00:00
Shiqing Fan
a1e5df1cc9
Use the new memchecker function call which is based on convertor.
...
Remove one unnecessary call.
This commit was SVN r18085.
2008-04-07 07:52:04 +00:00
Gleb Natapov
713a27dc71
Counter of created RDMA channels should be incremented immediately after channel
...
creation (not in control message completion) otherwise more than max_eager_rdma
channel may be created.
This commit was SVN r18082.
2008-04-06 13:48:45 +00:00
Rich Graham
1b54e8b76e
fix buffer management for nb-barrier.
...
This commit was SVN r18081.
2008-04-05 21:59:04 +00:00
Tim Prins
313edd8955
- Fix a problem reported on the users list where we would segfault in finalize after calling spawn if the user did not call MPI_Comm_disconnect
...
- Fix the app context constructor so it initializes all the fields.
This commit was SVN r18079.
2008-04-04 15:07:39 +00:00
Jeff Squyres
7072a32703
* Properly protect XRC stuff
...
* A few minor style fixes
This commit was SVN r18076.
2008-04-02 19:52:03 +00:00
Rich Graham
94f8fd365c
a few reduction optimizations. Add bcast.
...
This commit was SVN r18075.
2008-04-02 19:02:33 +00:00
George Bosilca
a00ca20446
More cleanups.
...
This commit was SVN r18069.
2008-04-02 06:38:33 +00:00
George Bosilca
944453c4c1
Cleanups.
...
This commit was SVN r18068.
2008-04-02 06:37:42 +00:00
Rich Graham
eb5d6096f1
add reduction routine - fix buffer recycling logic which was totally
...
broken.
This commit was SVN r18065.
2008-04-01 22:56:18 +00:00
Jeff Squyres
d944d5ec52
Just in case something goes drastically wrong, don't segv.
...
This commit was SVN r18049.
2008-03-31 21:55:07 +00:00
George Bosilca
b4f828f389
We need a newline at the nd of the file, or some compiler bark.
...
This commit was SVN r18023.
2008-03-30 19:05:56 +00:00
Gleb Natapov
b42234461a
Cleanup shared file creation on unix/linux.
...
This commit was SVN r18021.
2008-03-30 13:41:47 +00:00
Jeff Squyres
d0f12f3df0
Make a better error message.
...
This commit was SVN r18014.
2008-03-29 12:54:24 +00:00
Rich Graham
90e53ca9ee
debug the pipeline algorithm.
...
This commit was SVN r18008.
2008-03-28 15:10:07 +00:00
Aurelien Bouteiller
77653ac787
Missing .h file in makefile breaked nightly tarball distcheck...
...
This commit was SVN r18006.
2008-03-28 14:36:56 +00:00
Aurelien Bouteiller
c16339944a
Fix a coverity warning about using unsafe sprintf.
...
This commit was SVN r17999.
2008-03-27 21:24:27 +00:00
Aurelien Bouteiller
e11237aadb
Introduction of the "progress" sender_based method to replace the slow isend-self method.
...
This commit was SVN r17998.
2008-03-27 21:19:45 +00:00
Aurelien Bouteiller
93db01871e
This is part of the previous patch.
...
This commit was SVN r17997.
2008-03-27 21:06:14 +00:00
Aurelien Bouteiller
f8bf6f2c6a
Code cleanup.
...
sender_based.h is now split in two files, to solve cyclic .h files inclusion.
Most macros are now inline functions.
Variable names have been changed from places to places.
Various other small things...
This commit was SVN r17996.
2008-03-27 21:05:44 +00:00
George Bosilca
be4b153f0d
Another patch for thread safety in the TCP BTL (thanks to Pierre).
...
This commit was SVN r17993.
2008-03-27 18:36:08 +00:00
Gleb Natapov
cf40674369
Decide if sends should be throttled at the receiver and pass this to the sender
...
in an ACK message. The decision can't be done reliably at the sender.
This commit was SVN r17987.
2008-03-27 08:56:43 +00:00
Rich Graham
e2ad9c4be2
adjust to change in orte_process_info.
...
This commit was SVN r17986.
2008-03-27 01:25:28 +00:00
Rich Graham
441fb9fb9e
checkpoint.
...
This commit was SVN r17985.
2008-03-27 01:16:32 +00:00
Ralph Castain
90107f3c14
Fix an issue with comm_spawn over who sent/recv first in the modex. The modex assumes that the first name on the list is the "root" that will serve as the allgather collector/distributor. The dpm was putting that entity last, which forced us to pre-inform the parent procs of the child proc's contact info since the parent was trying to send to the child.
...
Clarify the setting of send_first in the mpi bindings (trivial, i know, but helpful)
Remove the extra xcast of child contact info to the parent job.
This commit was SVN r17952.
2008-03-25 14:57:34 +00:00
Ralph Castain
cca449e379
Move an OMPI RML tag to the OMPI layer
...
This commit was SVN r17950.
2008-03-25 13:30:48 +00:00
Jeff Squyres
5320c91ab3
Oops -- fix the constructor to also use opal_object_t instead of
...
opal_list_item_t.
This commit was SVN r17945.
2008-03-25 11:59:50 +00:00
Galen Shipman
0116041133
BTL shouldn't own the passive side's descriptor in the PML get protocol. The BTL
...
doesn't know when to free it on the passive side.
This commit was SVN r17943.
2008-03-25 01:43:41 +00:00
Jeff Squyres
ebfdd133f5
AFACT, we never put endpoints on a list.
...
This commit was SVN r17940.
2008-03-24 18:32:55 +00:00