Rich Graham
08becf33b5
add more debugging.
...
This commit was SVN r18100.
2008-04-08 18:44:50 +00:00
Rich Graham
aa1b7dd406
more debug
...
This commit was SVN r18099.
2008-04-08 03:56:47 +00:00
Rich Graham
0c18bdeff7
more debug code.
...
This commit was SVN r18098.
2008-04-08 03:04:20 +00:00
Rich Graham
9d5a7238df
Add some debugging code.
...
This commit was SVN r18097.
2008-04-07 23:20:15 +00:00
Rich Graham
fa696734d5
add some debug code.
...
This commit was SVN r18096.
2008-04-07 21:03:23 +00:00
Shiqing Fan
28746bbcdb
Remove the memchecker macro in pml base request, used in req_wait.c, which actually is in the wrong place. Instead, one simple call from send_request_free and recv_request_free(already done) will do all the work, fast and clean.
...
This commit was SVN r18095.
2008-04-07 17:46:50 +00:00
Shiqing Fan
a1e5df1cc9
Use the new memchecker function call which is based on convertor.
...
Remove one unnecessary call.
This commit was SVN r18085.
2008-04-07 07:52:04 +00:00
Gleb Natapov
713a27dc71
Counter of created RDMA channels should be incremented immediately after channel
...
creation (not in control message completion) otherwise more than max_eager_rdma
channel may be created.
This commit was SVN r18082.
2008-04-06 13:48:45 +00:00
Rich Graham
1b54e8b76e
fix buffer management for nb-barrier.
...
This commit was SVN r18081.
2008-04-05 21:59:04 +00:00
Tim Prins
313edd8955
- Fix a problem reported on the users list where we would segfault in finalize after calling spawn if the user did not call MPI_Comm_disconnect
...
- Fix the app context constructor so it initializes all the fields.
This commit was SVN r18079.
2008-04-04 15:07:39 +00:00
Jeff Squyres
7072a32703
* Properly protect XRC stuff
...
* A few minor style fixes
This commit was SVN r18076.
2008-04-02 19:52:03 +00:00
Rich Graham
94f8fd365c
a few reduction optimizations. Add bcast.
...
This commit was SVN r18075.
2008-04-02 19:02:33 +00:00
George Bosilca
a00ca20446
More cleanups.
...
This commit was SVN r18069.
2008-04-02 06:38:33 +00:00
George Bosilca
944453c4c1
Cleanups.
...
This commit was SVN r18068.
2008-04-02 06:37:42 +00:00
Rich Graham
eb5d6096f1
add reduction routine - fix buffer recycling logic which was totally
...
broken.
This commit was SVN r18065.
2008-04-01 22:56:18 +00:00
Jeff Squyres
d944d5ec52
Just in case something goes drastically wrong, don't segv.
...
This commit was SVN r18049.
2008-03-31 21:55:07 +00:00
George Bosilca
b4f828f389
We need a newline at the nd of the file, or some compiler bark.
...
This commit was SVN r18023.
2008-03-30 19:05:56 +00:00
Gleb Natapov
b42234461a
Cleanup shared file creation on unix/linux.
...
This commit was SVN r18021.
2008-03-30 13:41:47 +00:00
Jeff Squyres
d0f12f3df0
Make a better error message.
...
This commit was SVN r18014.
2008-03-29 12:54:24 +00:00
Rich Graham
90e53ca9ee
debug the pipeline algorithm.
...
This commit was SVN r18008.
2008-03-28 15:10:07 +00:00
Aurelien Bouteiller
77653ac787
Missing .h file in makefile breaked nightly tarball distcheck...
...
This commit was SVN r18006.
2008-03-28 14:36:56 +00:00
Aurelien Bouteiller
c16339944a
Fix a coverity warning about using unsafe sprintf.
...
This commit was SVN r17999.
2008-03-27 21:24:27 +00:00
Aurelien Bouteiller
e11237aadb
Introduction of the "progress" sender_based method to replace the slow isend-self method.
...
This commit was SVN r17998.
2008-03-27 21:19:45 +00:00
Aurelien Bouteiller
93db01871e
This is part of the previous patch.
...
This commit was SVN r17997.
2008-03-27 21:06:14 +00:00
Aurelien Bouteiller
f8bf6f2c6a
Code cleanup.
...
sender_based.h is now split in two files, to solve cyclic .h files inclusion.
Most macros are now inline functions.
Variable names have been changed from places to places.
Various other small things...
This commit was SVN r17996.
2008-03-27 21:05:44 +00:00
George Bosilca
be4b153f0d
Another patch for thread safety in the TCP BTL (thanks to Pierre).
...
This commit was SVN r17993.
2008-03-27 18:36:08 +00:00
Gleb Natapov
cf40674369
Decide if sends should be throttled at the receiver and pass this to the sender
...
in an ACK message. The decision can't be done reliably at the sender.
This commit was SVN r17987.
2008-03-27 08:56:43 +00:00
Rich Graham
e2ad9c4be2
adjust to change in orte_process_info.
...
This commit was SVN r17986.
2008-03-27 01:25:28 +00:00
Rich Graham
441fb9fb9e
checkpoint.
...
This commit was SVN r17985.
2008-03-27 01:16:32 +00:00
Ralph Castain
90107f3c14
Fix an issue with comm_spawn over who sent/recv first in the modex. The modex assumes that the first name on the list is the "root" that will serve as the allgather collector/distributor. The dpm was putting that entity last, which forced us to pre-inform the parent procs of the child proc's contact info since the parent was trying to send to the child.
...
Clarify the setting of send_first in the mpi bindings (trivial, i know, but helpful)
Remove the extra xcast of child contact info to the parent job.
This commit was SVN r17952.
2008-03-25 14:57:34 +00:00
Ralph Castain
cca449e379
Move an OMPI RML tag to the OMPI layer
...
This commit was SVN r17950.
2008-03-25 13:30:48 +00:00
Jeff Squyres
5320c91ab3
Oops -- fix the constructor to also use opal_object_t instead of
...
opal_list_item_t.
This commit was SVN r17945.
2008-03-25 11:59:50 +00:00
Galen Shipman
0116041133
BTL shouldn't own the passive side's descriptor in the PML get protocol. The BTL
...
doesn't know when to free it on the passive side.
This commit was SVN r17943.
2008-03-25 01:43:41 +00:00
Jeff Squyres
ebfdd133f5
AFACT, we never put endpoints on a list.
...
This commit was SVN r17940.
2008-03-24 18:32:55 +00:00
Ralph Castain
dc7f45dafd
Remove the obsolete and largely unused orte_system_info structure. The only fields that were used in that struct were nodeid and nodename - these have been transferred to the orte_process_info structure.
...
Only one place used the user name field - session_dir, when formulating the name of the top-level directory. Accordingly, the code for getting the user's id has been moved to the session_dir code.
This commit was SVN r17926.
2008-03-23 23:10:15 +00:00
Rich Graham
a7c836a2b0
fix location of the restrict key word.
...
Make the tag in the fan-in/fan-out algorithm be fragment based.
This commit was SVN r17903.
2008-03-21 01:40:36 +00:00
Rich Graham
2c66d396b7
take care of some bit-rot with the fanin-fanout method.
...
This commit was SVN r17902.
2008-03-21 01:08:49 +00:00
Rich Graham
b9520e61dc
get the sm optimized allreduce working for all but user defined
...
operations. Added to the reduction operations a set of reduction
functions that take 2 input buffers and one output buffer to avoid
some extra memory copies. These can't be used with user defined
operations. The intel c collective suite passes both original, and
new (new, not the user defined operations).
This commit was SVN r17901.
2008-03-20 23:51:16 +00:00
Galen Shipman
dcac824f59
Fix problem in releasing fragments during GET_END event (didn't check that
...
portals btl has ownership and therefor didn't free the frag as it should) this
causes leakage and hangs in MPI_Finalize.
Also added a bit more debugging.
This commit was SVN r17900.
2008-03-20 22:46:32 +00:00
George Bosilca
efa89bfa3f
Revert r17857. The context should be set in one case ... when we call prepare_{src|dst}
...
without calling a get or put. So, just keep it here until a better solution is
found.
This commit was SVN r17872.
The following SVN revision numbers were found above:
r17857 --> open-mpi/ompi@d460ccfbf9
2008-03-18 19:01:27 +00:00
George Bosilca
8943ae0b4e
Cleanup plus some typos.
...
This commit was SVN r17858.
2008-03-18 03:03:33 +00:00
George Bosilca
d460ccfbf9
No need to check for NULL there. The bml_btl is set correctly
...
on the upper level.
This commit was SVN r17857.
2008-03-18 03:02:31 +00:00
George Bosilca
39353ebb44
Cleanup.
...
This commit was SVN r17855.
2008-03-18 02:56:50 +00:00
George Bosilca
76deec135e
The .h file is not used anymore (it contain the descriptor cache). Update the
...
Makefile.am file as well.
This commit was SVN r17854.
2008-03-18 02:50:24 +00:00
George Bosilca
1d04ec4ded
Correct the connection logic for TCP. Now we have not only a cleaner
...
connection, but a more thread safe one. Thanks to Pierre for his
help on this.
This commit was SVN r17853.
2008-03-18 02:42:16 +00:00
Jeff Squyres
61290c0e51
Remove a useless file.
...
This commit was SVN r17852.
2008-03-18 01:50:47 +00:00
Ralph Castain
be7d0a8a4d
Fix a problem introduced by the conversion of orte_pointer_array to opal_pointer_array. We used to derive the app context's index from the returned index of the orte_pointer_array_add function - this parameter was lost in the transition to opal_pointer_array_add. As a result, we no longer knew the index of the app_context, so everything is launched with app0.
...
This commit was SVN r17851.
2008-03-17 23:48:10 +00:00
Edgar Gabriel
570bbea5e0
fixing the allgather problem reported on the mailing list. The problem was
...
that at one locatin we had the local-size instead of the remote size as a
receive argument.
This commit was SVN r17849.
2008-03-17 19:42:18 +00:00
Gleb Natapov
9b6db25182
Fix compilation warning.
...
This commit was SVN r17839.
2008-03-17 13:37:57 +00:00
Pavel Shamis
54ad8d7446
The issue was reported/fixed by Jon Mason one month ago but the fix was not committed. So I'm commiting it now.
...
This commit was SVN r17835.
2008-03-17 11:13:06 +00:00