openmpi

Автор	SHA1	Сообщение	Дата
Gleb Natapov	2b6cbd6299	Separate frag lists for RDMA descriptors to two, one for src descriptors and another for dst descriptors. This provide partial solution to OB1 protocol deadlock problem. We can limit number of RDMA descriptors (by setting btl_openib_free_list_max to something different from -1) and if we will be lucky to hit this limit before we fail to register more memory the protocol will not deadlock. When we had only one list for src/dst descriptors we deadlocked when we reached max limit for the list. This commit was SVN r13844.	2007-02-28 13:43:38 +00:00
Sven Stork	870740efe2	- proper export symbols that are required by other components. This commit was SVN r13841.	2007-02-28 12:51:55 +00:00
Rainer Keller	0889ebd59f	- Eliminate warnings, that PGI-6.2.5 issues with -Minform=inform This commit was SVN r13840.	2007-02-28 08:36:34 +00:00
Li-Ta Lo	c5d8c221b0	added binomial tree based Gather alogrithm, passed IBM and Intel tests This commit was SVN r13835.	2007-02-28 01:11:01 +00:00
Jelena Pjesivac-Grbovic	627533fe4a	Adding segmented ring algorithm for Allreduce for commutative operations. Algorithm allows user to specify the segment size to be used for computation/communication overlap. The additional memory requirement for the algorithm is 2 x segment size. It performed well for (really) large message sizes over MX and it passed intel Allreduce_c and Allreduce_loc_c tests. This commit was SVN r13832.	2007-02-27 20:32:30 +00:00
Jeff Squyres	38c976d527	Remove redundant declaration of ompi_err_unknown. This commit was SVN r13829.	2007-02-27 19:37:42 +00:00
Sven Stork	d8a369936e	- Fix more symbols that should be exported. This commit was SVN r13824.	2007-02-27 15:17:17 +00:00
George Bosilca	533dfff56d	Only do the preconnection stage if we found the local proc. It's mostly to make some compilers complain less about uninitialized values. This commit was SVN r13805.	2007-02-26 22:24:44 +00:00
George Bosilca	bec20422ee	Remove the warnings about printf data-type mismatch. This commit was SVN r13804.	2007-02-26 22:20:35 +00:00
Brian Barrett	6d70f5fbe0	don't define malloc and friends in opal_config, as it causes problems when we later include malloc.h This commit was SVN r13803.	2007-02-26 21:34:48 +00:00
Li-Ta Lo	c860bd1be5	fixed a typo in the comment This commit was SVN r13802.	2007-02-26 19:20:46 +00:00
Li-Ta Lo	73a73b1c78	added ASCII graph on reduce_log_intra This commit was SVN r13801.	2007-02-26 19:15:37 +00:00
Pavel Shamis	6fe84f581b	mpool_base_module_destroy was removing all modules from a list instead of removing specific one. Fixing the bug. This commit was SVN r13795.	2007-02-26 16:25:20 +00:00
Brian Barrett	d9e0e80190	Make some debugging output only looked at when debugging is enabled This commit was SVN r13777.	2007-02-25 01:03:19 +00:00
Bill D'Amico	db1c2a58c4	Removed cruft - unused variables causing warnings during OMPI build. This commit was SVN r13772.	2007-02-23 18:55:41 +00:00
Tim Prins	dbe82c70d6	Get rid of stale file, and remove the (unused) references to it. This commit was SVN r13771.	2007-02-23 13:50:39 +00:00
Tim Prins	bddd06bcdb	pedantic formatting... This commit was SVN r13766.	2007-02-23 00:54:41 +00:00
Tim Prins	f35f67ed1c	(very) minor correction to helpfile This commit was SVN r13758.	2007-02-22 16:02:12 +00:00
Ron Brightwell	e15e85a0b6	Fix a problem with long unexpected messages that was causing hangs. Long unexpected messages were not generating PUT_START events because the MD for long unexpected messages was configured to ignore start events. When a long unexpected message arrived, it traversed the match list, and ended up in the long unexpected MD. As the long message is being consumed, the code called PtlMDUpdate() to look for the message, but there was no event that indicated that it had arrived. So, the update succeeded. Once the long unexpected message was consumed, the PUT_END event showed up in the event queue -- except the code wasn't looking for it anymore. The PUT_START events exist specifically to handle ordering between short and long unexpected messages, so PUT_START events can't be ignored on long unexpected messages. Modified the code to generate PUT_START events for both long and short unexpected messages and handle matching up START and END events appropriately. This commit was SVN r13746.	2007-02-21 21:59:48 +00:00
Li-Ta Lo	049921a5ec	the temporary buffer is not needed for the MPI_IN_PLACE cases if the underlying Gather is implemented correctly This commit was SVN r13740.	2007-02-21 20:39:56 +00:00
Jelena Pjesivac-Grbovic	36156f39c2	Modification to allreduce ring algorithm: - the block sizes are computed in more uniformn way. The first k blocks may be 1 element larger than the remaining blocks. The algorithm passed Intel Allreduce_c and Allreduce_loc_c tests, and IMB-3.2 Allreduce, over TCP and both btl and mtl MX (up to 128 processes). The algorithm still only supports commutative operations. This commit was SVN r13738.	2007-02-21 19:30:08 +00:00
Josh Hursey	c573171b7d	Mostly a cleanup commit. - Implement the BML/r2 finialize funciton - Cleanup the btl close routine - Wire up a pml_base_verbose MCA parameter so you can actually watch the PML selection logic if you really want to. - Fix a potental segfault in the selection logic. ompi_pointer_array_get_item() may return NULL, so we have to check for it This commit was SVN r13734. The following SVN revision numbers were found above: r2 --> open-mpi/ompi@58fdc18855	2007-02-21 16:18:43 +00:00
Jelena Pjesivac-Grbovic	b608887466	Adding variant of linear alltoall algorithm where the number of outstanding requests can be limited using mca parameters. The implementation passed Intel, IMB-3.2, and mpi_test_suite tests over TCP and MX up to 128 processes (64 nodes), on both 32-bit and 64-bit machines. It is not activated by default, but it should be useful for really large communicator sizes. This commit was SVN r13720.	2007-02-20 04:25:00 +00:00
Jeff Squyres	f820e44112	Remove a gcc-ism from the code (defining an anonymous union in the middle of a struct). Now we properly define and name the union outside the struct and simply create an instance of it inside the struct. This commit was SVN r13709.	2007-02-19 18:21:57 +00:00
Brian Barrett	727f64aecf	It appears that SEND_COMPLETE on a 0 byte message with BTLs that don't support SEND_IN_PLACE causes badness because the BTL tries to use the not-exactly-complete convertor. Don't need it in this situation anyway. This commit was SVN r13700.	2007-02-19 02:43:26 +00:00
George Bosilca	020b8ade70	A slightly better fix for the data mismatch compiler complaints. This commit was SVN r13695.	2007-02-17 05:23:57 +00:00
Jelena Pjesivac-Grbovic	d2d02642ca	Removing compilation warnings about the output format. This commit was SVN r13693.	2007-02-16 23:32:47 +00:00
Rich Graham	b925d6588d	add some missing error checking - thanks to Ron B. This commit was SVN r13692.	2007-02-16 22:19:24 +00:00
George Bosilca	04138c23af	No more warnings. This commit was SVN r13683.	2007-02-16 16:25:58 +00:00
Pavel Shamis	edeab0e912	Adding Mellanox Technologies copyright to files touched by Mellanox. This commit was SVN r13669.	2007-02-15 18:03:20 +00:00
Jeff Squyres	b7b893b771	Need to EXTRA_DIST README.txt so that it gets included in the tarball; the *_DATA files are not automatically picked up for inclusion into the tarball. This commit was SVN r13667.	2007-02-15 16:21:25 +00:00
Jelena Pjesivac-Grbovic	e532b928af	Adding segmented binary reduce algorithm which works with non-commutative operations. Implementation passed intel: MPI_Reduce_c , MPI_Reduce_loc_c, and MPI_Reduce_user_c tests over TCP, BTL MX, and MTL MX, as well as, mpi_test_suite Reduce tests (up to 64 nodes). The algorithm is still not activated by decision function (will be in the near future). This commit was SVN r13657.	2007-02-14 22:38:38 +00:00
Pavel Shamis	2483cefc57	Additional check if descriptor is NULL. It prevents mca_pml_dr_sendreq_cleanup_active failure on segfault. This commit was SVN r13647.	2007-02-14 10:43:43 +00:00
Brian Barrett	c00d841741	Fix hang on Cray machine introduced with r13582. The modex will never fire when on the Cray machine (aka when the NULL GPR is in use). This commit was SVN r13638. The following SVN revision numbers were found above: r13582 --> open-mpi/ompi@041beeb1b6	2007-02-13 18:34:03 +00:00
Gleb Natapov	4d4b0a022a	Add error callback to sm BTL. Call it when allocation of the initial circular buffer fails. If cb is already allocated, but it is full and allocation of additional cb fails, we spin waiting for receiver to free space in existing cb. This commit was SVN r13635.	2007-02-13 12:01:36 +00:00
George Bosilca	2e042c91cf	Once we compute the local offset use it (instead of the global one). This commit was SVN r13634.	2007-02-13 09:34:04 +00:00
George Bosilca	22eca30b45	One less compiler warning. This commit was SVN r13633.	2007-02-13 09:32:57 +00:00
George Bosilca	7b7fecad85	More output when position_debug is enabled. This commit was SVN r13631.	2007-02-13 09:28:39 +00:00
George Bosilca	5214e3751b	Correctly handle the pack and unpack for contiguous types with gaps. Th pack/unpack let the convertor in a consistent state, such that the next operation will succeed. This commit was SVN r13630.	2007-02-13 09:28:05 +00:00
Jeff Squyres	dd35fb73ff	* Fix some "MPI:Exception" typos (needs 2 :'s) * Update exactly how we handle MPI exceptions, particularly with respect to MPI-1 section 3.2.5, and how error handlers are only invoked for the ''first'' request that generates an exception. * Update the "see also" section to be consistent across all 8 MPI_Test* and MPI_Wait* functions. * Fixes trac:560 This commit was SVN r13619. The following Trac tickets were found above: Ticket 560 --> https://svn.open-mpi.org/trac/ompi/ticket/560	2007-02-12 18:08:42 +00:00
Gleb Natapov	1033002595	Fix memory leak. Free allocated descriptor if operation cannot proceed. This commit was SVN r13610.	2007-02-12 09:47:51 +00:00
Jelena Pjesivac-Grbovic	b52dc9e427	Modifying fixed decision function for reduce to utilize linear algorithm only for really small communicator sizes. This commit was SVN r13597.	2007-02-10 00:31:10 +00:00
Brian Barrett	8b28e5b33d	Allow the OOB to connect between all MPI applications during MPI_INIT without also establishing MPI connectivity. This commit was SVN r13595.	2007-02-09 20:17:37 +00:00
Brian Barrett	262cbbc5c9	Back out r13593, which contained a change that shouldn't be committed. This commit was SVN r13594. The following SVN revision numbers were found above: r13593 --> open-mpi/ompi@81472363ea	2007-02-09 20:13:02 +00:00
Brian Barrett	81472363ea	Allow the OOB to connect between all MPI applications during MPI_INIT without also establishing MPI connectivity. This commit was SVN r13593.	2007-02-09 20:11:40 +00:00
Brian Barrett	041beeb1b6	Share currently selected PML in the modex information, then check whenever adding new procs that the remote proc's pml is the same as our local pml. Turns the hangs from mismatched PMLs into an abort, which is better, I think. This commit was SVN r13582.	2007-02-09 16:38:16 +00:00
Galen Shipman	f98a442c82	Fix a problem in the selection logic for MX. Basically we need to be able to open MTL MX and BTL MX and initialize them at the same time. The problem is that both call mx_init and mx_finalize, solution is to add an external entity that does the init and finalize (based on ref counting). This commit was SVN r13576.	2007-02-09 03:19:38 +00:00
Jeff Squyres	260f1fd468	Fixes trac:817 The C++ bindings were not tracking keyvals properly -- they were freeing some internal meta data when Free_keyval() was called, not when the keyval was actually destroyed (keyvals are refcounted in the C layer, just like all other MPI objects, because they can live for long after their corresponding Free call is invoked). This commit fixes this problem and several other things: * Add infrastructure on the ompi_attribute_keyval_t for an "extra" destructor pointer that will be invoked during the "real" constructor (i.e., when OBJ_RELEASE puts the refcount to 0). This allows calling back into the C++ layer to release meta data associated with the keyval. * Adjust all cases where keyvals are created to pass in relevant destructors (NULL or the C++ destructor). * Do essentially the same for MPI::Comm, MPI::Win, and MPI:Datatype: * Move several functions out of the .cc file into the _inln.h file since they no longer require locks * Make the 4 Create_keyval() functions call a common back-end keyval creation function that does the Right Thing depending on whether C or C++ function pointers were used for the keyval functions. The back-end function does not call the corresponding C MPI__create_keyval function, but rather does the work itself so that it can associate a "destructor" callback for the C++ bindings for when the keyval is actually destroyed. Change a few type names to be more indicative of what they are (mostly dealing with keyvals [not "keys"]). * Add the 3 missing bindings for MPI::Comm::Create_keyval(). * Remove MPI::Comm::comm_map (and associated types) because it's no longer necessary in the intercepts -- it was a by-product of being a portable C++ bindings layer. Now we can just query the C layer directly to figure out what type a communicator is. This solves some logistics / callback issues, too. * Rename several types, variables, and fix many comments in the back-end C attribute implementation to make the names really reflect what they are (keyvals vs. attributes). The previous names heavily overloaded the name "key" and were ''extremely'' confusing. This commit was SVN r13565. The following Trac tickets were found above: Ticket 817 --> https://svn.open-mpi.org/trac/ompi/ticket/817	2007-02-08 23:50:04 +00:00
Jeff Squyres	33619d6b43	Minor fixes for the ompi_bitmap class that were found while investivating #817: * Remove use of legal_numbits member and always just use the full size of the array. There was a corner case where legal_numbits was not an even multiple of the number of bits in the array where bits would not get freed properly, ususally causing wasted fortran MPI handles, or, as in the case of #817, wasted attribute keyvals (i.e., the user freed them, but the bitmap didn't reflect the free). * Re-order some error checks to ensure that we don't segv (we don't currently trigger this problem anywhere; I just noticed it while doing the other attribute keyval and legal_numbits work). Since this change affects all Fortran MPI handles, I ran all the intel and ibm tests and all still pass with this change. This commit was SVN r13561.	2007-02-08 18:20:36 +00:00
Gleb Natapov	4e5deec496	Fix previous patch. In case of different sm base use pointer to tail after recalculating it. This commit was SVN r13557.	2007-02-08 14:59:18 +00:00

1 2 3 4 5 ...

2426 Коммитов