openmpi

Автор	SHA1	Сообщение	Дата
Rich Graham	bc97d22182	remove tabs. Remove old code that was commented out. This commit was SVN r15975.	2007-08-28 03:08:36 +00:00
Rich Graham	4d58f9aed7	Add comments. Move temporary receive object from a free list object to a stack object. This commit was SVN r15971.	2007-08-27 21:41:04 +00:00
Brian Barrett	8b9e8054fd	Move modex from pml base to general ompi runtime, sicne it's used by more than just the PML/BTLs these days. Also clean up the code so that it handles the situation where not all nodes register information for a given node (rather than just spinning until that node sends information, like we do today). Includes r15234 and r15265 from the /tmp/bwb-modex branch. This commit was SVN r15310. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r15234 r15265	2007-07-09 17:16:34 +00:00
Sven Stork	21f12f29f8	- fix a sm bug that causes segfaults in the case of threaded builds. The problem is that in the case of threaded builds for every fifo a head and tail lock will be allocated inside the shared memory segment and the ptr is stored inside the fifo. In the case that the sm backend file will be mapped in all processes at the same address (mostly the case for non-thread builds) this is fine, but in the cases when the processes map the file at different addresses this addresses cause big trouble in other processes than the one that allocted the locks. Therefore the send lock addresses have to be recalculated to match the local mapping of the processes that use them. This commit was SVN r15291.	2007-07-05 14:26:32 +00:00
Gleb Natapov	b88b7dedfe	Rename btl_rdma_offset to btl_pipeline_send_length. This commit was SVN r15153.	2007-06-21 07:12:40 +00:00
Galen Shipman	3401bd2b07	Add optional ordering to the BTL interface. This is required to tighten up the BTL semantics. Ordering is not guaranteed, but, if the BTL returns a order tag in a descriptor (other than MCA_BTL_NO_ORDER) then we may request another descriptor that will obey ordering w.r.t. to the other descriptor. This will allow sane behavior for RDMA networks, where local completion of an RDMA operation on the active side does not imply remote completion on the passive side. If we send a FIN message after local completion and the FIN is not ordered w.r.t. the RDMA operation then badness may occur as the passive side may now try to deregister the memory and the RDMA operation may still be pending on the passive side. Note that this has no impact on networks that don't suffer from this limitation as the ORDER tag can simply always be specified as MCA_BTL_NO_ORDER. This commit was SVN r14768.	2007-05-24 19:51:26 +00:00
George Bosilca	b2e805db61	Nothing relevant. Indentation, typos, change PTL to BTL. This commit was SVN r14727.	2007-05-23 14:03:52 +00:00
Gleb Natapov	3ebaff8dfe	Implement new BTL parameters: We eagerly send data up to btl__eager_limit with the match Upon ACK of the MATCH we start using send/receives of size btl__max_send_size up to the btl__rdma_pipeline_offset After the btl__rdma_pipeline_offset we begin using RDMA writes of size btl__rdma_pipeline_frag_size. Now, on a per message basis we only use the above protocol if the message is larger than btl__min_rdma_pipeline_size btl__eager_limit - > same btl__max_send_size -> same btl__rdma_pipeline_offset -> btl__min_rdma_size btl__rdma_pipeline_frag_size -> btl__max_rdma_size btl_*_min_rdma_pipeline_size is new.. This patch also moves all BTL common parameters initialisation into btl_base_mca.c file. This commit was SVN r14681.	2007-05-17 07:54:27 +00:00
Jeff Squyres	51f286d737	Just like r14289 on the ORTE trunk: Per discussions with Brian and Ralph, make a slight correction in where components are installed. Use $pkglibdir, not $libdir/openmpi, so that when compiled in the orte trunk, components are installed to the right directory (because the component search patch is checking $pkglibdir). This commit was SVN r14345. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r14289	2007-04-12 11:19:42 +00:00
Gleb Natapov	d41ca417e8	Delete declaration of non-existent functions and no longer relevant comment. This commit was SVN r14341.	2007-04-12 08:12:31 +00:00
Li-Ta Lo	ec8a859a44	fixed typo This commit was SVN r14207.	2007-04-03 17:21:54 +00:00
Gleb Natapov	e5450613b5	Add new SM BTL parameter btl_sm_cb_max_num. If set to value greater then zero it limits the number of circular buffers allocated between each pair of peers. This allows for more tight memory usage control. This commit was SVN r14120.	2007-03-22 12:21:42 +00:00
Gleb Natapov	efe0323d35	Initialize fifos at SM BTL init time instead of waiting for first send. This waist slightly more memory, but prevents problem when fifo cannot be allocated later during a job run when memory resource is exhausted. This commit was SVN r14119.	2007-03-22 12:18:44 +00:00
Gleb Natapov	c389c47d79	Fix SM connectivity calculations. This commit was SVN r14109.	2007-03-21 13:29:19 +00:00
Gleb Natapov	a1a14aa4c3	Add memory barriers during SM btl initialization. This commit was SVN r14099.	2007-03-21 10:25:10 +00:00
Gleb Natapov	e551c5f1a3	Get rid of separate sm BTL for different shared memory base addresses. Now, when we precalculate most of the addresses there is no point to have separate BTL for this. The sm_progress() code become much more simple as a result. This commit was SVN r14071.	2007-03-20 08:15:58 +00:00
Josh Hursey	dadca7da88	Merging in the jjhursey-ft-cr-stable branch (r13912 : HEAD). This merge adds Checkpoint/Restart support to Open MPI. The initial frameworks and components support a LAM/MPI-like implementation. This commit follows the risk assessment presented to the Open MPI core development group on Feb. 22, 2007. This commit closes trac:158 More details to follow. This commit was SVN r14051. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r13912 The following Trac tickets were found above: Ticket 158 --> https://svn.open-mpi.org/trac/ompi/ticket/158	2007-03-16 23:11:45 +00:00
Gleb Natapov	be018944d2	Clean up circular buffer implementation. Get rid of _same_base_address() functions by pre-calculating everything in advance. This commit was SVN r13923.	2007-03-05 14:27:26 +00:00
Gleb Natapov	8078ae5977	Optimize sm communication. Pass message type (MCA_BTL_SM_FRAG_ACK/ MCA_BTL_SM_FRAG_SEND) and status success/fail in low bits of pointers we are passing through circular buffer. The rank that receives ACK doesn't need to look into data it received and this is a big win since this data is not in the cache of the rank's CPU. (Note that we can use low bits of pointers because free_list always return pointers aligned at least to cache line size). This commit was SVN r13922.	2007-03-05 14:24:09 +00:00
Gleb Natapov	90fb58de4f	When frags are allocated from mpool by free_list the frag structure is also allocated from mpool memory (which is registered memory for RDMA transports) This is not a problem for a small jobs, but for a big number of ranks an amount of waisted memory is big. This commit was SVN r13921.	2007-03-05 14:17:50 +00:00
Gleb Natapov	4d4b0a022a	Add error callback to sm BTL. Call it when allocation of the initial circular buffer fails. If cb is already allocated, but it is full and allocation of additional cb fails, we spin waiting for receiver to free space in existing cb. This commit was SVN r13635.	2007-02-13 12:01:36 +00:00
George Bosilca	b611e6d7dc	Less warnings. This commit was SVN r13419.	2007-02-01 17:51:43 +00:00
Brian Barrett	58b325b03f	Two changes to improve the sm situation with spawn: * have the mpool size be based on MCW, not num procs in other jobs we know about. Solves the problem of the spawned job having a much bigger than needed sm file * Can't assume that "me" is in the list of procs passed to addprocs, so need to use slightly different logic and not go through all of add procs unless there's a proc in my job that isn't me. This seems to greatly improve the situation, although there still seems to be more of a slowdown through MPI_INIT for the children (if there are more than one child) than MPI_INIT for the parent if there are 'n' children compared to 'n' parents. Hopefully that made sense ;) This commit was SVN r13417.	2007-02-01 17:18:35 +00:00
Brian Barrett	a34e67d743	Remove unneeded PARAM_INIT_FILE variable in configure.params files used by components that use configure.m4 for configuration or are always built. The macro has not been needed since moving to configure types other than configure.stub Fixes trac:590 This commit was SVN r13031. The following Trac tickets were found above: Ticket 590 --> https://svn.open-mpi.org/trac/ompi/ticket/590	2007-01-08 03:44:22 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Brian Barrett	6f8b366acb	Rename liborte to libopen-rte and libopal to libopen-pal per telecon today and bug #632. Refs trac:632 This commit was SVN r12762. The following Trac tickets were found above: Ticket 632 --> https://svn.open-mpi.org/trac/ompi/ticket/632	2006-12-05 18:27:24 +00:00
George Bosilca	126a68dc9a	Big datatype commit. Remove all unused features of the datatype engine. As the memory allocation logic is completely done outside the data-type engine (in the PML) there is no need for any special case inside the data-type engine. There is less arguments for the ompi_convertor_pack and ompi_convertor_unpack as well (the last field free_after is not required anymore as there is no memory allocated in the engine itself). This change affect all components using datatypes. I test most of them, but it might happens that I miss some ... If it's the case please let me know (don't shoot the pianist!!). This commit was SVN r12331.	2006-10-26 23:11:26 +00:00
George Bosilca	d7268557a8	Complete the SM BTL changes. Now all displacements are ptrdiff_t and there is no warnings about any issue with signed/unsigned. This commit was SVN r12234.	2006-10-20 19:28:12 +00:00
George Bosilca	c86214f420	Fix the SM BTL issues. The problem seems to come from the fact that the maximum number of nodes on the SM file should be signed, as we use the -1 to unlimit it. This commit was SVN r12227.	2006-10-20 17:25:53 +00:00
George Bosilca	06563b5dec	Last set of explicit conversions. We are now close to the zero warnings on all platforms. The only exceptions (and I will not deal with them anytime soon) are on Windows: - the write functions which require the length to be an int when it's a size_t on all UNIX variants. - all iovec manipulation functions where the iov_len is again an int when it's a size_t on most of the UNIXes. As these only happens on Windows, so I think we're set for now :) This commit was SVN r12215.	2006-10-20 03:57:44 +00:00
Brian Barrett	51b2a0fd3f	A couple of changes to improve shared memory behavior when resources get constrained: * Make sure we always have a number of eager fragments available that scales with the number of processes communicating with a given proc over shared memory * Use FREE_LIST_GET instead of FREE_LIST_WAIT to return an error to the PML when resource exhaustion occurs * Don't dereference the frag during alloc unless we're sure it's not NULL Reviewed by: Galen Refs trac:413 This commit was SVN r12053. The following Trac tickets were found above: Ticket 413 --> https://svn.open-mpi.org/trac/ompi/ticket/413	2006-10-06 21:13:49 +00:00
George Bosilca	a3ad4a7fc8	The visibility flags (and/or Windows friendly export) is now on for all BTLs. This commit was SVN r11662.	2006-09-14 22:19:39 +00:00
George Bosilca	3f0a7cad9e	The last patch for Windows support. Mostly casting and conversion to C++ friendly headers. This commit was SVN r11400.	2006-08-24 16:38:08 +00:00
Galen Shipman	e5c594c211	More updates for the async error handler for btl's In order to provide backwards compatability the framework versions are bumped and the handler registeration function is at the end of the btl struct. Testing done on sm, openib, and gm.. This commit was SVN r11256.	2006-08-17 22:02:01 +00:00
Galen Shipman	3b49953ce2	Add error callback to the btl interface, this allows error to be delivered to the upperlayer assynchronously although there are some issues with this.. such as there are multiple consumers of the btl's.. who get's the This commit was SVN r11232.	2006-08-16 20:21:38 +00:00
Brian Barrett	4c101c6394	* rename the collectives sm bootstrap area to be consistent with other shared memory segments * make sure to properly unlink the collectives sm bootstrap area at shutdown * Add missing / in the path for the mpool shared memory segment * make sure to release the common_mmap structure in the SM btl after unlinking the file during shutdown This commit was SVN r10886.	2006-07-19 20:55:29 +00:00
George Bosilca	21c542f0a5	Make the SM BTL FT friendly. Now there are 3 FT friendly BTLs: TCP, SM and self. This commit was SVN r10780.	2006-07-13 07:42:18 +00:00
George Bosilca	d00e6e29e8	Create a close function for the mpool SM module, in order to allow the cleanup. The mca_common_sm_mmap file was left over by the SM mpool, and there was nobody able to unmap and unlink it. This commit was SVN r10770.	2006-07-12 22:12:07 +00:00
George Bosilca	fd39203262	As the self proc is marked as local, there will always be at least one local proc. Don't create the SM file until we really know there is someone lse on the same node. This commit was SVN r10740.	2006-07-11 17:05:13 +00:00
Galen Shipman	218a438509	finished the ompi_free_list_t class nightmare.. This commit was SVN r10314.	2006-06-12 22:09:03 +00:00
Galen Shipman	c79efc9efb	track which list a fragment came from, allows returning based on list, not on size. This commit was SVN r10142.	2006-05-31 14:24:32 +00:00
Jeff Squyres	dd44d36be0	Fix for ticket #25 . Ensure that in the threaded case where we have This commit was SVN r10043.	2006-05-24 16:15:07 +00:00
Jeff Squyres	e24377a89c	Back out a pair of commits from George from last week because they apparently don't work properly: r9869, r9868 (sm btl alignment issues) This commit was SVN r9936. The following SVN revision numbers were found above: r9868 --> open-mpi/ompi@9b985c3216 r9869 --> open-mpi/ompi@adedf511fb	2006-05-16 16:48:43 +00:00
George Bosilca	adedf511fb	Remove the printf that I unfortunately commit. This commit was SVN r9869.	2006-05-10 00:02:54 +00:00
George Bosilca	9b985c3216	Force the useful data to be aligned on special boundary. It is 32 bits right now. Some testing on large NUMA machines should be done in order to make sure that we need to export this variable out to the MCA layer. This commit was SVN r9868.	2006-05-09 21:46:10 +00:00
George Bosilca	a386fccccc	Increase the default limits for the SM BTL. These new values allow better performances on all the clusters I was able to test. This commit was SVN r9867.	2006-05-09 21:44:24 +00:00
Tim Woodall	350d5b1713	change hardcoded values into mca params This commit was SVN r9815.	2006-05-04 15:20:18 +00:00
Tim Woodall	c7ee5e13bc	simplification - dont swap src/dst pointers - always leave both src/dst pointing to same segments This commit was SVN r9357.	2006-03-21 18:20:17 +00:00
Tim Woodall	712468dbef	add diagnostic interface This commit was SVN r9328.	2006-03-17 17:39:41 +00:00

1 2

95 Коммитов