openmpi

Автор	SHA1	Сообщение	Дата
George Bosilca	80c02647c8	Each level (OPAL/ORTE/OMPI) should only return it's own constants, instead of the current mismatch. This commit was SVN r25230.	2011-10-04 14:50:31 +00:00
George Bosilca	4184baa67a	Remove the proc_guid from the BTL proc structure. Instead use directly the one stored in the ompi_proc_t. This commit was SVN r24461.	2011-02-25 00:36:08 +00:00
Donald Kerr	47dc1bd493	fix #1828 ; rework the private data connection establishment process; reviewed by terry d. This commit was SVN r20889.	2009-03-26 17:54:44 +00:00
Donald Kerr	ef55aae401	fix #1829 : udapl btl support for relaxed ordering This commit was SVN r20772.	2009-03-13 01:01:00 +00:00
Rainer Keller	296a6fb275	- So much fun along the way: we normally don't do opal/include/opal/... Just use the std. opal/... This commit was SVN r20766.	2009-03-12 19:21:11 +00:00
Rainer Keller	4c0e8e1e69	- Header orte/mca/oob/base/base.h is probably the wrong one to include anyhow -- if oob functionality is neededm then orte/mca/oob/oob.h Nevertheless compiles fine with -Wimplicit-function-declaration This commit was SVN r20641.	2009-02-26 04:20:03 +00:00
Ralph Castain	eaa57e29b6	Revert r20480 as this breaks the trunk. The dpm.h include file has defines for OMPI_RML tags that are required for wireup. This commit was SVN r20482. The following SVN revision numbers were found above: r20480 --> open-mpi/ompi@62282fefe5	2009-02-09 14:14:45 +00:00
Rainer Keller	62282fefe5	- Get rid of #include "ompi/mca/dpm/dpm.h" This commit was SVN r20480.	2009-02-09 02:56:10 +00:00
Donald Kerr	e57435a5d4	udapl btl fix for #1725 ; replace WAIT with GET This commit was SVN r20227.	2009-01-08 13:41:36 +00:00
Donald Kerr	213daa58da	support for solaris relaxed ordering This commit was SVN r20167.	2008-12-24 15:05:12 +00:00
Jeff Squyres	671f0c379d	Remove a whole pile of orte/util/show_help.h's that I missed. :-( This commit was SVN r18437.	2008-05-14 11:32:33 +00:00
Jeff Squyres	e7ecd56bd2	This commit represents a bunch of work on a Mercurial side branch. As such, the commit message back to the master SVN repository is fairly long. = ORTE Job-Level Output Messages = Add two new interfaces that should be used for all new code throughout the ORTE and OMPI layers (we already make the search-and-replace on the existing ORTE / OMPI layers): * orte_output(): (and corresponding friends ORTE_OUTPUT, orte_output_verbose, etc.) This function sends the output directly to the HNP for processing as part of a job-specific output channel. It supports all the same outputs as opal_output() (syslog, file, stdout, stderr), but for stdout/stderr, the output is sent to the HNP for processing and output. More on this below. * orte_show_help(): This function is a drop-in-replacement for opal_show_help(), with two differences in functionality: 1. the rendered text help message output is sent to the HNP for display (rather than outputting directly into the process' stderr stream) 1. the HNP detects duplicate help messages and does not display them (so that you don't see the same error message N times, once from each of your N MPI processes); instead, it counts "new" instances of the help message and displays a message every ~5 seconds when there are new ones ("I got X new copies of the help message...") opal_show_help and opal_output still exist, but they only output in the current process. The intent for the new orte_* functions is that they can apply job-level intelligence to the output. As such, we recommend that all new ORTE and OMPI code use the new orte_* functions, not thei opal_* functions. === New code === For ORTE and OMPI programmers, here's what you need to do differently in new code: * Do not include opal/util/show_help.h or opal/util/output.h. Instead, include orte/util/output.h (this one header file has declarations for both the orte_output() series of functions and orte_show_help()). * Effectively s/opal_output/orte_output/gi throughout your code. Note that orte_output_open() takes a slightly different argument list (as a way to pass data to the filtering stream -- see below), so you if explicitly call opal_output_open(), you'll need to slightly adapt to the new signature of orte_output_open(). * Literally s/opal_show_help/orte_show_help/. The function signature is identical. === Notes === * orte_output'ing to stream 0 will do similar to what opal_output'ing did, so leaving a hard-coded "0" as the first argument is safe. * For systems that do not use ORTE's RML or the HNP, the effect of orte_output_* and orte_show_help will be identical to their opal counterparts (the additional information passed to orte_output_open() will be lost!). Indeed, the orte_* functions simply become trivial wrappers to their opal_* counterparts. Note that we have not tested this; the code is simple but it is quite possible that we mucked something up. = Filter Framework = Messages sent view the new orte_* functions described above and messages output via the IOF on the HNP will now optionally be passed through a new "filter" framework before being output to stdout/stderr. The "filter" OPAL MCA framework is intended to allow preprocessing to messages before they are sent to their final destinations. The first component that was written in the filter framework was to create an XML stream, segregating all the messages into different XML tags, etc. This will allow 3rd party tools to read the stdout/stderr from the HNP and be able to know exactly what each text message is (e.g., a help message, another OMPI infrastructure message, stdout from the user process, stderr from the user process, etc.). Filtering is not active by default. Filter components must be specifically requested, such as: {{{ $ mpirun --mca filter xml ... }}} There can only be one filter component active. = New MCA Parameters = The new functionality described above introduces two new MCA parameters: * '''orte_base_help_aggregate''': Defaults to 1 (true), meaning that help messages will be aggregated, as described above. If set to 0, all help messages will be displayed, even if they are duplicates (i.e., the original behavior). * '''orte_base_show_output_recursions''': An MCA parameter to help debug one of the known issues, described below. It is likely that this MCA parameter will disappear before v1.3 final. = Known Issues = * The XML filter component is not complete. The current output from this component is preliminary and not real XML. A bit more work needs to be done to configure.m4 search for an appropriate XML library/link it in/use it at run time. * There are possible recursion loops in the orte_output() and orte_show_help() functions -- e.g., if RML send calls orte_output() or orte_show_help(). We have some ideas how to fix these, but figured that it was ok to commit before feature freeze with known issues. The code currently contains sub-optimal workarounds so that this will not be a problem, but it would be good to actually solve the problem rather than have hackish workarounds before v1.3 final. This commit was SVN r18434.	2008-05-13 20:00:55 +00:00
Donald Kerr	843a35094f	adding local work queue accounting This commit was SVN r18352.	2008-05-01 21:01:51 +00:00
Donald Kerr	ef8f807c1c	was not passing correct variable to dat_strerror This commit was SVN r17749.	2008-03-05 21:45:16 +00:00
Ralph Castain	d70e2e8c2b	Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately. Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer This commit was SVN r17632.	2008-02-28 01:57:57 +00:00
Donald Kerr	437e280829	removing a few superfluous casts when the base or super is available This commit was SVN r17554.	2008-02-22 20:10:55 +00:00
Donald Kerr	58bf7f5a1d	add uintptr_t to prevent the possibility of a signed extension occuring This commit was SVN r17456.	2008-02-14 19:16:34 +00:00
Donald Kerr	5f884b1ca4	fix for #1130 - adds support for multi-rail configurations This commit was SVN r17152.	2008-01-17 17:30:50 +00:00
Donald Kerr	908b514ac5	update use of internal tag values to accommodate the active message change found in r17140 This commit was SVN r17148. The following SVN revision numbers were found above: r17140 --> open-mpi/ompi@6310ce955c	2008-01-16 21:17:25 +00:00
George Bosilca	906e8bf1d1	Replace the ompi_pointer_array with opal_pointer_array. The next step (sometimes after the merge with the ORTE branch), the opal_pointer_array will became the only pointer_array implementation (the orte_pointer_array will be removed). This commit was SVN r17007.	2007-12-21 06:02:00 +00:00
Donald Kerr	d05d3afaed	clean up and make consistent the reporting out from the udapl btl; report out readeable event string instead of just a number This commit was SVN r16954.	2007-12-13 15:32:26 +00:00
Tim Prins	5a795128af	Change it so that different components in orte use unique rml tags This commit was SVN r15881.	2007-08-16 14:02:35 +00:00
Donald Kerr	8ecbc71ed2	add support for connection private data, off by default This commit was SVN r14878.	2007-06-05 19:29:50 +00:00
Donald Kerr	2ed72bf2e2	break evd_qlen into individual qlens (async,dto,conn); add checks based on udapl limits and number of peers This commit was SVN r14659.	2007-05-15 17:47:00 +00:00
Donald Kerr	436d370d51	latency improvements: use ompi_free_list_init_ex, create optimal alignment parameter, remove rdma guarantee path, replace dat_lmt_sync_rdma with use of volatile This commit was SVN r14634.	2007-05-09 19:41:25 +00:00
Donald Kerr	80d984441f	change so that we only check connection queue when expecting a connection; create a mca parameter that controls frequency at which the async queue is checked This commit was SVN r14511.	2007-04-25 17:46:25 +00:00
Donald Kerr	cae24fcde1	move mca parameter registration into own .c and .h files This commit was SVN r14493.	2007-04-24 18:34:16 +00:00
Donald Kerr	3f428af7b8	couple of minor changes to fix #973 and seperated eager rdma fragments into structure only and data only area This commit was SVN r14470.	2007-04-23 17:41:34 +00:00
Gleb Natapov	90fb58de4f	When frags are allocated from mpool by free_list the frag structure is also allocated from mpool memory (which is registered memory for RDMA transports) This is not a problem for a small jobs, but for a big number of ranks an amount of waisted memory is big. This commit was SVN r13921.	2007-03-05 14:17:50 +00:00
Donald Kerr	ed097d17c1	fix for bug #749 , though I can not confirm without a linux compiler This commit was SVN r13090.	2007-01-11 22:25:13 +00:00
Donald Kerr	80f2cbb498	add udapl rdma capabilities into the udapl btl This commit was SVN r13082.	2007-01-11 15:22:08 +00:00
Brian Barrett	48ec0b2071	Revert out r12974, 12976, and 12991 as George has provided a less intrusive fix for now... This commit was SVN r12997. The following SVN revision numbers were found above: r12974 --> open-mpi/ompi@27cea44a9c	2007-01-04 22:07:37 +00:00
Brian Barrett	27cea44a9c	Fix a number of issues with the ompi_ptr_t: * Make sure that the pval always writes to the correct portion of the lval. This only matters on 32 bit big endian machines. * On 32 bit machines when assigning to pval, the other 4 bytes of lval weren't being written, which could lead to bogus data We use macros so that there aren't casts all over the code and the pval assignment can occur to the correct 4 bytes. Refs trac:587 This commit was SVN r12974. The following Trac tickets were found above: Ticket 587 --> https://svn.open-mpi.org/trac/ompi/ticket/587	2007-01-03 19:47:48 +00:00
Gleb Natapov	190e7a27cd	Merge with gleb-mpool branch. All RDMA components use same mpool now (rdma). udapl/openib/vapi/gm mpools a deprecated. rdma mpool has parameter that allows to limit its size mpool_rdma_rcache_size_limit (default is 0 - unlimited). This commit was SVN r12878.	2006-12-17 12:26:41 +00:00
Ralph Castain	6d6cebb4a7	Bring over the update to terminate orteds that are generated by a dynamic spawn such as comm_spawn. This introduces the concept of a job "family" - i.e., jobs that have a parent/child relationship. Comm_spawn'ed jobs have a parent (the one that spawned them). We track that relationship throughout the lineage - i.e., if a comm_spawned job in turn calls comm_spawn, then it has a parent (the one that spawned it) and a "root" job (the original job that started things). Accordingly, there are new APIs to the name service to support the ability to get a job's parent, root, immediate children, and all its descendants. In addition, the terminate_job, terminate_orted, and signal_job APIs for the PLS have been modified to accept attributes that define the extent of their actions. For example, doing a "terminate_job" with an attribute of ORTE_NS_INCLUDE_DESCENDANTS will terminate the given jobid AND all jobs that descended from it. I have tested this capability on a MacBook under rsh, Odin under SLURM, and LANL's Flash (bproc). It worked successfully on non-MPI jobs (both simple and including a spawn), and MPI jobs (again, both simple and with a spawn). This commit was SVN r12597.	2006-11-14 19:34:59 +00:00
Terry Dontje	bc93adee26	Fixed connection inversion bug by putting in sequence checking for first sendrecv exchanges for each connection. This was to fix Trac #390. This commit was SVN r11821.	2006-09-26 13:53:00 +00:00
Terry Dontje	d636db5832	Fixed bug trac #213 by moving the udapl btl header to being a footer. Also fixed bug trac #346. This commit was SVN r11760.	2006-09-22 19:28:09 +00:00
Donald Kerr	ba1688dff2	Removing component level lock from mca_btl_udapl_endpoint_finish_eager() routine because it is already locked before entry. Will be evaluating entire lock scheme but this one was blocking as it was. This commit was SVN r11161.	2006-08-11 18:46:06 +00:00
Andrew Friedley	b7e0484c37	Give up on dat_ep_query() and instead manually send our address information across the wire after connection establishment. I've introduced a race condition - seeing occasional LOCAL_LENGTH errors on the receive side. I think I'm mixing up eager/max somehow - will look at it more on monday. This commit was SVN r10690.	2006-07-07 21:48:16 +00:00
Andrew Friedley	365c81d6e9	Fix a few issues reported by Terry Dontje: 1. ompi/mca/btl/udapl/btl_udapl_proc.c should be including btl_udapl_endpoint.h for mca_btl_udapl_proc_insert function. 2. btl_udapl_endpoint.c it looks like you are using &endpoint->endpoint_lock when you should use &ep->endpoint_lock in a OPAL_THREAD_LOCK call. 3. btl_udapl_frag.h has a couple opal_list_item_t's that should be ompi_free_list_item_t in the _FRAG_ALLOC_{EAGER,MAX} macros. This commit was SVN r10442.	2006-06-20 17:13:44 +00:00
Andrew Friedley	c68c6ac122	A number of fixes and the usual cleanup.. - Added some basic flow control to limit number of posted sends. - Merged endpoint send/recv lock into single endpoint lock. - Set the LMR triplet length in the send path, not at allocation time. This has to be done because upper layers might send less than the amount allocated. - Alter the tie-breaker if statement protecting the second call to dat_ep_connect(). The logic was reversed compared to the tie- breaker for the first dat_ep_connect(), making it possible for 3 or more processes to form a deadlock loop. - Some asserts were added for debugging purposes.. leaving them in place for now. This commit was SVN r10317.	2006-06-12 22:42:01 +00:00
Andrew Friedley	8a3d0862ca	I can commit! happy dance Trying to remember what I did here.. eager/max messages should work now, no RDMA yet. A number of other fixes and cleanups. I do know of two problems: Bad stuff happens when flooded with send frags too quickly - the BTL doesn't handle flow control. Certain IBM tests turn up a length assertion in the datatype engine - needs more investigation. This commit was SVN r10070.	2006-05-25 15:47:59 +00:00
Andrew Friedley	345551cb36	Checkpoint before starting work on max-sized frags (maybe user too?). - Some initial work on prepare_src - Move some fragment initialization around - Fix a union casting issue on picky compilers, identified by Don Kerr - Other small cleanups/bugfixes This commit was SVN r9662.	2006-04-19 22:20:22 +00:00
Andrew Friedley	d461b55696	- Implement OOB connection handshaking via the ORTE RML. To start a connect, we send our local addr_t OOB. Remote side then matches endpoints and calls dat_ep_connect(). Everything should be the same as before from here, except that client/server roles are reversed. - Properly set our buffer size when posting receives. When the frag used to transfer address information is recycled by the free list, the wrong buffer size was being used, which caused buffer overflow errors. - Finally put the uDAPL error handling stuff in the mpool component. - Remove a few more OPAL_OUTPUTs. This commit was SVN r9569.	2006-04-07 15:26:05 +00:00
Andrew Friedley	74b2f77a4c	The expected cleanup/refactoring commit.. Not much got tested that wasn't already - I've uncovered a connection establishment deadlock and wanted to get these changes committed before I attack it. The big changes: - Moved much of the connection code from btl_udapl_component.c to btl_udapl_endpoint.c. - Cleaned up initialization of various fragment members. - MCA_BTL_UDAPL_ERROR macro, which is compiled in/out appropriately. This commit was SVN r9496.	2006-03-31 16:25:19 +00:00
Andrew Friedley	0eba366b07	Various pieces all over to make basic small message send/recv work. Next step is clean up the code.. it is in need of refactoring and testing. Thanks to Brian for help in troubleshooting! This commit was SVN r9466.	2006-03-29 21:55:41 +00:00
Andrew Friedley	48d61cd99a	Mostly fragment/LMR handling fixes: - Grab the mpool_registration in _frag_common_constructor() - Save the LMR context in the segment key - No need for cookie variables - can just cast the frag - No need to memcpy() data when recv'ing - Add an LMR triplet to the fragment structure and initialize it in btl_udapl_alloc(). - Whitespace/typo fixes, remove some opal_output() calls Looks like I can use triplets describing sub-regions of registered LMR's. So I do this - prior to this patch I was sending the entire free list memory over, which isn't correct :) Back to an earlier problem - when sending address information right after connection establishment, the receiving end receives a DTO completion event and appears to have good data. But the sending end never receives a DTO completion event indicating the send completed, and never completes the client side of the connection. This commit was SVN r9386.	2006-03-23 16:21:08 +00:00
Andrew Friedley	cf9246f7b9	Long overdue commit.. many changes. In short, I'm very close to having connection establishment and eager send/recv working. Part of the connection process involves sending address information from the client to server. For some reason, I am never receiving an event indicating completetion of the send on the client side. Otherwise, connection establishment is working and eager send/recv should be trivial from here. Some more detailed changes: - Send partially implemented, just handles starting up new connections. - Several support functions implemented for establishing connection. Client side code went in btl_udapl_endpoint.c, server side in btl_udapl_component.c - Frags list and send/recv locks added to the endpoint structure. - BTL sets up a public service point, which listens for new connections. Steps over ports that are already bound, iterating through a range of ports. - Remove any traces of recv frags, don't think I need them after all. - Pieces of component_progress() implemented for connection establishment. - Frags have two new types for connection establishment - CONN_SEND and CONN_RECV. - Many other minor cleanups not affecting functionality This commit was SVN r9345.	2006-03-21 00:12:55 +00:00
Brian Barrett	566a050c23	Next step in the project split, mainly source code re-arranging - move files out of toplevel include/ and etc/, moving it into the sub-projects - rather than including config headers with <project>/include, have them as <project> - require all headers to be included with a project prefix, with the exception of the config headers ({opal,orte,ompi}_config.h mpi.h, and mpif.h) This commit was SVN r8985.	2006-02-12 01:33:29 +00:00
Andrew Friedley	b37e18916f	Many different things, the big ones: - Start filling in the progress function, focusing on connection establishment. - Initialize udapl mpool and free lists - Create/destroy a protection zone with each IA - Misc organization as I learn how things work This commit was SVN r8969.	2006-02-10 21:49:15 +00:00

1 2

51 Коммитов