openmpi

Автор	SHA1	Сообщение	Дата
Terry Dontje	12baa72580	This commit fixes trac:1306 This commit was SVN r18718. The following Trac tickets were found above: Ticket 1306 --> https://svn.open-mpi.org/trac/ompi/ticket/1306	2008-06-24 14:38:11 +00:00
George Bosilca	1eb62b6c48	Remove a warning. Close ticket #1357 . This commit was SVN r18717.	2008-06-24 14:23:02 +00:00
Jeff Squyres	3f95b906c5	Fixes trac:1085: Improves SLURM configure logic to also allow OS X or any platform where srun is found in the PATH. This commit was SVN r18714. The following Trac tickets were found above: Ticket 1085 --> https://svn.open-mpi.org/trac/ompi/ticket/1085	2008-06-23 23:12:55 +00:00
Jeff Squyres	281c37afcc	Ensure to ignoe the "empty" CPC components. This commit was SVN r18702.	2008-06-21 11:39:53 +00:00
Jeff Squyres	807e2cc742	Mark a notable place where we need to return an error up to the BTL or PML. This commit was SVN r18701.	2008-06-20 22:11:49 +00:00
Jeff Squyres	5ded50df0e	* Fix a > that should be == * Ensure to destroy the correct QP (local->id[num]->qp will always have a valid pointer in it, even if we setup a dummy qp) * Note two notable places where we need to figure out how to propagate errors up from the CPC to the main BTL / PML when errors occur. Probably have the same issue in IBCM, too. This commit was SVN r18700.	2008-06-20 22:09:30 +00:00
Jeff Squyres	0074126886	Per #1352 , most iWARP adapters today cannot handle connections between two processes on the same server (!). So for today, we'll simply mark all local processes that use iWARP adapters as "unreachable". More details in #1352. This commit was SVN r18699.	2008-06-20 22:08:00 +00:00
Jeff Squyres	f4145fce7a	Ensure that we don't try to shut down a thread that is not [yet] there (e.g., if you're excluding some devices, their destructors will be invoked before the async event thread was setup for them). This commit was SVN r18698.	2008-06-20 19:30:51 +00:00
Jeff Squyres	ed17b51204	Adjust the max_inline default size down so that it can be accepted on multiple adapters (eg., Chelsio T3). But we need to figure out how to determine a good value for the resident adapter(s) at runtime. It's problematic because, for example, Mellanox ConnectX and Chelsio T3 report max_inline values differently at run-time. If you ibv_create_qp with a max_inline value of 0, ConnectX reports back a value that is a formular based on a few other values (e.g., max_send_sge and max_recv_sge). But T3 always reports back "64". We're looking into this to figure out the best way -- reducing the default right now should allow other adapters to run while we figure it out. This commit was SVN r18697.	2008-06-20 18:24:04 +00:00
George Bosilca	54e7e03695	One less warning. This commit was SVN r18695.	2008-06-20 17:50:19 +00:00
Jeff Squyres	7905db57bd	Slightly decrease the number of buffers for the NetXen adapter This commit was SVN r18691.	2008-06-20 01:00:22 +00:00
George Bosilca	bc9b950162	Honor ^ for the PML selection. This commit was SVN r18683.	2008-06-19 16:50:46 +00:00
Ralph Castain	3b5e80fa61	Shift responsibility for preconnecting the oob to the orte routed framework, which is the only place that knows what needs to be done. Only the direct module will actually do anything - it uses the same algo as the original preconnect function. This commit was SVN r18677.	2008-06-19 13:48:26 +00:00
Pavel Shamis	4537827973	Making the qp allocation more optimized. - sq parameter was replaced with max_inline parameter - inline is allocated only for relevant QPs This commit was SVN r18675.	2008-06-19 08:40:39 +00:00
George Bosilca	8e7c35e76c	These symbols are only available via the module/component structure, so they don't have to be globally visible. This commit was SVN r18666.	2008-06-18 08:20:02 +00:00
Ralph Castain	0532d799d6	Complete implementation of the --without-rte-support configure option. Working with Brian, this has been tested on RedStorm. Some minor changes to help facilitate debugger support so that both mpirun and yod can operate with it. Still to be completed. This commit was SVN r18664.	2008-06-18 03:15:56 +00:00
Lenny Verkhovsky	f4811d6c4d	NUMA Awareness support. Gleb's patch This commit was SVN r18658.	2008-06-15 13:43:28 +00:00
Brian Barrett	79ad6d983e	- The ptmalloc2 memory manager component is now by default built as a standalone library named libopenmpi-malloc. Users wanting to use leave_pinned with ptmalloc2 will now need to link the library into their application explicitly. All other users will use the libc-provided allocator instead of Open MPI's ptmalloc2. This change may be overriden with the configure option enable-ptmalloc2-internal - The leave_pinned options will now default to using mallopt on Linux in the cases where ptmalloc2 was not linked in. mallopt will also only be available if munmap can be intercepted (the default whenever Open MPI is not compiled with --without-memory- manager. - Open MPI will now complain and refuse to use leave_pinned if no memory intercept / mallopt option is available. This commit was SVN r18654.	2008-06-13 22:32:49 +00:00
Galen Shipman	44cd373a87	I also forgot to initialize the convertor max_data, george probably copied this dumb mistake from me. This commit was SVN r18653.	2008-06-13 18:33:43 +00:00
George Bosilca	170b9c344e	Mea culpa. I forget to initialize the max_data before the call to the convertor. This commit was SVN r18651.	2008-06-12 17:24:39 +00:00
Pavel Shamis	dc3f14736d	Fixing QP initialization stuff. This commit was SVN r18650.	2008-06-11 16:31:39 +00:00
Galen Shipman	a239877b78	revert my previous boneheadedness This commit was SVN r18634.	2008-06-10 01:19:04 +00:00
George Bosilca	dc0ab0d0a8	Enable the sendi path. This commit was SVN r18633.	2008-06-09 23:03:56 +00:00
Galen Shipman	4ef4a9520f	remove showhelp.. This commit was SVN r18628.	2008-06-09 20:53:01 +00:00
Aurelien Bouteiller	ebe6df4c06	Moving the pml_v_output global variable inside the pml_v structure. This should avoid one of the missing symbols when visibility is enabled. This commit was SVN r18627.	2008-06-09 20:38:44 +00:00
Galen Shipman	9efbec0383	fix normal send path remove unneeded checks This commit was SVN r18624.	2008-06-09 20:25:27 +00:00
Galen Shipman	dbd282fcad	doh.. fix GET protocol.. This commit was SVN r18623.	2008-06-09 19:45:44 +00:00
Ralph Castain	9613b3176c	Effectively revert the orte_output system and return to direct use of opal_output at all levels. Retain the orte_show_help subsystem to allow aggregation of show_help messages at the HNP. After much work by Jeff and myself, and quite a lot of discussion, it has become clear that we simply cannot resolve the infinite loops caused by RML-involved subsystems calling orte_output. The original rationale for the change to orte_output has also been reduced by shifting the output of XML-formatted vs human readable messages to an alternative approach. I have globally replaced the orte_output/ORTE_OUTPUT calls in the code base, as well as the corresponding .h file name. I have test compiled and run this on the various environments within my reach, so hopefully this will prove minimally disruptive. This commit was SVN r18619.	2008-06-09 14:53:58 +00:00
George Bosilca	2aec094d56	The PML V is a component so it should use OMPI_MODULE_DECLSPEC. This commit was SVN r18610.	2008-06-06 17:43:57 +00:00
Josh Hursey	1de50b523c	Fix some Coverity 'Event set_but_not_used' highlights. Thanks to Jeff for bringing them to my attention. This commit was SVN r18606.	2008-06-06 14:38:41 +00:00
Jeff Squyres	1a748bc7be	First cut at the NetEffect NE020 NIC. This commit was SVN r18599.	2008-06-05 20:24:24 +00:00
Jeff Squyres	9109f7126a	Per CID 988, free some memory that would be leaked in an error condition. This commit was SVN r18597.	2008-06-05 20:04:38 +00:00
Jeff Squyres	f0d465c30a	Slightly simplify the code and remove a compiler warning. This commit was SVN r18596.	2008-06-05 19:08:08 +00:00
Jeff Squyres	b1999bbba3	* Use inclusive NIC/HCA language * Add a description of receive_queues This commit was SVN r18595.	2008-06-05 19:07:22 +00:00
Pavel Shamis	7b9024bc05	Updating Mellanox's Copyright in files touched in 2008 This commit was SVN r18592.	2008-06-05 13:40:26 +00:00
Pavel Shamis	379e00050c	Fixing openib btl finalize flow. Bug fix for #1286 . This commit was SVN r18590.	2008-06-05 12:20:13 +00:00
Jeff Squyres	91a281080a	Fix a compiler warning for a case that would never really happen anyway. Rename a variable to be a bit more descriptive. This commit was SVN r18585.	2008-06-04 19:10:23 +00:00
Jeff Squyres	bc584dedd6	Remove a compiler warning that would never happen in practice. This commit was SVN r18584.	2008-06-04 19:03:02 +00:00
Jeff Squyres	6e37dd0ef0	Fix some 32/64 printf errors once and for all This commit was SVN r18582.	2008-06-04 14:39:37 +00:00
Pavel Shamis	0a8321e08d	Calls to APM functions should be protected with OMPI_HAVE_THREADS. This commit was SVN r18581.	2008-06-04 14:27:41 +00:00
Jeff Squyres	5e918ad25d	Add first cut of NetXen iWARP NIC definition. May still be refined with more experimentation. This commit was SVN r18580.	2008-06-04 12:11:45 +00:00
Pavel Shamis	c73ed2b256	Updating cpc name from xrc to xoob. This commit was SVN r18571.	2008-06-04 08:50:30 +00:00
George Bosilca	4d8cbbc167	Add Pasha's patch as it correctly solve the issues. In fact in the current incarnation these functions do not need the inline keyword anymore. This commit was SVN r18558.	2008-06-03 16:03:36 +00:00
Ralph Castain	c992e99035	Remove the tags from orte_output_open and the filtering operation from orte_output - this will be handled differently to improve the XML output interface This commit was SVN r18557.	2008-06-03 14:24:01 +00:00
Jeff Squyres	69d78c6739	Fixes trac:1215: adds specific show_help messages about PP vs. SRQ/XRC RNR retry exceeded errors. This commit was SVN r18554. The following Trac tickets were found above: Ticket 1215 --> https://svn.open-mpi.org/trac/ompi/ticket/1215	2008-06-02 11:03:48 +00:00
Jeff Squyres	8c267d50a3	Fixes trac:1121. We already show_help when we fail to create queues, so I just made the message a little more verbose such that it may be that OMPI is trying to use a feature that is not supported on the hardware. This commit was SVN r18553. The following Trac tickets were found above: Ticket 1121 --> https://svn.open-mpi.org/trac/ompi/ticket/1121	2008-05-30 19:03:58 +00:00
George Bosilca	e361bcb64c	Send optimizations. 1. The send path get shorter. The BTL is allowed to return > 0 to specify that the descriptor was pushed to the networks, and that the memory attached to it is available again for the upper layer. The MCA_BTL_DES_SEND_ALWAYS_CALLBACK flag can be used by the PML to force the BTL to always trigger the callback. Unmodified BTL will continue to work as expected, as they will return OMPI_SUCCESS which force the PML to have exactly the same behavior as before. Some BTLs have been modified: self, sm, tcp, mx. 2. Add send immediate interface to BTL. The idea is to have a mechanism of allowing the BTL to take advantage of send optimizations such as the ability to deliver data "inline". Some network APIs such as Portals allow data to be sent using a "thin" event without packing data into a memory descriptor. This interface change allows the BTL to use such capabilities and allows for other optimizations in the future. All existing BTLs except for Portals and sm have this interface set to NULL. This commit was SVN r18551.	2008-05-30 03:58:39 +00:00
Galen Shipman	4da4c44210	Receive side changes, basically uses multiple active message callbacks rather than using a single receive callback followed by a switch on the header. Also fast pathed the matching for small fragments. This commit was SVN r18549.	2008-05-30 01:29:09 +00:00
Jeff Squyres	728ee47be4	Just check for the presents of $sysfsdir/class/infiniband and check that it's a directory. That's good enough to know that the OpenFabrics kernel drivers have been loaded. If you have no RDMA devices and don't want to see the OMPI warning about not finding any devices, then don't start the OpenFabrics kernel drivers. This commit was SVN r18540.	2008-05-29 14:19:51 +00:00
Nysal Jan	25ac3629e9	eHCA does not have SRQ. Adding receive_queues value so that it works out of the box This commit was SVN r18537.	2008-05-29 13:55:39 +00:00
Jeff Squyres	d5bf8fe005	Remove unused variables. This commit was SVN r18532.	2008-05-29 11:58:16 +00:00
Jeff Squyres	e5ea9d08ca	Fixes trac:1305: check to see if $sysfsdir/class/infiniband exists and is non-empty. If not, then exit the openib btl silently. This addresses the case where libibverbs is installed (which is getting more common) and therefore the openib BTL was built/installed, but the kernel drivers are not loaded (assumedly because there is no RDMA hardware present). In this case, "mpirun a.out" will not issue a warning. There appears to be no good way to definitely tell if there are no RDMA hardware devices present. For example, if libibverbs/the openib BTL is installed, there are no RDMA devices present, but the RDMA hardware kernel drivers ''are'' loaded, OMPI will warn that it was unable to find suitable devices. This warning is easily eliminated by unloading the kernel drivers. This commit was SVN r18530. The following Trac tickets were found above: Ticket 1305 --> https://svn.open-mpi.org/trac/ompi/ticket/1305	2008-05-28 22:05:47 +00:00
Pavel Shamis	28c763f751	Fixing the error flow when somebody tries to use XRC without XOOB. This commit was SVN r18527.	2008-05-28 15:56:04 +00:00
Pavel Shamis	2c81b0ab9a	Fixing compilation warning in btl_openib_connect_ibcm.c This commit was SVN r18526.	2008-05-28 15:20:48 +00:00
Pavel Shamis	879a9fe45c	setup_qps() may exit with error. This commit was SVN r18523.	2008-05-28 11:36:38 +00:00
Pavel Shamis	e657a03143	Fixing broken XRC initialization flow. This commit was SVN r18522.	2008-05-28 11:31:38 +00:00
Rolf vandeVaart	18879285c7	Fix the selection logic to prevent memory leaks. More work may be done in the priority logic but for now we just fix the leaks and preserve current behavior. This commit fixes trac:1307. This commit was SVN r18504. The following Trac tickets were found above: Ticket 1307 --> https://svn.open-mpi.org/trac/ompi/ticket/1307	2008-05-27 14:16:39 +00:00
Pavel Shamis	6596d19c90	Adding new ConnectX vendor_part_id. Fix for ticket #1310 . This commit was SVN r18495.	2008-05-26 12:25:49 +00:00
Jeff Squyres	e1f118d0e6	Remove unused variable This commit was SVN r18491.	2008-05-24 13:05:04 +00:00
Rolf vandeVaart	5baa733ad5	Fix another warning (using a variable before it was initialized.) Thanks Jeff for pointing this out. This commit was SVN r18489.	2008-05-23 13:57:55 +00:00
Jeff Squyres	1b50e5f6a5	Use the right variable in the output This commit was SVN r18487.	2008-05-23 13:11:12 +00:00
Rich Graham	b08839f9f5	change reduce-scatter/gather for non-power of 2. Spreading out the load for the non-power of 2 phase of the reduction. This commit was SVN r18486.	2008-05-22 21:42:42 +00:00
Rich Graham	f2a4b67809	automate the allreduce selection logic. This commit was SVN r18484.	2008-05-22 20:53:35 +00:00
Jeff Squyres	8faeeab81a	Style cleanup only: s/struct foo/foo_t/g to conform to rest of code base This commit was SVN r18483.	2008-05-22 19:26:00 +00:00
Jeff Squyres	1f7f0e1f96	Fixes trac:1281 * s/port/tcp_port/g where relevant to disambiguate TCP port from device port * Rework ipaddrcheck to make it work in the LMC>0 case This commit was SVN r18482. The following Trac tickets were found above: Ticket 1281 --> https://svn.open-mpi.org/trac/ompi/ticket/1281	2008-05-22 19:18:15 +00:00
Rich Graham	5900415a25	for non-powers of 2, distribute the work on the first step among all the procs doing the work. This commit was SVN r18480.	2008-05-22 18:50:53 +00:00
Jon Mason	d0e26b1cf6	Add pretty comments for _iwarp. This commit was SVN r18478.	2008-05-22 18:02:20 +00:00
Jeff Squyres	62ac6533e0	* Add proper copyrights * Ensure _iwarp.h is always included, or you'll get warnings on platforms that don't have the RDMACM * Add skeleton for function descriptions in comments in iwarp.h This commit was SVN r18477.	2008-05-22 17:41:43 +00:00
Jeff Squyres	28b56c389a	Only check if the opal_ifindex is >= 0 (opal_ifbegin() and opal_ifnext() return -1 upon completion); don't check it against opal_ifcount() -- the interface indexes aren't necessarily related to how many interfaces were found. This commit was SVN r18476.	2008-05-22 02:10:23 +00:00
Jeff Squyres	27978b29f8	Fixes trac:1302: ensure to also use the LID for identifing an incoming IBCM request (not just the port number). This commit was SVN r18475. The following Trac tickets were found above: Ticket 1302 --> https://svn.open-mpi.org/trac/ompi/ticket/1302	2008-05-22 01:28:34 +00:00
George Bosilca	c31cc5b270	Remove a warning about line being unused. This commit was SVN r18472.	2008-05-21 20:46:22 +00:00
George Bosilca	df2156568d	The Elan BTL is now thread safe, and can be build in all conditions. This commit was SVN r18471.	2008-05-21 20:44:37 +00:00
Pak Lui	1585789e8b	Fix the undeclared variable. This commit was SVN r18470.	2008-05-21 04:09:54 +00:00
Jon Mason	b9c25efbd2	Modify to comply with the "prefix rule" and remove "static inline" for the non-rdmacm enabled case. This should fix Ticket #1294. This commit was SVN r18468.	2008-05-20 23:28:59 +00:00
Jeff Squyres	64f61ebd07	Fixes trac:1285. Really. This commit has the same commit message as r18450, but without the extra bonus memory corruption that was introduced. This commit was SVN r18467. The following SVN revision numbers were found above: r18450 --> open-mpi/ompi@5295902ebe The following Trac tickets were found above: Ticket 1285 --> https://svn.open-mpi.org/trac/ompi/ticket/1285	2008-05-20 21:53:42 +00:00
Edgar Gabriel	0500420bec	fixing a bug in the inter-communicator scatter operation, where we used accidentally rcount instead of scounts. This commit was SVN r18466.	2008-05-20 21:17:19 +00:00
Rolf vandeVaart	74d0259480	Add new implentation of barrier. This shows better performance on some clusters. However, no decision logic is changed by this commit so default behavior has not changed. This is only selectable by runtime parameters. This commit was SVN r18464.	2008-05-20 17:37:41 +00:00
Rolf vandeVaart	71091a19c3	Fix bug in spacing of code per https://svn.open-mpi.org/trac/ompi/wiki/CodingStyle . This commit was SVN r18463.	2008-05-20 14:11:10 +00:00
Rolf vandeVaart	763f5259a8	Fix memory leak of 88 bytes that occurred on each call to MPI_Comm_dup. Need to release the items and the item list after selecting the collective modules that are being used. Reviewed by Jeff Squyres. This commit was SVN r18457.	2008-05-19 21:34:01 +00:00
Jeff Squyres	01a7f7eeb6	Switch orte_output* -> OPAL_OUTPUT* for two reasons: 1. We can't use orte_output in the CPC service thread because orte is not thread safe 1. Use the macro version sso that they're compiled out of production builds This commit was SVN r18455.	2008-05-19 17:42:51 +00:00
Jeff Squyres	7154776465	Removed unused variable / compiler warning. This commit was SVN r18454.	2008-05-19 13:41:45 +00:00
Jeff Squyres	76fc8dd188	Revert r18450 -- there is some memory badness in there somewhere... This commit was SVN r18451. The following SVN revision numbers were found above: r18450 --> open-mpi/ompi@5295902ebe	2008-05-18 19:11:45 +00:00
Jeff Squyres	5295902ebe	Fixes trac:1285: * allow receive_queues to be specified in the INI file * detect when multiple different receive_queues are specified and gracefully abort However, accomplishing these goals ran into multiple difficulties. By putting receive_queues in the INI file: 1. we may not find the value until we've already traversed multiple HCAs 1. we may find multiple different receive_queues values But since the openib btl initializes as it discovers each HCA/port/LID (including the BSRQ data), if we find a new receive_queues value late in the discovery process, then all the BSRQ data that was previously initialized will likely be invalid. So I had to pull all the BSRQ initialization out until after the rest of the discovery / initialization process. Additionally, note that if the user specifies the MCA parameter btl_openib_receive_queues, it trumps whatever was in the INI file. So in this case, there can never be a receive_queues conflict. This commit does the following (Jon wrote part of this, too): * adapt _ini.c to accept the "receive_queues" field in the file * move 90% of _setup_qps() from _ini.c to _component.c * move what was left of _setup_qps() into the main _register_mca_params() function * adapt init_one_hca() to detect conflicting receive_queues values from the INI file * after the _component.c loop calling init_one_hca(): * call setup_qps() to parse the final receive_queues string value * traverse all resulting btls and initialize their HCAs (if they weren't already): setup some lists and call prepare_hca_for_use() I tested this code on a dual-HCA system where I artificially put in differing receive_queues values in the INI file for the two different types of HCAs that I have and it all seemed to work. This commit was SVN r18450. The following Trac tickets were found above: Ticket 1285 --> https://svn.open-mpi.org/trac/ompi/ticket/1285	2008-05-18 18:50:56 +00:00
Jeff Squyres	caacaadb0a	Minor shuffling of code: no need to query the GID in the iWARP case. This commit was SVN r18446.	2008-05-16 03:36:48 +00:00
Jeff Squyres	9f1b5237fe	Ensure to return an error rather than continue This commit was SVN r18445.	2008-05-16 03:36:11 +00:00
Jeff Squyres	6546898f09	Minor style cleanups; nothing very important in this commit. This commit was SVN r18444.	2008-05-16 03:28:20 +00:00
Jeff Squyres	5c91f53848	Fix a minor memory leak This commit was SVN r18443.	2008-05-16 03:27:42 +00:00
Rolf vandeVaart	375406e1fa	Remove the ignore files as decided at Tuesday's developers conference call. Now, hierarchical collectives will be compiled in but the priority is still at 0 requiring a user to set mca parameters to enable them. This commit was SVN r18440.	2008-05-15 01:26:52 +00:00
Josh Hursey	35a2af28d1	Cleanup the CRCP Coord timing functionality. Provides a rough assessment of time each element of the algorithm is taking. There are more details in the code regarding how to use this feature. Also shift a few of the orte_output back to opal_output. I'm experiencing an odd problem with locks in the oob/tcp when using orte_output. I haven't had time to track it down yet. This commit was SVN r18439.	2008-05-14 19:54:20 +00:00
Jeff Squyres	671f0c379d	Remove a whole pile of orte/util/show_help.h's that I missed. :-( This commit was SVN r18437.	2008-05-14 11:32:33 +00:00
Jeff Squyres	e7ecd56bd2	This commit represents a bunch of work on a Mercurial side branch. As such, the commit message back to the master SVN repository is fairly long. = ORTE Job-Level Output Messages = Add two new interfaces that should be used for all new code throughout the ORTE and OMPI layers (we already make the search-and-replace on the existing ORTE / OMPI layers): * orte_output(): (and corresponding friends ORTE_OUTPUT, orte_output_verbose, etc.) This function sends the output directly to the HNP for processing as part of a job-specific output channel. It supports all the same outputs as opal_output() (syslog, file, stdout, stderr), but for stdout/stderr, the output is sent to the HNP for processing and output. More on this below. * orte_show_help(): This function is a drop-in-replacement for opal_show_help(), with two differences in functionality: 1. the rendered text help message output is sent to the HNP for display (rather than outputting directly into the process' stderr stream) 1. the HNP detects duplicate help messages and does not display them (so that you don't see the same error message N times, once from each of your N MPI processes); instead, it counts "new" instances of the help message and displays a message every ~5 seconds when there are new ones ("I got X new copies of the help message...") opal_show_help and opal_output still exist, but they only output in the current process. The intent for the new orte_* functions is that they can apply job-level intelligence to the output. As such, we recommend that all new ORTE and OMPI code use the new orte_* functions, not thei opal_* functions. === New code === For ORTE and OMPI programmers, here's what you need to do differently in new code: * Do not include opal/util/show_help.h or opal/util/output.h. Instead, include orte/util/output.h (this one header file has declarations for both the orte_output() series of functions and orte_show_help()). * Effectively s/opal_output/orte_output/gi throughout your code. Note that orte_output_open() takes a slightly different argument list (as a way to pass data to the filtering stream -- see below), so you if explicitly call opal_output_open(), you'll need to slightly adapt to the new signature of orte_output_open(). * Literally s/opal_show_help/orte_show_help/. The function signature is identical. === Notes === * orte_output'ing to stream 0 will do similar to what opal_output'ing did, so leaving a hard-coded "0" as the first argument is safe. * For systems that do not use ORTE's RML or the HNP, the effect of orte_output_* and orte_show_help will be identical to their opal counterparts (the additional information passed to orte_output_open() will be lost!). Indeed, the orte_* functions simply become trivial wrappers to their opal_* counterparts. Note that we have not tested this; the code is simple but it is quite possible that we mucked something up. = Filter Framework = Messages sent view the new orte_* functions described above and messages output via the IOF on the HNP will now optionally be passed through a new "filter" framework before being output to stdout/stderr. The "filter" OPAL MCA framework is intended to allow preprocessing to messages before they are sent to their final destinations. The first component that was written in the filter framework was to create an XML stream, segregating all the messages into different XML tags, etc. This will allow 3rd party tools to read the stdout/stderr from the HNP and be able to know exactly what each text message is (e.g., a help message, another OMPI infrastructure message, stdout from the user process, stderr from the user process, etc.). Filtering is not active by default. Filter components must be specifically requested, such as: {{{ $ mpirun --mca filter xml ... }}} There can only be one filter component active. = New MCA Parameters = The new functionality described above introduces two new MCA parameters: * '''orte_base_help_aggregate''': Defaults to 1 (true), meaning that help messages will be aggregated, as described above. If set to 0, all help messages will be displayed, even if they are duplicates (i.e., the original behavior). * '''orte_base_show_output_recursions''': An MCA parameter to help debug one of the known issues, described below. It is likely that this MCA parameter will disappear before v1.3 final. = Known Issues = * The XML filter component is not complete. The current output from this component is preliminary and not real XML. A bit more work needs to be done to configure.m4 search for an appropriate XML library/link it in/use it at run time. * There are possible recursion loops in the orte_output() and orte_show_help() functions -- e.g., if RML send calls orte_output() or orte_show_help(). We have some ideas how to fix these, but figured that it was ok to commit before feature freeze with known issues. The code currently contains sub-optimal workarounds so that this will not be a problem, but it would be good to actually solve the problem rather than have hackish workarounds before v1.3 final. This commit was SVN r18434.	2008-05-13 20:00:55 +00:00
Jon Mason	125eb5a2ed	Convert from the Linux ifaddrs to the OMPI ifaddrs, which should unbreak Solaris. This commit was SVN r18433.	2008-05-13 18:34:22 +00:00
Jeff Squyres	d8e5608053	Remove all retransmission code; the IBCM kernel module handles all of that for us. This commit was SVN r18432.	2008-05-13 16:10:34 +00:00
Jon Mason	74bf1ae25f	Fix compiler warnings This commit was SVN r18431.	2008-05-13 16:01:58 +00:00
Jon Mason	4ead9442b5	Add in IDs for all Chelsio iWARP capable adapters This commit was SVN r18428.	2008-05-12 21:59:03 +00:00
Jeff Squyres	6b26895ad4	A little style update -- constants on the left... This commit was SVN r18426.	2008-05-12 12:05:16 +00:00
Jeff Squyres	16cde0e5fa	Fix compile error on older OFED systems This commit was SVN r18425.	2008-05-12 11:56:14 +00:00
Gleb Natapov	6844ff32ba	Return OMPI_ERR_RESOURCE_BUSY from sm->btl_send() function if there is no place in cb. This will prevent OB1 from doing early completion of small sends. This commit was SVN r18424.	2008-05-12 07:15:29 +00:00
Gleb Natapov	31d2797a2f	If RDMA PUT is received before ACK and registration of memory fails don't start sending fragment by copy in/out before ACK is received as we don't know pointer to receive request yet. Pipeline protocol sometimes doesn't send ACK though, so this case is still broken. This commit was SVN r18423.	2008-05-11 12:40:55 +00:00
Gleb Natapov	0827e537fa	Don't include rdma/rdma_cma.h if !OMPI_HAVE_RDMACM. This commit was SVN r18422.	2008-05-11 11:58:02 +00:00
Jon Mason	99ab66e131	RDMACM code cleanup This patch adds some much needed comments, reduces the amount of code wrapping, and rearrges and removes redundant code. This commit was SVN r18417.	2008-05-08 21:20:12 +00:00
Josh Hursey	da2f1c58e2	Some checkpoint/restart cleanup. * Remove the opal_only option. This was suffering from bit rot, and no one uses it. It can be added back fairly easily if wanted. * Cleanup metadata interactions at the local level. * Touch up some of the INC funcitonality (fix typos and a minor ordering issue) This commit was SVN r18416.	2008-05-08 18:47:47 +00:00
Jon Mason	88e5f2a339	Abstract iWARP subnet ID functions (sans build break) The iWARP subnet ID determination should not be in the RDMACM cpc, as it was in the preversion, as this violates the cpc abstract that is present throughout the code. Also, this patch uses the opal_list_t data struct instead of using its own linked lists. This attempt includes iwarp.c and iwarp.h This commit was SVN r18414.	2008-05-08 14:38:14 +00:00
Jeff Squyres	60f39a30f6	Revert r18409; that commit broke the build because it forgot to add the btl_openib_iwarp.c and btl_openib_iwarp.h files. This commit was SVN r18410. The following SVN revision numbers were found above: r18409 --> open-mpi/ompi@056bbb68c8	2008-05-08 00:22:21 +00:00
Jon Mason	056bbb68c8	Abstract iWARP subnet ID functions The iWARP subnet ID determination should not be in the RDMACM cpc, as it was in the preversion, as this violates the cpc abstract that is present throughout the code. Also, this patch uses the opal_list_t data struct instead of using its own linked lists. This commit was SVN r18409.	2008-05-07 23:59:43 +00:00
Ralph Castain	7c7b9b0486	Do a little cleanup on the opal graph class and opal carto framework to conform to OMPI naming conventions and avoid potential conflict with user applications - no change in functionality, passes carto test program This commit was SVN r18407.	2008-05-07 19:33:49 +00:00
Jeff Squyres	157cea378f	* A few fixes to make IP address and port number comparisons properly * A few indenting and style fixes This commit was SVN r18405.	2008-05-07 16:56:07 +00:00
Jeff Squyres	bfae8ea828	The comment wasn't long enough; I felt the need to make it longer (and explain a little more ;-) ). This commit was SVN r18404.	2008-05-07 16:53:05 +00:00
Jeff Squyres	63abb3eb9b	Clarify a comment / fix typos. This commit was SVN r18402.	2008-05-07 14:51:36 +00:00
Shiqing Fan	8393fb5d47	Use the new memchecker_call function for memory checking of non-blocking communication. This commit was SVN r18399.	2008-05-07 12:28:51 +00:00
Ralph Castain	ff70636024	Allgather_list needs its own tag to avoid conflicting with the allgather modex operation. All spawned procs must decode the port of the spawning process so they can communicate in direct routed mode. This fixes comm_spawn for all routing modes. This commit was SVN r18395.	2008-05-07 03:03:56 +00:00
Rolf vandeVaart	0e32dd1022	Add MPI_Alltoallv to tuned collectives and add a pairwise implementation of MPI_Alltoallv. However, do not change the default behavior for now. The only way to use new pairwise implementation is via mca parameters. This commit was SVN r18394.	2008-05-07 02:31:24 +00:00
Jon Mason	502d164908	Create subnet ID's for iWARP. This enables subnet differientation for iWARP devices, and rearrange initilization so that the services are available when they are needed. This commit was SVN r18393.	2008-05-06 22:43:52 +00:00
Jon Mason	9c724128f8	Handle no IP Address in rdmacm more resiliently If there is no IP Address, have rdmacm log the correct error and let another cpc have a go at it. This is being done by splitting off the IP address checking logic for the modex message creation, and having it log the correct error in the error case. This commit was SVN r18392.	2008-05-06 22:31:29 +00:00
Jon Mason	46bfd42c09	Fix compile warnings in rdmacm Fix some reported compiler warnings and make the code a little prettier. This commit was SVN r18391.	2008-05-06 22:19:28 +00:00
Jon Mason	9066168cd1	Prevent iWARP qp flush errors. For iWARP, the TCP connection is tied to the QP once the QP is in RTS. And destroying the QP is thus tied to connection teardown for iWARP. This is a key distinction from IB, I think. Anyway, to destroy the connection in iWARP you must move the QP out of RTS, either into CLOSING for a nice graceful close, or to ERROR if you want to be rude. In both cases, all pending non-completed SQ and RQ WRs must be flushed. This patch ignores all flush errors reaped by the cq and removes an earlier attempt to work around this in the rdmacm cpc. This commit was SVN r18388.	2008-05-06 21:57:40 +00:00
Josh Hursey	9971bc9d95	Merge in the mca_base_select changes per RFC: http://www.open-mpi.org/community/lists/devel/2008/04/3779.php {{{ svn merge -r 18276:18380 https://svn.open-mpi.org/svn/ompi/tmp-public/jjh-mca-play . }}} Any components not in the trunk, but in one of the effected frameworks must be updated. Contact the list, look at the RFC, or look at the diff for how to do this. Sorry for the early commit of this, but I wanted to get it in today (per RFC) and didn't know if I would have a chance later today. This commit was SVN r18381.	2008-05-06 18:08:45 +00:00
Jeff Squyres	a06d4023b8	Oops -- missed one sys_errlist -> strerror(). This commit was SVN r18378.	2008-05-06 13:22:36 +00:00
Jeff Squyres	4154e587de	strerror() is much better. This commit was SVN r18376.	2008-05-05 21:06:07 +00:00
Shiqing Fan	f35a06119c	Use memchecker_convertor_call function instead the old one. Move the function to the place that we can use convertor. This commit was SVN r18370.	2008-05-05 13:57:27 +00:00
Jon Mason	a3bf503e01	Remove error on rdma cm If there are multiple QP's, RDMACM will not send a message if the qpnum != 0. In doing so, it will log an error unecessarily. This removes that. This commit was SVN r18363.	2008-05-02 20:12:01 +00:00
Jon Mason	3989981578	Enable support of num_proc > num_nodes Add the logic to support using port numbers, instead of simply using the IP address of the sending node to determine which endpoint to connect. Since each process calls the cpc query function, it will generate its own port to listen on thus enablign this to work. This commit was SVN r18362.	2008-05-02 16:20:28 +00:00
Jeff Squyres	ba5615a18f	Merge in /tmp-public/cpc3 branch to trunk. oob/xoob still remains the default CPC. This commit was SVN r18356.	2008-05-02 11:52:33 +00:00
Donald Kerr	843a35094f	adding local work queue accounting This commit was SVN r18352.	2008-05-01 21:01:51 +00:00
George Bosilca	a69ac964df	Allow any order in the list of Elan vpid. This commit was SVN r18350.	2008-05-01 20:32:03 +00:00
Josh Hursey	dcd21d7d07	Some checkpoint/restart fixes in response to r18338 (changes in modex). Things should be working now. This commit was SVN r18348. The following SVN revision numbers were found above: r18338 --> open-mpi/ompi@3e55fe6f6d	2008-05-01 17:48:13 +00:00
Ralph Castain	3e55fe6f6d	Fold in the revised modex scheme. Move the ompi_proc_t modex portions to the RTE level since the daemons already have that info. Provide each process with the equivalent of a "nidmap" - both a map of what nodes are in the job, and a map of which node each process is on. This enables the use of static ports, though that hasn't been turned "on" in this commit. Update the rsh tree spawn capability so we spawn the next wave of daemons before launching our own local procs. Add an ability to encode nodenames for large clusters with contiguous node name numbering schemes - this allows communication of all node names in a few bytes instead of tens-of-bytes/node. This commit was SVN r18338.	2008-04-30 19:49:53 +00:00
Pavel Shamis	61cc8843bf	The r17940 broke the XRC code. The endpoint may be appended to list during XOOB connection bring up. This commit was SVN r18328. The following SVN revision numbers were found above: r17940 --> open-mpi/ompi@ebfdd133f5	2008-04-29 13:22:40 +00:00
Galen Shipman	ced88a338b	include portals modex fun in the distro This commit was SVN r18325.	2008-04-28 18:51:54 +00:00
Brad Penoff	c699236be2	updating SCTP BTL to configure properly with FreeBSD 7 This commit was SVN r18324.	2008-04-28 04:19:10 +00:00
George Bosilca	6e6c370917	Rollback r18274 as its legal to have a sequence number smaller than the expected one. It doesn't necessarily means the message is duplicated, it can simply signify the message is out of sequence and the counter overflowed. This commit was SVN r18323. The following SVN revision numbers were found above: r18274 --> open-mpi/ompi@73c9de3af9	2008-04-27 18:35:54 +00:00
Aurelien Bouteiller	611d52fa95	Fix a bug that rpevented to use the same port (as returned by Open_port) for several Comm_accept) This commit was SVN r18303.	2008-04-25 20:41:44 +00:00
Aurelien Bouteiller	c20b020ea6	Fix ticket #1275 . The pml v can now be correctly deactivated on the configure command line. Also fix a dist target under some unusual circumpstances. This commit was SVN r18291.	2008-04-24 21:42:54 +00:00
Josh Hursey	2c736873bb	Fix a checkpoint/restart bug that causes a restarted application to occasionally throw a SIGSEGV or SIGPIPE due to invalid socket descriptors. The problem was caused by a bad ordering between the restart of the ORTE level tcp connections (in the OOB - out-of-band communication) and the Open MPI level tcp connections (BTLs). Before this commit ORTE would shutdown and restart the OOB completely before the OMPI level restarted its tcp connections. What would happen is that a socket descriptor used by the OMPI level on checkpoint was assigned to the ORTE level on restart. But the OMPI level had no knowledge that the socket descriptor it was previously using has been recycled so it closed it on restart. This caused the ORTE level to break as the newly created socket descriptor was closed without its knowledge. The fix is to have the OMPI level shutdown tcp connections, allow the ORTE level to restart, and then allow the OMPi level to restart its connections. This seems obvious, and I'm surprised that this bug has not cropped up sooner. I'm confident that this specific problem has been fixed with this commit. Thanks to Eric Roman and Tamer El Sayed for their help in identifying this problem, and patience while I was fixing it. * Add a new state {{{OPAL_CRS_RESTART_PRE}}}. This state identifies when we are on the down slope of the INC (finalize-like) which is useful when you want to close, but not reopen a component set for fear of interfering with a lower level. * Use this new state in OMPI level coordination. Here we want to make sure to play well with both the OMPI/BTL/TCP and ORTE/OOB/TCP components. * Update ft_event functions in PML and BML to handle the new restart state. * Add an additional flag to the error output in OOB/TCP so we can see what the socket descriptor was on failure as this can be helpful in debugging. This commit was SVN r18276.	2008-04-24 17:54:22 +00:00
George Bosilca	3ccac4f803	Oops ... This commit was SVN r18275.	2008-04-24 15:54:52 +00:00
George Bosilca	73c9de3af9	Bark if we got a wrong sequence number. Here wrong means that the seq number if smaller than what we expect. This commit was SVN r18274.	2008-04-24 15:48:43 +00:00
Rich Graham	4d1ae7b05f	accidentally made a change in the wrong place. This commit was SVN r18262.	2008-04-23 17:32:05 +00:00
Rich Graham	293dd6ad4e	add myself to list of people building this module. This commit was SVN r18261.	2008-04-23 17:25:36 +00:00
Rich Graham	7658cc79e4	Pass in the correct module to the reduction call. This commit was SVN r18260.	2008-04-23 17:23:30 +00:00
Adrian Knoth	c53d3c3c22	reverted r18169,r18170 due to connection reset by peer on odin/sif This commit was SVN r18255. The following SVN revision numbers were found above: r18169 --> open-mpi/ompi@20473bfda2 r18170 --> open-mpi/ompi@d34dfbe12c	2008-04-23 15:26:15 +00:00
Josh Hursey	cc83d41ad9	Merge in tmp/jjh-scratch {{{ svn merge -r 18218:18240 https://svn.open-mpi.org/svn/ompi/tmp/jjh-scratch . }}} Contains: * Primarily a fix for a user reported problem where a cached file descriptor is causing a SIGPIPE on restart. * Cleanup some small memory leaks from using mca_base_param_env_var() - Thanks Jeff * Cleanup ORTE FT tool compilation in non-FT builds - Thanks Tim P. * Cleanup mpi interface with missplaced {{{OPAL_CR_ENTER_LIBRARY}}} - Thanks Terry * Some other sundry cleanup items all dealing with C/R functionality in the trunk. This commit was SVN r18241.	2008-04-23 00:17:12 +00:00
Tim Mattox	0215474cb8	Fix two bugs in coll_sm_module.c from bit-rot: Fixed a selection bug, and removed a bogus "free(proc)" call which ultimately caused MPI_Finalize to crash. This commit was SVN r18235.	2008-04-22 18:41:21 +00:00
Jeff Squyres	c40740947f	Fix minor spelling error. This commit was SVN r18229.	2008-04-22 13:11:50 +00:00
Galen Shipman	27c425b304	make portals level ack's optional (require ACK by default) This commit was SVN r18228.	2008-04-21 22:22:18 +00:00
Rich Graham	df35223603	add selection logic for barrier and reduce. This commit was SVN r18215.	2008-04-19 22:40:04 +00:00
Rich Graham	bee8b42f29	remove debug code that would not let people run. Add infrastructure for blocking-barrier. This commit was SVN r18214.	2008-04-19 01:34:04 +00:00
Galen Shipman	92e3b8671f	nasty memory bug... This commit was SVN r18207.	2008-04-18 03:01:53 +00:00
Ralph Castain	fa082cafa9	Shift the architecture calculation from the ompi/datatype engine to the opal/util area. This allows us to compute the architecture earlier in the launch and communicate it outside of the modex. Note: this is an early preliminary step in the movement of portions of the datatype engine to the opal layer. This commit was SVN r18198.	2008-04-17 20:43:56 +00:00
Tim Prins	eb94fa48ce	the port name is only relevant at the root, so only look at it there. This commit was SVN r18188.	2008-04-17 12:37:10 +00:00
Tim Prins	3582e11200	cleanup some warnings on 32 bit systems This commit was SVN r18187.	2008-04-17 12:25:05 +00:00
Rich Graham	6c77fa4921	add a blocking shared memory algorithm. This commit was SVN r18185.	2008-04-16 22:10:23 +00:00
Ralph Castain	7b91f8baff	Cleanup and fix bugs in the MPI dynamics section. Modify the dpm API so it properly takes ports instead of process names (as correctly identified by Aurelien). Fix race conditions in the use of ompi-server. Fix incompatibilities between the mpi bindings and the dpm implemenation that could cause segfaults due to uninitialized memory. Fix the ompi-server -h cmd line option so it actually tells you something! Add two new testing codes to the orte/test/mpi area: accept and connect. This commit was SVN r18176.	2008-04-16 14:27:42 +00:00
Shiqing Fan	1c4c7e0f2f	Add memchecker support for osc rdma communication. This commit was SVN r18173.	2008-04-16 13:29:55 +00:00
Shiqing Fan	79da2fdd2c	Use the new memchecker convertor function. Remove some unnecessary memchecker calls. This commit was SVN r18172.	2008-04-16 13:24:35 +00:00
Adrian Knoth	d34dfbe12c	fixed misleading comment. This commit was SVN r18170.	2008-04-16 11:26:15 +00:00
Adrian Knoth	20473bfda2	on incoming connections, compare with every possible source address. Rational (taken from the code): /* This is PITA. We never know which source address an * incoming/outgoing packet will have, so even with * btl_tcp_if_include/exclude on the remote end, we * might get a different source address. * * If this address isn't included in btl_proc->proc_addrs, * we would erroneously drop the connection */ merge -r18165:18167 to the trunk. This commit was SVN r18169. The following SVN revisions from the original message are invalid or inconsistent and therefore were not cross-referenced: r18165 r18167	2008-04-16 11:24:09 +00:00
Adrian Knoth	e981a259bb	btl_tcp_disable_family=4 and btl_tcp_disable_family=6 are mutually exclusive, so this should result in "unreachable" when set differently between peers. This commit was SVN r18168.	2008-04-16 10:14:58 +00:00
Adrian Knoth	75c54616c7	renamed opal_sockaddr2str to opal_net_get_hostname for WANT_PEER_DUMP=1 This commit was SVN r18154.	2008-04-15 19:23:47 +00:00
Jeff Squyres	72af302360	Remove unused variable. This commit was SVN r18151.	2008-04-15 14:58:32 +00:00
Aurelien Bouteiller	0f311ed824	Make sure the function returns NULL when no elan adapter is available instead of a random value. This commit was SVN r18136.	2008-04-11 21:03:01 +00:00
Aurelien Bouteiller	20592cbcbf	Fixes a warning about mallocing 0 bytes when no elan adapter is available. This commit was SVN r18135.	2008-04-11 20:59:12 +00:00
Rich Graham	249445d61f	added reduce-scatter followed by gather to root. This commit was SVN r18133.	2008-04-11 13:49:08 +00:00
Rich Graham	a6bdbfab97	implement allreduce as reduce-scatter, followed by an allgather. This commit was SVN r18132.	2008-04-11 04:06:29 +00:00
Jon Mason	08ead87604	Potential double free of locks mca_btl_openib_endpoint_post_rr_nolock is freeing the endpoint lock on the error case, but most/all of the functions calling this free the lock regardless of its error case. Thus resulting is a double free of the lock. This commit was SVN r18131.	2008-04-10 21:15:01 +00:00
Rich Graham	70f3aab5f2	remove some code that is not needed. This commit was SVN r18128.	2008-04-10 17:32:04 +00:00
Rich Graham	5c7db1e315	remove 2 race conditions in the buffer recycling logic. This commit was SVN r18127.	2008-04-10 17:20:52 +00:00
Edgar Gabriel	4964434205	reverting commit 18122, since the commit was executed accidentally in the wring directory. The UH copyrights do belong into this file (i.e. because of the fix which is in the 1.2 branch, the UH copyright notes are in the header there alreary), but I want to have the proper log for that. This commit was SVN r18124.	2008-04-10 15:09:31 +00:00
Edgar Gabriel	f87830767a	the verification of recvcount==0 and rank = root was braking inter-communicator scatter, since the root (root==MPI_ROOT) might very well have recvcount=0. The same fix has been applied to gather.c just the other way round. Fixes the bug reported on the mainling list by Martin Audet. If there is a 1.2.7 this fix might be worthwhile porting it over. Please note, that while the test works now for basic and for inter, we get a 0byte malloc warning from the inter module, which we still have to fix in a separate patch. This commit was SVN r18122.	2008-04-10 14:58:51 +00:00
Ralph Castain	3a0d09300b	Fully implement the inbound binomial allgather for daemon-based collectives. Supports both modex and barrier operations. Comm_spawn still uses the rank=0 method - shifting that algo to the daemons is under study. This commit was SVN r18115.	2008-04-09 22:10:53 +00:00
Rich Graham	c6783549ef	getting old This commit was SVN r18110.	2008-04-09 16:55:16 +00:00
Rich Graham	1a20c3ce51	more debug. This commit was SVN r18109.	2008-04-09 16:19:52 +00:00
Rich Graham	e7e18303f6	more debug. This commit was SVN r18108.	2008-04-09 15:10:58 +00:00
Rich Graham	b14c6b17d5	adding debug output. This commit was SVN r18107.	2008-04-09 13:32:01 +00:00
Rich Graham	10434fb2f1	add barrier synchorinzation at the end of the module init, to avoid initializing shared memory variables in use. This commit was SVN r18105.	2008-04-09 03:44:40 +00:00
Rich Graham	19bb1a2e86	fix initialization bug. This commit was SVN r18104.	2008-04-08 23:34:06 +00:00
Donald Kerr	38e298cc9a	report error message in all libs, not just debug This commit was SVN r18103.	2008-04-08 22:58:28 +00:00
Rich Graham	a69a8d9626	initialize the flags. This commit was SVN r18102.	2008-04-08 22:16:39 +00:00
Rich Graham	8765a2bbdd	more debug code. This commit was SVN r18101.	2008-04-08 20:38:20 +00:00
Rich Graham	08becf33b5	add more debugging. This commit was SVN r18100.	2008-04-08 18:44:50 +00:00
Rich Graham	aa1b7dd406	more debug This commit was SVN r18099.	2008-04-08 03:56:47 +00:00
Rich Graham	0c18bdeff7	more debug code. This commit was SVN r18098.	2008-04-08 03:04:20 +00:00
Rich Graham	9d5a7238df	Add some debugging code. This commit was SVN r18097.	2008-04-07 23:20:15 +00:00
Rich Graham	fa696734d5	add some debug code. This commit was SVN r18096.	2008-04-07 21:03:23 +00:00
Shiqing Fan	28746bbcdb	Remove the memchecker macro in pml base request, used in req_wait.c, which actually is in the wrong place. Instead, one simple call from send_request_free and recv_request_free(already done) will do all the work, fast and clean. This commit was SVN r18095.	2008-04-07 17:46:50 +00:00
Shiqing Fan	a1e5df1cc9	Use the new memchecker function call which is based on convertor. Remove one unnecessary call. This commit was SVN r18085.	2008-04-07 07:52:04 +00:00
Gleb Natapov	713a27dc71	Counter of created RDMA channels should be incremented immediately after channel creation (not in control message completion) otherwise more than max_eager_rdma channel may be created. This commit was SVN r18082.	2008-04-06 13:48:45 +00:00
Rich Graham	1b54e8b76e	fix buffer management for nb-barrier. This commit was SVN r18081.	2008-04-05 21:59:04 +00:00
Tim Prins	313edd8955	- Fix a problem reported on the users list where we would segfault in finalize after calling spawn if the user did not call MPI_Comm_disconnect - Fix the app context constructor so it initializes all the fields. This commit was SVN r18079.	2008-04-04 15:07:39 +00:00
Jeff Squyres	7072a32703	* Properly protect XRC stuff * A few minor style fixes This commit was SVN r18076.	2008-04-02 19:52:03 +00:00
Rich Graham	94f8fd365c	a few reduction optimizations. Add bcast. This commit was SVN r18075.	2008-04-02 19:02:33 +00:00
George Bosilca	a00ca20446	More cleanups. This commit was SVN r18069.	2008-04-02 06:38:33 +00:00
George Bosilca	944453c4c1	Cleanups. This commit was SVN r18068.	2008-04-02 06:37:42 +00:00
Rich Graham	eb5d6096f1	add reduction routine - fix buffer recycling logic which was totally broken. This commit was SVN r18065.	2008-04-01 22:56:18 +00:00
Jeff Squyres	d944d5ec52	Just in case something goes drastically wrong, don't segv. This commit was SVN r18049.	2008-03-31 21:55:07 +00:00
George Bosilca	b4f828f389	We need a newline at the nd of the file, or some compiler bark. This commit was SVN r18023.	2008-03-30 19:05:56 +00:00
Gleb Natapov	b42234461a	Cleanup shared file creation on unix/linux. This commit was SVN r18021.	2008-03-30 13:41:47 +00:00
Jeff Squyres	d0f12f3df0	Make a better error message. This commit was SVN r18014.	2008-03-29 12:54:24 +00:00
Rich Graham	90e53ca9ee	debug the pipeline algorithm. This commit was SVN r18008.	2008-03-28 15:10:07 +00:00
Aurelien Bouteiller	77653ac787	Missing .h file in makefile breaked nightly tarball distcheck... This commit was SVN r18006.	2008-03-28 14:36:56 +00:00
Aurelien Bouteiller	c16339944a	Fix a coverity warning about using unsafe sprintf. This commit was SVN r17999.	2008-03-27 21:24:27 +00:00

... 2 3 4 5 6 ...

2708 Коммитов