openmpi

Автор	SHA1	Сообщение	Дата
Brian Barrett	5dec52af8d	remove some now unneeded debugging This commit was SVN r26364.	2012-04-30 22:50:52 +00:00
Brian Barrett	c654ee6afc	* Use triggered operations for restart barrier as well This commit was SVN r26363.	2012-04-30 22:48:10 +00:00
Brian Barrett	91a9973bde	* Make flow control on by default * Move alarm code back into a triggered operation This commit was SVN r26362.	2012-04-30 22:25:40 +00:00
Brian Barrett	e6a0a1cf8a	* Make sure to release all resources on failed send * Avoid triggered ops until we get everything debugged * Simplify flowctl interface a bit This commit was SVN r26356.	2012-04-27 21:11:01 +00:00
Nathan Hjelm	c36ab84116	ugni: missed a couple of lines in the last commit This commit was SVN r26340.	2012-04-25 14:24:48 +00:00
Nathan Hjelm	a753fe91f7	fix merge This commit was SVN r26332.	2012-04-24 21:16:51 +00:00
Nathan Hjelm	0eb18b9699	ob1: update copyrights This commit was SVN r26331.	2012-04-24 20:19:15 +00:00
Nathan Hjelm	0a0e487d9c	ob1: add emacs mode/indentation defaults This commit was SVN r26330.	2012-04-24 20:19:06 +00:00
Nathan Hjelm	9a35f96bda	ob1: add support for get fallback on put/send This commit was SVN r26329.	2012-04-24 20:18:56 +00:00
Nathan Hjelm	93780c63be	replace tabs w/ spaces This commit was SVN r26328.	2012-04-24 20:18:45 +00:00
Nathan Hjelm	0f60858a01	ugni: improve handling of smsg completions This commit was SVN r26327.	2012-04-24 20:18:35 +00:00
Nathan Hjelm	e3b9040e69	vader: remove maffinity code This commit was SVN r26321.	2012-04-24 15:38:03 +00:00
Nathan Hjelm	363bd184e7	ugni: re-disable uGNI for local procs This commit was SVN r26318.	2012-04-23 21:12:12 +00:00
Nathan Hjelm	ca3ceb840c	ugni: add mca parameter to control the number of smsg retries This commit was SVN r26317.	2012-04-23 21:12:05 +00:00
Nathan Hjelm	95b12f140a	ugni: cleanup frag setup code This commit was SVN r26316.	2012-04-23 21:11:57 +00:00
Nathan Hjelm	37ca31b295	ugni: remove unused completion queue This commit was SVN r26315.	2012-04-23 21:11:39 +00:00
Nathan Hjelm	1340f9c65a	ugni update: - Move endpoint code back up to BTL - Use opal_pointer_array_t for bounce buffer to identify local smsg completions. - Update and reenable sendi - Create a new endpoint for FMA/BTE transactions (keep local smsg/fma transactions seperate) - Move reverse get code into btl_ugni_put.c - Move eager get code into btl_ugni_get.c - Handle remote SMSG overruns correctly - Added support for inplace sends - etc This commit was SVN r26307.	2012-04-19 21:51:55 +00:00
Nathan Hjelm	2b9827f45c	ugni: restrict number of memory registrations per process This commit was SVN r26306.	2012-04-19 21:51:44 +00:00
Jeff Squyres	253444c6d0	== Highlights == 1. New mpifort wrapper compiler: you can utilize mpif.h, use mpi, and use mpi_f08 through this one wrapper compiler 1. mpif77 and mpif90 still exist, but are sym links to mpifort and may be removed in a future release 1. The mpi module has been re-implemented and is significantly "mo' bettah" 1. The mpi_f08 module offers many, many improvements over mpif.h and the mpi module This stuff is coming from a VERY long-lived mercurial branch (3 years!); it'll almost certainly take a few SVN commits and a bunch of testing before I get it correctly committed to the SVN trunk. == More details == Craig Rasmussen and I have been working with the MPI-3 Fortran WG and Fortran J3 committees for a long, long time to make a prototype MPI-3 Fortran bindings implementation. We think we're at a stable enough state to bring this stuff back to the trunk, with the goal of including it in OMPI v1.7. Special thanks go out to everyone who has been incredibly patient and helpful to us in this journey: * Rolf Rabenseifner/HLRS (mastermind/genius behind the entire MPI-3 Fortran effort) * The Fortran J3 committee * Tobias Burnus/gfortran * Tony !Goetz/Absoft * Terry !Donte/Oracle * ...and probably others whom I'm forgetting :-( There's still opportunities for optimization in the mpi_f08 implementation, but by and large, it is as far along as it can be until Fortran compilers start implementing the new F08 dimension(..) syntax. Note that gfortran is currently unsupported for the mpi_f08 module and the new mpi module. gfortran users will a) fall back to the same mpi module implementation that is in OMPI v1.5.x, and b) not get the new mpi_f08 module. The gfortran maintainers are actively working hard to add the necessary features to support both the new mpi_f08 module and the new mpi module implementations. This will take some time. As mentioned above, ompi/mpi/f77 and ompi/mpi/f90 no longer exist. All the fortran bindings implementations have been collated under ompi/mpi/fortran; each implementation has its own subdirectory: {{{ ompi/mpi/fortran/ base/ - glue code mpif-h/ - what used to be ompi/mpi/f77 use-mpi-tkr/ - what used to be ompi/mpi/f90 use-mpi-ignore-tkr/ - new mpi module implementation use-mpi-f08/ - new mpi_f08 module implementation }}} There's also a prototype 6-function-MPI implementation under use-mpi-f08-desc that emulates the new F08 dimension(..) syntax that isn't fully available in Fortran compilers yet. We did that to prove it to ourselves that it could be done once the compilers fully support it. This directory/implementation will likely eventually replace the use-mpi-f08 version. Other things that were done: * ompi_info grew a few new output fields to describe what level of Fortran support is included * Existing Fortran examples in examples/ were renamed; new mpi_f08 examples were added * The old Fortran MPI libraries were renamed: * libmpi_f77 -> libmpi_mpifh * libmpi_f90 -> libmpi_usempi * The configury for Fortran was consolidated and significantly slimmed down. Note that the F77 env variable is now IGNORED for configure; you should only use FC. Example: {{{ shell$ ./configure CC=icc CXX=icpc FC=ifort ... }}} All of this work was done in a Mercurial branch off the SVN trunk, and hosted at Bitbucket. This branch has got to be one of OMPI's longest-running branches. Its first commit was Tue Apr 07 23:01:46 2009 -0400 -- it's over 3 years old! :-) We think we've pulled in all relevant changes from the OMPI trunk (e.g., Fortran implementations of the new MPI-3 MPROBE stuff for mpif.h, use mpi, and use mpi_f08, and the recent Fujitsu Fortran patches). I anticipate some instability when we bring this stuff into the trunk, simply because it touches a LOT of code in the MPI layer in the OMPI code base. We'll try our best to make it as pain-free as possible, but please bear with us when it is committed. This commit was SVN r26283.	2012-04-18 15:57:29 +00:00
Brian Barrett	8a70747da2	Fix some naming that doesn't make a ton of sense This commit was SVN r26277.	2012-04-18 01:05:18 +00:00
Brian Barrett	f4d4e87176	add some flow control debugging output This commit was SVN r26276.	2012-04-17 23:14:05 +00:00
Brian Barrett	fe0dfc8e26	First take at flow control protocol This commit was SVN r26274.	2012-04-17 21:46:21 +00:00
Brian Barrett	dde6f094eb	In preperation for flow control changes coming, always utilize ACKs for message completion. This commit was SVN r26272.	2012-04-16 17:25:27 +00:00
Terry Dontje	81d7fcaf82	back out r26255 to avoid cross component linkage so Solaris can build a usable openib btl This commit was SVN r26269. The following SVN revision numbers were found above: r26255 --> open-mpi/ompi@fe25b8704b	2012-04-13 18:08:54 +00:00
Nathan Hjelm	f88babfb92	ugni: minor updates This commit was SVN r26262.	2012-04-10 19:56:19 +00:00
Mike Dubman	34acf769d4	mtl_mxm: support canceling messages This commit was SVN r26256.	2012-04-09 16:02:05 +00:00
Mike Dubman	fe25b8704b	performance fix: set alignment for openib internal buffers Thanks to Jeff/Pasha for valuable comments Thanks to Valentin Petrov for implementation This commit was SVN r26255.	2012-04-09 08:06:15 +00:00
George Bosilca	f09e3ce5a4	Spring cleanup. Nothing important. This commit was SVN r26247.	2012-04-06 15:48:07 +00:00
George Bosilca	654c75ff24	As suggested on the mailing list a while back, switch the default alltoallv algorithm to pairwise exchange instead of the default one. This might improve the scheduling and relax the pressure on the network. This commit was SVN r26246.	2012-04-06 15:47:29 +00:00
Ralph Castain	bd8b4f7f1e	Sorry for mid-day commit, but I had promised on the call to do this upon my return. Roll in the ORTE state machine. Remove last traces of opal_sos. Remove UTK epoch code. Please see the various emails about the state machine change for details. I'll send something out later with more info on the new arch. This commit was SVN r26242.	2012-04-06 14:23:13 +00:00
Brian Barrett	d46d55ee9b	If we're locking the local window, need to wait until the lock returns. This commit was SVN r26234.	2012-04-04 16:27:24 +00:00
Josh Hursey	d1571b027a	Fix a few error return paths This commit was SVN r26233.	2012-04-04 15:11:03 +00:00
Nathan Hjelm	b0c3c18e02	Initial upload of grdma mpool This commit was SVN r26232.	2012-04-03 23:03:03 +00:00
Mike Dubman	ff1c84c53f	revert previous commit This commit was SVN r26206.	2012-03-29 14:07:13 +00:00
Mike Dubman	43a5775e8a	performance fix: set alignment for openib internal buffers This commit was SVN r26205.	2012-03-29 14:00:08 +00:00
Nathan Hjelm	d62c0f1872	ugni: handle smsg failure in mca_btl_ugni_ep_connect_finish This commit was SVN r26202.	2012-03-28 05:40:16 +00:00
Brian Barrett	451af0e832	Ensure async progress for long unexpected messages by waiting for an event on the ME. The events we're likely to see are LINK (the ME was added to the match list), PUT (weird to see first, but means that the ME was linked to the match list and then matched), or PUT_OVERFLOW, meaning the message was unexpected. This commit was SVN r26199.	2012-03-26 22:54:35 +00:00
Brian Barrett	2a26d0f9a2	Forgot to add new file in the last commit. Mark ME as invalid once we see a completion event, and look for events before trying to unlink. This commit was SVN r26198.	2012-03-26 22:39:05 +00:00
Brian Barrett	0e91084385	* Add type field to the request structure to deal with random user requests (ie, cancel) * Implement cancel for receives. Sends are slightly more complicated... This commit was SVN r26197.	2012-03-26 22:32:36 +00:00
Brian Barrett	61a090e0d1	Checking for NULL function pointers and direct-call semantics can't work together, so implement all functions in the MTL interface for all MTLs. The only places NULL was still being set was for add_comm/del_comm, and matched probe, both of which are straight forward to implement (or return ERROR_NOT_IMPLEMENTED, since the PML can't emulate matched probe). This commit was SVN r26194.	2012-03-26 19:27:03 +00:00
Brian Barrett	cdaf110c0f	* Implement mtl_send in addition to mtl_sendi This commit was SVN r26193.	2012-03-26 19:19:11 +00:00
Brian Barrett	27c8f71773	Start of the flow control implementation. #defined out for now. This commit was SVN r26192.	2012-03-26 01:31:58 +00:00
Jeff Squyres	fa8980157a	Fix typo. This commit was SVN r26183.	2012-03-23 00:12:32 +00:00
Brian Barrett	cce936b94c	* Implement matched probe for the CM PML. Required adding a peer field to the ompi_message_t structure to properly initialize convertor (the peer is available in the request in OB1, and wasn't needed when I did the original implementation). * Implement matched probe for the Portals4 MTL and add NULL function pointers for the other MTLs. * Add add_comm and del_comm functions to portals4 MTL so that direct call almost works again. * Add NEWS item that we've implemented matched probe This commit was SVN r26180.	2012-03-22 22:55:59 +00:00
Brian Barrett	4d12616b64	Frank pointed out that PTL_OK is zero and PtlHandleIsEqual either returns PTL_OK or PTL_FAIL and that I had these backwards. This commit was SVN r26179.	2012-03-22 15:58:00 +00:00
Brian Barrett	1c6b5a1358	* Set all appropriate flags for portal table entries * split eq into send and receive eqs so that we can control the number of outstanding events in send eq and ensure we never lose an ack * Shouldn't ever truncate on short unexpected receive bocks, so don't set the truncate bit * Track active vs. waiting for free short unexpected receive blocks so to ensure an active short unexpected receive block is posted coming out of flow control. Also allow creation of "temporary" blocks which should be released once FREE event is received. * Slight reorganization of some code in preparation for more flow control work. This commit was SVN r26174.	2012-03-21 22:20:55 +00:00
Mike Dubman	a45898ea9c	fix support for fca 2.2, warning fixes on rhel 6.x This commit was SVN r26166.	2012-03-20 10:00:52 +00:00
Nathan Hjelm	135ac32b64	ugni: use hash table to keep track of smsg frag completion This commit was SVN r26154.	2012-03-15 20:15:59 +00:00
Nathan Hjelm	fca42347e3	ugni: use hash table to keep track of smsg frag completion This commit was SVN r26153.	2012-03-15 20:13:32 +00:00
Nathan Hjelm	deddf0b33e	ugni: fix frag leak in sendi This commit was SVN r26152.	2012-03-15 20:13:20 +00:00
Nathan Hjelm	99f05d56e3	ugni: updated parameters and code cleanup This commit was SVN r26151.	2012-03-15 20:13:11 +00:00
Nathan Hjelm	921176745d	vader: remove lock based fifos This commit was SVN r26150.	2012-03-15 20:12:59 +00:00
Nathan Hjelm	4e01440b05	vader: clean frag alloc/return This commit was SVN r26149.	2012-03-15 20:12:46 +00:00
Terry Dontje	e73df369e4	Update bfo pml with code from ob1 to support mprobe, improbe, mrecv, imrecv and cuda. This commit was SVN r26145.	2012-03-15 10:20:46 +00:00
Christopher Yeoh	524de80eaa	Adds support for Cross Memory Attach in the sm btl. This feature can be enabled at compile time with --with-cma passed to configure. At runtime it is also necessary to add "--mca btl btl_sm_use_cma 1" to the mpirun command. If both CMA and KNEM are compiled in and enabled at runtime then KNEM will take precedence and CMA will disable itself This commit was SVN r26134.	2012-03-14 06:29:09 +00:00
Mike Dubman	bd7abd72a9	in mca_mtl_mxm, don't allow negative tags for MPI_ANY_TAG This commit was SVN r26128.	2012-03-09 22:11:14 +00:00
Rolf vandeVaart	41870ce6ee	Mostly fix some of the verbose output. Also fix issue where memory handle was blocking other registration. This commit was SVN r26124.	2012-03-09 21:28:56 +00:00
George Bosilca	de1078a71b	Thanks to Alex Margolin for pointing out this relique. This commit was SVN r26121.	2012-03-09 14:01:45 +00:00
Pavel Shamis	102da281c4	OPENIB BTL - use orte_show_help instead of BTL_ERROR print in case ibv_reg_mr failed. This commit was SVN r26111.	2012-03-08 09:04:03 +00:00
Mike Dubman	4e7e7d7c3f	print error which is ignored on upper layer This commit was SVN r26106.	2012-03-06 14:25:56 +00:00
George Bosilca	a78a7bd8e8	The tuned collectives can now deal with more than 2Gb of data. This commit was SVN r26103.	2012-03-05 22:23:44 +00:00
George Bosilca	762b3e13a9	Use the correct name for the datatype destruction function. This commit was SVN r26100.	2012-03-05 15:54:53 +00:00
George Bosilca	7d523a8852	Avoid calling the bcast with counts larger than INT_MAX. This commit was SVN r26098.	2012-03-05 14:30:30 +00:00
George Bosilca	e8c358c188	Allow Open MPI to deal with size_t internally. This commit was SVN r26097.	2012-03-05 14:10:26 +00:00
Abhishek Kulkarni	08ca0f80bc	Fix a C/R bug where the restart hung due to dangling fds in the openib btl. This commit was SVN r26094.	2012-03-04 06:57:33 +00:00
George Bosilca	f83670211e	Allow the user to define dynamic rules for messages larger than 2GB. This commit was SVN r26084.	2012-03-02 21:16:23 +00:00
George Bosilca	8791ade293	Help he selection of the right algorithm for large data (> 2Gb). Thanks to Fujitsu for the patch. This commit was SVN r26080.	2012-03-02 19:12:22 +00:00
Terry Dontje	3e70cad203	Correct a few alignment problems to address the issue brought up in ticket #2964 This commit was SVN r26078.	2012-03-01 17:29:40 +00:00
Nathan Hjelm	f1525bdbff	ob1: fix two fragment leaks - MAJOR! get src descriptor leaks if mca_bml_base_send fails - minor. descriptor leaked in mca_pml_send_request_start_copy if the btl returns OMPI_ERR_RESOURCE_BUSY. This commit was SVN r26077.	2012-03-01 15:53:39 +00:00
Mike Dubman	540b3c0c25	update mxm mtl to changes in mxm api This commit was SVN r26073.	2012-02-29 22:02:34 +00:00
Nathan Hjelm	a7209e309a	ugni: opps, sendi was missing from Makefile.am This commit was SVN r26067.	2012-02-28 16:10:35 +00:00
Edgar Gabriel	636cf786be	local_heap_sort should really be a static function. This commit was SVN r26065.	2012-02-28 14:42:56 +00:00
Vishwanath Venkatesan	7c9c3ede61	Modified implementation for the static segmentation read algorithm with improved performance and bug fixes. This commit was SVN r26056.	2012-02-24 20:55:33 +00:00
Vishwanath Venkatesan	d5a9223a9a	Removed a variable which was allocated but never used. This commit was SVN r26055.	2012-02-24 20:48:52 +00:00
Vishwanath Venkatesan	326bc69df4	Modified implementation for static file write all algorithm which fixes all the previous bugs and provides improved performance. This commit was SVN r26054.	2012-02-24 20:47:09 +00:00
Rolf vandeVaart	b0a84b0a7d	New btl that extends sm btl to support GPU transfers within a node. Uses new CUDA IPC support. Also, a few minor changes in PML to take advantage of it. This code has no effect unless user asks for it explicitly via configure arguments. Otherwise, it is either #ifdef'ed out or not compiled. This commit was SVN r26039.	2012-02-24 02:13:33 +00:00
Nathan Hjelm	8217c46666	ompi_free_list: allocate payload if payload size > 0 in the fl_mpool = NULL case This commit was SVN r26027.	2012-02-23 16:47:28 +00:00
Nathan Hjelm	9843cd0466	ugni: missed one more merge typo This commit was SVN r26026.	2012-02-23 16:39:15 +00:00
Nathan Hjelm	d7cd95c802	vader: fix typo This commit was SVN r26025.	2012-02-23 16:29:45 +00:00
Nathan Hjelm	c83fe003a0	ugni: fix branch merge typo This commit was SVN r26024.	2012-02-23 16:16:21 +00:00
Rolf vandeVaart	c7a0ce2755	Two new mpools. They are not used now (and by default, not compiled) but they will be soon. Provide support for GPU buffer transfers within a node. This commit was SVN r26008.	2012-02-22 23:32:36 +00:00
Nathan Hjelm	dc94b6a3fb	vader: minor fast box optimization This commit was SVN r26002.	2012-02-22 20:53:49 +00:00
Nathan Hjelm	4c7b7c675a	vader: minor code cleanup. move xpmem_create to component_init This commit was SVN r25999.	2012-02-22 18:32:40 +00:00
Mike Dubman	81bd5eee8d	in mxm, use sender_len field and not actual_len when returning result from probe This commit was SVN r25993.	2012-02-21 19:55:16 +00:00
Jeff Squyres	b6a90434e4	Fix some include file header ordering issues for some BSDs, suggested by Paul Hargrove. This commit was SVN r25984.	2012-02-21 13:32:14 +00:00
Jeff Squyres	7fb1144d27	Add missing AC_CONFIG_FILES This commit was SVN r25974.	2012-02-20 22:59:03 +00:00
Jeff Squyres	6a109c8e15	Ensure that we have aio.h before trying to compile this component. This commit was SVN r25966.	2012-02-20 15:53:20 +00:00
George Bosilca	bd9402fd8d	Fix call to error_cb. This commit was SVN r25946.	2012-02-17 03:18:51 +00:00
Jeff Squyres	4c0d24ff9a	Improve the help messages for tcp_btl_if_[in\|ex]clude This commit was SVN r25940.	2012-02-16 15:59:18 +00:00
Nathan Hjelm	1ee6d5d21a	ugni: fix typos in mca_btl_ugni_put This commit was SVN r25937.	2012-02-15 23:26:03 +00:00
Mike Dubman	6ec768f0c6	fix #2971 This commit was SVN r25908.	2012-02-12 09:28:42 +00:00
Nathan Hjelm	8c9dc990a9	ugni: fixed typo This commit was SVN r25899.	2012-02-10 01:27:31 +00:00
Nathan Hjelm	994127cb6b	ugni: move mca_btl_ugni_smsg_mbox_t back to btl_ugni_endpoint.h This commit was SVN r25898.	2012-02-10 01:11:54 +00:00
Nathan Hjelm	0ccfd3e6db	ugni btl update. changes: - re-enable sendi - move smsg common code into btl_ugni_smsg.h - added new parameters for smsg/eager frags - use get for frags larger than the smsg_limit - bug fixes - code cleanup This commit was SVN r25897.	2012-02-10 00:47:29 +00:00
Mike Dubman	b18a1611c3	- if everything is ok set return value to OMPI_SUCCESS in mtl/mxm This commit was SVN r25879.	2012-02-08 14:19:58 +00:00
Christopher Yeoh	bc26adcc32	Fixes trac:2998 Adds a lock to protect the sm pending_sends list from concurrent access Fixes bug where btl_sm_process_pending_sends would return an item to the free list and then continue to use it for a little while cmr:v1.6 This commit was SVN r25878. The following Trac tickets were found above: Ticket 2998 --> https://svn.open-mpi.org/trac/ompi/ticket/2998	2012-02-08 01:32:36 +00:00
Brian Barrett	25d48e22fa	Implementation of the MPI-3 Matched Probe functionality. Currently only implemented in the OB1 PML, will return NOT_SUPPORTED in other PMLs. This commit was SVN r25865.	2012-02-06 17:35:21 +00:00
Mike Dubman	6188ab7317	* ep init refactoring * split ep_info into fragments to fit PMI limit This commit was SVN r25857.	2012-02-02 15:00:47 +00:00
Vishwanath Venkatesan	15ebe838e9	Modified implementation of two_phase read all similar to the changes for the write all incorporating romio style partitioning. This commit was SVN r25853.	2012-02-01 18:30:13 +00:00
Vishwanath Venkatesan	158374bdd0	Dynamic file write all algorithm optimized by using derived datatype for receiving actual data thereby avoiding the merging step in the fbtl. This commit was SVN r25852.	2012-02-01 18:20:44 +00:00
Vishwanath Venkatesan	b9026ccbd0	Fix for two-phase generating flattened datatype using decoded iovec for handling non-contiguous memory and contiguous file cases. This commit was SVN r25850.	2012-02-01 17:23:51 +00:00
Jeff Squyres	4f35b62154	Components should not be linking to top-level libraries -- and definitely should not be linking to more than libmpi.la! (remember that libmpi.la now wholly contains libopen-rte.la, which wholly contains libopen-pal.la). This commit was SVN r25843.	2012-01-31 20:43:27 +00:00
Mike Dubman	92873872f5	revert r25813 This commit was SVN r25816. The following SVN revision numbers were found above: r25813 --> open-mpi/ompi@8ed781d7e9	2012-01-30 13:22:38 +00:00
Mike Dubman	8ed781d7e9	add mca param to enable/disable mxm This commit was SVN r25813.	2012-01-30 11:14:20 +00:00
Mike Dubman	9f0ca9dfc0	fix: extract source from imm request fields instead from depricated api This commit was SVN r25812.	2012-01-30 10:37:37 +00:00
Ralph Castain	a0edae52f2	Ensure the wrapper flags get entered in the right order, with -lpmi coming before the alps util libs This commit was SVN r25809.	2012-01-27 20:56:21 +00:00
Nathan Hjelm	97dad0ac49	ugni: don't release eager fragments until we get local smsg completion This commit was SVN r25796.	2012-01-27 00:32:43 +00:00
Nathan Hjelm	669f0afd14	ugni: poll smsg mailbox until it is empty This commit was SVN r25794.	2012-01-26 20:50:09 +00:00
Nathan Hjelm	2a83297f96	silence vader warnings This commit was SVN r25793.	2012-01-26 20:07:33 +00:00
Mike Dubman	6c954ad43f	set mxm to call opal_progress in tight loops This commit was SVN r25788.	2012-01-26 18:33:43 +00:00
Nathan Hjelm	3d9bc68435	reorder udcm init/finalize code. fixes trac:2973 This commit was SVN r25787. The following Trac tickets were found above: Ticket 2973 --> https://svn.open-mpi.org/trac/ompi/ticket/2973	2012-01-26 16:28:55 +00:00
Shiqing Fan	2c9a4beffd	Add and remove a few components for windows build. This commit was SVN r25775.	2012-01-25 09:01:27 +00:00
Nathan Hjelm	7b9bf6fe41	ugni: remove another erroneous error message This commit was SVN r25768.	2012-01-23 21:23:01 +00:00
Nathan Hjelm	f3b60062cb	ugni: remove erroneous error message This commit was SVN r25767.	2012-01-23 21:05:24 +00:00
Nathan Hjelm	521546aaa3	bug fix: ugni: pack only as many bytes as the pml requested This commit was SVN r25766.	2012-01-23 17:21:45 +00:00
Ralph Castain	f03b82ab0a	Don't fiddle with the port_name memory as, per standard, this is an input-only parameter This commit was SVN r25756.	2012-01-20 13:15:41 +00:00
Ralph Castain	be3dfb6a1a	Ensure that we only add -lpmi once to the wrapper compilers, no matter how many components might use it. This commit was SVN r25753.	2012-01-20 04:56:38 +00:00
Ralph Castain	f5c43e8d60	Get the header files into the tarball This commit was SVN r25746.	2012-01-19 21:02:20 +00:00
Nathan Hjelm	804a494036	zero out ugni fragments in constructor This commit was SVN r25731.	2012-01-17 19:52:26 +00:00
Vishwanath Venkatesan	1e95d8b1e2	remove the MPI functions used in these files by the OMPI internal corresponding functionality and also add error checking in these for functions which did not have them' This commit was SVN r25723.	2012-01-13 17:21:51 +00:00
Rolf vandeVaart	16d676aa5b	Fix minor issue with CUDA. Cannot register overlappiing regions. This commit was SVN r25716.	2012-01-12 13:00:42 +00:00
Samuel Gutierrez	63869c431b	init seg_num_procs_inited to zero before the atomic add. This commit was SVN r25710.	2012-01-11 03:37:23 +00:00
Nathan Hjelm	96c1df8d90	clean up vader registration code This commit was SVN r25704.	2012-01-10 22:33:22 +00:00
Edgar Gabriel	fb4d1a7099	remove the MPI functions used in this file by the OMPI internal corresponding functionality. This commit was SVN r25703.	2012-01-10 19:55:05 +00:00
Nathan Hjelm	f65f6f5c39	bugfix: ugni: increase smsg mailbox size to a multiple of 4096 This commit was SVN r25702.	2012-01-10 19:50:25 +00:00
Mike Dubman	37dc53bbc9	mxm: return the MXM_REQ_SEND_SYNC flag to mxm_req_send This commit was SVN r25694.	2012-01-06 18:56:28 +00:00
Mike Dubman	3b97d609a8	mtl_mxm: fix double free This commit was SVN r25693.	2012-01-06 16:22:58 +00:00
Samuel Gutierrez	d1a44ecd34	send packed buffers instead of using iovecs in common sm rml. this commit will hopefully resolve the periodic bus errors that some mtt tests have been encountering. This commit was SVN r25692.	2012-01-05 00:11:59 +00:00
Rolf vandeVaart	9441f33981	Improve an error message. Replace tabs with spaces. This commit was SVN r25688.	2012-01-03 15:19:01 +00:00
Rolf vandeVaart	8073f5002a	Some additional CUDA specific code. Adding a few more support functions that will be used in future development. This commit was SVN r25684.	2011-12-29 12:31:54 +00:00
Edgar Gabriel	e0139a2d7e	provide descriptions about the functionality of these frameworks. This commit was SVN r25682.	2011-12-22 19:42:00 +00:00
Vishwanath Venkatesan	0f928be1d5	Modifying selection logic back to select two-phase at the cases it should. This commit was SVN r25681.	2011-12-22 01:01:32 +00:00
Vishwanath Venkatesan	37c8470e3d	modified implementation for two-phase write_all incorporating romio style domain partitioning This commit was SVN r25680.	2011-12-22 00:16:29 +00:00
Vishwanath Venkatesan	738a67b704	Removing duplicate code while setting default file view and using internal file-set-view for setting the default file view This commit was SVN r25679.	2011-12-21 21:50:47 +00:00
Rolf vandeVaart	6ca186fb64	Delay some initialization until needed. This eliminates some warnings and removes need for CUDA init before MPI_Init. This commit was SVN r25678.	2011-12-21 15:21:57 +00:00
Samuel Gutierrez	519f71ab7e	silences valgrind warning in common sm (Syscall param writev(vector[...]) points to uninitialised byte(s)). probably also silences a large stack allocation warning in coverity. This commit was SVN r25666.	2011-12-16 23:17:48 +00:00
Samuel Gutierrez	0ca6603fa0	remove some unused cruft in shmem. minor common sm cleanup. This commit was SVN r25665.	2011-12-16 22:43:55 +00:00
Nathan Hjelm	71527c8058	minor ugni btl code cleanup This commit was SVN r25618.	2011-12-10 08:20:46 +00:00
Nathan Hjelm	c8a4687402	don't set SIGSEGV to default This commit was SVN r25610.	2011-12-09 21:54:05 +00:00
Nathan Hjelm	e03d23d96e	Intial support for Cray's uGNI interface (XE-6/XK-6) This commit was SVN r25608.	2011-12-09 21:24:07 +00:00
Nathan Hjelm	87b7e85d53	rfc timeout. retry registration after removing old registration from lru This commit was SVN r25587.	2011-12-07 18:20:44 +00:00
Josh Hursey	e56b4de2c9	Fixes trac:2550 : Cleanup comment in crcp_bkmrk_pml.h This commit was SVN r25585. The following Trac tickets were found above: Ticket 2550 --> https://svn.open-mpi.org/trac/ompi/ticket/2550	2011-12-07 14:50:04 +00:00
Jeff Squyres	c10f41c87e	Do not build these frameworks when --disable-mpi-io is specified. Fixes some Cisco MTT MPI install errors. This commit was SVN r25566.	2011-12-02 22:11:23 +00:00
Ralph Castain	07655e2945	Handle the case where the allocator "fibs" to us about the node names. In some cases (ahem...you know who you are!), the allocator will tell us a node number (e.g., "16"). However, the daemon will return a node name (e.g., "nid0016") - leaving us not recognizing its location. So provide a new parameter (can't have too many!) that handles this situation by stripping the prefix from the returned node name. Also do a little cleanup to ensure we cleanly exit from errors, without generating too many annoying messages. This commit was SVN r25562.	2011-12-02 14:10:08 +00:00
Ralph Castain	357ac14530	Can't return a numerical value here This commit was SVN r25559.	2011-12-02 10:36:57 +00:00
Nathan Hjelm	bb1fec0407	added put/get btl descriptor flags This commit was SVN r25553.	2011-11-30 21:37:23 +00:00
Ralph Castain	c56acf60ca	Although we never really thought about it, we made an unconscious assumption in the mapper system - we assumed that the daemons would be placed on nodes in the order that the nodes appear in the allocation. In other words, we assumed that the launch environment would map processes in node order. Turns out, this isn't necessarily true. The Cray, for example, launches processes in a toroidal pattern, thus causing the daemons to wind up somewhere other than what we thought. Other environments (e.g., slurm) are also capable of such behavior, depending upon the default mapping algorithm they are told to use. Resolve this problem by making the daemon-to-node assignment in the affected environments when the daemon calls back and tells us what node it is on. Order the nodes in the mapping list so they are in daemon-vpid order as opposed to the order in which they show in the allocation. For environments that don't exhibit this mapping behavior (e.g., rsh), this won't have any impact. Also, clean up the vm launch procedure a little bit so it more closely aligns with the state machine implementation that is coming, and remove some lingering "slave" code. This commit was SVN r25551.	2011-11-30 19:58:24 +00:00
Jeff Squyres	6fbbfd0f7a	Gah! r25545 acidentally included ''waaaay'' more stuff than it was supposed to. I.e., half-baked/not complete stuff. This commit backs out all of r25545. Sorry folks! This commit was SVN r25546. The following SVN revision numbers were found above: r25545 --> open-mpi/ompi@7f9ae11faf	2011-11-29 23:24:52 +00:00
Jeff Squyres	7f9ae11faf	Per http://www.open-mpi.org/community/lists/users/2011/11/17862.php , to make MPI_IN_PLACE (and other sentinel Fortran constants) work on OS X, we need to use the following compiler (linker) flag: -Wl,-commons,use_dylibs So if we're compiling on OS X, test to see if that flag works with the compiler. If so, add it to the wrapper FFLAGS and FCFLAGS (note that per a future update, we'll only have one Fortran compiler anyway). Fixes trac:1982. This commit was SVN r25545. The following Trac tickets were found above: Ticket 1982 --> https://svn.open-mpi.org/trac/ompi/ticket/1982	2011-11-29 23:05:54 +00:00
Terry Dontje	5209de048c	add code to service_thread_start to handle EBADF returns from select. This commit fixes trac:2922. This commit was SVN r25520. The following Trac tickets were found above: Ticket 2922 --> https://svn.open-mpi.org/trac/ompi/ticket/2922	2011-11-29 16:49:59 +00:00
Samuel Gutierrez	375162c693	this commit fixes a few things. 1. silence warning in common sm. 2. remove unneeded config code in common sm. 3. move opal_shmem_base_close to a better place in opal_finalize. 4. fix opal_path_nfs output. This commit was SVN r25518.	2011-11-28 23:41:19 +00:00
George Bosilca	0bd2bf9aae	The number of segments accepted should be bounded by MCA_BTL_DES_MAX_SEGMENTS and not by 2. This commit was SVN r25515.	2011-11-28 17:19:12 +00:00
Nathan Hjelm	f8c8c641f1	added asserts to warn developers that ob1/csum match fragments do not support more than 2 segments This commit was SVN r25514.	2011-11-28 16:12:25 +00:00
Samuel Gutierrez	b4edf0ff5c	getting ready for 1.5 port of the shared memory enhancements. remove some unused/unneeded stuff and minor style update. This commit was SVN r25513.	2011-11-28 16:08:32 +00:00
Ralph Castain	9b59d8de6f	This is actually a much smaller commit than it appears at first glance - it just touches a lot of files. The --without-rte-support configuration option has never really been implemented completely. The option caused various objects not to be defined and conditionally compiled some base functions, but did nothing to prevent build of the component libraries. Unfortunately, since many of those components use objects covered by the option, it caused builds to break if those components were allowed to build. Brian dealt with this in the past by creating platform files and using "no-build" to block the components. This was clunky, but acceptable when only one organization was using that option. However, that number has now expanded to at least two more locations. Accordingly, make --without-rte-support actually work by adding appropriate configury to prevent components from building when they shouldn't. While doing so, remove two frameworks (db and rmcast) that are no longer used as ORCM comes to a close (besides, they belonged in ORCM now anyway). Do some minor cleanups along the way. This commit was SVN r25497.	2011-11-22 21:24:35 +00:00
Ralph Castain	6310361532	At long last, the fabled revision to the affinity system has arrived. A more detailed explanation of how this all works will be presented here: https://svn.open-mpi.org/trac/ompi/wiki/ProcessPlacement The wiki page is incomplete at the moment, but I hope to complete it over the next few days. I will provide updates on the devel list. As the wiki page states, the default and most commonly used options remain unchanged (except as noted below). New, esoteric and complex options have been added, but unless you are a true masochist, you are unlikely to use many of them beyond perhaps an initial curiosity-motivated experimentation. In a nutshell, this commit revamps the map/rank/bind procedure to take into account topology info on the compute nodes. I have, for the most part, preserved the default behaviors, with three notable exceptions: 1. I have at long last bowed my head in submission to the system admin's of managed clusters. For years, they have complained about our default of allowing users to oversubscribe nodes - i.e., to run more processes on a node than allocated slots. Accordingly, I have modified the default behavior: if you are running off of hostfile/dash-host allocated nodes, then the default is to allow oversubscription. If you are running off of RM-allocated nodes, then the default is to NOT allow oversubscription. Flags to override these behaviors are provided, so this only affects the default behavior. 2. both cpus/rank and stride have been removed. The latter was demanded by those who didn't understand the purpose behind it - and I agreed as the users who requested it are no longer using it. The former was removed temporarily pending implementation. 3. vm launch is now the sole method for starting OMPI. It was just too darned hard to maintain multiple launch procedures - maybe someday, provided someone can demonstrate a reason to do so. As Jeff stated, it is impossible to fully test a change of this size. I have tested it on Linux and Mac, covering all the default and simple options, singletons, and comm_spawn. That said, I'm sure others will find problems, so I'll be watching MTT results until this stabilizes. This commit was SVN r25476.	2011-11-15 03:40:11 +00:00
Jeff Squyres	e8dcad6017	This typo has been here since August 2005. :-) This commit was SVN r25468.	2011-11-11 03:01:52 +00:00
Brian Barrett	45a27e4f9f	For now, ignore LINK event This commit was SVN r25467.	2011-11-11 02:49:03 +00:00
Brad Benton	96395c916e	de-tab'd This commit was SVN r25465.	2011-11-09 19:45:12 +00:00
Brad Benton	0712b911a5	Updated IBM copyright This commit was SVN r25464.	2011-11-09 19:38:53 +00:00
Mike Dubman	00c27afd52	fix pid This commit was SVN r25463.	2011-11-09 17:53:59 +00:00
Nathan Hjelm	d603f31976	removed ptr member from seg_key union This commit was SVN r25460.	2011-11-08 15:44:54 +00:00
Mike Dubman	71398b658e	fix: OMPI_ERR_CONNECTION_FAILED available in v1.5, unavailable in trunk This commit was SVN r25459.	2011-11-08 12:34:01 +00:00
Mike Dubman	4cf9e1323d	fix: return correct error on connection failure This commit was SVN r25452.	2011-11-07 06:13:17 +00:00
Nathan Hjelm	8962ce25b0	fixed some compiler errors caused by seg_key changes. osc/rdma may need to be updated to use btls that use 128 bit segment keys This commit was SVN r25448.	2011-11-06 20:19:14 +00:00
Samuel Gutierrez	e03bc93fb7	only use pmi grpcomm and pubsub during the direct launch case. use PMI environment variable to setup vpid in ess alps on cray xe systems. add pmi test code. This commit was SVN r25447.	2011-11-06 17:28:40 +00:00
Nathan Hjelm	520a7c570e	changes to seg_key needed for a new btl This commit was SVN r25445.	2011-11-06 16:19:09 +00:00
Rolf vandeVaart	f777fe8eba	Change tab to spaces. This commit was SVN r25433.	2011-11-04 17:18:30 +00:00
Christopher Yeoh	fb57a74a40	Removes pointless memmove which because of a previous memcpy will always have identical source and destination pointers. See #2871 Plugs a couple of minor memory leaks related to remote qp info This commit was SVN r25431.	2011-11-04 00:15:08 +00:00
Mike Dubman	7595a80a63	fix self pid This commit was SVN r25424.	2011-11-03 06:46:20 +00:00
Nathan Hjelm	211e2dbdf3	clean up tab characters This commit was SVN r25413.	2011-11-02 15:07:57 +00:00
Ralph Castain	14966e0f8f	Cleanup PMI startup - if a component isn't selected, it should finalize PMI IFF it started it. Otherwise, components that aren't selected can finalize PMI when it is in use by other parts of the system. This commit was SVN r25407.	2011-11-01 16:25:12 +00:00
Mike Dubman	3edd77ea25	update mxm plugin to mxm api change: pass synchronous request as an opcode instead of a flag This commit was SVN r25403.	2011-10-31 22:36:15 +00:00
Mike Dubman	6b50ba22a6	select mxm ptl based on user preferences This commit was SVN r25401.	2011-10-31 10:17:43 +00:00
Samuel Gutierrez	0ba13e2f8e	fix typo. use PMI_Initialized for init status instead of PMI_Init. This commit was SVN r25378.	2011-10-27 22:41:50 +00:00
Nathan Hjelm	ee087de073	added fast boxes to vader This commit was SVN r25376.	2011-10-27 20:22:46 +00:00
Mike Dubman	f96ae43e23	pass jobid to mxm/sm module This commit was SVN r25375.	2011-10-27 13:14:52 +00:00
Nathan Hjelm	82efe131dc	made btl_vader_max_inline_send a configurable parameter and updated and enabled sendi This commit was SVN r25374.	2011-10-26 22:15:42 +00:00
Nathan Hjelm	033179d6ac	fixed bug in frag initialization This commit was SVN r25373.	2011-10-26 19:29:37 +00:00
George Bosilca	6fdb040eef	ORTE_ERROR to OPAL_ERROR. This commit was SVN r25372.	2011-10-26 15:59:43 +00:00
George Bosilca	9d8e84142f	Survivor!!! This commit was SVN r25371.	2011-10-26 00:58:55 +00:00
Nathan Hjelm	05114ffb51	fixed off by one error This commit was SVN r25369.	2011-10-25 22:07:47 +00:00
George Bosilca	72f731f25f	The SM2 collective component has not been updated in a long time. Rich, the original developer, agrees with this removal. This commit was SVN r25368.	2011-10-25 22:07:09 +00:00
Nathan Hjelm	e887d595c7	fix potential bug with non-contiguous sends This commit was SVN r25367.	2011-10-25 19:21:45 +00:00
Nathan Hjelm	433cfa3665	use single copy for some sends This commit was SVN r25365.	2011-10-25 18:38:42 +00:00
Mike Dubman	9ffeeb69d9	fix help message This commit was SVN r25364.	2011-10-25 14:02:43 +00:00
Samuel Gutierrez	663f4546f5	fix define typo in psm mtl. This commit was SVN r25362.	2011-10-24 18:38:12 +00:00
Ralph Castain	955d8e7d46	Allow apps to use pmi when launched by mpirun, if desired, without affecting daemons This commit was SVN r25359.	2011-10-23 15:57:13 +00:00
Nathan Hjelm	fb19f56965	Cray doesn't define PMI2_SUCCESS This commit was SVN r25354.	2011-10-21 16:34:22 +00:00
Nathan Hjelm	cd68dbe2b8	only try to build vader if xpmem is installed. unignore vader This commit was SVN r25352.	2011-10-21 15:45:05 +00:00
Ralph Castain	3e72fccacf	Cray's PMI implementation is quite different from slurm's - they extended PMI-1 by adding some, but not all, of the PMI-2 APIs. So you can't just switch to using PMI-2 functions as it isn't a complete implementation. Instead, you have to selectively figure out which ones they have in PMI-2, and use any missing ones from PMI-1. What fun. Modify the configure logic and the PMI components to accommodate Cray's approach. Refactor the PMI error reporting code so it resides in only one place. Cray actually decided -not- to define the PMI-2 error codes, so we have to use the PMI-1 codes instead. More fun. This commit was SVN r25348.	2011-10-21 04:54:38 +00:00
Ralph Castain	e2adc8fa3a	Ignore until Nathan can fix - probably configure problem This commit was SVN r25347.	2011-10-21 03:43:01 +00:00
Ralph Castain	5947f61b86	Remove windows reference for now This commit was SVN r25346.	2011-10-21 01:19:03 +00:00
Nathan Hjelm	414677a082	default to no xpmem support This commit was SVN r25345.	2011-10-20 22:13:45 +00:00
Nathan Hjelm	808a73a5c5	removed erroneous add of .deps This commit was SVN r25343.	2011-10-20 21:41:51 +00:00
Nathan Hjelm	3dbaaf6879	initial commit of vader (xpmem) btl This commit was SVN r25342.	2011-10-20 21:39:44 +00:00
Nathan Hjelm	586403f052	more pmi return code wtf This commit was SVN r25337.	2011-10-20 17:53:04 +00:00
Nathan Hjelm	e1e8837992	add a uintptr_t to the seg_key union This commit was SVN r25334.	2011-10-19 21:48:52 +00:00
George Bosilca	1bc5da0911	These are supposed to be OPAL level errors. This commit was SVN r25326.	2011-10-19 14:22:09 +00:00
Ralph Castain	72a4b0bd8a	Fix constants This commit was SVN r25325.	2011-10-19 14:14:58 +00:00
George Bosilca	efd88e10d7	Cleanup the error codes. Get rid of all the useless ones, and mark the distinction between ORTE and OMPI errors. This commit was SVN r25323.	2011-10-19 03:51:53 +00:00
Ralph Castain	0bf4f48aa3	Don't need priority in this framework This commit was SVN r25308.	2011-10-17 22:39:15 +00:00
Ralph Castain	8f0ef54130	Complete implementation of pmi support. Ensure we support both mpirun and direct launch within same configuration to avoid requiring separate builds. Add support for generic pmi, not just under slurm. Add publish/subscribe support, although slurm's pmi implementation will just return an error as it hasn't been done yet. This commit was SVN r25303.	2011-10-17 20:51:22 +00:00
Ralph Castain	e7f6be5385	Unused variable This commit was SVN r25301.	2011-10-17 18:59:22 +00:00
Ralph Castain	2eaadcfab9	Remove unused variable This commit was SVN r25284.	2011-10-14 15:32:18 +00:00
Vishwanath Venkatesan	8dd07bdceb	Removing .ompi_ignore and .ompi_unignore from fs/pvfs2 and fbtl/pvfs2 This commit was SVN r25283.	2011-10-14 00:40:11 +00:00
Vishwanath Venkatesan	8f6b29e95b	Fixing the default file view issue and merging contiguous lengths and offsets for explicit offset case. This commit was SVN r25281.	2011-10-13 19:50:45 +00:00
Jeff Squyres	2c6254b70d	Second change from Intel. This commit was SVN r25279.	2011-10-12 23:26:34 +00:00
Jeff Squyres	28118d0611	Updte the parameters for the Intel iWARP devices, per request from Faisal Latif <faisal.latif@intel.com>. This commit was SVN r25278.	2011-10-12 22:58:30 +00:00
Brian Barrett	d8b5b544ad	Update list name to match change in spec This commit was SVN r25273.	2011-10-12 20:09:39 +00:00
Rainer Keller	4e6a6fc146	- Check, whether the compiler supports __builtin_clz (count leading zeroes); if so, use it for bit-operations like opal_cube_dim and opal_hibit. Implement two versions of power-of-two. In case of opal_next_poweroftwo, this reduces the average execution time from 83 cycles to 4 cycles (Intel Nehalem, icc, -O2, inlining, measured rdtsc, with loop over 2^27 values). Numbers for other functions are similar (but of course heavily depend on the usage, e.g. opal_hibit() with a start of 4 does not save much). The bsr instruction on AMD Opteron is also not as fast. - Replace various places where the next power-of-two is computed. Tested on Intel Nehalem Cluster with openib, compilers GNU-4.6.1 and Intel-12.0.4 using mpi_testsuite -t "Collective" with 128 processes. This commit was SVN r25270.	2011-10-11 22:49:01 +00:00
George Bosilca	74c88a9e48	This was never used (sm_ctl_header). This commit was SVN r25267.	2011-10-11 20:37:00 +00:00
George Bosilca	ca6c282f23	Small cleanups in the SM BTL. This commit was SVN r25266.	2011-10-11 20:32:10 +00:00
George Bosilca	3241bea696	Apply a patch provided by Sébastien Boisvert fixing an issue with the probe fairness. This commit was SVN r25265.	2011-10-11 20:28:33 +00:00
George Bosilca	4fd78c4683	Keep track of the last probe on each communicator, so we can probe all peers in a round-robin fashion. A little bit more fair ... This commit was SVN r25264.	2011-10-11 20:24:54 +00:00
George Bosilca	2fefd3a928	Don't forget to move the pointer back by the true_lb. This commit was SVN r25262.	2011-10-11 20:15:49 +00:00
George Bosilca	649af6c925	Enumerated mixed with another type (int) is tolerated but easily fixable. This commit was SVN r25241.	2011-10-09 03:54:52 +00:00
George Bosilca	07f6ce235f	Return an OMPI_ error not an ORTE_. This commit was SVN r25232.	2011-10-04 14:57:24 +00:00
George Bosilca	ce7935c8fa	Obviously these were not needed. This commit was SVN r25231.	2011-10-04 14:56:34 +00:00
George Bosilca	80c02647c8	Each level (OPAL/ORTE/OMPI) should only return it's own constants, instead of the current mismatch. This commit was SVN r25230.	2011-10-04 14:50:31 +00:00
Mike Dubman	7a9ae43276	added support for shared memory transport in mxm This commit was SVN r25220.	2011-10-03 12:59:55 +00:00
Brian Barrett	fc29ffebdb	* remove two aborts that aren't necessary This commit was SVN r25214.	2011-09-29 22:27:23 +00:00
Brian Barrett	14f32a1a54	* Clean up progress function * Only print returnable errors when verbose=1. Still print errors when we're going to abort, since those obviously aren't returnable This commit was SVN r25213.	2011-09-29 22:26:33 +00:00
Brian Barrett	758f8a4d87	* More debugging output * Make recv short block events use the callback mechanism so that can add overflow debugging This commit was SVN r25212.	2011-09-29 21:59:48 +00:00
Brian Barrett	c08ea5c0f5	Set options correctly for the two pts This commit was SVN r25211.	2011-09-29 21:56:37 +00:00
Brian Barrett	05f800abae	Properly unpack data for long unexpected This commit was SVN r25210.	2011-09-29 17:25:45 +00:00
Rolf vandeVaart	3d8c6b83a9	Make some error messages more helpful This commit was SVN r25209.	2011-09-29 16:32:46 +00:00
Brian Barrett	bb9e73232a	* Leverage hdr_data and opcount to improve debugging * Clean up handling of short synchronous messages This commit was SVN r25208.	2011-09-28 21:18:47 +00:00
Brian Barrett	71d8300607	* Fix name clash with macros in mtl_portals4.h * hdr_data now includes opcount and length for all messages, which is the match bits for long and rndv messages * Re-add probe implementation This commit was SVN r25207.	2011-09-28 16:53:01 +00:00
Brian Barrett	2fb8045fad	clean up printfs This commit was SVN r25206.	2011-09-28 15:28:46 +00:00
Brian Barrett	26e781f002	* Remove triggered code for now * Move from per-endpoint send/recv count to just send side op count This commit was SVN r25205.	2011-09-28 15:25:39 +00:00
Brian Barrett	592c1ab6db	* revert probe and size information changes, since it seems to break everything This commit was SVN r25204.	2011-09-28 14:57:19 +00:00
Brian Barrett	211b5c7824	* Make triggered protocol only work for non-wildcard receives * Always encode length in header data to make probe work * General send/receive cleanups * Implement iprobe This commit was SVN r25197.	2011-09-27 22:45:00 +00:00
Brian Barrett	77c560be42	updates to match new api changes This commit was SVN r25196.	2011-09-27 20:38:22 +00:00
Brad Benton	0f2475c554	Modified set_remote_info() to use memmove() instead of memcpy() when copying rem_qp info. This avoids potential errors when src & dest overlap. This is a workaround for the issue in #2871 This commit was SVN r25180.	2011-09-26 20:07:36 +00:00
Vishwanath Venkatesan	2ee2b478d8	Modifying selection logic to select dynamic for cases where two_phase fails. This commit was SVN r25171.	2011-09-20 21:57:23 +00:00
Pavel Shamis	29c4981caa	Removing unused include from openib/ofud btls. This include causes compilation failure on macos platform. This commit was SVN r25170.	2011-09-20 19:25:59 +00:00
Rolf vandeVaart	0749a220e8	Add support for MPI_IN_PLACE to MPI_Exscan. Required for MPI 2.2 compliance. Reviewed by Jeff Squyres. This fixes trac:2221. This commit was SVN r25165. The following Trac tickets were found above: Ticket 2221 --> https://svn.open-mpi.org/trac/ompi/ticket/2221	2011-09-20 14:54:41 +00:00
Nathan Hjelm	98b56108c4	add unconnect datagram connection manager (udcm) This commit was SVN r25160.	2011-09-19 21:24:58 +00:00
Nathan Hjelm	8cd550f49f	fixed error in last commit This commit was SVN r25159.	2011-09-19 21:13:59 +00:00
Nathan Hjelm	de950959ee	print a more meaningful error message when ibv_create_qp fails This commit was SVN r25158.	2011-09-19 21:12:22 +00:00
Josh Hursey	2d25d70a1c	Missing header for opal_timer_base_get_cycles This commit was SVN r25157.	2011-09-19 19:52:58 +00:00
George Bosilca	9687e7f38e	This commit fixes trac:2679 and should be added to cmr:v1.4:reviewer=jsquyres and cmr:v1.5:reviewer=jsquyres This commit was SVN r25155. The following Trac tickets were found above: Ticket 2679 --> https://svn.open-mpi.org/trac/ompi/ticket/2679	2011-09-18 00:58:26 +00:00
Steve Wise	e4629259f0	Update T4 openib btl defaults. - add 2 new device ids. - default rq depth to 64, which proved good for large runs. This commit should be added to cmr:v1.4:reviewer=jsquyres and cmr:v1.5:reviewer=jsquyres This commit was SVN r25145.	2011-09-14 22:12:25 +00:00
Steve Wise	e5bba38434	Increase the rdmacm cpc address resolution timeout to 30 seconds. Global rdmacm_resolve_timeout defaults to 1000 (1000 ms), which is way too small for even a 16 node x 12 core iwarp cluster in the presence of drops. Bump up the default to 30000ms. This commit fixes trac:2860 and should be added to cmr:v1.4:reviewer=jsquyres and cmr:v1.5:reviewer=jsquyres This commit was SVN r25144. The following Trac tickets were found above: Ticket 2860 --> https://svn.open-mpi.org/trac/ompi/ticket/2860	2011-09-14 21:52:58 +00:00
Nathan Hjelm	3048ce043d	permanently disable ibcm This commit was SVN r25137.	2011-09-13 15:10:51 +00:00
Shiqing Fan	8bf5a61265	Fix another compile error for Windows. This commit was SVN r25129.	2011-09-12 14:19:42 +00:00
Ralph Castain	b11f93a039	Check add_procs return value This commit was SVN r25122.	2011-09-11 14:53:26 +00:00
Sylvain Jeaugey	002d39f345	Restored Bull vendor id for ConnectX card This commit was SVN r25121.	2011-09-07 15:58:42 +00:00
Edgar Gabriel	4bc2e9b023	fix a typo and add an actual pvfs function in the configure link-test. This commit was SVN r25120.	2011-09-07 15:46:41 +00:00
Edgar Gabriel	196c3819e2	- revamp the configure logic to detect pvfs2 and lustre - slight change in the selection logic of the fs module, which makes the ompio independent of the file system type (otherwise ompio would also have required a configure script). This commit was SVN r25118.	2011-09-07 10:39:47 +00:00
Mike Dubman	29b63fee81	better support of pml/cm for mxm This commit was SVN r25113.	2011-09-06 06:38:57 +00:00
Shiqing Fan	16193771ba	Add one missing header file. Fix the MTT build for Windows. This commit was SVN r25112.	2011-08-31 13:15:05 +00:00
Rainer Keller	9d5afc58c6	- Fix breakage of the epoch changes with PGI: Don't juse include pre-processor macros between two strins ("s1" #if 0 ... "s2")... Rather print out the epoch as 0 always... This commit was SVN r25110.	2011-08-31 08:40:31 +00:00
Wesley Bland	4e7ff0bd5e	By popular demand the epoch code is now disabled by default. To enable the epochs and the resilient orte code, use the configure flag: --enable-resilient-orte This will define both: ORTE_ENABLE_EPOCH ORTE_RESIL_ORTE This commit was SVN r25093.	2011-08-26 22:16:14 +00:00
Edgar Gabriel	9b6cf80074	- update the svn:ignore properties for some generated files - add ompi_unignore files for the pvfs2 and lustre components to debug the configure problem This commit was SVN r25090.	2011-08-26 13:23:29 +00:00
Edgar Gabriel	f46ef05c6e	ompi_ignore some components that depend on the configure logic, since some libs don't seem to propagate correctly under certain circumstances. This makes hopefully the nightly tests pass. also, remove the files that should not have been committed in the first place :-) This commit was SVN r25085.	2011-08-26 00:49:32 +00:00
Ralph Castain	71e74990de	Add missing includes so this compiles under Mac OSX This commit was SVN r25084.	2011-08-25 23:04:24 +00:00
Edgar Gabriel	4d23ea19cb	add a missing header file. This commit was SVN r25081.	2011-08-25 21:28:28 +00:00
Edgar Gabriel	61ac1dbcf3	silence some warnings. This commit was SVN r25080.	2011-08-25 21:22:34 +00:00
Edgar Gabriel	52063267df	commit of the OMPIO modules and frameworks. This commit was SVN r25079.	2011-08-25 20:08:17 +00:00
Mike Dubman	98f382ba0e	fixes in mxm mtl This commit was SVN r25066.	2011-08-19 22:18:17 +00:00
Shiqing Fan	627f1dd351	Correct several export declarations. This commit was SVN r25047.	2011-08-15 09:45:51 +00:00
Jeff Squyres	1cbfb53801	r24976 wasn't quite right -- you now actually get a warning if you specify btl_tcp_if_include because btl_tcp_if_exclude is defaulted to the loopback devices. This commit does a few things: * Introduce a new OPAL MCA base function: mca_base_param_check_exclusive_string(). It checks to see that the ''user'' does not set two MCA parameters that are mutually exclusive by checking the source of those MCS param values. * Use the above function in many BTLs (and the OOB TCP) to ensure that <foo>_if_include and <foo>_if_exclude are not both specified ''by the user''. * Re-arrange many of these BTLs to move their MCA registration code into a separate component_register() function (vs. the component_open() function). This code has been nominally reviewed and checked by Ralph, George, Terry, and Shiqing. This commit was SVN r25043. The following SVN revision numbers were found above: r24976 --> open-mpi/ompi@8f4ac54336	2011-08-10 17:24:36 +00:00
Mike Dubman	e3c869d83b	fix double free This commit was SVN r25041.	2011-08-10 05:47:55 +00:00
Mike Dubman	a751cd93d3	improve debug macro availability This commit was SVN r25022.	2011-08-09 10:54:08 +00:00
Mike Dubman	bfd75de6f9	fix selection logic: if no suitable device found - disqulaify mxm w/o complains. This commit was SVN r25021.	2011-08-09 07:09:37 +00:00
Wesley Bland	09274cd047	Make sure that the epoch is initialized everywhere so we don't get weird output during valgrind. This shouldn't have caused any problems with any actual execution. Just extra warnings in valgrind. This commit was SVN r25015.	2011-08-08 15:11:55 +00:00
Mike Dubman	1d3f5e1314	better mxm selection mechanism, some refactoring This commit was SVN r25005.	2011-08-07 12:06:49 +00:00
Yevgeny Kliteynik	7068dc64eb	Dynamic SL rework: - Added dynamic SL support to xoob - Fixed seg fault in finalization - All the code has been moved to separate files: connect/btl_openib_connect_sl.{c,h} - The new files compilation is conditionalized This commit was SVN r24991.	2011-08-04 20:26:08 +00:00
Rolf vandeVaart	3d3b3d4dad	Add support for CUDA registering sm and openib buffers. Feature is disabled by default. This commit was SVN r24987.	2011-08-04 10:15:45 +00:00
Mike Dubman	7b18ab2fa9	remove unused includes This commit was SVN r24980.	2011-08-03 07:07:29 +00:00
Mike Dubman	45ea375531	code and readme updates, some refactoring This commit was SVN r24977.	2011-08-02 14:30:11 +00:00
Jeff Squyres	8f4ac54336	Fixes trac:2838: add a warning message and disqualify the TCP BTL if both btl_tcp_if_include and btl_tcp_if_exclude are specified. This commit was SVN r24976. The following Trac tickets were found above: Ticket 2838 --> https://svn.open-mpi.org/trac/ompi/ticket/2838	2011-08-01 23:30:33 +00:00
Yevgeny Kliteynik	c1ab24c687	openib: added Mellanox ConnectX3 device ID to the device parameters ini file This commit was SVN r24947.	2011-07-26 12:06:43 +00:00
Mike Dubman	aefffa073d	initial implementation of MXM MTL layer This commit was SVN r24946.	2011-07-26 04:36:21 +00:00
Mike Dubman	96ef2fc0e4	fix handling datatypes which have a gap in the beginning This commit was SVN r24936.	2011-07-25 06:30:09 +00:00
Terry Dontje	fbda6aaf89	Fixes trac:2532 issues with 32-bit binaries This commit was SVN r24891. The following Trac tickets were found above: Ticket 2532 --> https://svn.open-mpi.org/trac/ompi/ticket/2532	2011-07-13 16:38:03 +00:00
Jeff Squyres	51ac69b05f	Remove a now-nonexistent file This commit was SVN r24874.	2011-07-11 23:51:41 +00:00
Abhishek Kulkarni	5501f83fb5	shmem fixes to make the trunk build with C/R flags on. This commit was SVN r24871.	2011-07-10 23:32:23 +00:00
Jeff Squyres	b2b781e537	Fix a few miscelaneous memory leaks. This commit was SVN r24865.	2011-07-08 16:39:58 +00:00
Mike Dubman	fd17f20ed5	Currently MTLs do no handle communicator contexts in any special way, they only add the context id to the tag selection of the underlying messaging meachinsm. We would like to enable an MTL to maintain its own context data per-communicator. This way an MTL will be able to queue incoming eager messages and rendezvous requests per-communicator basis. The MTL will be allowed to override comm->c_pml_comm member, since it's unused in pml_cm anyway. This commit was SVN r24858.	2011-07-06 18:25:49 +00:00
Shiqing Fan	1ed0f40d35	Fix a few type casts on Windows. This commit was SVN r24857.	2011-07-06 08:08:53 +00:00
Yevgeny Kliteynik	4fbe68dd86	Removing trailing white spaces in all the openib btl code. This commit was SVN r24855.	2011-07-04 14:00:41 +00:00
Yevgeny Kliteynik	5cae33503d	Changing the weird non-ASCII sign with '*' This commit was SVN r24854.	2011-07-04 13:39:38 +00:00
Brian Barrett	a4b2bd903b	* Implement long-ago discussed RFC to add a callback data pointer in the request completion callback * Use the completion callback pointer to remove all need for opal_progress calls in the one-sided layer This commit was SVN r24848.	2011-06-30 20:05:16 +00:00
Rolf vandeVaart	e6295159ae	Fix compilation of file due to some changes in btl structure. This commit was SVN r24847.	2011-06-30 19:22:41 +00:00
Yevgeny Kliteynik	b05211148d	Supporting dynamic SL (#2674 ) - Added enable/disable configuration parameter for dynamic SL - All the dynamic SL code is conditionalized - Removed libibmad dependency - Using only one include - ib_types.h (part of opensm-devel package) - Removed all the macro and data types definitions, using the existing definitions from ib_types.h instead - general cleaning here and there The async mode is not implemented yet - stay tuned... This commit was SVN r24830.	2011-06-28 14:28:29 +00:00
Wesley Bland	84be81df95	Standardize the initialization of the EPOCH's. Everyone will be starting at MIN anyway (until we implement restart of course) so there's no reason to set the epoch to INVALID and then immediately reset them to MIN. This way there's less room to make mistakes later. This commit was SVN r24829.	2011-06-28 14:20:33 +00:00
Wesley Bland	e1ba09ad51	Add a resilience to ORTE. Allows the runtime to continue after a process (or ORTED) failure. Note that more work will be necessary to allow the MPI layer to take advantage of this. Per RFC: http://www.open-mpi.org/community/lists/devel/2011/06/9299.php This commit was SVN r24815.	2011-06-23 20:38:02 +00:00
Brian Barrett	e8817f3f63	* Don't send acks for expected triggered messages; still need to get the rest of the data * Don't ask for UNLINK events for persistent long unexpected ME or the get MEs. This commit was SVN r24814.	2011-06-23 16:21:10 +00:00
Samuel Gutierrez	81f38b258a	commit of new shared memory backing facility framework (shmem) and its components. This commit was SVN r24795.	2011-06-21 15:41:57 +00:00
Jeff Squyres	3d8ef08912	Minor updates. This commit was SVN r24791.	2011-06-20 17:59:37 +00:00
Jeff Squyres	c4f9debe21	Fix some names -- PTLs died a long time ago! This commit was SVN r24787.	2011-06-20 17:28:27 +00:00
George Bosilca	65661a3cb4	Dont use a temporary string. This commit was SVN r24786.	2011-06-20 09:29:19 +00:00
Brian Barrett	09d89242d6	Crank up the number of short receive blocks so that we're unlikely to hit the flow control case. Uses about same amount of memory as the Portals 3.3 implementations This commit was SVN r24782.	2011-06-16 21:58:53 +00:00
Brian Barrett	4fec0c198d	updtae short recv blocks to properly setup for triggered operations (where they also store the triggered start message) This commit was SVN r24777.	2011-06-16 16:51:59 +00:00
Brian Barrett	83154af74d	Check return codes a bit more closely Fix broken debug output in any_source recv case Other minor code cleanups This commit was SVN r24774.	2011-06-13 15:18:55 +00:00
Edgar Gabriel	0173a00f6b	replace the switch-case statement on the basic datatypes by a series of if-elseif statements to make it compile with OpenMPi again. Fixes trac:2808 This commit was SVN r24768. The following Trac tickets were found above: Ticket 2808 --> https://svn.open-mpi.org/trac/ompi/ticket/2808	2011-06-09 15:35:35 +00:00
Rolf vandeVaart	610421a0da	Fix registration of common parameters in sm btl. This was broken by earlier checkin. Now we can adjust them via MCA parameters again and see the right values from ompi_info. This commit was SVN r24763.	2011-06-09 13:57:46 +00:00
Brian Barrett	a7c682cdb0	Fix starting buffer point for triggered get. Should be after the eager part of the message This commit was SVN r24752.	2011-06-06 17:08:13 +00:00
Rolf vandeVaart	d1fdbadc91	Fix broken basic allocator. Not sure how this ever worked. This commit was SVN r24746.	2011-06-03 14:43:54 +00:00
Brian Barrett	b778d785fb	Add some debugging output and fix some places where the output id and verbosity level were swapped This commit was SVN r24740.	2011-06-01 17:20:18 +00:00
Brian Barrett	37d5c7e2ca	* Add ability to set long protocol with MCA parameter * Instead of static arrays of send/recv counts, put them in the endpoint This commit was SVN r24735.	2011-05-26 21:53:39 +00:00
Brian Barrett	beb1bc70b2	* Add support for using modex to exchange NID/PID pairs when using Portals4. Rather than try to support a bunch of lightweight environments like I did with the Portals3 code, always use the "modex" and hack the grpcomm for the SHMEM implementation to return the right nid/pid for a remote process by "magic". This commit was SVN r24733.	2011-05-25 22:10:27 +00:00
Ralph Castain	b47ec2ee87	Remove lingering references to opal_profile option This commit was SVN r24709.	2011-05-18 18:27:29 +00:00
Ralph Castain	502cc0747f	My my...cleanup a disconnect between the man pages and how we implemented comm_spawn_multiple. We allow an info key per executable. Also fix the -host and -add-host info keys - they are supposed to accept comma-separated lists. This commit was SVN r24706.	2011-05-17 20:12:31 +00:00
Mike Dubman	36db9c6233	* updated copyrights * added support for non-contig data layout in FCA This commit was SVN r24702.	2011-05-16 14:43:11 +00:00
Brian Barrett	d8b7ea315e	First take at implementing rndv and triggered protocols This commit was SVN r24699.	2011-05-13 05:57:16 +00:00
Brian Barrett	43902221cc	* Fix bad argument to PtlGet in long receive * Fix bad params when configuring ME for long unexpected This commit was SVN r24698.	2011-05-13 03:56:03 +00:00
Brian Barrett	8376e0e507	Use free list get instead of wait; this is a constrained resource that will never come back, as it scales with the number of windows and not some more dynamic resources... This commit was SVN r24685.	2011-05-05 17:19:59 +00:00
Jeff Squyres	d1d2cd0a87	Make the description of mca_btl_openib_cq_size be more accurate of what it really is/does. cmr:v1.5.4:kliteyn cmr:v1.4.4:reviewer=kliteyn This commit was SVN r24684.	2011-05-05 13:10:11 +00:00
Brian Barrett	3d4b7ecbaf	updates for API changes This commit was SVN r24628.	2011-04-20 16:48:27 +00:00
Jeff Squyres	25a8944e09	Fixes trac:2776. Let the openib BTL auto-detect its bandwidth. cmr:v1.5.4 This commit was SVN r24621. The following Trac tickets were found above: Ticket 2776 --> https://svn.open-mpi.org/trac/ompi/ticket/2776	2011-04-19 16:31:36 +00:00
Josh Hursey	045035963a	Fix return code from MPI_Probe and MPI_Iprobe. Instead of returning MPI_SUCCESS every time they are called regardless of the status of the call, they should return a value representative of the action. So similar to MPI_Wait/MPI_Test they will return MPI_SUCCESS if the action was successfull, or the value that matches status.MPI_ERROR for the operation if it is unsuccessful. This was discussed on the [http://www.open-mpi.org/community/lists/devel/2011/03/9109.php ompi-devel list] This commit was SVN r24551.	2011-03-22 13:29:29 +00:00
Eugene Loh	2770a12beb	Continue clean up of thread options started in r22841, 22842, and 22849. No need for any CMRs to 1.5... that was already done in CMR 2728. This commit was SVN r24545. The following SVN revision numbers were found above: r22841 --> open-mpi/ompi@b400b84162	2011-03-18 21:36:35 +00:00
Jeff Squyres	82f9474fec	Revert r24533 and r24507 until the compile errors can be fixed. This commit was SVN r24541. The following SVN revision numbers were found above: r24507 --> open-mpi/ompi@4ce1936fed r24533 --> open-mpi/ompi@3204af2d36	2011-03-18 13:33:02 +00:00
George Bosilca	13d2998d54	When the BTL TCP is trying to connect to a peer, output it's process name in addition to all the information. This commit was SVN r24534.	2011-03-16 20:20:14 +00:00
Mike Dubman	3204af2d36	* temporary fix for ib btl compilation with old ofed versions 1.3.x. This commit was SVN r24533.	2011-03-16 17:53:51 +00:00
Ralph Castain	3e2c836e51	Initial cut at integrating new mapper capabilities into comm_spawn. Support specification of a mapper to use, and setting of npernode value. Other info flags can also be defined, but these will serve as examples for now - someone who wants to extend this to all the available mapping controls is welcome to do so. This commit was SVN r24522.	2011-03-12 15:39:56 +00:00
Samuel Gutierrez	0867454a06	Fixes CID #1665 . This commit was SVN r24519.	2011-03-12 03:41:49 +00:00
Samuel Gutierrez	5cff21842a	a friday night in sf, nm. fixes CID 1666. This commit was SVN r24517.	2011-03-12 02:39:31 +00:00
Doron Shoham	4ce1936fed	Fix the following for dynamic SL patch: * rename ib_path_rec_service_level -> ib_path_record_service_level * use mad.h and ib_types.h * free all resources * move ibv_post_recv to be just before ibv_post_send * cleanup and beatify code This commit was SVN r24507.	2011-03-10 16:19:00 +00:00
Jeff Squyres	ec90a3ba6d	Fix a few memory leaks, and ensure that coll sm is also registering the common SM MCA params. This commit was SVN r24497.	2011-03-08 17:36:59 +00:00
Jeff Squyres	2600672b31	Fix minor memory leak. This commit was SVN r24494.	2011-03-08 15:21:33 +00:00
Ralph Castain	63f38e38bb	Fix ompi-server: remove extra command flag in buffer being sent to mpirun, ensure that tools route messages thru a remote HNP This commit was SVN r24491.	2011-03-05 17:12:46 +00:00
Jeff Squyres	67cdecf262	Add svn:ignore and a configure.m4 that always fails (because this component is only built via cmake... right?). This commit was SVN r24487.	2011-03-03 18:00:42 +00:00
Shiqing Fan	8cefb6e67f	Fix the errors in Makefile, the source file list was not updated. Sorry about the MTT create failure. This commit was SVN r24485.	2011-03-03 15:20:30 +00:00
Mike Dubman	70392ac1dc	fca: broadcast comm_new return status to from rank0 to all ranks prior to exiting with an error This commit was SVN r24481.	2011-03-02 22:18:43 +00:00
Shiqing Fan	8c2865bd6a	Add the new WinVerbs btl component. This commit was SVN r24477.	2011-03-02 09:14:44 +00:00
George Bosilca	27fecda12c	Allow the one sided components to correctly retrieve the op to be applied. Correct the MPI validation process of the MPI_Accumulate arguments. Fix another potential problem not yet reported. If we convert the MPI datatypes direclty into OPAL datatypes, we will restrict their number to the locally different types. Which might not be identical on the remote node, if we are in a heterogeneous environment. So, for MPI One sided only deal with MPI level types, never simplify them on OPAL types (at least on the args). The unfortunate outcome is that we need to create the args for all datatypes. This commit was SVN r24466.	2011-02-25 20:43:17 +00:00
George Bosilca	4184baa67a	Remove the proc_guid from the BTL proc structure. Instead use directly the one stored in the ompi_proc_t. This commit was SVN r24461.	2011-02-25 00:36:08 +00:00
George Bosilca	6fc4c22037	Pedantic. This commit was SVN r24460.	2011-02-25 00:29:48 +00:00
George Bosilca	ceb519a026	Fix an annoying warning from gcc about uninitialized variables. This commit was SVN r24459.	2011-02-25 00:29:20 +00:00
George Bosilca	87f3109df4	Cleanups. This commit was SVN r24458.	2011-02-25 00:28:32 +00:00
Jeff Squyres	4cb8a42e7b	Add btl_openib_gid_index MCA param to allow selecting which GID to use from an openfabrics port's GID table. This commit was SVN r24456.	2011-02-24 14:09:22 +00:00
Doron Shoham	47a0752856	max_hw_msg_size should be 0 (default) or greater This commit was SVN r24455.	2011-02-24 09:17:18 +00:00
Doron Shoham	e41e15c8db	cosmetic fixes in openib btl: * replace tabs with ws * remove unnecessary casting * use proper escape codes for printf() like functions This commit was SVN r24445.	2011-02-23 15:50:37 +00:00
Jeff Squyres	b468c71b47	Use complete types This commit was SVN r24434.	2011-02-22 22:34:44 +00:00
Shiqing Fan	90eeba252e	Make openib compile again for Windows. Update the CMake script for checking mca subdirs. Add windows support for __attribute__ packed structures. Define usleep and posix_memalign with equivalent windows functions. And a few minor fixes, type casts. This commit was SVN r24429.	2011-02-22 15:49:27 +00:00
Doron Shoham	e5eef80364	fix type warning in openib btl This commit was SVN r24419.	2011-02-21 15:13:30 +00:00
Mike Dubman	89ba89e812	- added support for upcomming FCA v2.1 version This commit was SVN r24418.	2011-02-21 14:08:24 +00:00
Donald Kerr	995d46344c	simplify the way IBV_ACCESS_SO is discovered This commit was SVN r24409.	2011-02-17 04:28:56 +00:00
Donald Kerr	2b60b165aa	on Solaris, when IBV_ACCESS_SO is available, use strong ordered memory region for eager rdma connection This commit was SVN r24395.	2011-02-16 05:37:22 +00:00
Mike Dubman	81222e1fe7	* fix PGI compiler support which does not have __BASE_FILE__ macro This commit was SVN r24369.	2011-02-10 06:42:37 +00:00
Ethan Mallove	c6fd141923	missing include This commit was SVN r24368.	2011-02-09 17:59:55 +00:00
Eugene Loh	cd5c2e794f	Some minor changes to help the openib BTL build and run on Solaris: - poll() can return POLLRDNORM even if not requested (Solaris bug) - MIN macro not defined in btl_openib.c and while we're at it, we clean up the MIN definition in ad_bgl_pset.h - btl_openib_connect_rdmacm.c was calling rdma_destroy_id() twice leading to undefined behavior (a hang on Solaris) This commit was SVN r24356.	2011-02-03 23:53:21 +00:00
Jeff Squyres	30e164e246	Fix all the problems with "make distcheck" caused by the new ROMIO import so that we can finally get a trunk nightly tarball\! This commit was SVN r24292.	2011-01-24 21:10:14 +00:00
Jeff Squyres	700d601dfc	Also need to check the "flag" value, because if flag!=true, then the value of "local_spawn" (and "non_mpi") is not set by ompi_info_get*(). This commit was SVN r24286.	2011-01-22 16:27:58 +00:00
Jeff Squyres	89fb26eb1c	Add missing line continuation character to prevent a Makefile syntax error This commit was SVN r24285.	2011-01-22 11:13:28 +00:00
Rolf vandeVaart	8171370287	Fix typo which broke builds when configured with hetero and debug. This commit was SVN r24283.	2011-01-21 17:10:09 +00:00
Abhishek Kulkarni	a1090575c2	Nitpick: Get rid of a redundant OPAL_SOS_GET_ERROR_CODE. This commit was SVN r24282.	2011-01-20 23:48:11 +00:00
Abhishek Kulkarni	3243b16bb3	Decode SOS error code before checking it with the native error code. This commit was SVN r24281.	2011-01-20 23:21:38 +00:00
Rolf vandeVaart	6a5ad29c36	Update configure command since it changed. This commit was SVN r24275.	2011-01-20 14:42:12 +00:00
Sylvain Jeaugey	46b711e164	Fixes trac:1888 introduced by r24264 : make Romio autogen.sh executable. This commit was SVN r24272. The following SVN revision numbers were found above: r24264 --> open-mpi/ompi@0e921bba7f The following Trac tickets were found above: Ticket 1888 --> https://svn.open-mpi.org/trac/ompi/ticket/1888	2011-01-20 09:20:34 +00:00
Rolf vandeVaart	acd38ff746	Final changes from jsquyres review. Moved configure code from upper level into btl configure.m4. Changed prefix from "OMPI" to "BTL" in preprocessor macro. Add an mca param that shows it has been configured in. This commit was SVN r24270.	2011-01-19 20:58:22 +00:00
Brian Barrett	4859bb82e2	* Update to support direct call * Add missing cancel (not that it does anything useful) * Fix bug in opal_output call This commit was SVN r24269.	2011-01-19 20:49:28 +00:00
Brian Barrett	8f6a19b0fc	export component/module interface so that direct call works again This commit was SVN r24268.	2011-01-19 20:47:17 +00:00
Rolf vandeVaart	f22f76a6ff	Add byte swapping macro for failover control message per jsquyres review. This commit was SVN r24266.	2011-01-19 19:58:35 +00:00
Rolf vandeVaart	e75b86d3ab	Fix some issues from jsquyres review. 1. Use asprintf instead of snprintf 2. Return remote_proc where possible. 3. Remove dead code. 4. Fix two comment typos. This commit was SVN r24265.	2011-01-19 16:09:17 +00:00
Sylvain Jeaugey	0e921bba7f	Romio Refresh from mpich2-1.3.1. Work by Pascal Deveze, tested through bitbucket by Jeff Squyres (https://bitbucket.org/devezep/new-romio-for-openmpi ). This commit was SVN r24264.	2011-01-19 15:55:10 +00:00
George Bosilca	5390fd6f33	Reshape the datatype engine. The basic types are built down in OPAL. MPI types are either direct link to these basic predefined types, or a combination of them. Anyway, the first items in the datatype list belong to OPAL, the second round are MPI datatypes created by composing basic OPAL datatypes, and the last batch are mapped datatype (direct correspondance between an OMPI datatype and an OPAL one such as int -> int32_t). Modify the op to fit this new scheme. This commit was SVN r24247.	2011-01-13 06:08:54 +00:00
Jeff Squyres	f08433c1e1	Fixes trac:2669. Apparently, gcc 4.4.x and 4.5.x complain about the ''possibility'' of us calling free() on a non-heap variable. We know that this case can never happen because the refcount will absolutely not go to zero here. We think it may be gcc being a bit too aggressive on the warnings. However, since this happens with gcc 4.4.x and 4.5.x, and since gcc 4.5.x ship in RHEL6 and Fedora 14 (and others), someone '''will''' complain about this in the future, so we might as well code around it so that we don't have to keep explaining "despite the warning, it's really ok." The workaround is pretty simple: just OBJ_RELEASE the values from ompi_mpi_comm_parent before it is re-assigned to the new intercommunicator. Then the compiler's static code analysis can't possibly tell that it's not a heap variable, and we're ok. So yes, we are still calling OBJ_RELEASE on a non-heap variable. But free() '''will never be called''' on it because of the refcount. This commit was SVN r24214. The following Trac tickets were found above: Ticket 2669 --> https://svn.open-mpi.org/trac/ompi/ticket/2669	2011-01-10 21:12:27 +00:00
Ethan Mallove	82054cb02c	Include <stdlib.h> instead of <malloc.h>. This avoids a compiler error on some systems caused by the definition of malloc in opal_config_bottom.h getting expanded in the system malloc.h when OPAL_ENABLE_MEM_DEBUG is set to 1. This commit was SVN r24210.	2011-01-06 18:16:36 +00:00
Jeff Squyres	58445f3775	After being hit by "why is openib not working?" ''again'', add a verbose statement that shows up when you --mca btl_base_verbose 100. It clearly states that the openib BTL disqualifies itself when MPI_THREAD_MULTIPLE is used. This commit was SVN r24209.	2011-01-05 22:01:15 +00:00
Eugene Loh	9bbcd51c5a	Properly initialize ep->btl_max_send_size, ep->btl_pipeline_send_length, and ep->btl_send_limit in mca_bml_r2_del_proc_btl() so that the loops will correctly compute new endpoint max/min after the BTL has been removed. See http://www.open-mpi.org/community/lists/devel/2011/01/8829.php This commit was SVN r24202.	2011-01-04 20:35:33 +00:00
Mike Dubman	4a2e29eb32	updated Makefile with a new file This commit was SVN r24199.	2011-01-01 14:11:49 +00:00
Mike Dubman	c56e3141cb	fca: fix segmentation fault when no underlying collective implementation is found This commit was SVN r24198.	2010-12-31 12:03:49 +00:00
Josh Hursey	bbfdf04a81	Fix a couple of 'unused variable' warnings, and one return value warning. {{{ base/paffinity_base_service.c: In function ‘opal_paffinity_base_cset2mapstr’: base/paffinity_base_service.c:623: warning: unused variable ‘range_last’ base/paffinity_base_service.c:623: warning: unused variable ‘range_first’ base/paffinity_base_service.c:622: warning: unused variable ‘count’ base/paffinity_base_service.c:622: warning: unused variable ‘m’ }}} {{{ connect/btl_openib_connect_oob.c: In function ‘init_ud_qp’: connect/btl_openib_connect_oob.c:1111: warning: control reaches end of non-void function connect/btl_openib_connect_oob.c: In function ‘init_device’: connect/btl_openib_connect_oob.c:1235: warning: unused variable ‘i’ connect/btl_openib_connect_oob.c: In function ‘get_pathrecord_sl’: connect/btl_openib_connect_oob.c:1323: warning: unused variable ‘i’ }}} This commit was SVN r24196.	2010-12-30 15:37:50 +00:00
Doron Shoham	834625cc51	Currently the service lever is passed as static parameter (ib_service_level), but the service level is possibly dynamic and if so the only way to get a proper value is to ask the SA. New mca parameter is added (ib_path_rec_service_level) - positive value means that we should get the SL from the SA. This is usable for torus topologies where different SL value is used for different endpoints. A cache is kept of ib queue pairs used to communicate with the SA for a particular device and port and path record SL values retrieved from that SA. The interaction with the cache assumes that there are no recursive calls to these routines. This must be solved either by code flow, by using higher level locks, or by adding a locking mechanism to these routines along with some method for avoiding deadlock. This code use a UD queue pair to talk to the SA, and not need to chmod /dev/infiniband/umad* for use by normal users. The request to the SA is a SubnAdmGet(), not a SubnAdmGetTable(). In the future we might add a support of a SubnAdmGetTable(), but it will require implementing RMPP (Reliable Multi-Packet Transaction Protocol) and I'm not sure we want to do that. This patched is based on the work of David McMillen <davem@systemfabricworks.com>. This commit was SVN r24195.	2010-12-30 08:20:24 +00:00
Mike Dubman	3d517c0285	ABI cleanups This commit was SVN r24193.	2010-12-28 07:11:46 +00:00
Mike Dubman	b339a7a07b	Add FCA 1.2/2.0 backward compatibility, depending on OMPI_FCA_VERSION_xx macro definition. This commit was SVN r24192.	2010-12-27 21:32:34 +00:00
Doron Shoham	bfe611d3bd	This patch fixes bugs #2627 (1.5.2) and #2623 (1.4.2) - Sending large messages over RDMA fails. The patch includes the following: * Add new mca parameter - btl_openib_max_hw_msg_size - Maximum size (in bytes) of a single fragment of a long message when using the RDMA protocols (must be > 0 and <= hw capabilities). * If btl_openib_max_hw_msg_size is larger than the maximum hw limitation print error message. * Change the default openib flags to include only PUT and not GET. * Print error message if user choose manually GET flag in openib btl. * In prepare_dst: limit the message size to be the minimum of both endpoint's hw_limitation and the user limitation (if requested). This commit was SVN r24191.	2010-12-23 11:48:43 +00:00
Brian Barrett	9876e65137	Fix race condition in unlock code, as well as a small memory leak. Somehow they got fixed in the pt2pt implementation, but not the RDMA implementation. Thanks to Guillaume Thouvenin for finding this issue. This commit was SVN r24188.	2010-12-20 22:15:29 +00:00
Rolf vandeVaart	a57e5587f6	Brief description of bfo functionality. This commit was SVN r24186.	2010-12-17 19:12:00 +00:00
Jeff Squyres	b113b1a382	Add the btl_tcp_if_seq MCA parameter. From the help string: If specified, a comma-delimited list of TCP interfaces. Interfaces will be assigned, one to each MPI process, in a round-robin fashion on each server. For example, if the list is "eth0,eth1" and four MPI processes are run on a single server, then local ranks 0 and 2 will use eth0 and local ranks 1 and 3 will use eth1. This feature is only useful for environments with virtual ethernet interfaces on the same network. For example, if eth0 and eth1 are virtual interfaces to the same NIC on the same subnet, and if the NIC provides different hardware resources to eth0 and eth1 (not just different kernel resources), some HOL blocking and congestion issues can be eased in a modest fashion. This commit was SVN r24181.	2010-12-16 00:54:32 +00:00
Samuel Gutierrez	7d8d7769ee	remove unneeded include in sm btl. This commit was SVN r24165.	2010-12-13 17:30:47 +00:00
Brian Barrett	6cf74eeb03	Fix bug in looking at convertor_unpack return code. Always print debug on error message for now. This commit was SVN r24163.	2010-12-10 22:36:47 +00:00
Brian Barrett	a26fadb26e	Bring Portals4 updates back into the trunk This commit was SVN r24154.	2010-12-07 20:11:25 +00:00
Shiqing Fan	f43862420c	Convert the bad dos line endings to unix style for all windows related files. This commit was SVN r24137.	2010-12-02 12:08:08 +00:00
Nathan Hjelm	94d4aa7253	fixed wrong include This commit was SVN r24133.	2010-12-01 23:10:12 +00:00
Rolf vandeVaart	ad4c411ab0	Delete old utility script. This commit was SVN r24101.	2010-11-30 15:18:07 +00:00
Rolf vandeVaart	fdf59375d6	Add a few #ifdefs for clarity. This commit was SVN r24098.	2010-11-29 19:34:04 +00:00
Samuel Gutierrez	5dbb688feb	Oops! Remove unneeded memory barrier. This commit was SVN r24096.	2010-11-29 16:10:44 +00:00
Samuel Gutierrez	74cff0e2a5	Update memory barriers in shared memory (sm) BTL. Fixes periodic segmentation faults when using Intel 10.0.025 or PGI 9.0-3. See #2619 for more details. Thanks Brian and George! This commit was SVN r24095.	2010-11-29 15:57:54 +00:00
Mike Dubman	956e030f28	support for dynamic rules to control offload This commit was SVN r24094.	2010-11-29 04:11:57 +00:00
Eugene Loh	fcf1d993b8	Clarify in ompi/mca/btl/base that eager limits include header. This commit was SVN r24085.	2010-11-24 03:21:33 +00:00
Rolf vandeVaart	29a7398df9	Move #define to Makefile for clarity. Add #ifdef to priority section. This commit was SVN r24084.	2010-11-23 22:19:32 +00:00
Rolf vandeVaart	f65309364b	Add closing comment to #ifdefs. No code changes. This commit was SVN r24083.	2010-11-23 21:42:20 +00:00
Rolf vandeVaart	b2d457f049	Make format match rest of files in directory. This commit was SVN r24078.	2010-11-22 15:10:02 +00:00
Rolf vandeVaart	9c57108e20	Change #ifdef to #if to match OMPI coding conventions. This commit was SVN r24067.	2010-11-17 20:34:15 +00:00
Rolf vandeVaart	90bbb33919	Move variable declaration at beginning of code block to avoid warnings. Also, add memchecker code to csum to keep PMLs consistent. This commit was SVN r24066.	2010-11-17 18:01:56 +00:00
Shiqing Fan	c11bdec1c8	revert r24059, need a better solution for windows build. This commit was SVN r24063. The following SVN revision numbers were found above: r24059 --> open-mpi/ompi@74927c7ab0	2010-11-17 16:09:07 +00:00
Shiqing Fan	74927c7ab0	Remove unnecessary semi-colons, they break windows build. This commit was SVN r24059.	2010-11-17 00:39:38 +00:00
Shiqing Fan	ba2dbff82d	Check for addressability in MPI_*_init, since buffer passed by the application should have been already allocated, but might be not initialized. Check in MPI_Start / MPI_Startall for defined-ness of the buffer passed into the send request(s). This commit was SVN r24054.	2010-11-16 01:01:12 +00:00
Jeff Squyres	e4744b4ed5	Per http://www.open-mpi.org/community/lists/devel/2010/11/8671.php , change a bunch of OMPI_<foo> names to OPAL_<foo>. This commit was SVN r24046.	2010-11-12 23:22:11 +00:00
Greg Koenig	0694a3203b	This was a small mistake introduced in r23925 in the changes to libevent. This commit was SVN r24043. The following SVN revision numbers were found above: r23925 --> open-mpi/ompi@fceabb2498	2010-11-11 21:54:28 +00:00
Rolf vandeVaart	72d06215d5	Add some missing semi-colons. This commit was SVN r24041.	2010-11-11 19:25:25 +00:00
Mike Dubman	5a7d76bb9c	resolve many warnings, comply to c99 This commit was SVN r24040.	2010-11-11 12:14:31 +00:00
Rolf vandeVaart	c74df90729	Fix compile warnings for unused functions. This commit was SVN r24033.	2010-11-10 21:04:32 +00:00

... 6 7 8 9 10 ...

4097 Коммитов