openmpi

Автор	SHA1	Сообщение	Дата
Nathan Hjelm	584c457352	ugni: update smsg defaults and add parameter to control local completion queue size This commit was SVN r26399.	2012-05-07 17:22:49 +00:00
Nathan Hjelm	bfcf67391a	ugni: set fragment id from opal_pointer_array_add This commit was SVN r26398.	2012-05-07 17:22:42 +00:00
Nathan Hjelm	b3dc726e9d	ugni: don't create completion queues until add_procs This commit was SVN r26397.	2012-05-07 17:22:35 +00:00
Nathan Hjelm	0e48ea1f65	vader: remove #include of headers that no longer exist This commit was SVN r26396.	2012-05-07 17:22:28 +00:00
Nathan Hjelm	a32d4c648d	ob1: rewind convertor after failed send This commit was SVN r26395.	2012-05-07 17:22:22 +00:00
Jeff Squyres	2ba10c37fe	Per RFC, bring in the following changes: * Remove paffinity, maffinity, and carto frameworks -- they've been wholly replaced by hwloc. * Move ompi_mpi_init() affinity-setting/checking code down to ORTE. * Update sm, smcuda, wv, and openib components to no longer use carto. Instead, use hwloc data. There are still optimizations possible in the sm/smcuda BTLs (i.e., making multiple mpools). Also, the old carto-based code found out how many NUMA nodes were ''available'' -- not how many were used ''in this job''. The new hwloc-using code computes the same value -- it was not updated to calculate how many NUMA nodes are used ''by this job.'' * Note that I cannot compile the smcuda and wv BTLs -- I ''think'' they're right, but they need to be verified by their owners. * The openib component now does a bunch of stuff to figure out where "near" OpenFabrics devices are. '''THIS IS A CHANGE IN DEFAULT BEHAVIOR!!''' and still needs to be verified by OpenFabrics vendors (I do not have a NUMA machine with an OpenFabrics device that is a non-uniform distance from multiple different NUMA nodes). * Completely rewrite the OMPI_Affinity_str() routine from the "affinity" mpiext extension. This extension now understands hyperthreads; the output format of it has changed a bit to reflect this new information. * Bunches of minor changes around the code base to update names/types from maffinity/paffinity-based names to hwloc-based names. * Add some helper functions into the hwloc base, mainly having to do with the fact that we have the hwloc data reporting ''all'' topology information, but sometimes you really only want the (online \| available) data. This commit was SVN r26391.	2012-05-07 14:52:54 +00:00
Mike Dubman	1b475523de	add support for FDR speed This commit was SVN r26385.	2012-05-06 05:53:05 +00:00
Nathan Hjelm	b6ae288a59	fix segfault when pml direct enabled This commit was SVN r26371.	2012-05-01 23:12:41 +00:00
Brian Barrett	0ae2277796	Add a backoff mechanism for re-establishing communication This commit was SVN r26366.	2012-05-01 15:53:00 +00:00
Brian Barrett	74ade8b181	need to order the pending list before we restart This commit was SVN r26365.	2012-04-30 23:06:00 +00:00
Brian Barrett	5dec52af8d	remove some now unneeded debugging This commit was SVN r26364.	2012-04-30 22:50:52 +00:00
Brian Barrett	c654ee6afc	* Use triggered operations for restart barrier as well This commit was SVN r26363.	2012-04-30 22:48:10 +00:00
Brian Barrett	91a9973bde	* Make flow control on by default * Move alarm code back into a triggered operation This commit was SVN r26362.	2012-04-30 22:25:40 +00:00
Brian Barrett	e6a0a1cf8a	* Make sure to release all resources on failed send * Avoid triggered ops until we get everything debugged * Simplify flowctl interface a bit This commit was SVN r26356.	2012-04-27 21:11:01 +00:00
Nathan Hjelm	c36ab84116	ugni: missed a couple of lines in the last commit This commit was SVN r26340.	2012-04-25 14:24:48 +00:00
Nathan Hjelm	a753fe91f7	fix merge This commit was SVN r26332.	2012-04-24 21:16:51 +00:00
Nathan Hjelm	0eb18b9699	ob1: update copyrights This commit was SVN r26331.	2012-04-24 20:19:15 +00:00
Nathan Hjelm	0a0e487d9c	ob1: add emacs mode/indentation defaults This commit was SVN r26330.	2012-04-24 20:19:06 +00:00
Nathan Hjelm	9a35f96bda	ob1: add support for get fallback on put/send This commit was SVN r26329.	2012-04-24 20:18:56 +00:00
Nathan Hjelm	93780c63be	replace tabs w/ spaces This commit was SVN r26328.	2012-04-24 20:18:45 +00:00
Nathan Hjelm	0f60858a01	ugni: improve handling of smsg completions This commit was SVN r26327.	2012-04-24 20:18:35 +00:00
Nathan Hjelm	e3b9040e69	vader: remove maffinity code This commit was SVN r26321.	2012-04-24 15:38:03 +00:00
Nathan Hjelm	363bd184e7	ugni: re-disable uGNI for local procs This commit was SVN r26318.	2012-04-23 21:12:12 +00:00
Nathan Hjelm	ca3ceb840c	ugni: add mca parameter to control the number of smsg retries This commit was SVN r26317.	2012-04-23 21:12:05 +00:00
Nathan Hjelm	95b12f140a	ugni: cleanup frag setup code This commit was SVN r26316.	2012-04-23 21:11:57 +00:00
Nathan Hjelm	37ca31b295	ugni: remove unused completion queue This commit was SVN r26315.	2012-04-23 21:11:39 +00:00
Nathan Hjelm	1340f9c65a	ugni update: - Move endpoint code back up to BTL - Use opal_pointer_array_t for bounce buffer to identify local smsg completions. - Update and reenable sendi - Create a new endpoint for FMA/BTE transactions (keep local smsg/fma transactions seperate) - Move reverse get code into btl_ugni_put.c - Move eager get code into btl_ugni_get.c - Handle remote SMSG overruns correctly - Added support for inplace sends - etc This commit was SVN r26307.	2012-04-19 21:51:55 +00:00
Nathan Hjelm	2b9827f45c	ugni: restrict number of memory registrations per process This commit was SVN r26306.	2012-04-19 21:51:44 +00:00
Jeff Squyres	253444c6d0	== Highlights == 1. New mpifort wrapper compiler: you can utilize mpif.h, use mpi, and use mpi_f08 through this one wrapper compiler 1. mpif77 and mpif90 still exist, but are sym links to mpifort and may be removed in a future release 1. The mpi module has been re-implemented and is significantly "mo' bettah" 1. The mpi_f08 module offers many, many improvements over mpif.h and the mpi module This stuff is coming from a VERY long-lived mercurial branch (3 years!); it'll almost certainly take a few SVN commits and a bunch of testing before I get it correctly committed to the SVN trunk. == More details == Craig Rasmussen and I have been working with the MPI-3 Fortran WG and Fortran J3 committees for a long, long time to make a prototype MPI-3 Fortran bindings implementation. We think we're at a stable enough state to bring this stuff back to the trunk, with the goal of including it in OMPI v1.7. Special thanks go out to everyone who has been incredibly patient and helpful to us in this journey: * Rolf Rabenseifner/HLRS (mastermind/genius behind the entire MPI-3 Fortran effort) * The Fortran J3 committee * Tobias Burnus/gfortran * Tony !Goetz/Absoft * Terry !Donte/Oracle * ...and probably others whom I'm forgetting :-( There's still opportunities for optimization in the mpi_f08 implementation, but by and large, it is as far along as it can be until Fortran compilers start implementing the new F08 dimension(..) syntax. Note that gfortran is currently unsupported for the mpi_f08 module and the new mpi module. gfortran users will a) fall back to the same mpi module implementation that is in OMPI v1.5.x, and b) not get the new mpi_f08 module. The gfortran maintainers are actively working hard to add the necessary features to support both the new mpi_f08 module and the new mpi module implementations. This will take some time. As mentioned above, ompi/mpi/f77 and ompi/mpi/f90 no longer exist. All the fortran bindings implementations have been collated under ompi/mpi/fortran; each implementation has its own subdirectory: {{{ ompi/mpi/fortran/ base/ - glue code mpif-h/ - what used to be ompi/mpi/f77 use-mpi-tkr/ - what used to be ompi/mpi/f90 use-mpi-ignore-tkr/ - new mpi module implementation use-mpi-f08/ - new mpi_f08 module implementation }}} There's also a prototype 6-function-MPI implementation under use-mpi-f08-desc that emulates the new F08 dimension(..) syntax that isn't fully available in Fortran compilers yet. We did that to prove it to ourselves that it could be done once the compilers fully support it. This directory/implementation will likely eventually replace the use-mpi-f08 version. Other things that were done: * ompi_info grew a few new output fields to describe what level of Fortran support is included * Existing Fortran examples in examples/ were renamed; new mpi_f08 examples were added * The old Fortran MPI libraries were renamed: * libmpi_f77 -> libmpi_mpifh * libmpi_f90 -> libmpi_usempi * The configury for Fortran was consolidated and significantly slimmed down. Note that the F77 env variable is now IGNORED for configure; you should only use FC. Example: {{{ shell$ ./configure CC=icc CXX=icpc FC=ifort ... }}} All of this work was done in a Mercurial branch off the SVN trunk, and hosted at Bitbucket. This branch has got to be one of OMPI's longest-running branches. Its first commit was Tue Apr 07 23:01:46 2009 -0400 -- it's over 3 years old! :-) We think we've pulled in all relevant changes from the OMPI trunk (e.g., Fortran implementations of the new MPI-3 MPROBE stuff for mpif.h, use mpi, and use mpi_f08, and the recent Fujitsu Fortran patches). I anticipate some instability when we bring this stuff into the trunk, simply because it touches a LOT of code in the MPI layer in the OMPI code base. We'll try our best to make it as pain-free as possible, but please bear with us when it is committed. This commit was SVN r26283.	2012-04-18 15:57:29 +00:00
Brian Barrett	8a70747da2	Fix some naming that doesn't make a ton of sense This commit was SVN r26277.	2012-04-18 01:05:18 +00:00
Brian Barrett	f4d4e87176	add some flow control debugging output This commit was SVN r26276.	2012-04-17 23:14:05 +00:00
Brian Barrett	fe0dfc8e26	First take at flow control protocol This commit was SVN r26274.	2012-04-17 21:46:21 +00:00
Brian Barrett	dde6f094eb	In preperation for flow control changes coming, always utilize ACKs for message completion. This commit was SVN r26272.	2012-04-16 17:25:27 +00:00
Terry Dontje	81d7fcaf82	back out r26255 to avoid cross component linkage so Solaris can build a usable openib btl This commit was SVN r26269. The following SVN revision numbers were found above: r26255 --> open-mpi/ompi@fe25b8704b	2012-04-13 18:08:54 +00:00
Nathan Hjelm	f88babfb92	ugni: minor updates This commit was SVN r26262.	2012-04-10 19:56:19 +00:00
Mike Dubman	34acf769d4	mtl_mxm: support canceling messages This commit was SVN r26256.	2012-04-09 16:02:05 +00:00
Mike Dubman	fe25b8704b	performance fix: set alignment for openib internal buffers Thanks to Jeff/Pasha for valuable comments Thanks to Valentin Petrov for implementation This commit was SVN r26255.	2012-04-09 08:06:15 +00:00
George Bosilca	f09e3ce5a4	Spring cleanup. Nothing important. This commit was SVN r26247.	2012-04-06 15:48:07 +00:00
George Bosilca	654c75ff24	As suggested on the mailing list a while back, switch the default alltoallv algorithm to pairwise exchange instead of the default one. This might improve the scheduling and relax the pressure on the network. This commit was SVN r26246.	2012-04-06 15:47:29 +00:00
Ralph Castain	bd8b4f7f1e	Sorry for mid-day commit, but I had promised on the call to do this upon my return. Roll in the ORTE state machine. Remove last traces of opal_sos. Remove UTK epoch code. Please see the various emails about the state machine change for details. I'll send something out later with more info on the new arch. This commit was SVN r26242.	2012-04-06 14:23:13 +00:00
Brian Barrett	d46d55ee9b	If we're locking the local window, need to wait until the lock returns. This commit was SVN r26234.	2012-04-04 16:27:24 +00:00
Josh Hursey	d1571b027a	Fix a few error return paths This commit was SVN r26233.	2012-04-04 15:11:03 +00:00
Nathan Hjelm	b0c3c18e02	Initial upload of grdma mpool This commit was SVN r26232.	2012-04-03 23:03:03 +00:00
Mike Dubman	ff1c84c53f	revert previous commit This commit was SVN r26206.	2012-03-29 14:07:13 +00:00
Mike Dubman	43a5775e8a	performance fix: set alignment for openib internal buffers This commit was SVN r26205.	2012-03-29 14:00:08 +00:00
Nathan Hjelm	d62c0f1872	ugni: handle smsg failure in mca_btl_ugni_ep_connect_finish This commit was SVN r26202.	2012-03-28 05:40:16 +00:00
Brian Barrett	451af0e832	Ensure async progress for long unexpected messages by waiting for an event on the ME. The events we're likely to see are LINK (the ME was added to the match list), PUT (weird to see first, but means that the ME was linked to the match list and then matched), or PUT_OVERFLOW, meaning the message was unexpected. This commit was SVN r26199.	2012-03-26 22:54:35 +00:00
Brian Barrett	2a26d0f9a2	Forgot to add new file in the last commit. Mark ME as invalid once we see a completion event, and look for events before trying to unlink. This commit was SVN r26198.	2012-03-26 22:39:05 +00:00
Brian Barrett	0e91084385	* Add type field to the request structure to deal with random user requests (ie, cancel) * Implement cancel for receives. Sends are slightly more complicated... This commit was SVN r26197.	2012-03-26 22:32:36 +00:00
Brian Barrett	61a090e0d1	Checking for NULL function pointers and direct-call semantics can't work together, so implement all functions in the MTL interface for all MTLs. The only places NULL was still being set was for add_comm/del_comm, and matched probe, both of which are straight forward to implement (or return ERROR_NOT_IMPLEMENTED, since the PML can't emulate matched probe). This commit was SVN r26194.	2012-03-26 19:27:03 +00:00

1 2 3 4 5 ...

3807 Коммитов