openmpi

Автор	SHA1	Сообщение	Дата
Vishwanath Venkatesan	1e95d8b1e2	remove the MPI functions used in these files by the OMPI internal corresponding functionality and also add error checking in these for functions which did not have them' This commit was SVN r25723.	2012-01-13 17:21:51 +00:00
Rolf vandeVaart	16d676aa5b	Fix minor issue with CUDA. Cannot register overlappiing regions. This commit was SVN r25716.	2012-01-12 13:00:42 +00:00
Samuel Gutierrez	63869c431b	init seg_num_procs_inited to zero before the atomic add. This commit was SVN r25710.	2012-01-11 03:37:23 +00:00
Nathan Hjelm	96c1df8d90	clean up vader registration code This commit was SVN r25704.	2012-01-10 22:33:22 +00:00
Edgar Gabriel	fb4d1a7099	remove the MPI functions used in this file by the OMPI internal corresponding functionality. This commit was SVN r25703.	2012-01-10 19:55:05 +00:00
Nathan Hjelm	f65f6f5c39	bugfix: ugni: increase smsg mailbox size to a multiple of 4096 This commit was SVN r25702.	2012-01-10 19:50:25 +00:00
Mike Dubman	37dc53bbc9	mxm: return the MXM_REQ_SEND_SYNC flag to mxm_req_send This commit was SVN r25694.	2012-01-06 18:56:28 +00:00
Mike Dubman	3b97d609a8	mtl_mxm: fix double free This commit was SVN r25693.	2012-01-06 16:22:58 +00:00
Samuel Gutierrez	d1a44ecd34	send packed buffers instead of using iovecs in common sm rml. this commit will hopefully resolve the periodic bus errors that some mtt tests have been encountering. This commit was SVN r25692.	2012-01-05 00:11:59 +00:00
Rolf vandeVaart	9441f33981	Improve an error message. Replace tabs with spaces. This commit was SVN r25688.	2012-01-03 15:19:01 +00:00
Rolf vandeVaart	8073f5002a	Some additional CUDA specific code. Adding a few more support functions that will be used in future development. This commit was SVN r25684.	2011-12-29 12:31:54 +00:00
Edgar Gabriel	e0139a2d7e	provide descriptions about the functionality of these frameworks. This commit was SVN r25682.	2011-12-22 19:42:00 +00:00
Vishwanath Venkatesan	0f928be1d5	Modifying selection logic back to select two-phase at the cases it should. This commit was SVN r25681.	2011-12-22 01:01:32 +00:00
Vishwanath Venkatesan	37c8470e3d	modified implementation for two-phase write_all incorporating romio style domain partitioning This commit was SVN r25680.	2011-12-22 00:16:29 +00:00
Vishwanath Venkatesan	738a67b704	Removing duplicate code while setting default file view and using internal file-set-view for setting the default file view This commit was SVN r25679.	2011-12-21 21:50:47 +00:00
Rolf vandeVaart	6ca186fb64	Delay some initialization until needed. This eliminates some warnings and removes need for CUDA init before MPI_Init. This commit was SVN r25678.	2011-12-21 15:21:57 +00:00
Samuel Gutierrez	519f71ab7e	silences valgrind warning in common sm (Syscall param writev(vector[...]) points to uninitialised byte(s)). probably also silences a large stack allocation warning in coverity. This commit was SVN r25666.	2011-12-16 23:17:48 +00:00
Samuel Gutierrez	0ca6603fa0	remove some unused cruft in shmem. minor common sm cleanup. This commit was SVN r25665.	2011-12-16 22:43:55 +00:00
Nathan Hjelm	71527c8058	minor ugni btl code cleanup This commit was SVN r25618.	2011-12-10 08:20:46 +00:00
Nathan Hjelm	c8a4687402	don't set SIGSEGV to default This commit was SVN r25610.	2011-12-09 21:54:05 +00:00
Nathan Hjelm	e03d23d96e	Intial support for Cray's uGNI interface (XE-6/XK-6) This commit was SVN r25608.	2011-12-09 21:24:07 +00:00
Nathan Hjelm	87b7e85d53	rfc timeout. retry registration after removing old registration from lru This commit was SVN r25587.	2011-12-07 18:20:44 +00:00
Josh Hursey	e56b4de2c9	Fixes trac:2550 : Cleanup comment in crcp_bkmrk_pml.h This commit was SVN r25585. The following Trac tickets were found above: Ticket 2550 --> https://svn.open-mpi.org/trac/ompi/ticket/2550	2011-12-07 14:50:04 +00:00
Jeff Squyres	c10f41c87e	Do not build these frameworks when --disable-mpi-io is specified. Fixes some Cisco MTT MPI install errors. This commit was SVN r25566.	2011-12-02 22:11:23 +00:00
Ralph Castain	07655e2945	Handle the case where the allocator "fibs" to us about the node names. In some cases (ahem...you know who you are!), the allocator will tell us a node number (e.g., "16"). However, the daemon will return a node name (e.g., "nid0016") - leaving us not recognizing its location. So provide a new parameter (can't have too many!) that handles this situation by stripping the prefix from the returned node name. Also do a little cleanup to ensure we cleanly exit from errors, without generating too many annoying messages. This commit was SVN r25562.	2011-12-02 14:10:08 +00:00
Ralph Castain	357ac14530	Can't return a numerical value here This commit was SVN r25559.	2011-12-02 10:36:57 +00:00
Nathan Hjelm	bb1fec0407	added put/get btl descriptor flags This commit was SVN r25553.	2011-11-30 21:37:23 +00:00
Ralph Castain	c56acf60ca	Although we never really thought about it, we made an unconscious assumption in the mapper system - we assumed that the daemons would be placed on nodes in the order that the nodes appear in the allocation. In other words, we assumed that the launch environment would map processes in node order. Turns out, this isn't necessarily true. The Cray, for example, launches processes in a toroidal pattern, thus causing the daemons to wind up somewhere other than what we thought. Other environments (e.g., slurm) are also capable of such behavior, depending upon the default mapping algorithm they are told to use. Resolve this problem by making the daemon-to-node assignment in the affected environments when the daemon calls back and tells us what node it is on. Order the nodes in the mapping list so they are in daemon-vpid order as opposed to the order in which they show in the allocation. For environments that don't exhibit this mapping behavior (e.g., rsh), this won't have any impact. Also, clean up the vm launch procedure a little bit so it more closely aligns with the state machine implementation that is coming, and remove some lingering "slave" code. This commit was SVN r25551.	2011-11-30 19:58:24 +00:00
Jeff Squyres	6fbbfd0f7a	Gah! r25545 acidentally included ''waaaay'' more stuff than it was supposed to. I.e., half-baked/not complete stuff. This commit backs out all of r25545. Sorry folks! This commit was SVN r25546. The following SVN revision numbers were found above: r25545 --> open-mpi/ompi@7f9ae11faf	2011-11-29 23:24:52 +00:00
Jeff Squyres	7f9ae11faf	Per http://www.open-mpi.org/community/lists/users/2011/11/17862.php , to make MPI_IN_PLACE (and other sentinel Fortran constants) work on OS X, we need to use the following compiler (linker) flag: -Wl,-commons,use_dylibs So if we're compiling on OS X, test to see if that flag works with the compiler. If so, add it to the wrapper FFLAGS and FCFLAGS (note that per a future update, we'll only have one Fortran compiler anyway). Fixes trac:1982. This commit was SVN r25545. The following Trac tickets were found above: Ticket 1982 --> https://svn.open-mpi.org/trac/ompi/ticket/1982	2011-11-29 23:05:54 +00:00
Terry Dontje	5209de048c	add code to service_thread_start to handle EBADF returns from select. This commit fixes trac:2922. This commit was SVN r25520. The following Trac tickets were found above: Ticket 2922 --> https://svn.open-mpi.org/trac/ompi/ticket/2922	2011-11-29 16:49:59 +00:00
Samuel Gutierrez	375162c693	this commit fixes a few things. 1. silence warning in common sm. 2. remove unneeded config code in common sm. 3. move opal_shmem_base_close to a better place in opal_finalize. 4. fix opal_path_nfs output. This commit was SVN r25518.	2011-11-28 23:41:19 +00:00
George Bosilca	0bd2bf9aae	The number of segments accepted should be bounded by MCA_BTL_DES_MAX_SEGMENTS and not by 2. This commit was SVN r25515.	2011-11-28 17:19:12 +00:00
Nathan Hjelm	f8c8c641f1	added asserts to warn developers that ob1/csum match fragments do not support more than 2 segments This commit was SVN r25514.	2011-11-28 16:12:25 +00:00
Samuel Gutierrez	b4edf0ff5c	getting ready for 1.5 port of the shared memory enhancements. remove some unused/unneeded stuff and minor style update. This commit was SVN r25513.	2011-11-28 16:08:32 +00:00
Ralph Castain	9b59d8de6f	This is actually a much smaller commit than it appears at first glance - it just touches a lot of files. The --without-rte-support configuration option has never really been implemented completely. The option caused various objects not to be defined and conditionally compiled some base functions, but did nothing to prevent build of the component libraries. Unfortunately, since many of those components use objects covered by the option, it caused builds to break if those components were allowed to build. Brian dealt with this in the past by creating platform files and using "no-build" to block the components. This was clunky, but acceptable when only one organization was using that option. However, that number has now expanded to at least two more locations. Accordingly, make --without-rte-support actually work by adding appropriate configury to prevent components from building when they shouldn't. While doing so, remove two frameworks (db and rmcast) that are no longer used as ORCM comes to a close (besides, they belonged in ORCM now anyway). Do some minor cleanups along the way. This commit was SVN r25497.	2011-11-22 21:24:35 +00:00
Ralph Castain	6310361532	At long last, the fabled revision to the affinity system has arrived. A more detailed explanation of how this all works will be presented here: https://svn.open-mpi.org/trac/ompi/wiki/ProcessPlacement The wiki page is incomplete at the moment, but I hope to complete it over the next few days. I will provide updates on the devel list. As the wiki page states, the default and most commonly used options remain unchanged (except as noted below). New, esoteric and complex options have been added, but unless you are a true masochist, you are unlikely to use many of them beyond perhaps an initial curiosity-motivated experimentation. In a nutshell, this commit revamps the map/rank/bind procedure to take into account topology info on the compute nodes. I have, for the most part, preserved the default behaviors, with three notable exceptions: 1. I have at long last bowed my head in submission to the system admin's of managed clusters. For years, they have complained about our default of allowing users to oversubscribe nodes - i.e., to run more processes on a node than allocated slots. Accordingly, I have modified the default behavior: if you are running off of hostfile/dash-host allocated nodes, then the default is to allow oversubscription. If you are running off of RM-allocated nodes, then the default is to NOT allow oversubscription. Flags to override these behaviors are provided, so this only affects the default behavior. 2. both cpus/rank and stride have been removed. The latter was demanded by those who didn't understand the purpose behind it - and I agreed as the users who requested it are no longer using it. The former was removed temporarily pending implementation. 3. vm launch is now the sole method for starting OMPI. It was just too darned hard to maintain multiple launch procedures - maybe someday, provided someone can demonstrate a reason to do so. As Jeff stated, it is impossible to fully test a change of this size. I have tested it on Linux and Mac, covering all the default and simple options, singletons, and comm_spawn. That said, I'm sure others will find problems, so I'll be watching MTT results until this stabilizes. This commit was SVN r25476.	2011-11-15 03:40:11 +00:00
Jeff Squyres	e8dcad6017	This typo has been here since August 2005. :-) This commit was SVN r25468.	2011-11-11 03:01:52 +00:00
Brian Barrett	45a27e4f9f	For now, ignore LINK event This commit was SVN r25467.	2011-11-11 02:49:03 +00:00
Brad Benton	96395c916e	de-tab'd This commit was SVN r25465.	2011-11-09 19:45:12 +00:00
Brad Benton	0712b911a5	Updated IBM copyright This commit was SVN r25464.	2011-11-09 19:38:53 +00:00
Mike Dubman	00c27afd52	fix pid This commit was SVN r25463.	2011-11-09 17:53:59 +00:00
Nathan Hjelm	d603f31976	removed ptr member from seg_key union This commit was SVN r25460.	2011-11-08 15:44:54 +00:00
Mike Dubman	71398b658e	fix: OMPI_ERR_CONNECTION_FAILED available in v1.5, unavailable in trunk This commit was SVN r25459.	2011-11-08 12:34:01 +00:00
Mike Dubman	4cf9e1323d	fix: return correct error on connection failure This commit was SVN r25452.	2011-11-07 06:13:17 +00:00
Nathan Hjelm	8962ce25b0	fixed some compiler errors caused by seg_key changes. osc/rdma may need to be updated to use btls that use 128 bit segment keys This commit was SVN r25448.	2011-11-06 20:19:14 +00:00
Samuel Gutierrez	e03bc93fb7	only use pmi grpcomm and pubsub during the direct launch case. use PMI environment variable to setup vpid in ess alps on cray xe systems. add pmi test code. This commit was SVN r25447.	2011-11-06 17:28:40 +00:00
Nathan Hjelm	520a7c570e	changes to seg_key needed for a new btl This commit was SVN r25445.	2011-11-06 16:19:09 +00:00
Rolf vandeVaart	f777fe8eba	Change tab to spaces. This commit was SVN r25433.	2011-11-04 17:18:30 +00:00
Christopher Yeoh	fb57a74a40	Removes pointless memmove which because of a previous memcpy will always have identical source and destination pointers. See #2871 Plugs a couple of minor memory leaks related to remote qp info This commit was SVN r25431.	2011-11-04 00:15:08 +00:00
Mike Dubman	7595a80a63	fix self pid This commit was SVN r25424.	2011-11-03 06:46:20 +00:00
Nathan Hjelm	211e2dbdf3	clean up tab characters This commit was SVN r25413.	2011-11-02 15:07:57 +00:00
Ralph Castain	14966e0f8f	Cleanup PMI startup - if a component isn't selected, it should finalize PMI IFF it started it. Otherwise, components that aren't selected can finalize PMI when it is in use by other parts of the system. This commit was SVN r25407.	2011-11-01 16:25:12 +00:00
Mike Dubman	3edd77ea25	update mxm plugin to mxm api change: pass synchronous request as an opcode instead of a flag This commit was SVN r25403.	2011-10-31 22:36:15 +00:00
Mike Dubman	6b50ba22a6	select mxm ptl based on user preferences This commit was SVN r25401.	2011-10-31 10:17:43 +00:00
Samuel Gutierrez	0ba13e2f8e	fix typo. use PMI_Initialized for init status instead of PMI_Init. This commit was SVN r25378.	2011-10-27 22:41:50 +00:00
Nathan Hjelm	ee087de073	added fast boxes to vader This commit was SVN r25376.	2011-10-27 20:22:46 +00:00
Mike Dubman	f96ae43e23	pass jobid to mxm/sm module This commit was SVN r25375.	2011-10-27 13:14:52 +00:00
Nathan Hjelm	82efe131dc	made btl_vader_max_inline_send a configurable parameter and updated and enabled sendi This commit was SVN r25374.	2011-10-26 22:15:42 +00:00
Nathan Hjelm	033179d6ac	fixed bug in frag initialization This commit was SVN r25373.	2011-10-26 19:29:37 +00:00
George Bosilca	6fdb040eef	ORTE_ERROR to OPAL_ERROR. This commit was SVN r25372.	2011-10-26 15:59:43 +00:00
George Bosilca	9d8e84142f	Survivor!!! This commit was SVN r25371.	2011-10-26 00:58:55 +00:00
Nathan Hjelm	05114ffb51	fixed off by one error This commit was SVN r25369.	2011-10-25 22:07:47 +00:00
George Bosilca	72f731f25f	The SM2 collective component has not been updated in a long time. Rich, the original developer, agrees with this removal. This commit was SVN r25368.	2011-10-25 22:07:09 +00:00
Nathan Hjelm	e887d595c7	fix potential bug with non-contiguous sends This commit was SVN r25367.	2011-10-25 19:21:45 +00:00
Nathan Hjelm	433cfa3665	use single copy for some sends This commit was SVN r25365.	2011-10-25 18:38:42 +00:00
Mike Dubman	9ffeeb69d9	fix help message This commit was SVN r25364.	2011-10-25 14:02:43 +00:00
Samuel Gutierrez	663f4546f5	fix define typo in psm mtl. This commit was SVN r25362.	2011-10-24 18:38:12 +00:00
Ralph Castain	955d8e7d46	Allow apps to use pmi when launched by mpirun, if desired, without affecting daemons This commit was SVN r25359.	2011-10-23 15:57:13 +00:00
Nathan Hjelm	fb19f56965	Cray doesn't define PMI2_SUCCESS This commit was SVN r25354.	2011-10-21 16:34:22 +00:00
Nathan Hjelm	cd68dbe2b8	only try to build vader if xpmem is installed. unignore vader This commit was SVN r25352.	2011-10-21 15:45:05 +00:00
Ralph Castain	3e72fccacf	Cray's PMI implementation is quite different from slurm's - they extended PMI-1 by adding some, but not all, of the PMI-2 APIs. So you can't just switch to using PMI-2 functions as it isn't a complete implementation. Instead, you have to selectively figure out which ones they have in PMI-2, and use any missing ones from PMI-1. What fun. Modify the configure logic and the PMI components to accommodate Cray's approach. Refactor the PMI error reporting code so it resides in only one place. Cray actually decided -not- to define the PMI-2 error codes, so we have to use the PMI-1 codes instead. More fun. This commit was SVN r25348.	2011-10-21 04:54:38 +00:00
Ralph Castain	e2adc8fa3a	Ignore until Nathan can fix - probably configure problem This commit was SVN r25347.	2011-10-21 03:43:01 +00:00
Ralph Castain	5947f61b86	Remove windows reference for now This commit was SVN r25346.	2011-10-21 01:19:03 +00:00
Nathan Hjelm	414677a082	default to no xpmem support This commit was SVN r25345.	2011-10-20 22:13:45 +00:00
Nathan Hjelm	808a73a5c5	removed erroneous add of .deps This commit was SVN r25343.	2011-10-20 21:41:51 +00:00
Nathan Hjelm	3dbaaf6879	initial commit of vader (xpmem) btl This commit was SVN r25342.	2011-10-20 21:39:44 +00:00
Nathan Hjelm	586403f052	more pmi return code wtf This commit was SVN r25337.	2011-10-20 17:53:04 +00:00
Nathan Hjelm	e1e8837992	add a uintptr_t to the seg_key union This commit was SVN r25334.	2011-10-19 21:48:52 +00:00
George Bosilca	1bc5da0911	These are supposed to be OPAL level errors. This commit was SVN r25326.	2011-10-19 14:22:09 +00:00
Ralph Castain	72a4b0bd8a	Fix constants This commit was SVN r25325.	2011-10-19 14:14:58 +00:00
George Bosilca	efd88e10d7	Cleanup the error codes. Get rid of all the useless ones, and mark the distinction between ORTE and OMPI errors. This commit was SVN r25323.	2011-10-19 03:51:53 +00:00
Ralph Castain	0bf4f48aa3	Don't need priority in this framework This commit was SVN r25308.	2011-10-17 22:39:15 +00:00
Ralph Castain	8f0ef54130	Complete implementation of pmi support. Ensure we support both mpirun and direct launch within same configuration to avoid requiring separate builds. Add support for generic pmi, not just under slurm. Add publish/subscribe support, although slurm's pmi implementation will just return an error as it hasn't been done yet. This commit was SVN r25303.	2011-10-17 20:51:22 +00:00
Ralph Castain	e7f6be5385	Unused variable This commit was SVN r25301.	2011-10-17 18:59:22 +00:00
Ralph Castain	2eaadcfab9	Remove unused variable This commit was SVN r25284.	2011-10-14 15:32:18 +00:00
Vishwanath Venkatesan	8dd07bdceb	Removing .ompi_ignore and .ompi_unignore from fs/pvfs2 and fbtl/pvfs2 This commit was SVN r25283.	2011-10-14 00:40:11 +00:00
Vishwanath Venkatesan	8f6b29e95b	Fixing the default file view issue and merging contiguous lengths and offsets for explicit offset case. This commit was SVN r25281.	2011-10-13 19:50:45 +00:00
Jeff Squyres	2c6254b70d	Second change from Intel. This commit was SVN r25279.	2011-10-12 23:26:34 +00:00
Jeff Squyres	28118d0611	Updte the parameters for the Intel iWARP devices, per request from Faisal Latif <faisal.latif@intel.com>. This commit was SVN r25278.	2011-10-12 22:58:30 +00:00
Brian Barrett	d8b5b544ad	Update list name to match change in spec This commit was SVN r25273.	2011-10-12 20:09:39 +00:00
Rainer Keller	4e6a6fc146	- Check, whether the compiler supports __builtin_clz (count leading zeroes); if so, use it for bit-operations like opal_cube_dim and opal_hibit. Implement two versions of power-of-two. In case of opal_next_poweroftwo, this reduces the average execution time from 83 cycles to 4 cycles (Intel Nehalem, icc, -O2, inlining, measured rdtsc, with loop over 2^27 values). Numbers for other functions are similar (but of course heavily depend on the usage, e.g. opal_hibit() with a start of 4 does not save much). The bsr instruction on AMD Opteron is also not as fast. - Replace various places where the next power-of-two is computed. Tested on Intel Nehalem Cluster with openib, compilers GNU-4.6.1 and Intel-12.0.4 using mpi_testsuite -t "Collective" with 128 processes. This commit was SVN r25270.	2011-10-11 22:49:01 +00:00
George Bosilca	74c88a9e48	This was never used (sm_ctl_header). This commit was SVN r25267.	2011-10-11 20:37:00 +00:00
George Bosilca	ca6c282f23	Small cleanups in the SM BTL. This commit was SVN r25266.	2011-10-11 20:32:10 +00:00
George Bosilca	3241bea696	Apply a patch provided by Sébastien Boisvert fixing an issue with the probe fairness. This commit was SVN r25265.	2011-10-11 20:28:33 +00:00
George Bosilca	4fd78c4683	Keep track of the last probe on each communicator, so we can probe all peers in a round-robin fashion. A little bit more fair ... This commit was SVN r25264.	2011-10-11 20:24:54 +00:00
George Bosilca	2fefd3a928	Don't forget to move the pointer back by the true_lb. This commit was SVN r25262.	2011-10-11 20:15:49 +00:00
George Bosilca	649af6c925	Enumerated mixed with another type (int) is tolerated but easily fixable. This commit was SVN r25241.	2011-10-09 03:54:52 +00:00
George Bosilca	07f6ce235f	Return an OMPI_ error not an ORTE_. This commit was SVN r25232.	2011-10-04 14:57:24 +00:00
George Bosilca	ce7935c8fa	Obviously these were not needed. This commit was SVN r25231.	2011-10-04 14:56:34 +00:00

1 2 3 4 5 ...

3678 Коммитов