openmpi

Автор	SHA1	Сообщение	Дата
Ralph Castain	f8a72feb25	Silence unitialized var warning This commit was SVN r29036.	2013-08-16 21:39:28 +00:00
Ralph Castain	c5f395d36a	Silence unitialized var warnings This commit was SVN r29035.	2013-08-16 21:37:35 +00:00
Ralph Castain	c74c54e18d	Cleanup uninitialized warnings This commit was SVN r29033.	2013-08-16 21:23:09 +00:00
Ralph Castain	33beab5918	Avoid segfault due to uninitialized variable This commit was SVN r29030.	2013-08-16 21:10:38 +00:00
Ralph Castain	7d2e3028d6	Add unique info_key to documentation This commit was SVN r29029.	2013-08-14 04:24:17 +00:00
Ralph Castain	bebe852057	Add new info key for publish that allows user to designate that the port is to be unique - i.e., to return an error if that service has already been published. Default is to overwrite This commit was SVN r29028.	2013-08-14 04:21:17 +00:00
Ralph Castain	318467c04f	If we only have global scope, then don't fall back to looking at local scope if the lookup target wasn't found else we will hang This commit was SVN r29025.	2013-08-13 04:45:33 +00:00
Nathan Hjelm	6c75699068	coll/ml: fix typo in assert that could cause an abort in debug builds. cmr=v1.7.3:reviewer=manjugv This commit was SVN r29024.	2013-08-12 14:31:44 +00:00
Jeff Squyres	c09ec204ad	Change usNIC BTL to always use small fragments when there is a non-contiguous converter. We can't "convert on the fly" because the # of bytes requested may not divide evenly into the convertor data type. This commit was SVN r29014.	2013-08-11 17:04:13 +00:00
Nathan Hjelm	47320713bb	coll/ml: do not register variables in open and fix a bug in the coll/ml parser cmr=v1.7.3:reviewer=pasha This commit was SVN r29010.	2013-08-09 17:55:30 +00:00
Rolf vandeVaart	cd72024a3c	Refactor some of the initialization code. This commit was SVN r29009.	2013-08-09 14:54:17 +00:00
Edgar Gabriel	f7391eca23	Lazy open does not work for the addproc sharedfp component since it starts by spawning a process using MPI_Comm_spawn. For this, the first operation has to be collective which we can not guarantuee outside of the MPI_File_open operation. This commit was SVN r29008.	2013-08-06 20:48:20 +00:00
Edgar Gabriel	e348f5567f	add unignore for me. This commit was SVN r29007.	2013-08-06 20:47:08 +00:00
Jeff Squyres	ed130dcef0	Add missing Fortran mpi module TKR implementation for MPI_Get_address This commit was SVN r29005.	2013-08-06 15:08:00 +00:00
George Bosilca	837b3363fe	Silence few warnings. This commit was SVN r29004.	2013-08-06 09:38:30 +00:00
George Bosilca	710d3836d5	Use a recv convertor for the pack external case. This commit was SVN r29003.	2013-08-06 09:09:42 +00:00
George Bosilca	4adaaa0b2b	Fix the profiling prototypes and the copyright. This commit was SVN r29000.	2013-08-05 21:07:32 +00:00
George Bosilca	a938f8fcc5	Add all missing prototypes for the _x functions. This commit was SVN r28999.	2013-08-05 20:49:31 +00:00
George Bosilca	47b1128993	It must be an MPI_Count. This commit was SVN r28998.	2013-08-05 20:49:00 +00:00
Brian Barrett	2cc947513b	* Fix some compile errors * Need to subtract 1 off the size so that we stay in the bit length requirements This commit was SVN r28997.	2013-08-05 18:49:48 +00:00
Jeff Squyres	87910daf51	Fix a collection of bugs found by QA and Coverity, and make some minor improvements: * Fix minor memory leaks during component_init * Ensure that an initialization loop does not underflow an unsigned int * Improve mlock limit checking * Fix set of BTL modules created during component_init when failing to get QP resources or otherwise excluding some (but not all) usnic verbs devices * Fix/improve error messages to be consistent with other Cisco documentation * Randomize the initial sliding window sequence number so that we silently drop incoming frames from previous jobs that still have existant processes in the middle of dying (and are still transmitting) * Ensure we don't break out of add_procs too soon and create an asymetrical view of what interfaces are available This commit was SVN r28975.	2013-08-01 16:56:15 +00:00
Nathan Hjelm	8429485a39	mpool/grdma: use the rcache even if not using mpi_leave_pinned or mpi_leave_pinned_pipeline This change should improve performance is the non-pinned case where the same memory region is involved in multiple simultaneous transfers. cmr=v1.7.3:reviewer=brbarret This commit was SVN r28973.	2013-07-31 23:50:41 +00:00
Matthias Jurenz	5c43ae156c	Fixed # 3704 This commit was SVN r28967.	2013-07-31 07:38:24 +00:00
Jeff Squyres	c7ff45d046	This define is not needed in mpi.h (it's private to the implementation, and is already included in opal_config.h) This commit was SVN r28964.	2013-07-30 21:33:19 +00:00
Nathan Hjelm	befcd8b63e	Disable mpi_param_check by default if parameter checking is compiled out. cmr=v1.7.3:reviewer=rhc This commit was SVN r28960.	2013-07-26 17:44:10 +00:00
Nathan Hjelm	1382d4fb53	Fix typos in _Complex ops This commit was SVN r28959.	2013-07-26 17:02:45 +00:00
Nathan Hjelm	9c519c5ce8	Bump MPI version to 2.2 now that in place alltoall and reductions on MPI_C_COMPLEX and MPI_CXX_COMPLEX are supported This commit was SVN r28958.	2013-07-26 15:51:38 +00:00
Jeff Squyres	c59770651f	Propagate the use of MPI_COUNT_KIND everywhere. This commit was SVN r28956.	2013-07-25 22:41:48 +00:00
Jeff Squyres	a001f01f05	Remove the other bogus KIND from here; it looks like old kruft This commit was SVN r28955.	2013-07-25 22:41:12 +00:00
Nathan Hjelm	7097ba3992	MPI_COUNT_KIND is defined in mpif-config.h This commit was SVN r28954.	2013-07-25 21:42:27 +00:00
Nathan Hjelm	e4f105ffb3	revert change that shouldn't have been part of r28952 This commit was SVN r28953. The following SVN revision numbers were found above: r28952 --> open-mpi/ompi@cb90a4a7fc	2013-07-25 20:23:55 +00:00
Nathan Hjelm	cb90a4a7fc	Add simple algorithms to support MPI_IN_PLACE for MPI_Alltoall, MPI_Alltoallv, and MPI_Alltoallw. Working on faster algorithms for tuned that will come at a later time. cmr=v1.7.3:ticket=trac:2965 This commit was SVN r28952. The following Trac tickets were found above: Ticket 2965 --> https://svn.open-mpi.org/trac/ompi/ticket/2965	2013-07-25 19:19:41 +00:00
Nathan Hjelm	99adeb7f6e	Fix support for complex datatypes when fortran is not available but _Complex is This commit was SVN r28951.	2013-07-25 19:08:21 +00:00
Edgar Gabriel	012f99c3b6	ompi_ignore this component until we find a solution for the dependence on libmpi for the external process that is being spawned by this component. This commit was SVN r28945.	2013-07-24 20:02:47 +00:00
Nathan Hjelm	22868b9f68	MPI_T: add man pages for MPI_T_* functions and fix typos in tool file names This commit was SVN r28943.	2013-07-24 18:19:40 +00:00
Jeff Squyres	f7337b8f77	Correct faulty max payload and MTU computations (and update some debugging that helped us find those). This commit was SVN r28942.	2013-07-24 16:06:28 +00:00
Ralph Castain	db214a2321	Refs trac:3697 - use the opal_pmi_error function instead of ompi_error as the returned error codes are from PMI This commit was SVN r28941. The following Trac tickets were found above: Ticket 3697 --> https://svn.open-mpi.org/trac/ompi/ticket/3697	2013-07-24 04:05:41 +00:00
Jeff Squyres	5323051047	Use sysfs to check MPI has enough VFs, QPs, and CQs Use the new sysfs files to check that there are enough VFs, QPs, and CQs for all the MPI processes on this server. Move the checking code into its own subroutine to make it smaller and easier to read/grok. This commit was SVN r28937.	2013-07-24 00:38:32 +00:00
Nathan Hjelm	90f5bd9424	Add missing f08 binding declarations for MPI_Count functions This commit was SVN r28936.	2013-07-23 19:00:06 +00:00
Rolf vandeVaart	a3995f73d3	Update trunk to match OMPI 1.7.3 due to code reviews. This commit was SVN r28934.	2013-07-23 17:58:21 +00:00
Nathan Hjelm	c6e586a81d	MPI-3: fortran support for large counts using derived datatypes Jeff: - Make sure not to go over 72 characters. Love Fortran! - Ensure to include 'mpif-config.h' in Type_size_x. This commit was SVN r28933.	2013-07-23 15:36:03 +00:00
Nathan Hjelm	c4c69b4ddf	MPI-3: add support for large counts using derived datatypes Add support for MPI_Count type and MPI_COUNT datatype and add the required MPI-3 functions MPI_Get_elements_x, MPI_Status_set_elements_x, MPI_Type_get_extent_x, MPI_Type_get_true_extent_x, and MPI_Type_size_x. This commit adds only the C bindings. Fortran bindins will be added in another commit. For now the MPI_Count type is define to have the same size as MPI_Offset. The type is required to be at least as large as MPI_Offset and MPI_Aint. The type was initially intended to be a ssize_t (if it was the same size as a long long) but there were issues compiling romio with that definition (despite the inclusion of stddef.h). I updated the datatype engine to use size_t instead of uint32_t to support large datatypes. This will require some review to make sure that 1) the changes are beneficial, 2) nothing was broken by the change (I doubt anything was), and 3) there are no performance regressions due to this change. Increase the maximum number of predifined datatypes to support MPI_Count Put common get_elements code to ompi/datatype/ompi_datatype_get_elements.c Update MPI_Get_count to reflect changes in MPI-3 (return MPI_UNDEFINED when the count is too large for an int) This commit was SVN r28932.	2013-07-23 15:35:14 +00:00
Matthias Jurenz	c4a7dded5f	Changes to VT: - configure: Removed double slashes in path names which make trouble when building RPMs on Fedora (see #3688) This commit was SVN r28924.	2013-07-23 08:12:18 +00:00
Nathan Hjelm	1349b825c2	MPI-2.2: Add C++ datatypes to mpi.h and fix support for MPI_C_*COMPLEX This commit was SVN r28919.	2013-07-22 23:45:45 +00:00
Ralph Castain	59a71765cf	Hmmm...these error outputs will never occur, which is probably not what the author intended. So do the output and THEN jump to the error exit. This commit was SVN r28918.	2013-07-22 22:58:03 +00:00
Edgar Gabriel	8ffc1aac89	update the _component.c files in ompio to use the explicit assignment of the mca_register_component_params element of the structure. This commit was SVN r28914.	2013-07-22 21:11:05 +00:00
Nathan Hjelm	b17cd13c09	sharedfp: ensure sharedfp components register their parameters in mca_register_component_params not mca_component_open This commit was SVN r28910.	2013-07-22 17:53:58 +00:00
Jeff Squyres	b437041aeb	Update one more comment. This commit was SVN r28908.	2013-07-22 17:29:00 +00:00
Jeff Squyres	4b6006402d	Use the RTE framework instead of calling ORTE directly. Brian (rightfully) hit me on the head with the don't-use-ORTE-use-the-rte-framework clue bat; the usnic BTL now nicely plays with the RTE framework. This commit was SVN r28907.	2013-07-22 17:28:23 +00:00
Jeff Squyres	ca9da8a554	Fix minor typo in the comments/docs. This commit was SVN r28905.	2013-07-22 17:24:17 +00:00
Rolf vandeVaart	67badf384c	Only search SONAME of library. Expand comments. This commit was SVN r28904.	2013-07-22 15:54:45 +00:00
Brian Barrett	e1d72409cd	add missing header This commit was SVN r28897.	2013-07-21 19:40:31 +00:00
Brian Barrett	704f1ecc18	fix non-orte builds of PSM This commit was SVN r28893.	2013-07-21 19:12:32 +00:00
Brian Barrett	05ab9cbaa6	Need to ship pmi_internal.h This commit was SVN r28891.	2013-07-21 19:00:50 +00:00
Brian Barrett	495384d8b7	Update documentation in rte.h to match recent changes This commit was SVN r28887.	2013-07-20 22:14:12 +00:00
Brian Barrett	414ba3dad8	Update PMI RTE to match error handling changes that were part of r28852. Note that the PMI RTE still doesn't listen for asynchronous errors, so the error handler still won't ever actually do anything :). This commit was SVN r28886. The following SVN revision numbers were found above: r28852 --> open-mpi/ompi@e4e678e234	2013-07-20 22:09:02 +00:00
Brian Barrett	5bfd980968	update PMI RTE component to adapt to ORTE changes This commit was SVN r28885.	2013-07-20 22:06:47 +00:00
Brian Barrett	d984d25da3	Remove orte header file from sharedfp components (OMPI layer should not include ORTE layer with the RTE framework). Thankfully, nothing used orte_show_help, so easy fix. This commit was SVN r28884.	2013-07-20 22:03:44 +00:00
Jeff Squyres	194b285447	First commit of the Cisco usNIC BTL. This BTL accesses the Cisco usNIC Linux device via the Linux verbs API via Unreliable Datagram queue pairs. A few noteworthy points: * This BTL does most of its own fragmentation; it tells the PML that it has a very high max_send_size (much higher than the network MTU). * Since UD fragments are, by definition, unreliable, the usnic BTL handles all of its own reliability via a sliding window approach using the opal_hotel construct and many tricks stolen from the corpus of knowledge surrounding efficient TCP. * There is a fun PML latency-metric based optimization for NUMA awareness of short messages. * Note that this is ''not'' a generic UD verbs BTL; it is specific to the Cisco usNIC device. This commit was SVN r28879.	2013-07-19 22:13:58 +00:00
Jeff Squyres	3546163c48	Devices that do not support RC QP's are also intentionally skipped; don't warn about skipping them. This commit was SVN r28874.	2013-07-19 19:05:18 +00:00
Ralph Castain	e4e678e234	Per the RFC and discussion on the devel list, update the RTE-MPI error handling interface. There are a few differences in the code from the original RFC that came out of the discussion - I've captured those in the following writeup George and I were talking about ORTE's error handling the other day in regards to the right way to deal with errors in the updated OOB. Specifically, it seemed a bad idea for a library such as ORTE to be aborting the job on its own prerogative. If we lose a connection or cannot send a message, then we really should just report it upwards and let the application and/or upper layers decide what to do about it. The current code base only allows a single error callback to exist, which seemed unduly limiting. So, based on the conversation, I've modified the errmgr interface to provide a mechanism for registering any number of error handlers (this replaces the current "set_fault_callback" API). When an error occurs, these handlers will be called in order until one responds that the error has been "resolved" - i.e., no further action is required - by returning OMPI_SUCCESS. The default MPI layer error handler is specified to go "last" and calls mpi_abort, so the current "abort" behavior is preserved unless other error handlers are registered. In the register_callback function, I provide an "order" param so you can specify "this callback must come first" or "this callback must come last". Seemed to me that we will probably have different code areas registering callbacks, and one might require it go first (the default "abort" will always require it go last). So you can append and prepend, or go first. Note that only one registration can declare itself "first" or "last", and since the default "abort" callback automatically takes "last", that one isn't available. :-) The errhandler callback function passes an opal_pointer_array of structs, each of which contains the name of the proc involved (which can be yourself for internal errors) and the error code. This is a change from the current fault callback which returned an opal_pointer_array of just process names. Rationale is that you might need to see the cause of the error to decide what action to take. I realize that isn't a requirement for remote procs, but remember that we will use the SAME interface to report RTE errors internal to the proc itself. In those cases, you really do need to see the error code. It is legal to pass a NULL for the pointer array (e.g., when reporting an internal failure without error code), so handlers must be prepared for that possibility. If people find that too burdensome, we can remove it. Should we ever decide to create a separate callback path for internal errors vs remote process failures, or if we decide to do something different based on experience, then we can adjust this API. This commit was SVN r28852.	2013-07-19 01:08:53 +00:00
Ralph Castain	8a8b4896be	Need to protect libgen.h as some systems might not have it This commit was SVN r28845.	2013-07-18 20:21:37 +00:00
Edgar Gabriel	185e365dad	make the sm sharedfp component compile on Mac. This commit was SVN r28844.	2013-07-18 20:17:14 +00:00
Edgar Gabriel	93cef82873	remove the ylib component from the fcoll framework. It is not used, there are no plans to use it. We can always recover it from svn if we would ever change our minds. This commit was SVN r28840.	2013-07-18 16:18:06 +00:00
Pavel Shamis	68969ba6e5	Removing bogus references in iboffload code. cmr:v1.7:reviewer=hjelmn This commit was SVN r28834.	2013-07-17 22:35:24 +00:00
Rolf vandeVaart	49663fb802	Move CUDA-aware configurary to its own file and other minor changes due to review. This commit was SVN r28832.	2013-07-17 22:12:29 +00:00
Edgar Gabriel	6e8522fec5	infuse life into the shared file pointer framework. For this: - extend the framework API - remove the dummy component, not require anymore - add four components to perform the actual job. This commit was SVN r28828.	2013-07-17 21:55:24 +00:00
Edgar Gabriel	ac694b7056	in preparation for the new shared file pointer components to be committed soon: - add a new abstraction layer to be used internally for some operations - add a new mca parameter to control lazy intialization of shared file pointer structures This commit was SVN r28826.	2013-07-17 21:30:50 +00:00
Vishwanath Venkatesan	ce8f8f0829	Changing the MPI Datatype from MPI_LONG to OMPI_OFFSET_DATATYPE for send/recv offsets This commit was SVN r28822.	2013-07-17 19:16:53 +00:00
Nathan Hjelm	d4c6029cf3	sbgp/ibnet: set mca_sbgp_ibnet_component.mtu to IBV_MTU_1024 before registering it. cmr:v1.7:reviewer=pasha This commit was SVN r28821.	2013-07-17 19:16:31 +00:00
Rolf vandeVaart	7a45be8bde	Fix variable initialization. This commit was SVN r28819.	2013-07-17 17:37:35 +00:00
Nathan Hjelm	5999906dec	Remote duplicate mpi/tool from DIST_SUBDIRS This commit was SVN r28818.	2013-07-17 04:35:47 +00:00
Nathan Hjelm	f0aeb36d80	Fix warnings in ob1 introduced by the pvar commit This commit was SVN r28817.	2013-07-17 03:41:05 +00:00
Ralph Castain	956317ac1e	Cleanup the MPIT errors - include the generated mpit lib in libompi, set ignores, remove the now unused mpit directory This commit was SVN r28816.	2013-07-17 03:13:13 +00:00
Rolf vandeVaart	f95c95cf79	Additional cleanup of how libraries and paths are searched. This commit was SVN r28815.	2013-07-16 18:40:55 +00:00
Nathan Hjelm	38bcbc4696	mpit: fix behavior when returning strings This commit was SVN r28804.	2013-07-16 16:03:48 +00:00
Nathan Hjelm	e6e9f2c6fd	Add profiling function definitions for MPI_T and add a missing type into mpi.h This commit was SVN r28803.	2013-07-16 16:03:33 +00:00
Nathan Hjelm	35673ea400	Add example performance variables to ob1: unexpected message queue length, posted receive length This commit was SVN r28801.	2013-07-16 16:02:25 +00:00
Nathan Hjelm	d446675526	MCA: Per-RFC, add support for performance variables This commit adds an API for registering and querying performance variables (mca_base_pvar) in the MCA base. The existing MCA variable system API has been updated to reflect the new API: MCA variable groups have performance variables, and new types have been added (double, unsigned long long) to reflect what is required by the MPI_T interface. Additionally, the MCA variable group code has been split into its own set of files: mca_base_var_group.[ch]. Details of the new API can be found in doxygen comments in the header: mca_base_pvar.h. Other changes to the variable system: - Use an opal_hash_table to speed up variable/group lookup. - Clean up code associated with MCA variable types. - Registered performance variables are printed by ompi_info -a. In the future an option should be added to control this behavior. Changes to OMPI: - Added full support for the MPI_T performance variable interface. This commit was SVN r28800.	2013-07-16 16:02:13 +00:00
Rolf vandeVaart	54b1fbdb4a	Better error message code. Remove commented out code. This commit was SVN r28793.	2013-07-15 22:27:34 +00:00
Rolf vandeVaart	4d2c2bcefe	Better error message. Remove a tab. This commit was SVN r28791.	2013-07-15 19:39:54 +00:00
George Bosilca	f7ac610bee	Fix an issue with the packing/unpacking of datatype representations identified by Takahiro Kawashima. The packed length was reported as a max bound and not provided on the unpacking side, so the unpacking buffer could become out of sync with the content stored after the packed representation. The fix force the packing operation itself before reporting the length, so we always report now the real number of bytes in the packed representation. cmr:v1.7.3:reviewer=jsquyres This commit was SVN r28790.	2013-07-15 10:54:22 +00:00
Mike Dubman	5bd2e15cbb	support for ConnectX3-Pro card. cmr:v1.7:reviewer=jsquyres cmr:v1.6:reviewer=jsquyres This commit was SVN r28787.	2013-07-14 06:44:19 +00:00
Nathan Hjelm	dfca3d4804	fix typos in the ugni and vader btls This commit was SVN r28772.	2013-07-12 17:55:33 +00:00
Nathan Hjelm	1119cd3e8a	Merge branch 'vader_fix' This commit was SVN r28764.	2013-07-11 23:30:20 +00:00
Brian Barrett	2f19fc52de	use the same multi-md workaround the rest of the Portals code is using. This commit was SVN r28761.	2013-07-11 21:00:11 +00:00
Nathan Hjelm	b5281778b0	btl/vader: improve small message performance This commit improved the small message latency and bandwidth when using the vader btl. These improvements should make performance competative with other MPI implementations. This commit was SVN r28760.	2013-07-11 20:54:12 +00:00
Brian Barrett	bea54eeeb1	First take at a BTL for Portals 4 This commit was SVN r28759.	2013-07-11 20:47:08 +00:00
Jeff Squyres	baa3182794	Per RFC (http://www.open-mpi.org/community/lists/devel/2013/07/12534.php), remove a bunch of dead code. This commit was SVN r28756.	2013-07-11 17:34:28 +00:00
Rolf vandeVaart	858ef65142	Fix loop limit. This commit was SVN r28755.	2013-07-11 17:15:43 +00:00
Rolf vandeVaart	5051cd53fd	Use new API. This commit was SVN r28754.	2013-07-11 17:06:14 +00:00
Joshua Ladd	16beaa3878	This fixes the nasty configure.m4 hack that was added long ago and not removed. My fault for not catching earlier. I've also removed the '.ompi_ignore' in coll/hcoll. Throwing this to Nathan for review. Upon successful review, this should be added to cmr:v1.7:reviewer=hjelmn This commit was SVN r28753.	2013-07-11 09:55:46 +00:00
Jeff Squyres	28dac8010b	The hcoll component configure.m4 commits multiple sins, and breaks many builds. I am temporarily .ompi_ignore'ing this component until it can be fixed by its owner. * It calls AC_MSG_ERROR, which configure.m4 scripts are ''never'' supposed to do. If you don't want to build, then call $2. * All static and --disable-dlopen builds are broken; they fall afoul of whatever test configure.m4 is doing and therefore error out of configure entirely (vs. simply disabling the hcoll component). * There appear to be multiple shell scripting errors in the configure.m4. Here's the output of "./configure --disable-dlopen": {{{ --- MCA component coll:hcoll (m4 configuration macro) checking for MCA component coll:hcoll compile mode... static checking --with-hcoll value... simple ok (unspecified) ./configure: line 421: test: basic: integer expression expected configure: error: Can not use coll/hcoll and coll/ml (static build) simultaneously. You have two options: 1. Use static build & disable ml with: --enable-mpi-no-build=coll-ml 2. Use dso build for ML & disable ml at runtime: -mca coll self ./configure: line 310: return: basic: numeric argument required ./configure: line 320: exit: basic: numeric argument required }}} Finally, all of these configure.m4 errors aside, I don't understand why there is a ''compile-time'' exclusion between the hcoll and ml components. Why isn't this a ''run-time'' decision? Having what seems to be an unnecessary compile-time exclusion goes against the general Open MPI philosophy. Note: Open MPI 1.7 is also broken in all the same ways. I suggest that the RM's .ompi_ignore hcoll over there, too. Mellanox: please fix. This commit was SVN r28748.	2013-07-10 16:03:15 +00:00
Jeff Squyres	80145742a3	Fix typo in comment This commit was SVN r28747.	2013-07-10 15:13:08 +00:00
Jeff Squyres	ea94936531	First cut at assigning some fine-grained "levels" to MCA parameters for the SM and TCP BTLs, as well as the mca_btl_base_param_register() function (which registers MCA params for all BTLs). The guidelines in https://svn.open-mpi.org/trac/ompi/wiki/MCAParamLevels were used to pick these levels. This commit was SVN r28746.	2013-07-10 00:47:52 +00:00
Aurelien Bouteiller	e1066143a4	rename ompi_free_list operations to _mt, as per discussions at last face to face meeting This commit was SVN r28734.	2013-07-08 22:07:52 +00:00
Brian Barrett	ecbbf888d3	* Update Portals 4 MTL's multi-md code to be a bit cleaner (no if statements in the path) and not create MDs due to boundary crossing * Add the same logic to the Coll component This commit was SVN r28733.	2013-07-08 21:27:37 +00:00
Brian Barrett	84aeb6a6a5	Update request alloc to use free list get instead of free list wait. This commit was SVN r28729.	2013-07-05 20:24:43 +00:00
George Bosilca	dc9352faf6	Remove some unused variables. This commit was SVN r28726.	2013-07-05 13:31:54 +00:00
George Bosilca	8b01c3da33	Slightly reorder the code. This commit was SVN r28725.	2013-07-05 13:29:29 +00:00
Jeff Squyres	b417095639	Do not destroy the sub-communicator until we have freed its attributes, per the reason cited in the comment in the code. This commit was SVN r28724.	2013-07-05 12:15:03 +00:00
George Bosilca	483ed8da8c	Remove an unused variable resulting from the removal of the last parameter of the OMPI_FREE_LIST_GET macro. This commit was SVN r28723.	2013-07-04 09:19:00 +00:00
George Bosilca	c9e5ab9ed1	Our macros for the OMPI-level free list had one extra argument, a possible return value to signal that the operation of retrieving the element from the free list failed. However in this case the returned pointer was set to NULL as well, so the error code was redundant. Moreover, this was a continuous source of warnings when the picky mode is on. The attached parch remove the rc argument from the OMPI_FREE_LIST_GET and OMPI_FREE_LIST_WAIT macros, and change to check if the item is NULL instead of using the return code. This commit was SVN r28722.	2013-07-04 08:34:37 +00:00
Brian Barrett	d3b49535b5	Only allow communication from the same user, since we don't have job-level protection. This commit was SVN r28715.	2013-07-03 17:29:02 +00:00
Jeff Squyres	d1ce64f049	Fix some "malloc of 0 bytes" warnings This commit was SVN r28713.	2013-07-03 12:05:33 +00:00
Brian Barrett	81efd0e3cf	Properly shut down Portals collective component This commit was SVN r28707.	2013-07-02 22:07:27 +00:00
Brian Barrett	133dafd3dc	First take at Barrier and Ibarrier, both of which seem to work. This commit was SVN r28706.	2013-07-02 21:42:10 +00:00
Brian Barrett	c4577723ed	fix misuse of param api This commit was SVN r28705.	2013-07-02 21:41:42 +00:00
Brian Barrett	c9a8217af6	Portals 4 doesn't have a BTL, need to default to MTL, rather than finding some stupid slow BTL. THis selection logic sucks. This commit was SVN r28704.	2013-07-02 21:18:04 +00:00
Brian Barrett	e4698f5cd4	Shell of the Portals 4 collectives componetn This commit was SVN r28703.	2013-07-02 15:23:55 +00:00
George Bosilca	fe012cdc2b	Use the converted value instead of calling the macro again. This commit was SVN r28701.	2013-07-02 11:33:18 +00:00
Joshua Ladd	5d2d5e958c	Deleting garbage I accidentally committed. Thanks, Nathan\! This commit was SVN r28698.	2013-07-01 22:50:54 +00:00
Joshua Ladd	d7a50343bf	Per the details and schedule outlined in the attached RFC, Mellanox Technologies would like to CMR the new 'coll/hcoll' component. This component enables Mellanox Technologies' latest HPC middleware offering - 'Hcoll'. 'Hcoll' is a high-performance, standalone collectives library with support for truly asynchronous, non-blocking, hierarchical collectives via hardware offload on supporting Mellanox HCAs (ConnectX-3 and above.) To build the component, libhcoll must first be installed on your system, then you must configure OMPI with the configure flag: '--with-hcoll=/path/to/libhcoll'. Subsequent to installing, you may select the 'coll/hcoll' component at runtime as you would any other coll component, e.g. '-mca coll hcoll,tuned,libnbc'. This has been reviewed by Josh Ladd and should be added to cmr:v1.7:reviewer=jladd This commit was SVN r28694.	2013-07-01 22:39:43 +00:00
George Bosilca	ae190246df	Oops, thanks Jeff for noticing. This commit was SVN r28693.	2013-07-01 17:51:52 +00:00
George Bosilca	e665cda6c2	Add the empty basic component where the function pointer from the base will be copied over. Without such a decoy component the entire framework will not function correctly. This commit was SVN r28692.	2013-07-01 17:47:44 +00:00
George Bosilca	dc1e68c3c1	Remove the item from the list before releasing it. This commit was SVN r28691.	2013-07-01 16:54:48 +00:00
George Bosilca	702e669636	Remove a [very] annoying warning. This commit was SVN r28690.	2013-07-01 16:49:13 +00:00
George Bosilca	5fae72b9aa	Add the MPI 2.2 MPI_Dist_graph functionality. This patch reshape the way we deal with topologies completely. Where our topologies were mainly storage components (they were not capable of creating the new communicator), the new version is built around a [possibly] common representation (in mca/topo/topo.h), but the functions to attach and retrieve the topological information are specific to each component. As a result the ompi_create_cart and ompi_create_graph functions become useless and have been removed. In addition to adding the internal infrastructure to manage the topology information, it updates the MPI interface, and the debuggers support and provides all Fortran interfaces. This commit was SVN r28687.	2013-07-01 12:40:08 +00:00
George Bosilca	b82abf6bef	Silence a compiler warning. This commit was SVN r28686.	2013-07-01 11:40:42 +00:00
Rolf vandeVaart	adda653fc1	Fix two bugs from previous commit. This commit was SVN r28684.	2013-06-28 16:32:51 +00:00
Rolf vandeVaart	850d325f32	Adjust how search is done for dynamic load of library. CUDA only. This commit was SVN r28683.	2013-06-27 22:13:25 +00:00
Jeff Squyres	e3d0782788	Move the assignment after the bozo check. This commit was SVN r28669.	2013-06-22 12:38:32 +00:00
Rolf vandeVaart	5ebb74bee3	Fix case where amount of data sent is less than expected. Otherwise, we will get hang when running the RGET protocol. Reviewed by hjelm,bosilca. This commit was SVN r28667.	2013-06-21 18:35:16 +00:00
Joshua Ladd	0b5c1f2ea8	Add 'generic' support for PMI2 (previously, we checked for PMI2 only on Cray systems.) If your resource manager (e.g. SLURM) has support for PMI2, then the --with-pmi configure flag will enable its usage. If you don't have PMI2, then you will fallback to regular old PMI1. This patch was submitted by Ralph Castain and reviewed and pushed by Josh Ladd. This should be added to cmr:v1.7:reviewer=jladd This commit was SVN r28666.	2013-06-21 15:28:14 +00:00
Jeff Squyres	2e5c18195b	We want to ignore this MPI extension in the general case -- it's just an example (and outputs stuff to stdout!). This commit was SVN r28654.	2013-06-19 16:01:45 +00:00
Mike Dubman	d1c82994be	fix: detect threading model to take appropriate flow in mxm This commit was SVN r28648.	2013-06-16 08:40:06 +00:00
Jeff Squyres	a0b27f5b28	Better comment than what was submitted in r28614. This commit was SVN r28631. The following SVN revision numbers were found above: r28614 --> open-mpi/ompi@9556310bd0	2013-06-13 20:52:44 +00:00
Matthias Jurenz	ebf441ba4b	Changes to VT: Fixed infinite recursion bug if the verbosity level (env. VT_VERBOSE) is higher or equal to 2 This commit was SVN r28624.	2013-06-13 07:33:22 +00:00
Jeff Squyres	34fb0712c4	Per https://svn.mpi-forum.org/trac/mpi-forum-web/ticket/256 , we need to set *flag=1 when source == MPI_PROC_NULL. cmr:v1.7.2:reviewer=dgoodell cmr:v1.6.5:reviewer=dgoodell This commit was SVN r28621.	2013-06-12 21:38:07 +00:00
Nathan Hjelm	db9bce0926	Add destructors for MPI_T error codes This commit was SVN r28618.	2013-06-12 14:58:14 +00:00
Matthias Jurenz	ba9bc238ee	attempt to fix #3627 : Pass all configure options from the OMPI top-level configure to the OTF sub-configure This commit was SVN r28616.	2013-06-12 13:39:21 +00:00
Mike Dubman	9556310bd0	cosmetic: add comment with rationale for malloc.h include This commit was SVN r28614.	2013-06-12 05:58:32 +00:00
Nathan Hjelm	9b1f32bf12	BTL: add flags for signaled BTL operations As per discussion in the June 2013 developer meeting these flags will be used by the PML in the future to request asynchronous progress on an operation. The naming was chosen to reflect that a BTL supports this mode (MCA_BTL_FLAG_SIGNALED) and that a descriptor should "signal" the remote side to wake up and progress the message (MCA_BTL_DES_FLAG_SIGNAL). Future commits will update OB1 to take advantage of this feature when performing the RDMA get or RDMA rendezvous protocols. This commit was SVN r28612.	2013-06-11 21:52:20 +00:00
Jeff Squyres	bf7f9b1f41	Fix minor typo in man page. This commit was SVN r28606.	2013-06-10 13:44:48 +00:00
Mike Dubman	d18b3ae1a7	fix malloc deprication error with gcc 4.6.3 on ubuntu/fedora This commit was SVN r28605.	2013-06-09 18:13:16 +00:00
George Bosilca	d789423d34	Typo. This commit was SVN r28603.	2013-06-08 10:44:02 +00:00
Vishwanath Venkatesan	0b727f84da	Avoid malloc of zero bytes, add a check and avoid it. This commit was SVN r28597.	2013-06-06 14:08:57 +00:00
Edgar Gabriel	2d4655a05a	Logic has been revised compared to the previous implementation. This commit was SVN r28594.	2013-06-05 23:47:42 +00:00
Edgar Gabriel	03c1db7a3a	fix the calculation of the UNIFORM flag. This commit was SVN r28593.	2013-06-05 23:18:50 +00:00
Vishwanath Venkatesan	7d6a05982a	Removing the gather_array based on the flag UNIFORM FVIEW for read all operations (dynamic/static), + Disabling Timing data extraction by default in dynamic write all This commit was SVN r28592.	2013-06-05 21:35:37 +00:00
Vishwanath Venkatesan	55878674d7	1. Removing the allgather_array based on the flag UNIFORM FVIEW. This is not really and optimization. 2. Fixing some of the debug printf's these are outdated. This commit was SVN r28591.	2013-06-05 21:30:15 +00:00
Jeff Squyres	713e3aa3db	Refs trac:3626: that ticket specifically refers to the v1.6 branch; this commit is the trunk version of what is needed for #3626. Add the "ignore_device" field to the INI file. This allows us to specifically list devices that should be ignored by the openib BTL (such as the Intel Phi, at least as of May 2013 -- see #3626). Also add the Intel Phi to the ini file, and set its ignore_device=1. Finally, add the concept of counting intentionally ignored verbs devices. Devices are ignored for one of two reasons: * If the number of allowed ports on that device is 0 (i.e., if if_include/if_exclude was set such that we're intentionally ignoring this device). * If the INI ignore_device field for this device is set to 1. Once we have the count of devices that were intentionally ignored, only show the "Hey, there's verbs devices that you're not using!" show_help message if there are devices that were ''unintentionally'' ignored. This commit was SVN r28589. The following Trac tickets were found above: Ticket 3626 --> https://svn.open-mpi.org/trac/ompi/ticket/3626	2013-06-05 12:12:09 +00:00
Jeff Squyres	3019b7a3f8	Oops! Remove duplicate registration. This commit was SVN r28588.	2013-06-05 11:55:19 +00:00
Jeff Squyres	1de00b17ad	Properly check the return status from registering the MCA params. This commit was SVN r28587.	2013-06-05 11:53:18 +00:00
Nathan Hjelm	e48bd9809e	Add useful messages for MPI_T error codes This commit was SVN r28584.	2013-06-04 23:18:44 +00:00
Jeff Squyres	d692aba672	Remove the DR PML. It was abondoned long ago. It had a nice life, a few papers, and now a decent demise with respect. This commit was SVN r28582.	2013-06-04 19:36:16 +00:00
Jeff Squyres	d1dc4da292	Fix typo (the debugger might not be TotalView). This commit was SVN r28577.	2013-05-31 00:39:05 +00:00
Edgar Gabriel	87b3782b7f	arghh, copy-and-paste error, status->_ucount has to be set to 0 not max_data for count=0. This commit was SVN r28576.	2013-05-30 22:00:29 +00:00
Edgar Gabriel	9daec82f17	- make a fileview of 0 bytes work in ompio - fixes the bug reported in ticket 3619 (which is already closed) also for ompio This commit was SVN r28575.	2013-05-30 21:33:13 +00:00
Rolf vandeVaart	3d1d158a80	Do not abort in BTL. Rather, callback into PML error function. Thanks George for review. This commit was SVN r28559.	2013-05-23 18:45:23 +00:00

1 2 3 4 5 ...

6628 Коммитов