1
1
Граф коммитов

5348 Коммитов

Автор SHA1 Сообщение Дата
Wesley Bland
5fde3e0e00 Move the resilient orte errmgr code into a seperate errmgr for now while it's
still unstable. Reverted errmgr modules back to the original errmgr (with the
updates since the resilient code was brought into the trunk).

This commit was SVN r24958.
2011-07-28 21:24:34 +00:00
Matthias Jurenz
4ca70e5c91 Changes to OTF:
- improved zlib compression
   - otfprofile-mpi:
      - fixed progress

Changes to VT:
   - fixed C++ linker issue for manual instrumentation of multiple files
   - fixed CUDA kernel launch configuration
   - process and thread buffer size can be explicitly specified by the user via the environment variables VT_BUFFER_SIZE and VT_THREAD_BUFFER_SIZE
   - fixed CUDA buffer management
   - vtfilter:
      - fixed progress
   - vtwrapper:
      - link CUPTI library, if available
   - vtsetup:
      - removed fixed path to *.dtd file in vtsetup-data.xml[.in] (fixes 'java.net.MalformedURLException')

This commit was SVN r24950.
2011-07-26 12:47:05 +00:00
Yevgeny Kliteynik
c1ab24c687 openib: added Mellanox ConnectX3 device ID to the device parameters ini file
This commit was SVN r24947.
2011-07-26 12:06:43 +00:00
Mike Dubman
aefffa073d initial implementation of MXM MTL layer
This commit was SVN r24946.
2011-07-26 04:36:21 +00:00
Mike Dubman
96ef2fc0e4 fix handling datatypes which have a gap in the beginning
This commit was SVN r24936.
2011-07-25 06:30:09 +00:00
Jeff Squyres
d6bc78920e Add a few manual cleanups that were missed (i.e., this is the
''other'' direction, so to speak, compared to r24921).

This commit was SVN r24924.

The following SVN revision numbers were found above:
  r24921 --> open-mpi/ompi@bd96d028de
2011-07-22 21:05:39 +00:00
Jeff Squyres
bd96d028de George identified some memory leaks and inconsistencies in the F77 API
when sizeof(int) != sizeof(MPI_Fint).  This commit should fix those
problems. 

This commit was SVN r24921.
2011-07-22 19:49:27 +00:00
Yevgeny Kliteynik
78ea8bcea2 Always defining OMPI_ENABLE_DYNAMIC_SL, not only when the feature enabled.
Also, refactoring some code - make all the checks only when relevant.

This commit was SVN r24900.
2011-07-13 23:19:58 +00:00
Terry Dontje
fbda6aaf89 Fixes trac:2532 issues with 32-bit binaries
This commit was SVN r24891.

The following Trac tickets were found above:
  Ticket 2532 --> https://svn.open-mpi.org/trac/ompi/ticket/2532
2011-07-13 16:38:03 +00:00
Jeff Squyres
51ac69b05f Remove a now-nonexistent file
This commit was SVN r24874.
2011-07-11 23:51:41 +00:00
Abhishek Kulkarni
5501f83fb5 shmem fixes to make the trunk build with C/R flags on.
This commit was SVN r24871.
2011-07-10 23:32:23 +00:00
Jeff Squyres
08a05a1e35 Minor additions to make OMPI trunk compatible with the latest GNU
Autotools:

 * Autoconf 2.68
 * Automake 1.11.1
 * Libtool 2.4
 * m4 1.4.16

This commit was SVN r24867.
2011-07-10 12:11:47 +00:00
Jeff Squyres
b2b781e537 Fix a few miscelaneous memory leaks.
This commit was SVN r24865.
2011-07-08 16:39:58 +00:00
Mike Dubman
fd17f20ed5 Currently MTLs do no handle communicator contexts in any special way,
they only add the context id to the tag selection of the underlying 
messaging meachinsm. 
 
 We would like to enable an MTL to maintain its own context data
per-communicator. This way an MTL will be able to queue incoming eager 
messages and rendezvous requests per-communicator basis.

 The MTL will be allowed to override comm->c_pml_comm member, 
since it's unused in pml_cm anyway. 

This commit was SVN r24858.
2011-07-06 18:25:49 +00:00
Shiqing Fan
1ed0f40d35 Fix a few type casts on Windows.
This commit was SVN r24857.
2011-07-06 08:08:53 +00:00
Yevgeny Kliteynik
4fbe68dd86 Removing trailing white spaces in all the openib btl code.
This commit was SVN r24855.
2011-07-04 14:00:41 +00:00
Yevgeny Kliteynik
5cae33503d Changing the weird non-ASCII sign with '*'
This commit was SVN r24854.
2011-07-04 13:39:38 +00:00
Yevgeny Kliteynik
3e43b21d41 As suggested by Jeff, checking if dynamic SL support
was explicitly requested. If it was, but opensm-devel
package is not found, warn and abort.
Otherwise, doing the best effort: if opensm-devel found,
enable dynamic SL. If it's not found, disable dynamic
SL and build OMPI w/o it.

This commit was SVN r24852.
2011-07-03 14:02:50 +00:00
Matthias Jurenz
5e6919b4e1 Changes to OTF:
otfprofile-mpi:
      - added progress display
      - added verbose messages
      - added functions to sychronize the error indicator to all worker ranks
        (enforces that all ranks will be terminated by calling MPI_Abort if anyone fails)
      - wrap def. comments after 80 characters
      - use pdf[la]tex instead of latex/dvipdf to convert TeX output to PDF
      - added configure checks for pdf[la]tex and PGFPLOTS v1.4
      - fixed function invocation statistics generated from summarized information (--stat)
      - fixed memory leak

Changes to VT:
   MPI wrappers:
      - fixed wrapper generation for MPI implementations which don't support the MPI-2 standard (e.g. MVAPICH, MPICH)
      - corrected IN_PLACE denotation for MPI_Alltoall* and MPI_Scatter*
   vtwrapper:
      - corrected detection of IBM XL's OpenMP flag -qsmp=*:omp:*
   vtunify:
      - fixed faulty cleanup of temporary files which occurred if VT is configured without trace compression support

This commit was SVN r24851.
2011-07-01 07:17:15 +00:00
Brian Barrett
a4b2bd903b * Implement long-ago discussed RFC to add a callback data pointer in the
request completion callback
* Use the completion callback pointer to remove all need for opal_progress
  calls in the one-sided layer

This commit was SVN r24848.
2011-06-30 20:05:16 +00:00
Rolf vandeVaart
e6295159ae Fix compilation of file due to some changes in btl structure.
This commit was SVN r24847.
2011-06-30 19:22:41 +00:00
Yevgeny Kliteynik
a09245e6c8 Changed default behavior when opensm-devel pachege not found - warn, not exit
This commit was SVN r24836.
2011-06-29 14:52:11 +00:00
Yevgeny Kliteynik
b05211148d Supporting dynamic SL (#2674)
- Added enable/disable configuration parameter for dynamic SL
 - All the dynamic SL code is conditionalized
 - Removed libibmad dependency
 - Using only one include - ib_types.h (part of opensm-devel package)
 - Removed all the macro and data types definitions, using the
   existing definitions from ib_types.h instead
 - general cleaning here and there

The async mode is not implemented yet - stay tuned...

This commit was SVN r24830.
2011-06-28 14:28:29 +00:00
Wesley Bland
84be81df95 Standardize the initialization of the EPOCH's.
Everyone will be starting at MIN anyway (until we implement restart of course)
so there's no reason to set the epoch to INVALID and then immediately reset them
to MIN. This way there's less room to make mistakes later.

This commit was SVN r24829.
2011-06-28 14:20:33 +00:00
Jeff Squyres
b05ca7c76b Only build the f90 ext stuff if we're also building the f90 bindings
(e.g., cover the case where the f90 bindings are not being built
because you don't have an f90 compiler)

This commit was SVN r24827.
2011-06-27 20:39:11 +00:00
Jeff Squyres
e32b9a8547 Fix minor typo -- f90, not f77
This commit was SVN r24826.
2011-06-27 20:38:30 +00:00
Jeff Squyres
2b2f6fc580 Fix typo noted by Zbigniew Koza.
This commit was SVN r24818.
2011-06-25 10:35:47 +00:00
Wesley Bland
e1ba09ad51 Add a resilience to ORTE. Allows the runtime to continue after a process (or
ORTED) failure. Note that more work will be necessary to allow the MPI layer to
take advantage of this.

Per RFC:
http://www.open-mpi.org/community/lists/devel/2011/06/9299.php

This commit was SVN r24815.
2011-06-23 20:38:02 +00:00
Brian Barrett
e8817f3f63 * Don't send acks for expected triggered messages; still need to get the rest of the data
* Don't ask for UNLINK events for persistent long unexpected ME or the get MEs.

This commit was SVN r24814.
2011-06-23 16:21:10 +00:00
Matthias Jurenz
f97bf63d98 - removed invalid wildcard from EXTRA_DIST (fixes broken 'make distcheck')
- extended 'make uninstall' rules to remove symbolic links (e.g. vtcxx, vtCC, otfdecompress)

This commit was SVN r24811.
2011-06-23 07:46:07 +00:00
Matthias Jurenz
0ecce8e3dd Updated integrated VT to version 5.11.1
(waiting for successful sanity checks (MTT) before moving to v1.5)

This commit was SVN r24803.
2011-06-22 13:58:00 +00:00
Matthias Jurenz
103044d3b7 Added workaround for libtool bug:
Renamed internal library 'libutil' to avoid name conflicts with library dependencies (e.g. libutil of libc-dev)

This commit was SVN r24799.
2011-06-22 11:39:49 +00:00
Samuel Gutierrez
81f38b258a commit of new shared memory backing facility framework (shmem) and its components.
This commit was SVN r24795.
2011-06-21 15:41:57 +00:00
Jeff Squyres
3d8ef08912 Minor updates.
This commit was SVN r24791.
2011-06-20 17:59:37 +00:00
Jeff Squyres
c4f9debe21 Fix some names -- PTLs died a long time ago!
This commit was SVN r24787.
2011-06-20 17:28:27 +00:00
George Bosilca
65661a3cb4 Dont use a temporary string.
This commit was SVN r24786.
2011-06-20 09:29:19 +00:00
Brian Barrett
09d89242d6 Crank up the number of short receive blocks so that we're unlikely to hit the flow
control case.  Uses about same amount of memory as the Portals 3.3 implementations

This commit was SVN r24782.
2011-06-16 21:58:53 +00:00
Brian Barrett
4fec0c198d updtae short recv blocks to properly setup for triggered operations (where
they also store the triggered start message)

This commit was SVN r24777.
2011-06-16 16:51:59 +00:00
Josh Hursey
0eb3b3b7b0 Fix missing functionality in MPI_Abort so that the group of peers defined by the communicator that should be aborted with this process are requested from the runtime before the local process exits.
Per RFC:
  http://www.open-mpi.org/community/lists/devel/2011/06/9335.php

This commit was SVN r24775.
2011-06-15 13:10:13 +00:00
Brian Barrett
83154af74d Check return codes a bit more closely
Fix broken debug output in any_source recv case
Other minor code cleanups

This commit was SVN r24774.
2011-06-13 15:18:55 +00:00
Jeff Squyres
90aa9ff09a Fix problem that causes make dist to barf
This commit was SVN r24773.
2011-06-11 18:11:17 +00:00
Josh Hursey
0833734fcf Add support for F77/F90 in the Open MPI extensions interface system.
Per RFC from OMPI Devel list:
 * http://www.open-mpi.org/community/lists/devel/2011/06/9324.php

This commit was SVN r24772.
2011-06-10 20:03:06 +00:00
Edgar Gabriel
0173a00f6b replace the switch-case statement on the basic datatypes by a series of
if-elseif statements to make it compile with OpenMPi again.

Fixes trac:2808

This commit was SVN r24768.

The following Trac tickets were found above:
  Ticket 2808 --> https://svn.open-mpi.org/trac/ompi/ticket/2808
2011-06-09 15:35:35 +00:00
Rolf vandeVaart
610421a0da Fix registration of common parameters in sm btl. This was broken by earlier checkin. Now we can adjust them via MCA parameters again and see the right values from ompi_info.
This commit was SVN r24763.
2011-06-09 13:57:46 +00:00
Brian Barrett
a7c682cdb0 Fix starting buffer point for triggered get. Should be after the eager part of the
message

This commit was SVN r24752.
2011-06-06 17:08:13 +00:00
Rolf vandeVaart
d1fdbadc91 Fix broken basic allocator. Not sure how this ever worked.
This commit was SVN r24746.
2011-06-03 14:43:54 +00:00
Brian Barrett
b778d785fb Add some debugging output and fix some places where the output id and
verbosity level were swapped

This commit was SVN r24740.
2011-06-01 17:20:18 +00:00
Brian Barrett
37d5c7e2ca * Add ability to set long protocol with MCA parameter
* Instead of static arrays of send/recv counts, put them in the endpoint

This commit was SVN r24735.
2011-05-26 21:53:39 +00:00
Brian Barrett
beb1bc70b2 * Add support for using modex to exchange NID/PID pairs when using Portals4.
Rather than try to support a bunch of lightweight environments like I did
  with the Portals3 code, always use the "modex" and hack the grpcomm for
  the SHMEM implementation to return the right nid/pid for a remote
  process by "magic".

This commit was SVN r24733.
2011-05-25 22:10:27 +00:00
Jeff Squyres
595dd60546 Fix off-by-one error in trimming space from the right of strings
This commit was SVN r24724.
2011-05-22 12:21:14 +00:00
Ralph Castain
ca5af216b6 Remove last vestige or orte-iof
This commit was SVN r24721.
2011-05-21 14:28:43 +00:00
Ralph Castain
8c08ee9c3d Remove stale tool
This commit was SVN r24720.
2011-05-21 00:38:35 +00:00
Ralph Castain
b47ec2ee87 Remove lingering references to opal_profile option
This commit was SVN r24709.
2011-05-18 18:27:29 +00:00
Ralph Castain
502cc0747f My my...cleanup a disconnect between the man pages and how we implemented comm_spawn_multiple. We allow an info key per executable. Also fix the -host and -add-host info keys - they are supposed to accept comma-separated lists.
This commit was SVN r24706.
2011-05-17 20:12:31 +00:00
Ralph Castain
d34bab541d Remove the ompi-profiler tool and its attendant ompi-probe program. Also remove the grpcomm basic component since its only function was to support profiled clusters, which nobody was doing. :-(
This commit was SVN r24704.
2011-05-17 03:30:25 +00:00
Mike Dubman
36db9c6233 * updated copyrights
* added support for non-contig data layout in FCA

This commit was SVN r24702.
2011-05-16 14:43:11 +00:00
Brian Barrett
d8b7ea315e First take at implementing rndv and triggered protocols
This commit was SVN r24699.
2011-05-13 05:57:16 +00:00
Brian Barrett
43902221cc * Fix bad argument to PtlGet in long receive
* Fix bad params when configuring ME for long unexpected

This commit was SVN r24698.
2011-05-13 03:56:03 +00:00
Brian Barrett
be8a126600 At Josh's request, make example MPI extension use the init/fini so that
the feature is actually documented.

This commit was SVN r24686.
2011-05-05 18:31:07 +00:00
Brian Barrett
8376e0e507 Use free list get instead of wait; this is a constrained resource that will never come back, as it scales with the number of windows and not some more dynamic resources...
This commit was SVN r24685.
2011-05-05 17:19:59 +00:00
Jeff Squyres
d1d2cd0a87 Make the description of mca_btl_openib_cq_size be more accurate of
what it really is/does.

cmr:v1.5.4:kliteyn cmr:v1.4.4:reviewer=kliteyn

This commit was SVN r24684.
2011-05-05 13:10:11 +00:00
Christopher Yeoh
bab59bda76 Fixes trac:2767: Recursive locking when ROMIO used with THREAD_MULITPLE
This commit was SVN r24681.

The following Trac tickets were found above:
  Ticket 2767 --> https://svn.open-mpi.org/trac/ompi/ticket/2767
2011-05-04 06:31:42 +00:00
George Bosilca
c3c231b5ae Unsigned datatypes should be redirected to their unsigned correspondants
in the OPAL layer. Thenks to Yossi Etigin for the patch.

cmr:v1.5

This commit was SVN r24677.
2011-05-03 12:53:52 +00:00
Matthias Jurenz
a1e304b2d6 Removed redundant debug message
This commit was SVN r24638.
2011-04-26 08:02:46 +00:00
Jeff Squyres
ddc44cfbce Fix the types of the sendcounts and displs parameters to MPI_Scatterv.
Thanks to Stanislav Sazykin for identifying the issue.

This commit was SVN r24630.
2011-04-22 10:11:45 +00:00
Brian Barrett
3d4b7ecbaf updates for API changes
This commit was SVN r24628.
2011-04-20 16:48:27 +00:00
Brian Barrett
e1676fd61e Make the no-orte case compile again
This commit was SVN r24627.
2011-04-20 16:48:07 +00:00
Jeff Squyres
25a8944e09 Fixes trac:2776. Let the openib BTL auto-detect its bandwidth.
cmr:v1.5.4

This commit was SVN r24621.

The following Trac tickets were found above:
  Ticket 2776 --> https://svn.open-mpi.org/trac/ompi/ticket/2776
2011-04-19 16:31:36 +00:00
George Bosilca
971711474f Based on the patch submitted by Pascal Deveze, here is the memory leak fix
for the type indexed creation.

CMR v1.4 and v1.5.

This commit was SVN r24617.
2011-04-14 21:50:06 +00:00
Matthias Jurenz
fe8cc366c8 Don't try to rename the compiler output of an OPARI modified source file if it's specified by '-o'
This commit was SVN r24601.
2011-04-08 11:53:46 +00:00
Edgar Gabriel
725a0d2100 fix a formatting issue
This commit was SVN r24596.
2011-03-31 20:05:45 +00:00
Edgar Gabriel
ad9f793ce4 avoid calling omp_dpm.mark_dyncomm if the size of the local communicator
is zero. The routine assumes that at least one process is available in the
group, which lead to a segfault when creating communicators with GROUP_EMPTY.

Fixes trac:2752

This commit was SVN r24595.

The following Trac tickets were found above:
  Ticket 2752 --> https://svn.open-mpi.org/trac/ompi/ticket/2752
2011-03-31 19:57:06 +00:00
Matthias Jurenz
53346a9c1a - fixed handling NULL value of pathname given to certain I/O calls (e.g. fopen, open, unlink)
- incremented version number

This commit was SVN r24575.
2011-03-25 11:15:49 +00:00
Josh Hursey
045035963a Fix return code from MPI_Probe and MPI_Iprobe.
Instead of returning MPI_SUCCESS every time they are called regardless of the status of the call, they should return a value representative of the action. So similar to MPI_Wait/MPI_Test they will return MPI_SUCCESS if the action was successfull, or the value that matches status.MPI_ERROR for the operation if it is unsuccessful.

This was discussed on the [http://www.open-mpi.org/community/lists/devel/2011/03/9109.php ompi-devel list]

This commit was SVN r24551.
2011-03-22 13:29:29 +00:00
Eugene Loh
2770a12beb Continue clean up of thread options started in r22841, 22842, and 22849.
No need for any CMRs to 1.5... that was already done in CMR 2728.

This commit was SVN r24545.

The following SVN revision numbers were found above:
  r22841 --> open-mpi/ompi@b400b84162
2011-03-18 21:36:35 +00:00
Matthias Jurenz
c34eed80c6 Fixed typo in configure options
This commit was SVN r24544.
2011-03-18 14:42:49 +00:00
Jeff Squyres
82f9474fec Revert r24533 and r24507 until the compile errors can be fixed.
This commit was SVN r24541.

The following SVN revision numbers were found above:
  r24507 --> open-mpi/ompi@4ce1936fed
  r24533 --> open-mpi/ompi@3204af2d36
2011-03-18 13:33:02 +00:00
George Bosilca
13d2998d54 When the BTL TCP is trying to connect to a peer, output it's process name
in addition to all the information.

This commit was SVN r24534.
2011-03-16 20:20:14 +00:00
Mike Dubman
3204af2d36 * temporary fix for ib btl compilation with old ofed versions 1.3.x.
This commit was SVN r24533.
2011-03-16 17:53:51 +00:00
Ralph Castain
3e2c836e51 Initial cut at integrating new mapper capabilities into comm_spawn. Support specification of a mapper to use, and setting of npernode value. Other info flags can also be defined, but these will serve as examples for now - someone who wants to extend this to all the available mapping controls is welcome to do so.
This commit was SVN r24522.
2011-03-12 15:39:56 +00:00
Samuel Gutierrez
0867454a06 Fixes CID #1665.
This commit was SVN r24519.
2011-03-12 03:41:49 +00:00
Samuel Gutierrez
5cff21842a a friday night in sf, nm. fixes CID 1666.
This commit was SVN r24517.
2011-03-12 02:39:31 +00:00
Doron Shoham
4ce1936fed Fix the following for dynamic SL patch:
* rename ib_path_rec_service_level -> ib_path_record_service_level
* use mad.h and ib_types.h
* free all resources
* move ibv_post_recv to be just before ibv_post_send
* cleanup and beatify code

This commit was SVN r24507.
2011-03-10 16:19:00 +00:00
Jeff Squyres
ec90a3ba6d Fix a few memory leaks, and ensure that coll sm is also registering
the common SM MCA params.

This commit was SVN r24497.
2011-03-08 17:36:59 +00:00
Jeff Squyres
2600672b31 Fix minor memory leak.
This commit was SVN r24494.
2011-03-08 15:21:33 +00:00
Jeff Squyres
79cf382ff3 Fix a few issues with error messages:
* If something goes wrong during ompi_mpi_init, don't erroneously
   report that it is illegal to invoke MPI_INIT* before MPI_INIT
 * Aggregate help messages when possible when something goes wring
   during ompi_mpi_init

This commit was SVN r24492.
2011-03-07 16:45:45 +00:00
Ralph Castain
63f38e38bb Fix ompi-server: remove extra command flag in buffer being sent to mpirun, ensure that tools route messages thru a remote HNP
This commit was SVN r24491.
2011-03-05 17:12:46 +00:00
Jeff Squyres
67cdecf262 Add svn:ignore and a configure.m4 that always fails (because this
component is only built via cmake... right?).

This commit was SVN r24487.
2011-03-03 18:00:42 +00:00
Shiqing Fan
8cefb6e67f Fix the errors in Makefile, the source file list was not updated. Sorry about the MTT create failure.
This commit was SVN r24485.
2011-03-03 15:20:30 +00:00
Mike Dubman
70392ac1dc fca: broadcast comm_new return status to from rank0 to all ranks prior to exiting with an error
This commit was SVN r24481.
2011-03-02 22:18:43 +00:00
George Bosilca
79b13f36ba darray and subarray are now first class citizens in Open MPI. They can be stored
in packed form and reloaded, as any other type (this is mainly for one sided).

This commit was SVN r24480.
2011-03-02 19:22:24 +00:00
Shiqing Fan
8c2865bd6a Add the new WinVerbs btl component.
This commit was SVN r24477.
2011-03-02 09:14:44 +00:00
George Bosilca
5721a06864 Indentation.
This commit was SVN r24467.
2011-02-25 20:51:16 +00:00
George Bosilca
27fecda12c Allow the one sided components to correctly retrieve the op to
be applied. Correct the MPI validation process of the
MPI_Accumulate arguments.

Fix another potential problem not yet reported. If we convert the
MPI datatypes direclty into OPAL datatypes, we will restrict their
number to the locally different types. Which might not be identical
on the remote node, if we are in a heterogeneous environment. So,
for MPI One sided only deal with MPI level types, never simplify
them on OPAL types (at least on the args). The unfortunate
outcome is that we need to create the args for all datatypes.

This commit was SVN r24466.
2011-02-25 20:43:17 +00:00
George Bosilca
78151490d1 Just use the LIB definition.
This commit was SVN r24463.
2011-02-25 00:39:05 +00:00
George Bosilca
c66e454181 Make ompi_datatype_destroy a real function (instead of inline).
This commit was SVN r24462.
2011-02-25 00:37:52 +00:00
George Bosilca
4184baa67a Remove the proc_guid from the BTL proc structure. Instead use directly
the one stored in the ompi_proc_t.

This commit was SVN r24461.
2011-02-25 00:36:08 +00:00
George Bosilca
6fc4c22037 Pedantic.
This commit was SVN r24460.
2011-02-25 00:29:48 +00:00
George Bosilca
ceb519a026 Fix an annoying warning from gcc about uninitialized variables.
This commit was SVN r24459.
2011-02-25 00:29:20 +00:00
George Bosilca
87f3109df4 Cleanups.
This commit was SVN r24458.
2011-02-25 00:28:32 +00:00