1
1
Граф коммитов

16039 Коммитов

Автор SHA1 Сообщение Дата
Yevgeny Kliteynik
7068dc64eb Dynamic SL rework:
- Added dynamic SL support to xoob
 - Fixed seg fault in finalization
 - All the code has been moved to separate files: connect/btl_openib_connect_sl.{c,h}
 - The new files compilation is conditionalized

This commit was SVN r24991.
2011-08-04 20:26:08 +00:00
Jeff Squyres
31311c981b Add note about C++ bindings cosmetic fix.
This commit was SVN r24990.
2011-08-04 15:44:04 +00:00
Jeff Squyres
d28564aa26 Per http://www.open-mpi.org/community/lists/devel/2011/08/9606.php,
comment out some unused parameter names.  I didn't use
__opal_attribute_unused__ because comm_inln.h is (eventually) included
by <mpi.h>, and therefore we don't have all the OPAL config stuff
available.  And it didn't seem worth it to add the optional
attribute_unused stuff to the top of mpi.h.

Thanks to Júlio Hoffimann for reporting the issue.

This commit was SVN r24989.
2011-08-04 15:39:12 +00:00
Jeff Squyres
ba432393d4 Remove some really old (internal) kruft that never ended up getting
used. 

This commit was SVN r24988.
2011-08-04 15:24:37 +00:00
Rolf vandeVaart
3d3b3d4dad Add support for CUDA registering sm and openib buffers. Feature is disabled by default.
This commit was SVN r24987.
2011-08-04 10:15:45 +00:00
Mike Dubman
9928c33edd better description of MXM MTL
This commit was SVN r24986.
2011-08-04 07:57:46 +00:00
Jeff Squyres
288915ac6a Add svn:ignore
This commit was SVN r24985.
2011-08-03 23:38:12 +00:00
Jeff Squyres
294e1f50cd Remove compiler warning about nested comment
This commit was SVN r24984.
2011-08-03 18:30:56 +00:00
Jeff Squyres
50ab8d893c Recent (as of 3 Aug 2011) versions of LWP in Macports seem to have
broken SSL certificate verification.  The IU CA is in my Mac system
keychain (and has been there for quite a long time), but after a
recent ports update, LWP fails the SSL certificate verification.
Fine.  So we'll just turn it off, per
http://search.cpan.org/~gaas/libwww-perl-6.02/lib/LWP/UserAgent.pm.

This commit was SVN r24983.
2011-08-03 13:50:23 +00:00
Jeff Squyres
ecc7937584 Format the README a bit and shape up some of the text about MXM.
Still need a bit more, though.

This commit was SVN r24982.
2011-08-03 13:22:56 +00:00
Jeff Squyres
cebd1837e5 Add special token to gkcommit commit messages so that the SVN
pre-commit hook doesn't try to re-close tickets that are referred to
in the original SVN commit messages.

This commit was SVN r24981.
2011-08-03 13:02:45 +00:00
Mike Dubman
7b18ab2fa9 remove unused includes
This commit was SVN r24980.
2011-08-03 07:07:29 +00:00
Jeff Squyres
f539b20a8f Patch from ARM for assembly:
http://www.open-mpi.org/community/lists/devel/2011/08/9586.php

This commit was SVN r24979.
2011-08-02 19:15:24 +00:00
Mike Dubman
45ea375531 code and readme updates, some refactoring
This commit was SVN r24977.
2011-08-02 14:30:11 +00:00
Jeff Squyres
8f4ac54336 Fixes trac:2838: add a warning message and disqualify the TCP BTL if both
btl_tcp_if_include and btl_tcp_if_exclude are specified. 

This commit was SVN r24976.

The following Trac tickets were found above:
  Ticket 2838 --> https://svn.open-mpi.org/trac/ompi/ticket/2838
2011-08-01 23:30:33 +00:00
Wesley Bland
87a96da99c Should fix some of the shutdown woes of the errmgr.
Correctly checks that the orted's job is completed.
Correctly tests to make sure that there is shutdown going on (doesn't rely on orte_orteds_term_ordered).
Adds a patch from Ralph to correctdly check the status of processes.

This commit was SVN r24962.
2011-08-01 14:00:41 +00:00
Ralph Castain
42b125ef35 Move the debug so it more accurately reports
This commit was SVN r24961.
2011-07-29 20:48:46 +00:00
Ralph Castain
70bca4691f Add a new "sensor" module that supports fault tolerance tests - randomly kills local procs and/or the daemon itself
This commit was SVN r24960.
2011-07-29 20:48:22 +00:00
Ralph Castain
e88a6c93da Set properties
This commit was SVN r24959.
2011-07-28 22:03:31 +00:00
Wesley Bland
5fde3e0e00 Move the resilient orte errmgr code into a seperate errmgr for now while it's
still unstable. Reverted errmgr modules back to the original errmgr (with the
updates since the resilient code was brought into the trunk).

This commit was SVN r24958.
2011-07-28 21:24:34 +00:00
Ralph Castain
6c879f87fb Add a new param "orte_remote_tmpdir_base" for those situations where the compute nodes require a different session directory head than the head node.
This commit was SVN r24956.
2011-07-27 19:37:17 +00:00
Ralph Castain
decab98fb2 Do a little better job of catching up on missed mcast messages, and provide a way out of scenarios where catch-up is impossible.
This commit was SVN r24955.
2011-07-27 14:58:30 +00:00
Ralph Castain
c3bc33b3fb Don't be so restrictive - accept "slots" as well as "slot" in rank file
This commit was SVN r24954.
2011-07-27 00:45:30 +00:00
Wesley Bland
b972fd84e1 No longer sends extra FAILED_NOTIFICATION messages in the non-failure case.
Should reduce finalize complexity and avoid a race condition that has been
detected by a few users.

This commit was SVN r24952.
2011-07-26 20:47:44 +00:00
Matthias Jurenz
4ca70e5c91 Changes to OTF:
- improved zlib compression
   - otfprofile-mpi:
      - fixed progress

Changes to VT:
   - fixed C++ linker issue for manual instrumentation of multiple files
   - fixed CUDA kernel launch configuration
   - process and thread buffer size can be explicitly specified by the user via the environment variables VT_BUFFER_SIZE and VT_THREAD_BUFFER_SIZE
   - fixed CUDA buffer management
   - vtfilter:
      - fixed progress
   - vtwrapper:
      - link CUPTI library, if available
   - vtsetup:
      - removed fixed path to *.dtd file in vtsetup-data.xml[.in] (fixes 'java.net.MalformedURLException')

This commit was SVN r24950.
2011-07-26 12:47:05 +00:00
Yevgeny Kliteynik
c1ab24c687 openib: added Mellanox ConnectX3 device ID to the device parameters ini file
This commit was SVN r24947.
2011-07-26 12:06:43 +00:00
Mike Dubman
aefffa073d initial implementation of MXM MTL layer
This commit was SVN r24946.
2011-07-26 04:36:21 +00:00
Ralph Castain
715f871605 Ignore the daemon job when reporting parseable output
This commit was SVN r24944.
2011-07-25 20:44:08 +00:00
Ralph Castain
db193555c2 Use non-blocking sends for recovering from lost multicast messages
This commit was SVN r24943.
2011-07-25 18:49:47 +00:00
Samuel Gutierrez
adde221413 use memcpy in ds_copy.
This commit was SVN r24942.
2011-07-25 17:16:29 +00:00
Mike Dubman
96ef2fc0e4 fix handling datatypes which have a gap in the beginning
This commit was SVN r24936.
2011-07-25 06:30:09 +00:00
Ralph Castain
199804fc35 complete implementation of parseable output
This commit was SVN r24929.
2011-07-23 22:23:24 +00:00
Ralph Castain
ffe6f5f40e Fix map pack/unpack so they match
This commit was SVN r24928.
2011-07-23 22:23:05 +00:00
Ralph Castain
00647fa342 Update orte-ps to add parseable output - not fully tested because I couldn't get other parts of the system to work.
This commit was SVN r24927.
2011-07-23 20:20:31 +00:00
Ralph Castain
869024f1c6 You have to initialize th daemon param -before- using it to get epoch!!
This commit was SVN r24926.
2011-07-23 20:19:43 +00:00
Ralph Castain
361bcef253 Close multicast before rml
This commit was SVN r24925.
2011-07-23 20:19:15 +00:00
Jeff Squyres
d6bc78920e Add a few manual cleanups that were missed (i.e., this is the
''other'' direction, so to speak, compared to r24921).

This commit was SVN r24924.

The following SVN revision numbers were found above:
  r24921 --> open-mpi/ompi@bd96d028de
2011-07-22 21:05:39 +00:00
Jeff Squyres
5fd57dad37 Add in missing ARM.asm file (this is in addition to r24875, which
included a missing ARM directory).

This commit was SVN r24923.

The following SVN revision numbers were found above:
  r24875 --> open-mpi/ompi@ceabe91484
2011-07-22 20:04:50 +00:00
Ralph Castain
8a7f9f8997 Hide libevent symbols when internal thread support enabled
This commit was SVN r24922.
2011-07-22 19:49:47 +00:00
Jeff Squyres
bd96d028de George identified some memory leaks and inconsistencies in the F77 API
when sizeof(int) != sizeof(MPI_Fint).  This commit should fix those
problems. 

This commit was SVN r24921.
2011-07-22 19:49:27 +00:00
Ralph Castain
3f0d13efe2 Fix libevent internal thread support
This commit was SVN r24920.
2011-07-22 19:18:49 +00:00
Jeff Squyres
352cd5bc62 Update svn:ignore
This commit was SVN r24917.
2011-07-22 13:56:31 +00:00
Jeff Squyres
d95f2361f8 Handle "svn st" output, even if it has a "+" in the middle of the line
This commit was SVN r24915.
2011-07-21 22:42:03 +00:00
Shiqing Fan
edaa7b96e4 This should not be commented out.
This commit was SVN r24914.
2011-07-21 12:56:18 +00:00
Shiqing Fan
cc4403a863 Remove two unused windows files.
This commit was SVN r24913.
2011-07-21 12:53:32 +00:00
Shiqing Fan
665d1284be Fix a bug that memcpy'ing a wrong temp string.
This commit was SVN r24912.
2011-07-21 12:53:03 +00:00
Brian Barrett
3bd66a5932 * Remove unused Portals3.3 reference implementation support
This commit was SVN r24906.
2011-07-20 23:30:29 +00:00
Brian Barrett
cc660fa57a Rather than looking for any path, look for any non-absolute path starting
in contrib/platform, in addition to cwd

This commit was SVN r24905.
2011-07-20 23:28:17 +00:00
Eugene Loh
921852e1e5 Clean up the computations of num_procs_alive. Do some code
refactoring to improve readability and to compute num_procs_alive
correctly and to remove the use of loop iteration variables for
two loops nested one inside another (causing MPI_Comm_spawn_multiple
to fail).

This commit was SVN r24903.
2011-07-14 20:10:48 +00:00
Ralph Castain
6201581544 Fix the symbol visibility issue for libevent by renaming all visible libevent symbols
This commit was SVN r24902.
2011-07-14 07:10:52 +00:00