Jeff Squyres
631967759f
Fix argument type.
...
This commit was SVN r25872.
2012-02-07 15:59:43 +00:00
Jeff Squyres
eb35cae681
Make the new Mprobe and friends functions be compiled in Fortran.
...
This commit was SVN r25871.
2012-02-07 15:58:58 +00:00
Jeff Squyres
092448f19e
Doh! Waiting for the test compile to finish ''before'' committing
...
reduces the potential for silly typos like this. :-(
This commit was SVN r25870.
2012-02-07 15:27:50 +00:00
Jeff Squyres
3c3994ede9
Ensure PMPI_Message_c2f and PMPI_Message_f2c are compiled.
...
This commit was SVN r25869.
2012-02-07 15:22:19 +00:00
Brian Barrett
25d48e22fa
Implementation of the MPI-3 Matched Probe functionality. Currently only
...
implemented in the OB1 PML, will return NOT_SUPPORTED in other PMLs.
This commit was SVN r25865.
2012-02-06 17:35:21 +00:00
Jeff Squyres
4eb22530ea
Missed this in last night's checkin: without it, Fortran
...
MPI_GET_LIBRARY_VERSION will cause compiler breakage on platforms
without weak symbols.
This commit was SVN r25863.
2012-02-03 15:59:55 +00:00
Jeff Squyres
feb8eb8a90
Add the new MPI-3 function MPI_GET_LIBRARY_VERSION.
...
This commit was SVN r25860.
2012-02-02 23:57:09 +00:00
Jeff Squyres
e7f0d2e091
We've been 2.1 compliant for quite a while now. :-)
...
This commit was SVN r25859.
2012-02-02 22:47:45 +00:00
Mike Dubman
6188ab7317
* ep init refactoring
...
* split ep_info into fragments to fit PMI limit
This commit was SVN r25857.
2012-02-02 15:00:47 +00:00
Vishwanath Venkatesan
15ebe838e9
Modified implementation of two_phase read all similar to the changes for the write all
...
incorporating romio style partitioning.
This commit was SVN r25853.
2012-02-01 18:30:13 +00:00
Vishwanath Venkatesan
158374bdd0
Dynamic file write all algorithm optimized by using derived datatype for receiving actual data
...
thereby avoiding the merging step in the fbtl.
This commit was SVN r25852.
2012-02-01 18:20:44 +00:00
Vishwanath Venkatesan
b9026ccbd0
Fix for two-phase generating flattened datatype using decoded iovec for
...
handling non-contiguous memory and contiguous file cases.
This commit was SVN r25850.
2012-02-01 17:23:51 +00:00
Jeff Squyres
4f35b62154
Components should not be linking to top-level libraries -- and
...
definitely should not be linking to more than libmpi.la! (remember
that libmpi.la now wholly contains libopen-rte.la, which wholly
contains libopen-pal.la).
This commit was SVN r25843.
2012-01-31 20:43:27 +00:00
Matthias Jurenz
f2362262a9
Pass STL configure test on every valid C++ compiler (see #2982 )
...
This commit was SVN r25833.
2012-01-31 15:39:11 +00:00
Matthias Jurenz
10697ec859
Fixed some of Jeff's suggestions to #2982
...
This commit was SVN r25817.
2012-01-30 15:36:01 +00:00
Mike Dubman
92873872f5
revert r25813
...
This commit was SVN r25816.
The following SVN revision numbers were found above:
r25813 --> open-mpi/ompi@8ed781d7e9
2012-01-30 13:22:38 +00:00
Matthias Jurenz
d154e81668
If configure of a contributed component fails, abort if it's requested via command line switch
...
This commit was SVN r25814.
2012-01-30 12:09:06 +00:00
Mike Dubman
8ed781d7e9
add mca param to enable/disable mxm
...
This commit was SVN r25813.
2012-01-30 11:14:20 +00:00
Mike Dubman
9f0ca9dfc0
fix: extract source from imm request fields instead from depricated api
...
This commit was SVN r25812.
2012-01-30 10:37:37 +00:00
Jeff Squyres
8e31f29f07
Fixes trac:2967: pretty much copy the text from MPI-2.2 in the description
...
of the group argument to MPI_COMM_CREATE.
cmr:v1.5:reviewer=jjhursey
cmr:v1.4.5:reviewer=jjhursey
This commit was SVN r25810.
The following Trac tickets were found above:
Ticket 2967 --> https://svn.open-mpi.org/trac/ompi/ticket/2967
2012-01-28 12:53:00 +00:00
Ralph Castain
a0edae52f2
Ensure the wrapper flags get entered in the right order, with -lpmi coming before the alps util libs
...
This commit was SVN r25809.
2012-01-27 20:56:21 +00:00
Matthias Jurenz
d2e180ffa0
Added field for VampirTrace support indicating whether it's enabled or not
...
This commit was SVN r25805.
2012-01-27 14:11:33 +00:00
Matthias Jurenz
085e3dc470
Added configure test for suitable C++ STL from Oracle Studio compiler.
...
VampirTrace needs the STLport library due to missing standard functionality in the default C++ STL (libCstd).
This commit was SVN r25804.
2012-01-27 13:11:09 +00:00
Nathan Hjelm
97dad0ac49
ugni: don't release eager fragments until we get local smsg completion
...
This commit was SVN r25796.
2012-01-27 00:32:43 +00:00
Nathan Hjelm
669f0afd14
ugni: poll smsg mailbox until it is empty
...
This commit was SVN r25794.
2012-01-26 20:50:09 +00:00
Nathan Hjelm
2a83297f96
silence vader warnings
...
This commit was SVN r25793.
2012-01-26 20:07:33 +00:00
Mike Dubman
6c954ad43f
set mxm to call opal_progress in tight loops
...
This commit was SVN r25788.
2012-01-26 18:33:43 +00:00
Nathan Hjelm
3d9bc68435
reorder udcm init/finalize code. fixes trac:2973
...
This commit was SVN r25787.
The following Trac tickets were found above:
Ticket 2973 --> https://svn.open-mpi.org/trac/ompi/ticket/2973
2012-01-26 16:28:55 +00:00
Jeff Squyres
7d1523cb00
Update the text; it was left over from autogen.sh days (i.e., it had
...
not been updated when we upgraded to autogen.pl).
This commit was SVN r25784.
2012-01-26 13:11:36 +00:00
George Bosilca
a9511779a6
Combined patch from Fujitsu. Fixes a collections of typos over the
...
code and man pages.
cmr:v1.4:reviewer=jsquyres and cmr:v1.5:reviewer=jsquyres
This commit was SVN r25781.
2012-01-26 04:22:00 +00:00
Shiqing Fan
2c9a4beffd
Add and remove a few components for windows build.
...
This commit was SVN r25775.
2012-01-25 09:01:27 +00:00
Nathan Hjelm
7b9bf6fe41
ugni: remove another erroneous error message
...
This commit was SVN r25768.
2012-01-23 21:23:01 +00:00
Nathan Hjelm
f3b60062cb
ugni: remove erroneous error message
...
This commit was SVN r25767.
2012-01-23 21:05:24 +00:00
Nathan Hjelm
521546aaa3
bug fix: ugni: pack only as many bytes as the pml requested
...
This commit was SVN r25766.
2012-01-23 17:21:45 +00:00
Ralph Castain
f03b82ab0a
Don't fiddle with the port_name memory as, per standard, this is an input-only parameter
...
This commit was SVN r25756.
2012-01-20 13:15:41 +00:00
Jeff Squyres
e114bd8a7b
Oops -- changed a field name that should not have been changed.
...
This commit was SVN r25754.
2012-01-20 11:02:18 +00:00
Ralph Castain
be3dfb6a1a
Ensure that we only add -lpmi once to the wrapper compilers, no matter how many components might use it.
...
This commit was SVN r25753.
2012-01-20 04:56:38 +00:00
Jeff Squyres
3661fe7a4e
More fixes so that picky compilers shut up about a local variable
...
named "index" shadowing a global function named "index".
This commit was SVN r25752.
2012-01-20 02:10:57 +00:00
Jeff Squyres
c77325fe42
Fix prototypes that cause a compiler error.
...
This commit was SVN r25751.
2012-01-20 02:07:44 +00:00
Jeff Squyres
5164d89815
Rename "index" parameters to "idx" so that picky compilers shut up
...
about "index" shadowing a global symbol name (i.e., the function
index()). And remove an unused #define.
This commit was SVN r25750.
2012-01-20 02:06:21 +00:00
Ralph Castain
f5c43e8d60
Get the header files into the tarball
...
This commit was SVN r25746.
2012-01-19 21:02:20 +00:00
Ralph Castain
7b65af28c6
Correct ordering in MPI_Init so that we do the modex prior to attempting to bind ourselves in the direct launch case as the modex contains info required for self-binding.
...
This commit was SVN r25742.
2012-01-19 18:38:58 +00:00
Brian Barrett
b2411fe131
Add support for MPI-3's MPI_COMM_SPLIT_TYPE function
...
This commit was SVN r25738.
2012-01-18 23:35:21 +00:00
Nathan Hjelm
804a494036
zero out ugni fragments in constructor
...
This commit was SVN r25731.
2012-01-17 19:52:26 +00:00
Shiqing Fan
f57f873404
Disable the debugger support for Windows.
...
This commit was SVN r25725.
2012-01-17 16:21:33 +00:00
Vishwanath Venkatesan
1e95d8b1e2
remove the MPI functions used in these files by the OMPI internal corresponding functionality and also add error checking in these for functions which did not have them'
...
This commit was SVN r25723.
2012-01-13 17:21:51 +00:00
George Bosilca
d58468f759
Correctly compute the aligned address when packing the
...
datatype description. Thanks to Fujitsu for the patch.
This commit was SVN r25721.
2012-01-12 19:15:22 +00:00
Rolf vandeVaart
16d676aa5b
Fix minor issue with CUDA. Cannot register overlappiing regions.
...
This commit was SVN r25716.
2012-01-12 13:00:42 +00:00
Ralph Castain
bf103de66c
My apologies for doing this outside of the usual time restrictions, but we need to get this in so we can make progress.
...
Move the ORTE-level debugger code back into orterun and out of the ORTE library to resolve symbol conflicts.
This commit was SVN r25713.
2012-01-11 15:53:09 +00:00
Samuel Gutierrez
63869c431b
init seg_num_procs_inited to zero before the atomic add.
...
This commit was SVN r25710.
2012-01-11 03:37:23 +00:00
Nathan Hjelm
96c1df8d90
clean up vader registration code
...
This commit was SVN r25704.
2012-01-10 22:33:22 +00:00
Edgar Gabriel
fb4d1a7099
remove the MPI functions used in this file by the OMPI internal corresponding functionality.
...
This commit was SVN r25703.
2012-01-10 19:55:05 +00:00
Nathan Hjelm
f65f6f5c39
bugfix: ugni: increase smsg mailbox size to a multiple of 4096
...
This commit was SVN r25702.
2012-01-10 19:50:25 +00:00
Mike Dubman
37dc53bbc9
mxm: return the MXM_REQ_SEND_SYNC flag to mxm_req_send
...
This commit was SVN r25694.
2012-01-06 18:56:28 +00:00
Mike Dubman
3b97d609a8
mtl_mxm: fix double free
...
This commit was SVN r25693.
2012-01-06 16:22:58 +00:00
Samuel Gutierrez
d1a44ecd34
send packed buffers instead of using iovecs in common sm rml. this commit will
...
hopefully resolve the periodic bus errors that some mtt tests have been
encountering.
This commit was SVN r25692.
2012-01-05 00:11:59 +00:00
Samuel Gutierrez
b3bfae129b
Increase format width in example pi.F to avoid Intel compiler warnings. Thanks to Mark Debbage from QLogic for the patch.
...
This commit was SVN r25691.
2012-01-04 16:44:16 +00:00
Rolf vandeVaart
9441f33981
Improve an error message. Replace tabs with spaces.
...
This commit was SVN r25688.
2012-01-03 15:19:01 +00:00
Rolf vandeVaart
8073f5002a
Some additional CUDA specific code.
...
Adding a few more support functions that will be used in future development.
This commit was SVN r25684.
2011-12-29 12:31:54 +00:00
Edgar Gabriel
e0139a2d7e
provide descriptions about the functionality of these frameworks.
...
This commit was SVN r25682.
2011-12-22 19:42:00 +00:00
Vishwanath Venkatesan
0f928be1d5
Modifying selection logic back to select two-phase at the cases it should.
...
This commit was SVN r25681.
2011-12-22 01:01:32 +00:00
Vishwanath Venkatesan
37c8470e3d
modified implementation for two-phase write_all incorporating romio style domain partitioning
...
This commit was SVN r25680.
2011-12-22 00:16:29 +00:00
Vishwanath Venkatesan
738a67b704
Removing duplicate code while setting default file view and using internal file-set-view for setting the default file view
...
This commit was SVN r25679.
2011-12-21 21:50:47 +00:00
Rolf vandeVaart
6ca186fb64
Delay some initialization until needed. This eliminates some
...
warnings and removes need for CUDA init before MPI_Init.
This commit was SVN r25678.
2011-12-21 15:21:57 +00:00
Matthias Jurenz
3ed2329643
Changes to OTF:
...
- tests/thumbnail:
- removed unnecessary header include (stdbool.h) that breaks build on Solaris
Changes to VT:
- configure:
- fixed detection of Open64 compilers for automatic instrumentation
- VT libs:
- fixed non-increasing timestamps when flushing the trace buffer: check trace status after calling vt_update_counter() to prevent function leave events from recording, if maximum buffer flushes are reached
- calculate fixed record lengths only once when creating new buffer entries
- vtunify[-mpi]:
- minor code-optimization: use ++it instead of it++ in for-loops to prevent unnecessary copying of objects
This commit was SVN r25674.
2011-12-19 08:40:35 +00:00
Samuel Gutierrez
519f71ab7e
silences valgrind warning in common sm (Syscall param writev(vector[...]) points
...
to uninitialised byte(s)). probably also silences a large stack allocation
warning in coverity.
This commit was SVN r25666.
2011-12-16 23:17:48 +00:00
Samuel Gutierrez
0ca6603fa0
remove some unused cruft in shmem. minor common sm cleanup.
...
This commit was SVN r25665.
2011-12-16 22:43:55 +00:00
Jeff Squyres
3fecac10a9
Fix typos: these should be OMPI_DECLSPEC, not ORTE_DECLSPEC.
...
This commit was SVN r25661.
2011-12-15 18:40:25 +00:00
George Bosilca
e3a373335d
If threads are not supported but were required we will go for SERIALIZED,
...
otherwise whatever was required.
This commit was SVN r25653.
2011-12-15 02:23:21 +00:00
Matthias Jurenz
58eb61bc46
Changes to OTF:
...
- otfprofile[-mpi]:
- added missing header includes for WEXITSTATUS
This commit was SVN r25637.
2011-12-14 08:20:26 +00:00
Nathan Hjelm
71527c8058
minor ugni btl code cleanup
...
This commit was SVN r25618.
2011-12-10 08:20:46 +00:00
Nathan Hjelm
c8a4687402
don't set SIGSEGV to default
...
This commit was SVN r25610.
2011-12-09 21:54:05 +00:00
Nathan Hjelm
e03d23d96e
Intial support for Cray's uGNI interface (XE-6/XK-6)
...
This commit was SVN r25608.
2011-12-09 21:24:07 +00:00
Matthias Jurenz
701d4edc2b
Changes to VT:
...
- removed unnecessary trailing ';' from several macro definitions
This commit was SVN r25598.
2011-12-08 13:24:39 +00:00
Matthias Jurenz
b3f4fca3f4
Changes to VT:
...
- Use own implementation of assert() to work around a compiler bug (seen on MacOS using GCC v4.2.1):
The linker results in an undefined reference to ___builtin_expect() when using assert() within OpenMP-parallel regions.
This commit was SVN r25595.
2011-12-08 12:20:05 +00:00
Nathan Hjelm
87b7e85d53
rfc timeout. retry registration after removing old registration from lru
...
This commit was SVN r25587.
2011-12-07 18:20:44 +00:00
Josh Hursey
e56b4de2c9
Fixes trac:2550 : Cleanup comment in crcp_bkmrk_pml.h
...
This commit was SVN r25585.
The following Trac tickets were found above:
Ticket 2550 --> https://svn.open-mpi.org/trac/ompi/ticket/2550
2011-12-07 14:50:04 +00:00
Matthias Jurenz
05549e889b
Changes to VT:
...
- fixed a bug (potential segfault) in the MPI wrapper functions MPI_Gatherv and MPI_Scatterv which occurred due to illegal access to insignificant parameters on non-root ranks
- vtdyn:
- stop instrumenting if an error occurred during finalizing instrumentation set
- vtunify-mpi:
- added option '--stats' to unify only summarized information, no events
- reduced memory usage on rank 0: immediately send token translation tables to the corresponding worker ranks when they are complete
- send the "finished-flag" together with the last set of definitions read to rank 0 instead of sending an extra message
- OPARI:
- fixed detection of DO loop beginnings; If there is a variable which contains "do" in its name it was detected as DO loop :-(
- fixed processing of Fortran line-continuation appearing after a complete OpenMP directive
This commit was SVN r25584.
2011-12-07 12:26:38 +00:00
Jeff Squyres
c10f41c87e
Do not build these frameworks when --disable-mpi-io is specified.
...
Fixes some Cisco MTT MPI install errors.
This commit was SVN r25566.
2011-12-02 22:11:23 +00:00
Jeff Squyres
718e95194c
Minor typo in a comment
...
This commit was SVN r25565.
2011-12-02 20:46:38 +00:00
Ralph Castain
07655e2945
Handle the case where the allocator "fibs" to us about the node names. In some cases (ahem...you know who you are!), the allocator will tell us a node number (e.g., "16"). However, the daemon will return a node name (e.g., "nid0016") - leaving us not recognizing its location.
...
So provide a new parameter (can't have too many!) that handles this situation by stripping the prefix from the returned node name. Also do a little cleanup to ensure we cleanly exit from errors, without generating too many annoying messages.
This commit was SVN r25562.
2011-12-02 14:10:08 +00:00
Ralph Castain
357ac14530
Can't return a numerical value here
...
This commit was SVN r25559.
2011-12-02 10:36:57 +00:00
Nathan Hjelm
bb1fec0407
added put/get btl descriptor flags
...
This commit was SVN r25553.
2011-11-30 21:37:23 +00:00
Ralph Castain
c56acf60ca
Although we never really thought about it, we made an unconscious assumption in the mapper system - we assumed that the daemons would be placed on nodes in the order that the nodes appear in the allocation. In other words, we assumed that the launch environment would map processes in node order.
...
Turns out, this isn't necessarily true. The Cray, for example, launches processes in a toroidal pattern, thus causing the daemons to wind up somewhere other than what we thought. Other environments (e.g., slurm) are also capable of such behavior, depending upon the default mapping algorithm they are told to use.
Resolve this problem by making the daemon-to-node assignment in the affected environments when the daemon calls back and tells us what node it is on. Order the nodes in the mapping list so they are in daemon-vpid order as opposed to the order in which they show in the allocation. For environments that don't exhibit this mapping behavior (e.g., rsh), this won't have any impact.
Also, clean up the vm launch procedure a little bit so it more closely aligns with the state machine implementation that is coming, and remove some lingering "slave" code.
This commit was SVN r25551.
2011-11-30 19:58:24 +00:00
Jeff Squyres
d71492108c
(this is what r25545 should have been)
...
Per http://www.open-mpi.org/community/lists/users/2011/11/17862.php ,
to make MPI_IN_PLACE (and other sentinel Fortran constants) work on OS
X, we need to use the following compiler (linker) flag:
-Wl,-commons,use_dylibs
So if we're compiling on OS X, test to see if that flag works with the
compiler. If so, add it to the wrapper FFLAGS and FCFLAGS (note that
per a future update, we'll only have one Fortran compiler anyway).
Fixes trac:1982.
This commit was SVN r25547.
The following SVN revision numbers were found above:
r25545 --> open-mpi/ompi@7f9ae11faf
The following Trac tickets were found above:
Ticket 1982 --> https://svn.open-mpi.org/trac/ompi/ticket/1982
2011-11-29 23:28:38 +00:00
Jeff Squyres
6fbbfd0f7a
Gah! r25545 acidentally included ''waaaay'' more stuff than it was
...
supposed to. I.e., half-baked/not complete stuff.
This commit backs out all of r25545. Sorry folks!
This commit was SVN r25546.
The following SVN revision numbers were found above:
r25545 --> open-mpi/ompi@7f9ae11faf
2011-11-29 23:24:52 +00:00
Jeff Squyres
7f9ae11faf
Per http://www.open-mpi.org/community/lists/users/2011/11/17862.php ,
...
to make MPI_IN_PLACE (and other sentinel Fortran constants) work on OS
X, we need to use the following compiler (linker) flag:
-Wl,-commons,use_dylibs
So if we're compiling on OS X, test to see if that flag works with the
compiler. If so, add it to the wrapper FFLAGS and FCFLAGS (note that
per a future update, we'll only have one Fortran compiler anyway).
Fixes trac:1982.
This commit was SVN r25545.
The following Trac tickets were found above:
Ticket 1982 --> https://svn.open-mpi.org/trac/ompi/ticket/1982
2011-11-29 23:05:54 +00:00
Terry Dontje
5209de048c
add code to service_thread_start to handle EBADF returns from select. This commit fixes trac:2922.
...
This commit was SVN r25520.
The following Trac tickets were found above:
Ticket 2922 --> https://svn.open-mpi.org/trac/ompi/ticket/2922
2011-11-29 16:49:59 +00:00
Samuel Gutierrez
375162c693
this commit fixes a few things. 1. silence warning in common sm. 2. remove unneeded config code in common sm. 3. move opal_shmem_base_close to a better place in opal_finalize. 4. fix opal_path_nfs output.
...
This commit was SVN r25518.
2011-11-28 23:41:19 +00:00
George Bosilca
0bd2bf9aae
The number of segments accepted should be bounded by MCA_BTL_DES_MAX_SEGMENTS
...
and not by 2.
This commit was SVN r25515.
2011-11-28 17:19:12 +00:00
Nathan Hjelm
f8c8c641f1
added asserts to warn developers that ob1/csum match fragments do not support more than 2 segments
...
This commit was SVN r25514.
2011-11-28 16:12:25 +00:00
Samuel Gutierrez
b4edf0ff5c
getting ready for 1.5 port of the shared memory enhancements. remove some unused/unneeded stuff and minor style update.
...
This commit was SVN r25513.
2011-11-28 16:08:32 +00:00
Matthias Jurenz
6b879bcf6e
Changes to OTF:
...
- otfprofile[-mpi]:
- fixed compile error with the PGI compiler
Changes to VT:
- added support for LIBC [I/O] tracing on Cray XT platforms
- vtrun:
- do preload Dyninst runtime library (DYNINSTAPI_RT_LIB) when
instrumenting user functions by Dyninst
This commit was SVN r25505.
2011-11-25 11:51:08 +00:00
Ralph Castain
9b59d8de6f
This is actually a much smaller commit than it appears at first glance - it just touches a lot of files. The --without-rte-support configuration option has never really been implemented completely. The option caused various objects not to be defined and conditionally compiled some base functions, but did nothing to prevent build of the component libraries. Unfortunately, since many of those components use objects covered by the option, it caused builds to break if those components were allowed to build.
...
Brian dealt with this in the past by creating platform files and using "no-build" to block the components. This was clunky, but acceptable when only one organization was using that option. However, that number has now expanded to at least two more locations.
Accordingly, make --without-rte-support actually work by adding appropriate configury to prevent components from building when they shouldn't. While doing so, remove two frameworks (db and rmcast) that are no longer used as ORCM comes to a close (besides, they belonged in ORCM now anyway). Do some minor cleanups along the way.
This commit was SVN r25497.
2011-11-22 21:24:35 +00:00
George Bosilca
61f273b987
Do not tolerate uninitialized variables.
...
This commit was SVN r25489.
2011-11-18 10:19:24 +00:00
Matthias Jurenz
4ec105182e
Updated integrated VT to version 5.12
...
This commit was SVN r25483.
2011-11-17 10:02:38 +00:00
Ralph Castain
6310361532
At long last, the fabled revision to the affinity system has arrived. A more detailed explanation of how this all works will be presented here:
...
https://svn.open-mpi.org/trac/ompi/wiki/ProcessPlacement
The wiki page is incomplete at the moment, but I hope to complete it over the next few days. I will provide updates on the devel list. As the wiki page states, the default and most commonly used options remain unchanged (except as noted below). New, esoteric and complex options have been added, but unless you are a true masochist, you are unlikely to use many of them beyond perhaps an initial curiosity-motivated experimentation.
In a nutshell, this commit revamps the map/rank/bind procedure to take into account topology info on the compute nodes. I have, for the most part, preserved the default behaviors, with three notable exceptions:
1. I have at long last bowed my head in submission to the system admin's of managed clusters. For years, they have complained about our default of allowing users to oversubscribe nodes - i.e., to run more processes on a node than allocated slots. Accordingly, I have modified the default behavior: if you are running off of hostfile/dash-host allocated nodes, then the default is to allow oversubscription. If you are running off of RM-allocated nodes, then the default is to NOT allow oversubscription. Flags to override these behaviors are provided, so this only affects the default behavior.
2. both cpus/rank and stride have been removed. The latter was demanded by those who didn't understand the purpose behind it - and I agreed as the users who requested it are no longer using it. The former was removed temporarily pending implementation.
3. vm launch is now the sole method for starting OMPI. It was just too darned hard to maintain multiple launch procedures - maybe someday, provided someone can demonstrate a reason to do so.
As Jeff stated, it is impossible to fully test a change of this size. I have tested it on Linux and Mac, covering all the default and simple options, singletons, and comm_spawn. That said, I'm sure others will find problems, so I'll be watching MTT results until this stabilizes.
This commit was SVN r25476.
2011-11-15 03:40:11 +00:00
Jeff Squyres
e8dcad6017
This typo has been here since August 2005. :-)
...
This commit was SVN r25468.
2011-11-11 03:01:52 +00:00
Brian Barrett
45a27e4f9f
For now, ignore LINK event
...
This commit was SVN r25467.
2011-11-11 02:49:03 +00:00
Brad Benton
96395c916e
de-tab'd
...
This commit was SVN r25465.
2011-11-09 19:45:12 +00:00
Brad Benton
0712b911a5
Updated IBM copyright
...
This commit was SVN r25464.
2011-11-09 19:38:53 +00:00
Mike Dubman
00c27afd52
fix pid
...
This commit was SVN r25463.
2011-11-09 17:53:59 +00:00
Nathan Hjelm
d603f31976
removed ptr member from seg_key union
...
This commit was SVN r25460.
2011-11-08 15:44:54 +00:00
Mike Dubman
71398b658e
fix: OMPI_ERR_CONNECTION_FAILED available in v1.5, unavailable in trunk
...
This commit was SVN r25459.
2011-11-08 12:34:01 +00:00
George Bosilca
3d318a4c26
Put the interface of our MPIR support in sync with the document accepted by the MPI
...
Forum (http://www.mpi-forum.org/docs/mpir-specification-10-11-2010.pdf ).
This commit was SVN r25456.
2011-11-08 01:24:16 +00:00
Mike Dubman
4cf9e1323d
fix: return correct error on connection failure
...
This commit was SVN r25452.
2011-11-07 06:13:17 +00:00
Nathan Hjelm
8962ce25b0
fixed some compiler errors caused by seg_key changes. osc/rdma may need to be updated to use btls that use 128 bit segment keys
...
This commit was SVN r25448.
2011-11-06 20:19:14 +00:00
Samuel Gutierrez
e03bc93fb7
only use pmi grpcomm and pubsub during the direct launch case. use PMI environment variable to setup vpid in ess alps on cray xe systems. add pmi test code.
...
This commit was SVN r25447.
2011-11-06 17:28:40 +00:00
Nathan Hjelm
520a7c570e
changes to seg_key needed for a new btl
...
This commit was SVN r25445.
2011-11-06 16:19:09 +00:00
Jeff Squyres
38451d4972
Add the MPI API version to the ompi_info output. How did we never
...
have this in there before?
This commit was SVN r25437.
2011-11-04 23:30:59 +00:00
Rolf vandeVaart
f777fe8eba
Change tab to spaces.
...
This commit was SVN r25433.
2011-11-04 17:18:30 +00:00
Christopher Yeoh
fb57a74a40
Removes pointless memmove which because of a previous memcpy will always
...
have identical source and destination pointers. See #2871
Plugs a couple of minor memory leaks related to remote qp info
This commit was SVN r25431.
2011-11-04 00:15:08 +00:00
Jeff Squyres
1d6d39d2ea
Missed this free/re-strdup
...
This commit was SVN r25426.
2011-11-03 11:31:37 +00:00
Jeff Squyres
6139256e45
v may get incremented, so be sure to save the ''original'' strdup'ed
...
pointer and free ''that'' -- not the (possibly incremented) pointer
This commit was SVN r25425.
2011-11-03 11:23:17 +00:00
Mike Dubman
7595a80a63
fix self pid
...
This commit was SVN r25424.
2011-11-03 06:46:20 +00:00
Nathan Hjelm
211e2dbdf3
clean up tab characters
...
This commit was SVN r25413.
2011-11-02 15:07:57 +00:00
Jeff Squyres
12d4280d0b
Fix a bunch of memory leaks
...
This commit was SVN r25411.
2011-11-01 20:22:49 +00:00
Ralph Castain
14966e0f8f
Cleanup PMI startup - if a component isn't selected, it should finalize PMI IFF it started it. Otherwise, components that aren't selected can finalize PMI when it is in use by other parts of the system.
...
This commit was SVN r25407.
2011-11-01 16:25:12 +00:00
Mike Dubman
3edd77ea25
update mxm plugin to mxm api change: pass synchronous request as an opcode instead of a flag
...
This commit was SVN r25403.
2011-10-31 22:36:15 +00:00
Mike Dubman
6b50ba22a6
select mxm ptl based on user preferences
...
This commit was SVN r25401.
2011-10-31 10:17:43 +00:00
Samuel Gutierrez
0ba13e2f8e
fix typo. use PMI_Initialized for init status instead of PMI_Init.
...
This commit was SVN r25378.
2011-10-27 22:41:50 +00:00
Nathan Hjelm
ee087de073
added fast boxes to vader
...
This commit was SVN r25376.
2011-10-27 20:22:46 +00:00
Mike Dubman
f96ae43e23
pass jobid to mxm/sm module
...
This commit was SVN r25375.
2011-10-27 13:14:52 +00:00
Nathan Hjelm
82efe131dc
made btl_vader_max_inline_send a configurable parameter and updated and enabled sendi
...
This commit was SVN r25374.
2011-10-26 22:15:42 +00:00
Nathan Hjelm
033179d6ac
fixed bug in frag initialization
...
This commit was SVN r25373.
2011-10-26 19:29:37 +00:00
George Bosilca
6fdb040eef
ORTE_ERROR to OPAL_ERROR.
...
This commit was SVN r25372.
2011-10-26 15:59:43 +00:00
George Bosilca
9d8e84142f
Survivor!!!
...
This commit was SVN r25371.
2011-10-26 00:58:55 +00:00
Samuel Gutierrez
ae66347c7a
added GNI configure script.
...
This commit was SVN r25370.
2011-10-25 22:15:16 +00:00
Nathan Hjelm
05114ffb51
fixed off by one error
...
This commit was SVN r25369.
2011-10-25 22:07:47 +00:00
George Bosilca
72f731f25f
The SM2 collective component has not been updated in a long
...
time. Rich, the original developer, agrees with this removal.
This commit was SVN r25368.
2011-10-25 22:07:09 +00:00
Nathan Hjelm
e887d595c7
fix potential bug with non-contiguous sends
...
This commit was SVN r25367.
2011-10-25 19:21:45 +00:00
Nathan Hjelm
433cfa3665
use single copy for some sends
...
This commit was SVN r25365.
2011-10-25 18:38:42 +00:00
Mike Dubman
9ffeeb69d9
fix help message
...
This commit was SVN r25364.
2011-10-25 14:02:43 +00:00
Samuel Gutierrez
663f4546f5
fix define typo in psm mtl.
...
This commit was SVN r25362.
2011-10-24 18:38:12 +00:00
Ralph Castain
955d8e7d46
Allow apps to use pmi when launched by mpirun, if desired, without affecting daemons
...
This commit was SVN r25359.
2011-10-23 15:57:13 +00:00
Nathan Hjelm
fb19f56965
Cray doesn't define PMI2_SUCCESS
...
This commit was SVN r25354.
2011-10-21 16:34:22 +00:00
Nathan Hjelm
cd68dbe2b8
only try to build vader if xpmem is installed. unignore vader
...
This commit was SVN r25352.
2011-10-21 15:45:05 +00:00
Ralph Castain
3e72fccacf
Cray's PMI implementation is quite different from slurm's - they extended PMI-1 by adding some, but not all, of the PMI-2 APIs. So you can't just switch to using PMI-2 functions as it isn't a complete implementation. Instead, you have to selectively figure out which ones they have in PMI-2, and use any missing ones from PMI-1. What fun.
...
Modify the configure logic and the PMI components to accommodate Cray's approach. Refactor the PMI error reporting code so it resides in only one place. Cray actually decided -not- to define the PMI-2 error codes, so we have to use the PMI-1 codes instead. More fun.
This commit was SVN r25348.
2011-10-21 04:54:38 +00:00
Ralph Castain
e2adc8fa3a
Ignore until Nathan can fix - probably configure problem
...
This commit was SVN r25347.
2011-10-21 03:43:01 +00:00
Ralph Castain
5947f61b86
Remove windows reference for now
...
This commit was SVN r25346.
2011-10-21 01:19:03 +00:00
Nathan Hjelm
414677a082
default to no xpmem support
...
This commit was SVN r25345.
2011-10-20 22:13:45 +00:00
Nathan Hjelm
808a73a5c5
removed erroneous add of .deps
...
This commit was SVN r25343.
2011-10-20 21:41:51 +00:00
Nathan Hjelm
3dbaaf6879
initial commit of vader (xpmem) btl
...
This commit was SVN r25342.
2011-10-20 21:39:44 +00:00
Nathan Hjelm
586403f052
more pmi return code wtf
...
This commit was SVN r25337.
2011-10-20 17:53:04 +00:00
Ralph Castain
43e35486a4
Correct flag type - thanks George!
...
This commit was SVN r25335.
2011-10-20 04:00:13 +00:00
Nathan Hjelm
e1e8837992
add a uintptr_t to the seg_key union
...
This commit was SVN r25334.
2011-10-19 21:48:52 +00:00
George Bosilca
78751b3b2d
Put back the OPI errors after the ORTE one.
...
This commit was SVN r25333.
2011-10-19 20:57:13 +00:00
Ralph Castain
b44f8d4b28
Complete implementation of the ess.proc_get_locality API. Up to this point, the API was only capable of telling if the specified proc was sharing a node with you. However, the returned value was capable of telling you much more detailed info - e.g., if the proc shares a socket, a cache, or numa node. We just didn't have the data to provide that detail.
...
Use hwloc to obtain the cpuset for each process during mpi_init, and share that info in the modex. As it arrives, use a new opal_hwloc_base utility function to parse the value against the local proc's cpuset and determine where they overlap. Cache the value in the pmap object as it may be referenced multiple times.
Thus, the return value from orte_ess.proc_get_locality is a 16-bit bitmask that describes the resources being shared with you. This bitmask can be tested using the macros in opal/mca/paffinity/paffinity.h
Locality is available for all procs, whether launched via mpirun or directly with an external launcher such as slurm or aprun.
This commit was SVN r25331.
2011-10-19 20:18:14 +00:00
George Bosilca
1bc5da0911
These are supposed to be OPAL level errors.
...
This commit was SVN r25326.
2011-10-19 14:22:09 +00:00
Ralph Castain
72a4b0bd8a
Fix constants
...
This commit was SVN r25325.
2011-10-19 14:14:58 +00:00
George Bosilca
a5f24bcdcf
The error here is meaningless.
...
This commit was SVN r25324.
2011-10-19 13:04:46 +00:00
George Bosilca
efd88e10d7
Cleanup the error codes. Get rid of all the useless ones, and
...
mark the distinction between ORTE and OMPI errors.
This commit was SVN r25323.
2011-10-19 03:51:53 +00:00
George Bosilca
c453614f8b
A more meaningful name for this function (mpi_proc_complete_init
...
instead of ompi_proc_set_arch). Change the comment to reflect the
real behavior of the function.
This commit was SVN r25312.
2011-10-18 02:54:38 +00:00
Ralph Castain
0bf4f48aa3
Don't need priority in this framework
...
This commit was SVN r25308.
2011-10-17 22:39:15 +00:00
Ralph Castain
8f0ef54130
Complete implementation of pmi support. Ensure we support both mpirun and direct launch within same configuration to avoid requiring separate builds. Add support for generic pmi, not just under slurm. Add publish/subscribe support, although slurm's pmi implementation will just return an error as it hasn't been done yet.
...
This commit was SVN r25303.
2011-10-17 20:51:22 +00:00
Ralph Castain
e7f6be5385
Unused variable
...
This commit was SVN r25301.
2011-10-17 18:59:22 +00:00
Ralph Castain
2eaadcfab9
Remove unused variable
...
This commit was SVN r25284.
2011-10-14 15:32:18 +00:00
Vishwanath Venkatesan
8dd07bdceb
Removing .ompi_ignore and .ompi_unignore from fs/pvfs2 and fbtl/pvfs2
...
This commit was SVN r25283.
2011-10-14 00:40:11 +00:00
Vishwanath Venkatesan
8f6b29e95b
Fixing the default file view issue and merging contiguous lengths and offsets
...
for explicit offset case.
This commit was SVN r25281.
2011-10-13 19:50:45 +00:00
Jeff Squyres
2c6254b70d
Second change from Intel.
...
This commit was SVN r25279.
2011-10-12 23:26:34 +00:00
Jeff Squyres
28118d0611
Updte the parameters for the Intel iWARP devices, per request from
...
Faisal Latif <faisal.latif@intel.com>.
This commit was SVN r25278.
2011-10-12 22:58:30 +00:00
Brian Barrett
d8b5b544ad
Update list name to match change in spec
...
This commit was SVN r25273.
2011-10-12 20:09:39 +00:00
Rainer Keller
4e6a6fc146
- Check, whether the compiler supports __builtin_clz (count leading
...
zeroes);
if so, use it for bit-operations like opal_cube_dim and opal_hibit.
Implement two versions of power-of-two.
In case of opal_next_poweroftwo, this reduces the average execution
time from 83 cycles to 4 cycles (Intel Nehalem, icc, -O2, inlining,
measured rdtsc, with loop over 2^27 values).
Numbers for other functions are similar (but of course heavily depend
on the usage, e.g. opal_hibit() with a start of 4 does not save
much). The bsr instruction on AMD Opteron is also not as fast.
- Replace various places where the next power-of-two is computed.
Tested on Intel Nehalem Cluster with openib, compilers GNU-4.6.1 and
Intel-12.0.4 using mpi_testsuite -t "Collective" with 128 processes.
This commit was SVN r25270.
2011-10-11 22:49:01 +00:00
George Bosilca
74c88a9e48
This was never used (sm_ctl_header).
...
This commit was SVN r25267.
2011-10-11 20:37:00 +00:00
George Bosilca
ca6c282f23
Small cleanups in the SM BTL.
...
This commit was SVN r25266.
2011-10-11 20:32:10 +00:00
George Bosilca
3241bea696
Apply a patch provided by Sébastien Boisvert fixing an issue
...
with the probe fairness.
This commit was SVN r25265.
2011-10-11 20:28:33 +00:00
George Bosilca
4fd78c4683
Keep track of the last probe on each communicator, so we can probe all
...
peers in a round-robin fashion. A little bit more fair ...
This commit was SVN r25264.
2011-10-11 20:24:54 +00:00
Jeff Squyres
951c745590
We always have hwloc xml support (now that it's built into to hwloc
...
without needing libxml2). So OPAL_HAVE_HWLOC_XML is no longer
necessary.
This commit was SVN r25263.
2011-10-11 20:20:59 +00:00
George Bosilca
2fefd3a928
Don't forget to move the pointer back by the true_lb.
...
This commit was SVN r25262.
2011-10-11 20:15:49 +00:00
Yevgeny Kliteynik
a503bd53b7
Fixing the librdmacm dependency for build process
...
If a user specifically asks for rdmacm support in configure script and
librdmacm (usual and devel) libraries are not found, configure script
would abort.
If a user didn't specify anything, and rdmacm libraries are not found,
configure script will continue after issuing warning message:
"Please install librdmacm and librdmacm-devel or disable rdmacm support"
-- YK
This commit was SVN r25253.
2011-10-11 09:10:06 +00:00
George Bosilca
649af6c925
Enumerated mixed with another type (int) is tolerated but
...
easily fixable.
This commit was SVN r25241.
2011-10-09 03:54:52 +00:00
Rainer Keller
61ffabed44
- Fix typo in _OMPI_CXX_CHECK_BUILTIN_BACKEND:
...
frontend checks have_cxx-variant, yet the backend (if run)
defined have_-variant; therefore these were never set.
This commit was SVN r25233.
2011-10-05 08:21:59 +00:00
George Bosilca
07f6ce235f
Return an OMPI_ error not an ORTE_.
...
This commit was SVN r25232.
2011-10-04 14:57:24 +00:00
George Bosilca
ce7935c8fa
Obviously these were not needed.
...
This commit was SVN r25231.
2011-10-04 14:56:34 +00:00
George Bosilca
80c02647c8
Each level (OPAL/ORTE/OMPI) should only return it's own constants,
...
instead of the current mismatch.
This commit was SVN r25230.
2011-10-04 14:50:31 +00:00
Mike Dubman
7a9ae43276
added support for shared memory transport in mxm
...
This commit was SVN r25220.
2011-10-03 12:59:55 +00:00
Brian Barrett
fc29ffebdb
* remove two aborts that aren't necessary
...
This commit was SVN r25214.
2011-09-29 22:27:23 +00:00
Brian Barrett
14f32a1a54
* Clean up progress function
...
* Only print returnable errors when verbose=1. Still print errors when
we're going to abort, since those obviously aren't returnable
This commit was SVN r25213.
2011-09-29 22:26:33 +00:00
Brian Barrett
758f8a4d87
* More debugging output
...
* Make recv short block events use the callback mechanism so that can
add overflow debugging
This commit was SVN r25212.
2011-09-29 21:59:48 +00:00
Brian Barrett
c08ea5c0f5
Set options correctly for the two pts
...
This commit was SVN r25211.
2011-09-29 21:56:37 +00:00
Brian Barrett
05f800abae
Properly unpack data for long unexpected
...
This commit was SVN r25210.
2011-09-29 17:25:45 +00:00
Rolf vandeVaart
3d8c6b83a9
Make some error messages more helpful
...
This commit was SVN r25209.
2011-09-29 16:32:46 +00:00
Brian Barrett
bb9e73232a
* Leverage hdr_data and opcount to improve debugging
...
* Clean up handling of short synchronous messages
This commit was SVN r25208.
2011-09-28 21:18:47 +00:00
Brian Barrett
71d8300607
* Fix name clash with macros in mtl_portals4.h
...
* hdr_data now includes opcount and length for all messages, which is the match
bits for long and rndv messages
* Re-add probe implementation
This commit was SVN r25207.
2011-09-28 16:53:01 +00:00
Brian Barrett
2fb8045fad
clean up printfs
...
This commit was SVN r25206.
2011-09-28 15:28:46 +00:00
Brian Barrett
26e781f002
* Remove triggered code for now
...
* Move from per-endpoint send/recv count to just send side op count
This commit was SVN r25205.
2011-09-28 15:25:39 +00:00
Brian Barrett
592c1ab6db
* revert probe and size information changes, since it seems to break everything
...
This commit was SVN r25204.
2011-09-28 14:57:19 +00:00
Brian Barrett
211b5c7824
* Make triggered protocol only work for non-wildcard receives
...
* Always encode length in header data to make probe work
* General send/receive cleanups
* Implement iprobe
This commit was SVN r25197.
2011-09-27 22:45:00 +00:00
Brian Barrett
77c560be42
updates to match new api changes
...
This commit was SVN r25196.
2011-09-27 20:38:22 +00:00
Brad Benton
0f2475c554
Modified set_remote_info() to use memmove() instead of memcpy() when
...
copying rem_qp info. This avoids potential errors when src & dest overlap.
This is a workaround for the issue in #2871
This commit was SVN r25180.
2011-09-26 20:07:36 +00:00
Jeff Squyres
6a221af009
Minor text fix suggested by Jeremiah Willcock.
...
This commit was SVN r25173.
2011-09-21 20:05:19 +00:00
Vishwanath Venkatesan
2ee2b478d8
Modifying selection logic to select dynamic for cases where two_phase
...
fails.
This commit was SVN r25171.
2011-09-20 21:57:23 +00:00
Pavel Shamis
29c4981caa
Removing unused include from openib/ofud btls.
...
This include causes compilation failure on macos platform.
This commit was SVN r25170.
2011-09-20 19:25:59 +00:00
Rolf vandeVaart
0749a220e8
Add support for MPI_IN_PLACE to MPI_Exscan. Required for MPI 2.2 compliance.
...
Reviewed by Jeff Squyres. This fixes trac:2221.
This commit was SVN r25165.
The following Trac tickets were found above:
Ticket 2221 --> https://svn.open-mpi.org/trac/ompi/ticket/2221
2011-09-20 14:54:41 +00:00
Nathan Hjelm
98b56108c4
add unconnect datagram connection manager (udcm)
...
This commit was SVN r25160.
2011-09-19 21:24:58 +00:00
Nathan Hjelm
8cd550f49f
fixed error in last commit
...
This commit was SVN r25159.
2011-09-19 21:13:59 +00:00
Nathan Hjelm
de950959ee
print a more meaningful error message when ibv_create_qp fails
...
This commit was SVN r25158.
2011-09-19 21:12:22 +00:00
Josh Hursey
2d25d70a1c
Missing header for opal_timer_base_get_cycles
...
This commit was SVN r25157.
2011-09-19 19:52:58 +00:00
George Bosilca
9687e7f38e
This commit fixes trac:2679 and should be added to cmr:v1.4:reviewer=jsquyres
...
and cmr:v1.5:reviewer=jsquyres
This commit was SVN r25155.
The following Trac tickets were found above:
Ticket 2679 --> https://svn.open-mpi.org/trac/ompi/ticket/2679
2011-09-18 00:58:26 +00:00
Jeff Squyres
ecd603256a
* Rename opal_hwloc_components to opal_hwloc_base_components
...
* Fix some comments
This commit was SVN r25150.
2011-09-17 11:54:36 +00:00