Howard Pritchard
bd9d185951
pmix/cray: remove workaround for OBJ_RELEASE
...
Per feedback from rhc, manually set the base_ptr member
of the opal_buffer_t variable to NULL prior to calling
OBJ_RELEASE. A similar feature of opal_dss.load also
exists so likewise reset the base_ptr to NULL prior to
invoking it.
Hopefully the opal_buffer_t struct does not change
frequently.
Minor cleanups to reduce output when pmix_base_verbose
mca paramater is set.
2015-02-13 07:47:26 -08:00
Nathan Hjelm
49ba150972
mca/base: fix path string parsing
...
CID 993709
2015-02-12 13:03:46 -07:00
Jeff Squyres
00c878957c
mca_base_var.c: add debug check for another programming error
...
Coverity alerted us to the fact that there are places where
the synonym_for param is hard-coded to -1 when calling
register_variable(). It would be a coding error if synonym_for==-1
and (flags & MCA_BASE_VAR_FLAG_SYNONYM)>0, so let's add that to the
debug-only check at the top of the function.
This was CID 993717.
2015-02-12 10:24:02 -08:00
Jeff Squyres
167d72ec68
net.c: ensure to free the args in the error case
...
This was CID 710643.
2015-02-12 10:24:02 -08:00
Jeff Squyres
332943f1c3
pstat linux: ensure to close the file
...
This was CID 71983.
2015-02-12 10:24:02 -08:00
Jeff Squyres
6a64fe85a1
pstat linux: ensure read() returns >=0
...
This was CID 71182.
2015-02-12 10:24:02 -08:00
Jeff Squyres
8be0e0b0ca
usnic: don't close fp upon error
...
Let the caller close fp. Properly check for errors when calling
subroutines.
This was Coverity CID 1269995.
2015-02-12 10:24:01 -08:00
Nathan Hjelm
1c8f8c6694
opal_fifo: add a couple of memory barriers to the cswap2 implementation
2015-02-12 11:01:40 -07:00
Howard Pritchard
0cf2b478e0
Merge pull request #391 from hppritcha/topic/cray_pmi_kvs
...
pmix/cray: initial kvs removal work
2015-02-11 19:55:34 -07:00
Howard Pritchard
9955834ff1
pmix/cray: initial kvs removal work
...
Remove use of the Cray PMI KVS - which is designed for a lighweight
MPI that exchanges only a minimimal amount of connection info
(about 128 bytes per rank) - within cray/pmix. Use Cray PMI
collective extensions instead.
This is the first of several steps to accelerate launch of
Open MPI on Cray systems using either native aprun or nativized
slurm.
2015-02-11 15:14:55 -08:00
Rolf vandeVaart
08dceda2c0
Fix logic for handling priority and eager RDMA. There was some refactoring that was done
...
in this code and it ended up changing the logic that is used to set up eager RDMA.
Rather than setting up eager RDMA with a high priority message, it did it the other
way around. For some reason, CUDA-aware support did not like this. So, basically,
restore the logic to the way it was prior to the refactoring. The refactoring did not
intend to change this. Lightly reviewed by hjelmn.
2015-02-11 16:38:36 -05:00
Jeff Squyres
08285c6361
lt_interface: properly check OPAL_HAVE_LTDL_ADVISE
2015-02-11 12:25:20 -08:00
Jeff Squyres
4f1996df5d
various: remove $(LTDLINCL) from Makefile.am's that didn't need it
2015-02-11 12:25:20 -08:00
Ralph Castain
3de8c5c7c6
Cleanup the munge support - the credential cannot be reused for multiple connections
2015-02-10 20:34:35 -08:00
George Bosilca
e173f9b0c0
Somehow we lost one of the most critical parameter
...
allowing the PML to decide how to order the different
interconnects. Bring it back !
2015-02-10 20:32:05 -05:00
George Bosilca
7f4c5fa96f
Add the displacement of the element to the
...
safeguard check.
2015-02-10 20:13:36 -05:00
Ralph Castain
3ae3b96c17
Fix master compilation - a buried header dependency must have been removed.
2015-02-10 07:22:10 -08:00
Mike Dubman
6816e3421f
Merge pull request #377 from regrant/ib_wr_fix
...
fix problem with get_pathrecord posting too many recv requests
2015-02-10 08:47:23 +02:00
Ralph Castain
bef830efef
Fix debug output
2015-02-09 20:49:04 -08:00
Ralph Castain
07134f5b17
Add munge security
2015-02-09 20:49:03 -08:00
Ralph Castain
a3275aa867
Once again, fix the blasted singleton comm_spawn
2015-02-05 17:34:25 -08:00
Jeff Squyres
0dbbffb753
pmix_base_frame: use the "= { 0 }" initializer
...
Per open-mpi/ompi#381 , convert the specific intialization of opal_pmix
to use the generic "= { 0 }" initializer. This form can be used to
initialize any type when the intent is just to zero out / assign
*some* value.
2015-02-05 17:51:06 -05:00
Ralph Castain
f28238af59
Fix a race condition seen by Absoft during finalize. Stop the orte progress thread without cleaning it up, thus allowing the frameworks to still cancel their posted recv's. Then cleanup the memory footprint afterwards.
2015-02-05 11:41:37 -08:00
Ralph Castain
4d882796b6
Silence warnings
2015-02-05 11:41:00 -08:00
Howard Pritchard
e508a4078e
Merge pull request #376 from regrant/ib_error_fix
...
fixes OpenIB connect error reporting for ibv_* calls that return an errn...
2015-02-04 10:22:03 -07:00
Jeff Squyres
621af3aa07
pmix_base: fix global opal_pmix symbol for static linking on OS X
...
OS X has weirdness when static linking. If a symbol is not
initialized, it is put into the common block section, and Weird Things
happen (linking when trying to using that global symbol will fail).
If you initialize the variable, it goes into a different section (and
linking to it will work).
This link (that might go stale someday) has some information about OS
X linker scope and treatment of symbol definitions:
https://developer.apple.com/library/mac/documentation/DeveloperTools/Conceptual/MachOTopics/1-Articles/executing_files.html#//apple_ref/doc/uid/TP40001829-98432-TPXREF120
Fixes #375 .
2015-02-04 12:12:31 -05:00
Ryan Grant
de93497789
fix problem with get_pathrecord posting too many recv requests
2015-02-04 09:53:58 -07:00
Ryan Grant
5d5e9bc1f8
fixes OpenIB connect error reporting for ibv_* calls that return an errno
2015-02-04 09:09:14 -07:00
Jeff Squyres
a3728f09af
libfabric: add another missing file to the Makefile.am
2015-02-04 04:02:27 -08:00
Jeff Squyres
66a680879e
libfabric: fix header file name in Makefile.am
2015-02-03 19:41:25 -08:00
Jeff Squyres
cb7cc171f9
usnic: update README.txt notes
...
Update notes about copying the usnic BTL between master and the v1.8
branch.
2015-02-03 15:54:36 -08:00
Jeff Squyres
edf7232e00
usnic: enable building with an external libfabric
2015-02-03 13:46:06 -08:00
Jeff Squyres
bfa54d5d7b
usnic: update to match new libfabric
2015-02-03 13:46:06 -08:00
Jeff Squyres
d2490d2fd8
libfabric: update Makefile.am to match new libfabric drop
2015-02-03 13:46:05 -08:00
Jeff Squyres
3dc0abfbc4
libfabric: update to (just past) 1.0rc1
...
Updated to Github ofiwg/libfabric@6b005d0d19 .
2015-02-03 13:46:05 -08:00
Ralph Castain
d3267c200f
Add missing OMPI-changes to libevent 2.0.22
2015-02-02 20:57:40 -08:00
Jeff Squyres
965ccab6cc
libfabric: remove a few warnings
...
Embedding libfabric is a temporary measure; I'm removing some warning
notifications so that the output isn't so cluttered (we're getting
the real warnings fixed upstream, but the OMPI community doesn't
really care/need to see the warnings in the meantime).
2015-01-29 17:38:02 -08:00
Todd Kordenbrock
37e6096fe7
Copyright update.
2015-01-29 11:08:13 -06:00
Todd Kordenbrock
ca30e129e8
Add the option to use the Portals4 logical to physical table.
...
This commit adds an MCA variable to select Portals4 logical
addressing, populates the logical-to-physical mapping table and
initializes the NI in this mode.
2015-01-29 11:08:13 -06:00
George Bosilca
b9a63cbe7a
One less warning.
2015-01-27 13:25:55 -05:00
Ralph Castain
294ebc907a
Fix singleton operations so they can work inside a slurm environment
2015-01-27 09:29:42 -06:00
Ralph Castain
ba25e8a0ce
Fix singletons
2015-01-27 09:29:42 -06:00
Ralph Castain
028b00154d
Complete implementation of the schizo framework to support OMPI component
2015-01-27 09:29:42 -06:00
Jeff Squyres
436223959d
usnic: update to match new libfabric APIs
2015-01-24 05:49:36 -08:00
Jeff Squyres
7d5755f62b
libfabric: update to ofiwg/libfabric@b3f7af4c67
...
Pull down a new embedded copy of libfabric from
https://github.com/ofiwg/libfabric .
2015-01-24 05:48:48 -08:00
Howard Pritchard
4de512af66
Merge pull request #358 from hppritcha/topic/ugni_spawn_issue
...
btl/ugni: use PMIX_GLOBAL for modex_send in ugni
2015-01-22 12:55:46 -06:00
Howard Pritchard
056daa05bf
btl/ugni: use PMIX_GLOBAL for modex_send in ugni
...
Using PMIX_REMOTE is not the right thing for ugni
BTL when its possible that spawned ranks end up
on the same node as some of the spawnee ranks.
2015-01-22 06:53:45 -08:00
Bert Wesarg
0d0a754c42
Remove VampirTrace.
2015-01-22 08:08:07 +01:00
Gilles Gouaillardet
9f80aa2d28
btl/openib: regression fix when rdmacm or udcm are disabled
...
This fixes a regression introduced in open-mpi/ompi@661c35ca67
Thanks to Mark Santcroos for reporting this issue
2015-01-20 11:31:50 +09:00
George Bosilca
da83b084f5
Shifting the datatype around should alter it's true LB and UB.
2015-01-19 02:28:17 -05:00