Rolf vandeVaart
54ab0d1a51
Add config code to check for need of workaround. Add runtime way to turn it off just in case
2015-08-31 17:18:47 -04:00
Rolf vandeVaart
cb8c86910e
Add static definitions where needed and remove one unused definition
2015-08-13 14:59:07 -04:00
Jeff Squyres
7f857034d9
common verbs: check return value of sscanf()
...
Fixes CID 1304563.
2015-08-12 05:14:58 -07:00
Rolf vandeVaart
8cc6bef090
Refs open-mpi/ompi#627 . Fix support for multi-threads with CUDA 7.0
2015-08-10 10:22:45 -04:00
Rolf vandeVaart
1f32fa21ae
Fix arguments to error message, remove tabs and trailing spaces
2015-07-23 10:02:45 -04:00
Rolf vandeVaart
7703c96496
Add a workaroud for issue in libcuda.so library
2015-07-22 11:35:27 -04:00
Rolf vandeVaart
ae0f3cfee7
Make explicit call to initalize MCA parameters in common CUDA code. This allows us to view them with ompi_info and possibly modify with tools interface
2015-07-09 12:51:55 -04:00
Jeff Squyres
f1353947ff
libfabric: fix wrappers for static builds
...
Need to set the WRAPPER_EXTRA flags so that the wrappers for static
builds pull in -lfabric.
Also update/fix some comments.
2015-07-02 07:58:16 -07:00
Ralph Castain
ea0e21bb06
Add a common/libfabric component to the opal layer where we can place common functions
2015-06-25 11:04:00 -07:00
Nathan Hjelm
4d92c9989e
more c99 updates
...
This commit does two things. It removes checks for C99 required
headers (stdlib.h, string.h, signal.h, etc). Additionally it removes
definitions for required C99 types (intptr_t, int64_t, int32_t, etc).
Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2015-06-25 10:14:13 -06:00
Ralph Castain
869041f770
Purge whitespace from the repo
2015-06-23 20:59:57 -07:00
Jeff Squyres
3e1b85ceb3
libfabric: remove embedded libfabric
...
OMPI now only builds against external libfabric installations.
2015-06-15 15:17:05 -07:00
Nathan Hjelm
3edb421adc
common/verbs: fix coverity issues
...
CID 1269864 Resource leak (RESOURCE_LEAK)
CID 1269865 Resource leak (RESOURCE_LEAK)
Slightly refactored the code to remove extra goto statements and
ensure the if_include_list and if_exclude_list are actually released
on success.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-05-28 08:38:10 -06:00
Rolf vandeVaart
b3e4ae71d5
Fix finalize code when cuda support is not fully initialized
2015-05-21 13:42:22 -04:00
Gilles Gouaillardet
69f900ab9d
libfabric: check the psm_epconn_t type is available before building the PSM provider
...
embedded libfabric configury does it its own way, so "backport" ofiwg/libfabric#1031
2015-05-18 14:04:41 +09:00
Jeff Squyres
e95010b095
common verbs: only install fake usnic driver when relevant
...
Only install the fake usnic libibverbs driver when there are actually
usnic kernel devices present. This prevents some run-time weirdness
on the Cray verbs emulation environment, where apparently
ibv_register_driver() either is not implemented or does not work
properly.
2015-05-11 12:57:06 -07:00
Howard Pritchard
94576993b0
Merge pull request #574 from hppritcha/topic/ugni_common_symbol
...
common/ugni: fix common symbol problem
2015-05-08 05:55:15 -06:00
Howard Pritchard
341b773724
common/ugni: fix common symbol problem
...
Stop nagging of common symbol detection script for ugni
stuff.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-08 04:27:01 -07:00
Jeff Squyres
7a577c0ed2
libfabric: delete tarball
...
Oops -- the tarball itself should not have been committed to the repo.
2015-05-08 03:24:09 -07:00
Jeff Squyres
a1770950c6
libfabric: update to 1.0.0
...
This is likely short-lived: now that libfabric has a 1.0.0 release
available, the embedded libfabric may disappear from the OMPI tree
sometime soon. However, we still need it for the time being...
2015-05-07 11:14:13 -07:00
Jeff Squyres
676673189b
Merge pull request #565 from jsquyres/pr/fake-usnic-ibv-driver
...
Squelch libibverbs complaints about lack of usnic userspace plugin
2015-05-05 10:27:33 -04:00
Howard Pritchard
b5fc5404c6
libfabric/embedded: add missing psmx_eq.c
...
The ompi libfabric/Makefile.am to build the libmca_component_libfabric
lib was missing a recently added psmx_eq.c in the list of source
files for the psm provider.
Fixes #569
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-05-04 14:54:59 -06:00
Jeff Squyres
a2b55e12f2
common verbs: insert fake usnic_verbs libibverbs driver
...
libibverbs will complain to stderr if it sees device entries in
/sys/class/infiniband for which it has no userspace plugins.
The Cisco usNIC device no longer exports a verbs interface, thereby
causing libibverbs to emit this annoying stderr warning.
To avoid this, use the public ibv API to register a "fake" libibverbs
driver at run-time (right after we call ibv_fork_init(), but --
critically -- *before* we call ibv_get_device_list()). The purpose of
this driver is solely to convince libibverbs that there *is* a driver
for /sys/class/infininband/usnic_verbs devices. ...although this
driver will never return a valid ibv context (and therefore will never
be used).
2015-04-29 18:07:12 -07:00
Jeff Squyres
4cc5c5261d
libfabric: disable all semblance of verbs
...
Including the usnic fake ibv verbs driver.
This fix was mistakenly not included in open-mpi/ompi@d0937c6 .
2015-04-29 17:46:12 -07:00
Jeff Squyres
d0937c6f42
libfabric: update to upstream c01338a53abf969799ac0722de152ca0bd96fa3c
...
Fixes a usnic bug with respect to porting to v1.8
2015-04-29 17:38:19 -07:00
Jeff Squyres
faf3324b0e
libfabric: update to upstream d4ab6e56e23124e565ada939054a159737e52102
...
Fix a critical usnic bug, and other misc updates.
2015-04-29 16:02:08 -07:00
Rolf vandeVaart
b260dc4228
Cleanup interface that handles events. No functional changes
2015-04-28 15:15:24 -04:00
Jeff Squyres
84f5c2bfb6
libfabric: update to upstream libfabric 1.0rc6
2015-04-27 10:18:49 -07:00
Rolf vandeVaart
030dddde23
Remove unneeded copy and minor clean up
2015-04-22 13:17:16 -04:00
Nathan Hjelm
bfacb5dd73
common/ugni: update for MCA 2.1
2015-04-17 08:09:18 -06:00
Rolf vandeVaart
b7913836fc
Initialize variables for safety
2015-04-09 12:58:55 -04:00
Rolf vandeVaart
d6d7184703
Enhance verbose message
2015-04-09 12:29:09 -04:00
Jeff Squyres
5aabee2644
libfabric: a few fixes since 1.0rc3
...
Including a critical atomic initialization fix for the usnic provider.
2015-04-02 15:54:01 -07:00
Jeff Squyres
d6d8ab01e5
libfabric: the fi_log.h file moved
2015-04-01 14:43:07 -07:00
Jeff Squyres
5e47eb81bf
libfabric: update component configury for new libfabric test
2015-04-01 06:43:08 -07:00
Jeff Squyres
a89a5872c2
libfabric: update to official 1.0.0rc3 release
...
One change was made to the 1.0.0rc3 tarball: remove an errand
debugging printf that accidentally made its way into the tarball (but
isn't in git).
2015-04-01 06:43:08 -07:00
Ralph Castain
79b90a54b6
Remove stale and unused component
2015-03-30 09:56:06 -07:00
Jeff Squyres
a85edb8ad4
libfabric: update to Github libfabric 0d7daf720f04
2015-03-26 14:40:46 -07:00
Rolf vandeVaart
dfb7e00ef5
Make sure context is still around when doing some other cleanup
2015-03-24 16:47:40 -04:00
Jeff Squyres
a1daa39425
libfabric: update to Github lifabric 90ac5a258418e
...
Update to latest upstream Github lifabric in order to fix some usnic
bugs.
2015-03-12 13:23:32 -07:00
Jeff Squyres
cbd99d5f60
libfabric: update to Github upstream 1b4bb2285b
...
Get a usnic bug fix.
2015-03-10 12:09:02 -07:00
Jeff Squyres
1a1be2efa0
libfabric: update to Github upstream 7095f3dc
2015-03-10 09:47:40 -07:00
Rolf vandeVaart
49b5eb6c91
Fix missing initialization of variable
2015-03-10 10:33:27 -04:00
Jeff Squyres
1995f6beba
cuda: convert to opal_dl interface
2015-03-09 08:18:13 -07:00
Rolf vandeVaart
237c268a09
Add extra check during cleanup to make sure we really should clean up the CUDA resources.
2015-03-06 13:12:19 -05:00
Alina Sklarevich
1560ed9761
initialize opal_common_verbs_want_fork_support to -1.
...
This way, if the call to ibv_fork_init() fails, the job will still
continue.
2015-03-05 14:29:09 +02:00
Mike Dubman
98503b56e0
Revert "create the opal_common_verbs_want_fork_support parameter."
2015-03-03 14:28:31 +02:00
Alina Sklarevich
8fe42f1bc1
create the opal_common_verbs_want_fork_support parameter.
...
call the opal_common_verbs_mca_register function to make sure that
opal_common_verbs_want_fork_support mca parameter is created and therefore
can be used to control the fork support.
2015-03-01 17:40:49 +02:00
George Bosilca
455b465329
Reflect in the naming the location of the variable.
2015-02-26 18:22:23 -05:00
George Bosilca
2c60c18e6f
A better fix for the want_fork_support issue. I noticed a naming
...
conflict where ompi was used down in OPAL. I correctly renamd the
MCA parameter, and created a deprecated synomym for the old
name.
2015-02-26 17:35:54 -05:00