Ralph Castain
4d882796b6
Silence warnings
2015-02-05 11:41:00 -08:00
Howard Pritchard
e508a4078e
Merge pull request #376 from regrant/ib_error_fix
...
fixes OpenIB connect error reporting for ibv_* calls that return an errn...
2015-02-04 10:22:03 -07:00
Jeff Squyres
621af3aa07
pmix_base: fix global opal_pmix symbol for static linking on OS X
...
OS X has weirdness when static linking. If a symbol is not
initialized, it is put into the common block section, and Weird Things
happen (linking when trying to using that global symbol will fail).
If you initialize the variable, it goes into a different section (and
linking to it will work).
This link (that might go stale someday) has some information about OS
X linker scope and treatment of symbol definitions:
https://developer.apple.com/library/mac/documentation/DeveloperTools/Conceptual/MachOTopics/1-Articles/executing_files.html#//apple_ref/doc/uid/TP40001829-98432-TPXREF120
Fixes #375 .
2015-02-04 12:12:31 -05:00
Ryan Grant
5d5e9bc1f8
fixes OpenIB connect error reporting for ibv_* calls that return an errno
2015-02-04 09:09:14 -07:00
Jeff Squyres
a3728f09af
libfabric: add another missing file to the Makefile.am
2015-02-04 04:02:27 -08:00
Jeff Squyres
66a680879e
libfabric: fix header file name in Makefile.am
2015-02-03 19:41:25 -08:00
Jeff Squyres
cb7cc171f9
usnic: update README.txt notes
...
Update notes about copying the usnic BTL between master and the v1.8
branch.
2015-02-03 15:54:36 -08:00
Jeff Squyres
edf7232e00
usnic: enable building with an external libfabric
2015-02-03 13:46:06 -08:00
Jeff Squyres
bfa54d5d7b
usnic: update to match new libfabric
2015-02-03 13:46:06 -08:00
Jeff Squyres
d2490d2fd8
libfabric: update Makefile.am to match new libfabric drop
2015-02-03 13:46:05 -08:00
Jeff Squyres
3dc0abfbc4
libfabric: update to (just past) 1.0rc1
...
Updated to Github ofiwg/libfabric@6b005d0d19 .
2015-02-03 13:46:05 -08:00
Ralph Castain
d3267c200f
Add missing OMPI-changes to libevent 2.0.22
2015-02-02 20:57:40 -08:00
Jeff Squyres
965ccab6cc
libfabric: remove a few warnings
...
Embedding libfabric is a temporary measure; I'm removing some warning
notifications so that the output isn't so cluttered (we're getting
the real warnings fixed upstream, but the OMPI community doesn't
really care/need to see the warnings in the meantime).
2015-01-29 17:38:02 -08:00
Todd Kordenbrock
37e6096fe7
Copyright update.
2015-01-29 11:08:13 -06:00
Todd Kordenbrock
ca30e129e8
Add the option to use the Portals4 logical to physical table.
...
This commit adds an MCA variable to select Portals4 logical
addressing, populates the logical-to-physical mapping table and
initializes the NI in this mode.
2015-01-29 11:08:13 -06:00
George Bosilca
b9a63cbe7a
One less warning.
2015-01-27 13:25:55 -05:00
Ralph Castain
294ebc907a
Fix singleton operations so they can work inside a slurm environment
2015-01-27 09:29:42 -06:00
Ralph Castain
ba25e8a0ce
Fix singletons
2015-01-27 09:29:42 -06:00
Ralph Castain
028b00154d
Complete implementation of the schizo framework to support OMPI component
2015-01-27 09:29:42 -06:00
Jeff Squyres
436223959d
usnic: update to match new libfabric APIs
2015-01-24 05:49:36 -08:00
Jeff Squyres
7d5755f62b
libfabric: update to ofiwg/libfabric@b3f7af4c67
...
Pull down a new embedded copy of libfabric from
https://github.com/ofiwg/libfabric .
2015-01-24 05:48:48 -08:00
Howard Pritchard
056daa05bf
btl/ugni: use PMIX_GLOBAL for modex_send in ugni
...
Using PMIX_REMOTE is not the right thing for ugni
BTL when its possible that spawned ranks end up
on the same node as some of the spawnee ranks.
2015-01-22 06:53:45 -08:00
Gilles Gouaillardet
9f80aa2d28
btl/openib: regression fix when rdmacm or udcm are disabled
...
This fixes a regression introduced in open-mpi/ompi@661c35ca67
Thanks to Mark Santcroos for reporting this issue
2015-01-20 11:31:50 +09:00
Rolf vandeVaart
66f6026214
Improve error message to help user figure out what to do
2015-01-16 13:55:27 -05:00
Jeff Squyres
65a279019e
usnic: fix typo in memchecker usage
2015-01-16 09:42:19 -08:00
Jeff Squyres
3969fe3a94
libfabric: ensure wrapper libs are loaded for static builds
...
For static builds, we need to also set
<framework>_<component>_WRAPPER_EXTRA_LIBS so that the wrappers know
what other libraries to add to link executables.
2015-01-16 09:29:52 -08:00
Gilles Gouaillardet
661c35ca67
cleanup dead code caused by the removal of the --with-threads configure option
2015-01-16 19:13:59 +09:00
Nathan Hjelm
006074c48d
Merge pull request #332 from hjelmn/openib_updates
...
Openib updates
2015-01-15 15:05:18 -06:00
Jeff Squyres
d13c14ec82
CSCus22527: fix off-by-one error in checking the number of VFs
...
Ensure to count *this* process when checking for how many VFs we need
on the local server.
(cherry picked from commit 386c01934e98cb8dcb48ff648ecdfb0c8677baa9)
2015-01-15 11:44:29 -08:00
Jeff Squyres
4685767b2d
libfabric: update usnic configury
...
Use new common m4 macro for choosing between libnl3 and libnl.
2015-01-15 07:12:39 -08:00
Jeff Squyres
400b02e566
libfabric: update to github:ofiwg/libfabric HEAD
...
Specifically: bbf0f3ea8e92c92a7cee56473ecdbbbb34cceb7d (15 Jan 2015)
2015-01-15 07:11:54 -08:00
Aurélien Bouteiller
f49981bb2a
Disable coalescing until pull request #332 gets in.
2015-01-14 14:12:47 -05:00
Nathan Hjelm
cf4975501d
rcache/vma: fix parent class of mca_rcache_vma_t
...
There was a mismatch between the structure for mca_rcache_vma_t and
the OBJ_CLASS_INSTANCE. One was opal_list_item_t and the other was
ompi_free_list_item_t. The super class in the structure looks like it
is the correct one. Changed the superclass in OBJ_CLASS_INSTANCE to
match.
2015-01-14 10:21:24 -07:00
Jeff Squyres
e4e5e7dbc0
usnic: ensure to clean up nicely in case of low resources
...
If there are not enough resources (e.g., low VFs), we can end up
calling finalize_one_channel() on the same channel multiple times. So
ensure to NULL out fields that we have freed already so that we do not
try to free them a second time.
Fixes CSCus26648.
2015-01-13 14:37:31 -08:00
Jeff Squyres
8807ae2497
usnic libfabric: also set the us_netmask_be field.
...
From libfabric upstream commit ofiwg/libfabric@3976745 .
Part of the fix for CSCus22495.
2015-01-13 12:04:57 -08:00
Jeff Squyres
d00cede718
usnic: fix if_include/exclude of CIDR-specified networks
...
Fix the ordering so that we obtain the usnic netmask information
*before* we do the filtering based on CIDR-specified networks.
Also requires upstream Github libfabric commit 3976745.
Fixes CSCus22495.
2015-01-13 12:04:51 -08:00
Jeff Squyres
a220b92cf8
usnic: fix function name in opal_output
2015-01-13 12:04:07 -08:00
Gilles Gouaillardet
955f3c2730
configury: check existence of the atomic_init function in libfabric
...
intel compilers implements atomic_init in c++ only,
so disable c11 atomic in libfabric for now
2015-01-13 16:39:41 +09:00
Gilles Gouaillardet
cbe0d26b2d
configury: do test the __STDC_NO_ATOMICS__ macro for libfabric
2015-01-13 16:06:37 +09:00
Jeff Squyres
5ed688a074
usnic: enusre that we only get "usnic"-named providers
...
Also, a minor update to a verbose message.
2015-01-12 13:21:22 -08:00
Jeff Squyres
881b1dcf19
usnic: document libfabric abstractions
...
Handy tips to remember the libfabric abstractions and what they
correspond to in usnic/VIC terms.
2015-01-09 15:21:51 -08:00
Gilles Gouaillardet
194d9f84d3
btl/usnic: move call to check_reg_mem_basics()
...
avoid annoying memlock related messages when there is no usnic device.
2015-01-09 11:37:45 +09:00
George Bosilca
1344097d35
Turn OFF the TCP dump mechanism.
2015-01-08 18:50:49 -05:00
George Bosilca
8ddd3b3b09
Cleanup the TCP dump mechanism.
2015-01-08 18:50:05 -05:00
Nathan Hjelm
c65f026fee
btl/vader: fix typo in xpmem setup
2015-01-08 12:52:38 -07:00
Gilles Gouaillardet
4c29d8e247
btl/openib: silence warning (unused code)
2015-01-08 17:18:07 +09:00
Gilles Gouaillardet
8ab605d9c5
btl/tcp: fix overflow in mca_btl_tcp_endpoint_dump()
2015-01-08 15:40:16 +09:00
Nathan Hjelm
7d206ae769
btl/ugni: fix a couple of bugs
...
Two fixes:
- Do not try to return a mailbox to the free list if one wasn't
allocated.
- Do not try to tear down IRQ CQs if they were not created.
2015-01-07 13:48:17 -07:00
Dave Goodell
49069bc661
usnic: fix fi_av_insert (ARP resolution) bugs
...
We had several problems in the old code:
1. We were specifying an arbitrary timeout (100 ms) and then abandoning
all remaining pending AV insert operations. We would then free the
endpoint buffer that we gave to fi_av_insert(), usually causing
libfabric's progress thread to write to a freed buffer.
2. We were claiming in a show_help message that the timeout was
controllable via an MCA parameter. This commit removes that
parameter, since there's no good method for us to specify a timeout
like this to libfabric right now.
3. We also weren't waiting for the correct number of fi_av_insert()
operations to complete. We were waiting for nprocs, which is
accidentally fine for 2 procs on separate hosts, but not for most
other proc counts.
Reviewed-by: Jeff Squyres <jsquyres@cisco.com>
2015-01-07 08:25:17 -08:00
Gilles Gouaillardet
06e071454e
btl/openib: cleanup duplicate code
2015-01-07 14:07:30 +09:00