Gilles Gouaillardet
b23126497c
Merge branch 'master' of https://github.com/open-mpi/ompi
2015-01-16 10:55:35 +09:00
Nathan Hjelm
006074c48d
Merge pull request #332 from hjelmn/openib_updates
...
Openib updates
2015-01-15 15:05:18 -06:00
Jeff Squyres
d13c14ec82
CSCus22527: fix off-by-one error in checking the number of VFs
...
Ensure to count *this* process when checking for how many VFs we need
on the local server.
(cherry picked from commit 386c01934e98cb8dcb48ff648ecdfb0c8677baa9)
2015-01-15 11:44:29 -08:00
Ralph Castain
e7ff21b3aa
The opal_stop_progress_thread function releases the event base, so don't do it again
2015-01-15 10:48:40 -08:00
Ralph Castain
9ac39b63cc
Use the opal_progress_threads support for the ORTE progress thread in applications
2015-01-15 07:55:19 -08:00
Jeff Squyres
9cc60b9e12
ofi mtl: update to new libfabric constant name
2015-01-15 07:12:39 -08:00
Jeff Squyres
4685767b2d
libfabric: update usnic configury
...
Use new common m4 macro for choosing between libnl3 and libnl.
2015-01-15 07:12:39 -08:00
Jeff Squyres
400b02e566
libfabric: update to github:ofiwg/libfabric HEAD
...
Specifically: bbf0f3ea8e92c92a7cee56473ecdbbbb34cceb7d (15 Jan 2015)
2015-01-15 07:11:54 -08:00
Gilles Gouaillardet
e89a2d2c02
make dist: portability fix
...
if ( -s xxx ) is not supported on Solaris, so use a portable syntax
2015-01-15 18:57:05 +09:00
Gilles Gouaillardet
bf6adedd70
atomic/ia32: silence warnings
2015-01-15 18:53:58 +09:00
Aurélien Bouteiller
f49981bb2a
Disable coalescing until pull request #332 gets in.
2015-01-14 14:12:47 -05:00
Nathan Hjelm
cf4975501d
rcache/vma: fix parent class of mca_rcache_vma_t
...
There was a mismatch between the structure for mca_rcache_vma_t and
the OBJ_CLASS_INSTANCE. One was opal_list_item_t and the other was
ompi_free_list_item_t. The super class in the structure looks like it
is the correct one. Changed the superclass in OBJ_CLASS_INSTANCE to
match.
2015-01-14 10:21:24 -07:00
Yohann Burette
8792450407
Merge pull request #344 from yburette/topic/ofimtl_fixwarnings
...
Fixed code around fi_av_insert().
2015-01-14 09:09:04 -08:00
Yohann Burette
bc93e04604
Fixed code around fi_av_insert().
2015-01-14 08:43:57 -08:00
Gilles Gouaillardet
6d84afc660
romio: update configure for Solaris 10 build
2015-01-14 18:25:55 +09:00
Jeff Squyres
e4e5e7dbc0
usnic: ensure to clean up nicely in case of low resources
...
If there are not enough resources (e.g., low VFs), we can end up
calling finalize_one_channel() on the same channel multiple times. So
ensure to NULL out fields that we have freed already so that we do not
try to free them a second time.
Fixes CSCus26648.
2015-01-13 14:37:31 -08:00
Jeff Squyres
8807ae2497
usnic libfabric: also set the us_netmask_be field.
...
From libfabric upstream commit ofiwg/libfabric@3976745 .
Part of the fix for CSCus22495.
2015-01-13 12:04:57 -08:00
Jeff Squyres
d00cede718
usnic: fix if_include/exclude of CIDR-specified networks
...
Fix the ordering so that we obtain the usnic netmask information
*before* we do the filtering based on CIDR-specified networks.
Also requires upstream Github libfabric commit 3976745.
Fixes CSCus22495.
2015-01-13 12:04:51 -08:00
Jeff Squyres
a220b92cf8
usnic: fix function name in opal_output
2015-01-13 12:04:07 -08:00
Gilles Gouaillardet
955f3c2730
configury: check existence of the atomic_init function in libfabric
...
intel compilers implements atomic_init in c++ only,
so disable c11 atomic in libfabric for now
2015-01-13 16:39:41 +09:00
Gilles Gouaillardet
cbe0d26b2d
configury: do test the __STDC_NO_ATOMICS__ macro for libfabric
2015-01-13 16:06:37 +09:00
Jeff Squyres
5ed688a074
usnic: enusre that we only get "usnic"-named providers
...
Also, a minor update to a verbose message.
2015-01-12 13:21:22 -08:00
Ralph Castain
d2938a144f
Use the proper interface index. Thanks to Mark Kettenis for spotting the problem and providing a patch
2015-01-12 05:31:02 -08:00
Jeff Squyres
881b1dcf19
usnic: document libfabric abstractions
...
Handy tips to remember the libfabric abstractions and what they
correspond to in usnic/VIC terms.
2015-01-09 15:21:51 -08:00
Gilles Gouaillardet
8c1698ae4a
coll/libnbc: enhance fix for MPI_Ireduce_scatter on single task communicator
...
this improves open-mpi/ompi@b9349d2eb9
2015-01-09 13:44:01 +09:00
Gilles Gouaillardet
194d9f84d3
btl/usnic: move call to check_reg_mem_basics()
...
avoid annoying memlock related messages when there is no usnic device.
2015-01-09 11:37:45 +09:00
George Bosilca
1344097d35
Turn OFF the TCP dump mechanism.
2015-01-08 18:50:49 -05:00
George Bosilca
8ddd3b3b09
Cleanup the TCP dump mechanism.
2015-01-08 18:50:05 -05:00
mjbhaskar
39f9880759
Fixed the data type argument in an all reduce operation to fix a bug
...
seen on 32 bit machines.
2015-01-08 14:18:54 -06:00
mjbhaskar
ba5dc660f7
Merge branch 'master' of https://github.com/open-mpi/ompi
2015-01-08 14:12:01 -06:00
Nathan Hjelm
c65f026fee
btl/vader: fix typo in xpmem setup
2015-01-08 12:52:38 -07:00
Nathan Hjelm
9f6faadd91
opal_fifo: add missing memory barrier in pop
...
Thanks to Adrian Reber for reporting this.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-01-08 09:14:56 -07:00
Gilles Gouaillardet
4c29d8e247
btl/openib: silence warning (unused code)
2015-01-08 17:18:07 +09:00
Gilles Gouaillardet
8ab605d9c5
btl/tcp: fix overflow in mca_btl_tcp_endpoint_dump()
2015-01-08 15:40:16 +09:00
Gilles Gouaillardet
b746a8f584
romio: compile openmpi mpi-io glue
2015-01-08 14:08:46 +09:00
Ralph Castain
4e592ac434
Fix the tarball by providing the correct list of headers in the Makefile.am
2015-01-07 18:37:26 -08:00
Nathan Hjelm
7d206ae769
btl/ugni: fix a couple of bugs
...
Two fixes:
- Do not try to return a mailbox to the free list if one wasn't
allocated.
- Do not try to tear down IRQ CQs if they were not created.
2015-01-07 13:48:17 -07:00
mjbhaskar
2d33b0a745
A fix for memory corruption seen on 32 bit machines
2015-01-07 14:41:44 -06:00
mjbhaskar
27dfcaaab2
Merge branch 'master' of https://github.com/open-mpi/ompi
2015-01-07 14:39:23 -06:00
mjbhaskar
74f8ba2acb
A fix for memory corruption problem
2015-01-07 14:34:38 -06:00
Howard Pritchard
f34dd5f5fd
plm/alps: update copyright
2015-01-07 12:33:38 -07:00
Howard Pritchard
c454d11b01
plm/alps: fix orted abort hang problem
...
Turns out the alps plm component wasn't changing the state
of the job upon terminating the orted's in the case of
an abnormal termination. This caused mpirun to hang
with a zommbie'd aprun process if an orted on a node
in the job was killed via signal.
2015-01-07 12:31:41 -07:00
Nathan Hjelm
81dc3a5db9
Merge pull request #335 from hjelmn/osc_updates
...
Osc updates
2015-01-07 11:16:55 -06:00
Dave Goodell
49069bc661
usnic: fix fi_av_insert (ARP resolution) bugs
...
We had several problems in the old code:
1. We were specifying an arbitrary timeout (100 ms) and then abandoning
all remaining pending AV insert operations. We would then free the
endpoint buffer that we gave to fi_av_insert(), usually causing
libfabric's progress thread to write to a freed buffer.
2. We were claiming in a show_help message that the timeout was
controllable via an MCA parameter. This commit removes that
parameter, since there's no good method for us to specify a timeout
like this to libfabric right now.
3. We also weren't waiting for the correct number of fi_av_insert()
operations to complete. We were waiting for nprocs, which is
accidentally fine for 2 procs on separate hosts, but not for most
other proc counts.
Reviewed-by: Jeff Squyres <jsquyres@cisco.com>
2015-01-07 08:25:17 -08:00
Gilles Gouaillardet
06e071454e
btl/openib: cleanup duplicate code
2015-01-07 14:07:30 +09:00
Gilles Gouaillardet
135ecce0eb
btl/openib: rename OPAL_HAVE_XRCD macro into OPAL_HAVE_CONNECTX_XRC_DOMAINS
2015-01-07 13:27:25 +09:00
Ralph Castain
e0927895db
Grrr...how many files did they forget?
2015-01-06 19:40:18 -08:00
Ralph Castain
84c41429e9
Add missing file
2015-01-06 18:41:11 -08:00
George Bosilca
bf62bed65f
Typo in the poll/epoll ops declaration.
2015-01-06 21:21:25 -05:00
Ralph Castain
a7c5ff2ace
Update to libevent 2.0.22-stable
2015-01-06 16:37:25 -08:00