Ralph Castain
b6aa691e0a
Fix incorrect implementation of new MCA param mca_base_env_list - it was not picking up envars and forwarding them, but only worked if you explicitly set a value for the envar. Ensure it works for both direct and indirect launch modes. Remove stale code as this replaced orte_forward_envars. Ensure it doesn't get passed to the ORTE daemons.
2014-10-16 12:58:56 -07:00
Aurelien Bouteiller
08d93bd4b9
Adding a platform file for Darter NICS CrayXC30
2014-10-16 14:22:02 -04:00
bosilca
d819939841
Merge pull request #233 from ggouaillardet/rfc/coll_module_disable
...
Provide a symmetric behavior for the activation/deactivation of collective modules.
2014-10-16 09:22:04 -04:00
Gilles Gouaillardet
27dcca0bb2
pmi/s1: fix large keys
...
do not overwrite the PMI key when pushing a message that does
not fit within 255 bytes
2014-10-16 13:29:32 +09:00
Gilles Gouaillardet
b5aea782ce
Revert "Fix heterogeneous support"
...
Per the discussion at http://www.open-mpi.org/community/lists/devel/2014/10/16050.php
This reverts commit c9c5d4011b
.
2014-10-16 12:24:38 +09:00
George Bosilca
8acadb279f
Merge branch 'topic/atomic-cleanups'
...
Various minor cleanups.
2014-10-15 21:50:42 -04:00
George Bosilca
63ba754f3f
Remove unnecessary includes from the datatype
2014-10-15 21:49:32 -04:00
George Bosilca
7541c03b4c
Mark all instances where atomic operations are used but their return value is unnecessary
2014-10-15 21:47:32 -04:00
Devendar Bureddy
8dfed1c657
oshmem: rename global variables with prefix
2014-10-15 20:30:37 +03:00
Jeff Squyres
dc66e197cc
var: fix segv in deprecated file var show_help()
...
Ensure to include the new variable filename in the show_help() output
when we load a deprecated MCA param from a file.
Fixes #236
2014-10-15 08:07:31 -07:00
Jeff Squyres
51027a6635
usnic: fix minor typo
...
Change harmless-but-weird comma to semicolon. Found during code
review.
2014-10-15 05:32:36 -07:00
Gilles Gouaillardet
c9c5d4011b
Fix heterogeneous support
...
* redefine orte_process_name_t so it can be converted
between host and network format as an opal_identifier_t
aka uint64_t by the OPAL layer.
* correctly send OPAL_DSTORE_ARCH key
2014-10-15 17:19:13 +09:00
Gilles Gouaillardet
5c81658d58
pmix: fix big endian arch
...
use the appropriate 64 bits type otherwise data gets incorrectly
truncated on big endian arch
2014-10-15 17:17:09 +09:00
Mike Dubman
ab22dcb875
Merge pull request #229 from nkogteva/master
...
oshmem mmap: new mca parameters were introduced - sshmem_mmap_anonymous,...
2014-10-15 10:24:29 +03:00
bureddy
3d77abaa1f
Merge pull request #234 from bureddy/master
...
OSHMEM: Fix application abort
2014-10-14 13:07:10 -07:00
Edgar Gabriel
0219c87039
set the fs_ptr to NULL in case of an error, to avoid a malicious free on file_close.
2014-10-14 13:09:06 -05:00
Devendar Bureddy
cbb3e95ce9
OSHMEM: Fix application abort
...
register on_exit() hook to know exit status inorder to
skip shmem_finalize destructor in case of non-zero exit status
2014-10-14 21:07:28 +03:00
Nathan Hjelm
083a659217
Correct some typos in Cray PMI detection
2014-10-14 10:28:36 -06:00
Alex Mikheev
314ba245e9
Merge branch 'topic/oshmem_spml_ikrit_hw_rdma_channel'
2014-10-14 16:21:06 +03:00
Alex Mikheev
643e64497d
OSHMEM: spml ikrit: hw rdma channel is disabled by default
2014-10-14 16:09:51 +03:00
Alex Mikheev
74ab30b738
OSHMEM: spml ikrit: improve mxm transport sanity check
...
Do not allow combination of transports that is not compliant with
shmem spec. Especially do not allow mix of hw and software atomic
ops
Issue: 4721
Change-Id: Ide382f7510495df3d385f2a5ae5f9def6ef5332c
2014-10-14 15:44:57 +03:00
Alex Mikheev
1bcc88cfb1
OSHMEM: spml ikrit: hardware rdma endpoint
...
Create additional endpoint that is capable of true
one sided RDMA transfers.
MXM atomics component now uses this endpoint
2014-10-14 15:31:09 +03:00
Alina Sklarevich
1eb6286547
OSHMEM: fix the makefile.
...
(oshmem/mca/sshmem/base/Makefile.am)
2014-10-14 11:57:46 +03:00
Gilles Gouaillardet
e3f74aca1c
Correctly mote the pointer back by the true_lb.
...
Fixes #231
2014-10-14 16:26:54 +09:00
Nadezhda Kogteva
b2a93943dc
oshmem mmap: set lvl4 for sshmem_mmap_anonymous and sshmem_mmap_fixed variables, define MAP_ANONYMOUS returned.
2014-10-14 08:54:44 +03:00
Gilles Gouaillardet
0f983d5a4f
add a disable function for coll module
2014-10-14 14:46:36 +09:00
Devendar Bureddy
7a6b4c36b0
HCOLL: Update the proc structure dereference
...
Update the proc structure dereference to reflect the new opal_proc_t
super field
2014-10-13 20:49:19 +03:00
Devendar Bureddy
b8d2a15be9
HCOLL: by default off
2014-10-13 20:49:09 +03:00
Mike Dubman
ec1f761d8e
OSHMEM: add missing help file, got lost during merge. Thanks to Yossi/Igor for finding it.
...
Change-Id: I466e40a3fea70e8045dd1e897edcc50ccf0451a3
Conflicts:
oshmem/mca/sshmem/base/Makefile.am
oshmem/mca/sshmem/base/help-oshmem-sshmem.txt
2014-10-13 16:58:35 +03:00
Alex Mikheev
8fcbcba516
Merge branch 'topic/oshmem_shared_mr_fix'
2014-10-13 15:24:12 +03:00
Alex Mikheev
cd67642183
OSHMEM: sshmem verbs: workaround shared_mr procfs bug
...
dereg shared_mr before doing dereg on its mr.
2014-10-13 15:14:34 +03:00
Nadezhda Kogteva
c68c4b45b5
Merge remote-tracking branch 'upstream/master'
2014-10-13 15:12:39 +03:00
Mike Dubman
a1db93077d
Merge pull request #230 from nkogteva/oshmem_refactor_macro_style
...
oshmem: refactor of oshmem/mca/sshmem/*.[ch] files to use #if MACRO style
2014-10-13 13:33:32 +03:00
Nadezhda Kogteva
de68d58a9e
oshmem: refactor of oshmem/mca/sshmem/*.[ch] files to use #if MACRO style
2014-10-13 13:12:16 +03:00
Nadezhda Kogteva
3e7002e8aa
oshmem mmap: copyrights for memheap_base_alloc.c files updated
2014-10-13 11:41:35 +03:00
Nadezhda Kogteva
ce4ee2aa8d
oshmem mmap: new mca parameters were introduced - sshmem_mmap_anonymous, sshmem_mmap_fixed and sshmem_base_backing_file_dir - for runtime mmap management.
...
(cherry picked up from Mellanox-v1.8 repo commit 4c391a)
2014-10-13 11:39:26 +03:00
Mike Dubman
6372ac926c
tools: fix cli args parsing
...
No need to "shift" if argument does not expect parameter on the command line.
2014-10-13 11:33:26 +03:00
Vasily Filipov
a215a4831d
MTL/MXM: disable "bulk_connect" by default.
2014-10-13 09:47:56 +03:00
Ralph Castain
3ef94a0675
Per email thread on devel list:
...
Revert "OPAL: drop dead with core on bad flow. rarely happens with helloworld on large scale."
This reverts commit 86f1d5af3e
.
Will be reconsidered via RFC as it represents a significant change in behavior
2014-10-12 21:13:42 -07:00
Mike Dubman
113f40b0ec
OSHMEM: sshmem verbs: allocate memory at fixed address
...
Use experimental verbs to allocate memory at fixed base
virtual address.
verbs will disqualify itself if shared_mr is disabled
or not supported and it is impossible to allocate memory
starting at fixed base virtual address.
verbs contig pages allocator did not guarantee fixed va, now it does.
(cherry picked from commit fd77ebd452
)
Apply Jeff`s comments
Update with Jeff commits
(cherry picked from commit open-mpi/ompi-release@4dc487fc3d )
2014-10-12 09:53:48 +03:00
Ralph Castain
4d27eb70f2
Extend the dstore framework to include a new "update_handle" API so the attributes of an existing handle can be changed. We can't just open a new handle as the upper layers won't know where to find the info. :-(
2014-10-10 12:40:32 -07:00
Ralph Castain
1ae34da5e5
Add an attributes parameter to the dstore.open function so we can pass directives to the active storage component. This can, for example, include the backing file info for a new shared memory segment.
2014-10-10 12:13:25 -07:00
Ralph Castain
63f619f871
Provide a mechanism by which an upstream project can rename the OPAL and ORTE libraries. This is required by projects such as ORCM that have their own ORTE and OPAL libraries in order to avoid library confusion. By renaming their version of the libraries, the OMPI applications can correctly dynamically load the correct one for their build.
2014-10-10 11:39:08 -07:00
Ralph Castain
1be1654e5f
Correctly identify the synonym for orte_direct_modex_cutoff as ompi_hostname_cutoff
2014-10-10 06:05:06 -07:00
Gilles Gouaillardet
8eb2d62919
coll/sm: fix an other memory leak
2014-10-10 19:54:45 +09:00
Gilles Gouaillardet
27e4389259
* comment on communicator creation in mca_topo_base_dist_graph_create(...)
...
* use accesors to retrieve topo info
2014-10-10 16:07:20 +09:00
Gilles Gouaillardet
5d44a30111
coll/sm: fix minor memory leaks
...
port 4488.1.patch attached in #196 to master
2014-10-10 14:21:34 +09:00
Ralph Castain
4fc4a8346b
Fix a couple of minor issues. Ensure usock isn't used if the session dirs aren't setup. Protect an oddball case where orte_xml_fp is NULL.
2014-10-09 20:58:46 -07:00
Gilles Gouaillardet
76204dfafe
coll/basic: fix segmentation fault in neighborhood collectives if the degree
...
of the topology is higher than the communicator size
It is possible to have a topology degree higher than the size of the communicator.
For example, a periodic cartesian communicator on MPI_COMM_SELF. This will leave
the neighborhood collectives with a request buffer that is too small.
This commits introduces a semantic change :
from now, c_topo must be set before invoking coll_select
2014-10-10 11:56:04 +09:00
Gilles Gouaillardet
2f67f29b85
Revert "coll/basic: fix segmentation fault in neighborhood collectives if the degree"
...
This reverts commit 9c788ff940
.
2014-10-10 11:29:06 +09:00