Rolf vandeVaart
e33d44a29a
Merge pull request #898 from rolfv/pr/add-empty-cache-feature
...
Add ability for user to empty the CUDA IPC registration cache when it is full
2015-09-21 08:50:04 -04:00
Jeff Squyres
73b399ab78
Merge pull request #913 from rhc54/topic/config
...
Do not use "==" in configure "test" calls
2015-09-21 09:09:58 +02:00
Ralph Castain
0b3f4c55f8
Do not use "==" in configure "test" calls
...
Thanks to Kevin Buckley for pointing it out
2015-09-20 21:34:27 -07:00
rhc54
13def2a69b
Merge pull request #911 from rhc54/topic/cleanup
...
Cleanup the odls "close file descriptor" commit to conform to OMPI co…
2015-09-20 07:01:39 -07:00
Howard Pritchard
1367a442b6
Merge pull request #910 from hppritcha/topic/odls_alps_use_907_stuff
...
odls/alps: do smarter close of fds in child
2015-09-20 07:37:55 -06:00
Ralph Castain
c167acc5a7
Cleanup the odls "close file descriptor" commit to conform to OMPI coding standards and remove memory leaks
2015-09-19 20:46:36 -07:00
rhc54
984418dd83
Merge pull request #907 from plesn/close-used-fds
...
odls: close only used file descriptors at fork/exec
2015-09-19 20:26:38 -07:00
Howard Pritchard
a31cc21bea
odls/alps: do smarter close of fds in child
...
Use a modified variant of #907 . Thanks to plesn
for noticing this.
Signed-off-by: Howard Pritchard <howardp@lanl.gov>
2015-09-19 14:17:05 -07:00
Jeff Squyres
7bf364ff04
usnic: create blocking libevent for progress thread emulation
...
In the v1.10 opal_progress_thread emulation, ensure to create a
blocking libevent timer far in the future. Without that,
opal_event_loop() will return immediately (and therefore the progress
thread spins hard, stealing CPU cycles).
2015-09-18 11:53:03 -07:00
Jeff Squyres
7cb6c2fcc2
usnic: put the HWLOC #if's back to preserve compat with v1.10
...
We try to keep the source code the same between master and v1.10. So
put the #if's back for OPAL_HAVE_HWLOC (and just hard-code it to 1 on
master) so that this code is also compilable in v1.10.
2015-09-18 11:53:03 -07:00
Piotr Lesnicki
1dd5487fae
odls: close only used file descriptors at fork/exec
2015-09-18 16:44:57 +02:00
Edgar Gabriel
01fcfb08fe
do not set the contigous flag in two_phase_file_read_all. This optimization
...
needs some more debugging for the two_phase component, and is disabled
for two_phase_file_write_all as well.
2015-09-18 09:30:50 -05:00
Edgar Gabriel
3734a38370
this file should have been part of the previous commit. for removeing io_ompio_nbc.[ch]
2015-09-18 09:28:25 -05:00
Edgar Gabriel
cf46a6bd4d
remove the io_ompio_nbc.[ch] files, they are not used anymore at this point in time.
2015-09-18 09:26:25 -05:00
Gilles Gouaillardet
a611274704
pml: fix commit open-mpi/ompi@6e6a3e965c
...
do not use the const modifier for allocator nor recv buffers
2015-09-18 09:54:18 +09:00
Rolf vandeVaart
7da614c75e
Add ability for user to empty the CUDA IPC registration cache when it is full
2015-09-17 16:42:16 -04:00
Jeff Squyres
567c9e3a5b
mtl_ofi_component.c: add missing argv.h header
2015-09-17 10:05:05 -07:00
Nathan Hjelm
dfbe584c92
ompi/group: fix typos in add_procs changes
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-09-17 09:21:32 -06:00
rhc54
6efe91a24b
Merge pull request #904 from ggouaillardet/topic/cpuset
...
hwloc: do not count not allowed cores in df_search_cores
2015-09-17 03:55:50 -07:00
Gilles Gouaillardet
975b6fd51b
hwloc: do not count not allowed cores in df_search_cores
2015-09-17 13:10:34 +09:00
Nathan Hjelm
131681acc6
Merge pull request #901 from hjelmn/comm_fix
...
ompi/comm: fix comm_[i]dup on intracommunicators
2015-09-16 12:43:19 -06:00
Nathan Hjelm
c84c05bab7
ompi/comm: fix comm_[i]dup on intracommunicators
...
The behavior of ompi_comm_set was changed to get the remote size from
the remote group. This broke how ompi_comm_[i]dup were using
ompi_comm_set. In order to adapt to the new behavior these functions
now pass NULL for the remote group if the communicator is not an
inter-communicator.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-09-16 10:31:18 -06:00
rhc54
55d40910ee
Merge pull request #899 from rhc54/topic/cov
...
Silence some warnings and address Coverity issues
2015-09-16 09:23:32 -07:00
Ralph Castain
1b7930ad52
Silence some warnings and address Coverity issues
2015-09-16 07:58:22 -07:00
Ralph Castain
8b88ea9b13
Fix singletons by removing stale code
2015-09-16 00:58:05 -07:00
George Bosilca
02624bd0b6
Fix all treematch issues idenfied by Coverity.
2015-09-15 23:49:11 -04:00
George Bosilca
6ab5f68fc3
indentation.
2015-09-15 22:46:13 -04:00
rhc54
5597416fe0
Merge pull request #897 from rhc54/topic/oob
...
Remove the last involvement of the OOB system from the MPI layer
2015-09-15 14:40:21 -07:00
Jeff Squyres
7cb546a221
core: yow; this should absolutely not be in the repo!
2015-09-15 16:15:04 -04:00
Ralph Castain
c1bbbb5e2f
Remove the last involvement of the OOB system from the MPI layer, remove the no-longer-needed usock/oob component, and have procs no longer open the RML, OOB, ROUTED, and GRPCOMM frameworks as PMIx now provides all required app-mpirun cmds
2015-09-15 13:08:35 -07:00
Rolf vandeVaart
555f14a479
Merge pull request #893 from rolfv/pr/more-verbose-fix
...
Cleanup handle verbose messages
2015-09-15 15:45:52 -04:00
rhc54
3b4e982f86
Merge pull request #896 from hjelmn/comm_set_fix
...
ompi/comm: fix bug in ompi_comm_set
2015-09-15 12:25:55 -07:00
Nathan Hjelm
9c45c63143
ompi/dpm: fix typo in dynamic communicator detection
...
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-09-15 12:42:58 -06:00
Nathan Hjelm
6379178046
ompi/comm: fix bug in ompi_comm_set
...
This commit updates the behavior of ompi_comm_set to explicitly take
either local/remote group(s) OR local/remote array(s). If array(s) are
in use the sizes will be taken from the appropriate group(s).
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-09-15 11:37:44 -06:00
Nathan Hjelm
ca4be77ff1
Merge pull request #894 from hjelmn/osh_memheap_fix
...
oshmem/memheap: correct usage of opal_dss functions
2015-09-15 08:05:39 -06:00
George Bosilca
0e7e14449f
Typo in the modex_recv.
2015-09-14 18:00:02 -04:00
Nathan Hjelm
69b9bc2269
oshmem/memheap: correct usage of opal_dss functions
...
Any buffer given to opal_dss.load becomes the responsibility of the
opal_buffer_t object. It will be freed automatically if either the
opal_buffer_t is released or opal_dss.load is called again on the
buffer. opal_dss.unload will not prevent this unless no unpacking
takes place between the .load and .unload calls.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-09-14 13:54:56 -06:00
Rolf vandeVaart
34fe2188cd
Cleanup handle verbose messages
2015-09-14 11:01:25 -04:00
Mike Dubman
6f82ce3fc8
Merge pull request #879 from igor-ivanov/pr/disable-oshmem-issue
...
Prevent oshmem related files inside install folder in case --disable-oshmem
2015-09-14 12:12:06 +03:00
Gilles Gouaillardet
d5af5d106c
btl/sm: mca_btl_sm_sendi: do not set *descriptor when descriptor is NULL
2015-09-14 14:04:40 +09:00
Nathan Hjelm
f29b65aa14
ompi/proc: fix typos CID 1323840
...
Signed-off-by: Nathan Hjelm <hjelmn@me.com>
2015-09-11 21:02:30 -06:00
rhc54
33f5e4c766
Merge pull request #892 from rhc54/topic/pmix
...
Fix the no-disconnect test by resolving a segfault on free - opal_dss…
2015-09-11 16:01:42 -07:00
Ralph Castain
fbcf819d2e
Remove unnecessary include
2015-09-11 15:53:00 -07:00
Nathan Hjelm
f798c909d1
Merge pull request #883 from hjelmn/comm_split_update
...
ompi/comm: improve comm_split_type scalability
2015-09-11 16:35:34 -06:00
Rolf vandeVaart
d78b954fd4
Merge pull request #891 from rolfv/pr/minor-cuda-verbosity-fixes
...
Fix cuda verbosity messages
2015-09-11 16:33:22 -04:00
Ralph Castain
22d7c0081a
Fix the no-disconnect test by resolving a segfault on free - opal_dss.unload will return the remaining unpacked portion of a buffer. As such, it cannot return the pointer to that info as it might be partway inside of a malloc'd region. So copy the data out of the buffer.
2015-09-11 13:01:35 -07:00
Ralph Castain
b60b03d613
It is okay not to get the hostname - we don't require that it be provided
2015-09-11 13:01:20 -07:00
Nathan Hjelm
c45789a222
ompi/comm: improve comm_split_type scalability
...
This commit includes two changes. First, the locality code has been
factored out to improve readability and maintainability. Second,
instead of looking up each proc using ompi_group_peer_lookup the code
now uses ompi_group_peer_lookup_existing. The code falls back on modex
if a proc doesn't exist. This will prevent MPI_Comm_split_type from
allocating ompi_proc_t's for every process in the job.
Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
2015-09-11 13:53:48 -06:00
Rolf vandeVaart
90dd1d264b
Fix cuda verbosity messages
2015-09-11 15:44:36 -04:00
Nathan Hjelm
1868b5937c
Merge pull request #889 from hjelmn/sentinel_update
...
Use the low instead of the high bit to indicate a proc is a sentinel
2015-09-11 12:30:27 -06:00