Ralph Castain
0442a807c0
Default the OOB to the "ud" component IFF the HNP finds itself on a node with a supported Infiniband device. Ensure that the daemons all pick the matching component by dictating the selection via mca param on the orted cmd line.
...
This commit was SVN r26582.
2012-06-08 01:23:08 +00:00
Ralph Castain
05122a2f93
Make debruijn the default routed component. Update the radix component to "short-circuit" the tree when the job size permits
...
This commit was SVN r26580.
2012-06-08 00:35:36 +00:00
Ralph Castain
ffcca0185a
Remove no longer needed component
...
This commit was SVN r26578.
2012-06-08 00:18:59 +00:00
Ralph Castain
980768965f
Remove unused and unsupported component
...
This commit was SVN r26577.
2012-06-07 23:48:06 +00:00
Ralph Castain
350900f70e
Remove unused and unsupported component
...
This commit was SVN r26576.
2012-06-07 23:47:35 +00:00
Nathan Hjelm
625c8078c3
oob/ud: fix typo
...
This commit was SVN r26569.
2012-06-07 19:21:23 +00:00
Ralph Castain
7a94a52420
No reason not to build this
...
This commit was SVN r26568.
2012-06-07 19:11:44 +00:00
Ralph Castain
5876496f4c
Enable orte progress threads and libevent thread support by default
...
This commit was SVN r26565.
2012-06-07 04:25:00 +00:00
Shiqing Fan
2abf783fa0
Remove a unnecessary definition before the real one.
...
This commit was SVN r26562.
2012-06-06 14:15:39 +00:00
Ralph Castain
166d254d4e
Add new routed component
...
This commit was SVN r26557.
2012-06-06 11:53:12 +00:00
Ralph Castain
d6279fc971
Fix the debugger daemon launch support to fit the new state machine. Treat debugger daemons just like any other job, except that we map them only to nodes where an app process currently exists (as opposed to every node in the system). Trigger breakpoint and rank0 release only after the debugger daemons are in position.
...
This commit was SVN r26556.
2012-06-06 02:01:23 +00:00
Jeff Squyres
0b8849e2c4
Make "mpirun --report-bindings" have a user-friendly output (i.e.,
...
readable by normal human beings, vs. having a bitmap of physical
PU's). Use the new hwloc base prettyprint functions to generate the
output.
This commit was SVN r26533.
2012-06-01 16:35:31 +00:00
Jeff Squyres
99c5afb397
Remove clang compiler warnings.
...
This commit was SVN r26523.
2012-05-29 23:36:06 +00:00
Ralph Castain
b0938a254e
Dont use mutex where it isn't needed
...
This commit was SVN r26521.
2012-05-29 20:21:11 +00:00
Ralph Castain
32b66c166b
Missed one blasted spot
...
This commit was SVN r26520.
2012-05-29 20:20:10 +00:00
Ralph Castain
9bedb25dda
Cleanup some compiler warnings, some of which are actual logic errors
...
This commit was SVN r26519.
2012-05-29 20:11:51 +00:00
Ralph Castain
d7ac424d8d
Silence optimized build warnings
...
This commit was SVN r26518.
2012-05-29 19:55:47 +00:00
Ralph Castain
bf5ec1ac0c
Silence optimized build warnings
...
This commit was SVN r26517.
2012-05-29 19:55:31 +00:00
Shiqing Fan
08d553d7bf
Add a file to the installation list.
...
This commit was SVN r26507.
2012-05-29 13:58:23 +00:00
Ralph Castain
9883f42caf
Add missing commit
...
This commit was SVN r26501.
2012-05-28 02:20:20 +00:00
Ralph Castain
e705de1ce6
Complete nidmap cleanup - we don't know our node until we have unpacked all the jobs since our job is always the last one, so wait until all jobs are unpacked before assigning locality
...
This commit was SVN r26500.
2012-05-27 18:37:57 +00:00
Ralph Castain
be6ed9c2df
Allow partial use of allocations by specifying the max number of daemons (i.e., max VM size) for the job
...
This commit was SVN r26499.
2012-05-27 16:48:19 +00:00
Ralph Castain
c69a04e16b
Cleanup the pidmap decoding for apps to avoid confusion
...
This commit was SVN r26498.
2012-05-27 16:21:38 +00:00
Ralph Castain
31beff6362
Oops - if we don't want the Java bindings, then we really shouldn't be building them :-/
...
Also ensure we don't try to build them if no Java support was found, and error out if the user requests the bindings and we didn't find Java support.
Add a configure flag to skip the Java tests and just force-set the Java support to "disabled"
This commit was SVN r26484.
2012-05-23 19:51:27 +00:00
Ralph Castain
7fb49b1559
Silence warning
...
This commit was SVN r26480.
2012-05-23 13:59:41 +00:00
Ralph Castain
da28a4b0e6
Silence warning
...
This commit was SVN r26479.
2012-05-23 13:59:22 +00:00
Jeff Squyres
7969faf372
Fixes trac:3057: minor update to the man page to state that slot locations
...
in rankfiles use ''physical'' device indexes (vs. logical indexes).
This commit was SVN r26478.
The following Trac tickets were found above:
Ticket 3057 --> https://svn.open-mpi.org/trac/ompi/ticket/3057
2012-05-23 11:43:33 +00:00
Nathan Hjelm
b9959a95cd
ack! one more
...
This commit was SVN r26472.
2012-05-22 20:52:52 +00:00
Nathan Hjelm
f2d4e95429
doh! add missing include
...
This commit was SVN r26471.
2012-05-22 20:49:13 +00:00
Nathan Hjelm
cdc3c87ba6
move pmi init/finalize into a common component
...
This commit was SVN r26470.
2012-05-22 15:15:39 +00:00
Nathan Hjelm
78b8b3cf76
bug fix: actually close ess components
...
This commit was SVN r26469.
2012-05-22 15:09:18 +00:00
Ralph Castain
b217124bd8
Symlink instead of copy
...
This commit was SVN r26464.
2012-05-21 23:07:48 +00:00
Ralph Castain
da3873af6f
Rename the mapreduce tool to "mr+" per the marketing types
...
This commit was SVN r26463.
2012-05-21 21:17:44 +00:00
Nathan Hjelm
6eeca66475
add an option to enable static ports. diabled by default
...
This commit was SVN r26462.
2012-05-21 19:56:15 +00:00
Ralph Castain
83d69b6c95
Enable the ORTE progress thread for apps (not needed in the tools as they already continuously loop in the event lib). This appears to be working, at least for MPI apps that only use shared memory (a simple "hello"). More testing is required to identify where problems will occur - this is only intended to allow further development.
...
In order to use the progress thread, you must configure with:
--enable-orte-progress-threads --enable-event-thread-support
This commit was SVN r26457.
2012-05-20 15:14:43 +00:00
Ralph Castain
c4f8043064
Per Nathan, with a little cleanup by me: update the PMI support to aggregate modex info, thus reducing the number of keys required so it fits within Cray default constraints
...
This commit was SVN r26456.
2012-05-19 16:12:52 +00:00
Ralph Castain
a526afae92
Ensure we always cleanup local procs, no matter how we exited.
...
This commit was SVN r26454.
2012-05-18 23:37:40 +00:00
Ralph Castain
12ebc0e269
Don't need this to be a bin program as the class is captured in the jar
...
This commit was SVN r26453.
2012-05-18 23:37:18 +00:00
Ralph Castain
b16e43f489
Silence a warning on Mac
...
This commit was SVN r26449.
2012-05-18 15:27:04 +00:00
Ralph Castain
ca1b325738
Tweak the java setup so it works better on Mac. Only build mapreduce and allocators if hadoop support was requested.
...
This commit was SVN r26448.
2012-05-18 01:02:01 +00:00
Jeff Squyres
cab31eafce
Revert r26413: it was causing too much confusion. When an MPI proc
...
exits with status 77, the whole job will be killed, but mpirun will
still return an exit status of 77, so MTT will report it as a skip
anyway.
This commit was SVN r26445.
The following SVN revision numbers were found above:
r26413 --> open-mpi/ompi@02aa36f2e5
2012-05-16 14:45:58 +00:00
Jeff Squyres
dab7d36a81
Fix location of the default hostfile. Thanks to Götz Waschk for
...
identifying the problem.
This commit was SVN r26441.
2012-05-15 16:13:39 +00:00
Jeff Squyres
2d78728d38
Fix the macro name in the comment: it's EXTRA_DIST, not EXTRA_SOURCES.
...
This commit was SVN r26429.
2012-05-10 14:07:36 +00:00
Jeff Squyres
b325c17c72
It's a little weird to put in a blank _SOURCES line for the
...
HDFSFileFinder PROGRAM, but if we don't put in a _SOURCES line at all,
Automake will default to "HDFSFileFinter_class_SOURCES =
HDFSFileFinder.c", which clearly will cause problems.
But we don't want to put the .java file in _SOURCES, either, because
we haven't configured Automake to handle Java (because current
versions of Automake only have GCJ, not other Java compilers). So set
HDFSFileFinder_class_SOURCES to blank and list the .java file in
EXTRA_SOURCES (so that they get picked up for "make dist").
This commit was SVN r26424.
2012-05-10 13:54:51 +00:00
Ralph Castain
b9d560263f
Ensure we properly handle systems that do not have a jdk installed
...
This commit was SVN r26421.
2012-05-10 12:06:59 +00:00
Ralph Castain
b143633593
Fix java config
...
This commit was SVN r26420.
2012-05-10 01:51:02 +00:00
Ralph Castain
640f0610aa
Fix the makefile to install the perl scripts properly
...
This commit was SVN r26416.
2012-05-09 14:06:02 +00:00
Ralph Castain
fd796cce0a
Add an allocator tool for finding HDFS file locations and obtaining allocations for those nodes (supports both Hadoop 1 and 2). Split the Java support into two parts: detection of Java support and request for Java bindings.
...
This commit was SVN r26414.
2012-05-09 01:13:49 +00:00
Jeff Squyres
02aa36f2e5
ORTE defaults to killing the entire job when any process exits with a
...
nonzero status (we polled other MPI implementations since one one in
the OMPI community had a concrete opinion on what behavior to do here
-- all other MPI's seem to adhere to this behavior, too).
This commit adds an MCA parameter that allows us to tell ORTE to
''not'' kill jobs when a process exits with a status of 77, meaning
the GNU testing standard of "this test was skipped". In all the OMPI
tests, all procs will either return 77 or not. So if they all return
77, mpirun won't consider it an error, but will still return an exit
status of 77 (so that MTT can know that the test was cleanly skipped).
This commit was SVN r26413.
2012-05-08 21:49:05 +00:00
Ralph Castain
84d031d6c1
Add daemon object to job array after creation
...
This commit was SVN r26406.
2012-05-08 13:39:20 +00:00