1
1
openmpi/orte/mca/odls
Josh Hursey e10f476c87 Bring over the jjh-filem branch which contains a non-blocking FileM interface
and implementation. This has shown drastic performance benefit when
transferring Many files at roughly the same time.

I tested this for many different filem operations and everything was working
fine. Let me know if you have any problems with this functionality.

Some Notes:
 - opal-checkpoint now has a 'quiet' flag to keep it from being too verbose.

 - FileM RSH component is fully non-blocking.

 - FileM RSH component has incomming connection throttling since by default
   ssh only allows 10 concurrent scp connections to any single host. This
   default can be adjusted via an MCA parameter.
    {{{-mca filem_rsh_max_incomming 10}}}

 - There is an MCA parameter for max outgoing connections, but it is currently
   not implemented. If someone needs it then it should not be hard to implement.
    {{{-mca filem_rsh_max_outgoing 10}}}

 - Changed the FileM request structure so that it is a bit more explicit and
   flexible.

 - Moved the 'preload-binary' and 'preload-files' functionality into odls/base
   allowing for code reuse in the 'process' and 'default' ODLS components.

 - Fixed a bug in the process name resolution which broke the 'preload-*'
   functionality due to GPR table structure changes.

 - The FileM RSH component might be able to see even more speedup from using a
   thread pool to operate on the work_pool structures, but that is for future
   work.

 - Added a 'opal-show-help' file to ODLS Base

This commit was SVN r16252.
2007-09-27 13:13:29 +00:00
..
base Bring over the jjh-filem branch which contains a non-blocking FileM interface 2007-09-27 13:13:29 +00:00
bproc reapply r15517 and r15520, which were removed in r15527 so that I could get 2007-07-20 02:34:29 +00:00
default Bring over the jjh-filem branch which contains a non-blocking FileM interface 2007-09-27 13:13:29 +00:00
process Bring over the jjh-filem branch which contains a non-blocking FileM interface 2007-09-27 13:13:29 +00:00
Makefile.am Here is the major MAD-cure commit. I have written plenty about it, so I refer you here to those messages for a description of everything that was done. 2006-09-14 21:29:51 +00:00
odls_types.h Bring in the generalized xcast communication system along with the correspondingly revised orted launch. I will send a message out to developers explaining the basic changes. In brief: 2007-06-12 13:28:54 +00:00
odls.h Fix a problem observed by Brian where processes launched local to mpirun lost their environment except for MCA params. 2007-07-13 15:47:57 +00:00