Rich Graham
27182afb67
get the timers in correctly.
...
This commit was SVN r17832.
2008-03-16 03:25:16 +00:00
Rich Graham
afcd1016fd
move temp buffer allocation out of the iteration loop - i.e. always use the
...
same temp loop. The algorithm is rather synchronous already...
This commit was SVN r17831.
2008-03-16 03:20:46 +00:00
Rich Graham
a1766b29f6
fix some barrier addressing errors.
...
This commit was SVN r17830.
2008-03-15 22:46:19 +00:00
Rich Graham
0453e7d2f4
bug in management memory allocation - too much memory allocated.
...
This commit was SVN r17829.
2008-03-15 18:12:20 +00:00
Rich Graham
3c2f1eb8bf
reduce the number of temp buffers used.
...
This commit was SVN r17828.
2008-03-15 17:23:04 +00:00
Rich Graham
0f9d642d51
temp buffer pointers are computed when they are set up. A bit more
...
efficient, but more important, it is much easier to play around with
memory layout now.
This commit was SVN r17827.
2008-03-15 16:36:35 +00:00
Rich Graham
e3e336b5ab
check point
...
This commit was SVN r17826.
2008-03-15 13:31:21 +00:00
Jeff Squyres
6c77c995c2
Add missing dependencies in the static build case.
...
This commit was SVN r17825.
2008-03-15 12:11:36 +00:00
George Bosilca
5e229fe688
Thanks Ma for the patch. Correct the multi-rail support and
...
rename some fields to something more clear.
This commit was SVN r17824.
2008-03-14 19:17:28 +00:00
George Bosilca
ecebd5ae77
Update the Elan BTL to take in account multiple networks, and correctly deal
...
with the node position in the network.
This commit was SVN r17822.
2008-03-14 17:32:35 +00:00
Gleb Natapov
772772b944
Remove unneeded include.
...
This commit was SVN r17813.
2008-03-12 10:01:20 +00:00
Gleb Natapov
90c70e37b9
Clean up SM btl startup code. Remove no longer needed code leftovers from two
...
BTL times. Remove old and no longer correct comment.
This commit was SVN r17805.
2008-03-11 14:39:10 +00:00
Gleb Natapov
3a9652ffc4
Endpoint array may not exist if in add_proc() we failed to find suitable
...
btl for communication with a proc. Don't segfault in this case.
This commit was SVN r17804.
2008-03-11 08:13:37 +00:00
Gleb Natapov
ffa09c44fd
Pass correct pointer to mpool_base function.
...
This commit was SVN r17795.
2008-03-09 13:22:12 +00:00
Gleb Natapov
b0b21c68b4
Remove trailing spaces from SM BTL.
...
This commit was SVN r17794.
2008-03-09 13:17:13 +00:00
Rich Graham
ebcf928c24
add some diagnostics.
...
This commit was SVN r17789.
2008-03-07 22:27:41 +00:00
Rich Graham
9131461511
move some test code to another machine.
...
This commit was SVN r17785.
2008-03-07 19:18:02 +00:00
Rich Graham
c230b65543
fix a couple of bugs. Recursive doubling seems to be working.
...
This commit was SVN r17777.
2008-03-07 02:51:38 +00:00
Rich Graham
70157166f9
checkpoint - compiles, now neeed to debug.
...
This commit was SVN r17775.
2008-03-07 00:39:59 +00:00
Ralph Castain
b110a247be
Fix comm_spawn (maybe).
...
Comm_spawn was sticking during spawn_multiple because of a problem in the dpm - the modex there is asking processes to talk to each other in an allgather_list operation, but the procs don't have the required contact info to do so. The solution here was to ensure that all parent procs have full contact info for procs in the child job.
Admittedly, this isn't the long-term answer. We would like to have the contact info given to only the parent procs that were involved in the comm_spawn. There is a way to do that, but this will suffice to keep things working until that can be implemented and tested.
This commit was SVN r17772.
2008-03-06 21:56:00 +00:00
Rich Graham
4eace9d020
starting to implement recursive doubling algorithm.
...
This commit was SVN r17765.
2008-03-06 18:38:58 +00:00
Tim Prins
5de3e1965e
Remove the orte_proc_table. Migrate all users of it to the opal_hash_table and a new name hash function in orte.
...
Everything should work, however I am unable to compile and test the sctp BTL.
This commit was SVN r17751.
2008-03-05 22:44:35 +00:00
Tim Prins
f9916811ae
Make it so we do not mangle the options the user passes to their executeable. Fixes trac:1124
...
The change also:
- cleans up and simplifies the command line processing code
- adds an error output if more than one hostfile passed for a single app context
- gets rid of the superfluous orte_app_context_map_t type, and instead use a simple argv of -host options
This commit was SVN r17750.
The following Trac tickets were found above:
Ticket 1124 --> https://svn.open-mpi.org/trac/ompi/ticket/1124
2008-03-05 22:12:27 +00:00
Donald Kerr
ef8f807c1c
was not passing correct variable to dat_strerror
...
This commit was SVN r17749.
2008-03-05 21:45:16 +00:00
Josh Hursey
612ebdc2ac
Cleanup some symbol visability issues.
...
This commit was SVN r17733.
2008-03-05 13:59:25 +00:00
Jeff Squyres
597266fdec
Present state of MPI debugger work:
...
* New/improved bootstrapping technique for DLLs
* First cut of the MPI handle debugging interface. It is still
evolving, but the interface is getting more stable.
* Some minor bugs were fixed in the unity topo component (brought to
light because of the new MPI handle debugging stuff).
Fixes trac:1209.
This commit was SVN r17730.
The following Trac tickets were found above:
Ticket 1209 --> https://svn.open-mpi.org/trac/ompi/ticket/1209
2008-03-05 12:22:34 +00:00
Josh Hursey
3b4073e32c
This commit fixes the checkpoint/restart functionality on the trunk. Included in this commit are:
...
* Extension to the ESS framework to support C/R
* Fixed support for {{{snapc_base_establish_global_snapshot_dir}}}
* Fixed FileM support
* Misc. minor code modifications
There are some outstanding visability issues that I want to fix next.
This commit was SVN r17725.
2008-03-05 04:57:23 +00:00
Jeff Squyres
ea5c0cb4a2
Now that the nightly tarball has safely been made, let's try this
...
commit again. Remove the svn:ignore from problematic directories and
try a merge from /tmp-public/plpa-merge-area2.
This commit was SVN r17718.
2008-03-05 02:45:15 +00:00
Galen Shipman
3a59cbd4a7
not sure how this got missed..
...
This commit was SVN r17710.
2008-03-05 01:23:43 +00:00
Christian Bell
987de57c9c
Looks like orte/ns is now gone
...
This commit was SVN r17706.
2008-03-05 00:55:43 +00:00
Jeff Squyres
8189fcc7d5
Back out r17702; it went very badly.
...
This commit was SVN r17704.
The following SVN revision numbers were found above:
r17702 --> open-mpi/ompi@3df754ebd7
2008-03-05 00:42:39 +00:00
Jeff Squyres
3df754ebd7
Bring over PLPA v1.1 from /tmp-public/plpa-v1.1 branch.
...
This commit was SVN r17702.
2008-03-05 00:16:49 +00:00
Christian Bell
c3d0a81cd3
Add new QLogic adapters to hca-params.init
...
This commit was SVN r17699.
2008-03-04 22:14:27 +00:00
Ralph Castain
55c727cea4
Fix compiler warning
...
This commit was SVN r17684.
2008-03-04 15:46:37 +00:00
Rich Graham
67ad9b6d6b
increase max data segments size.
...
This commit was SVN r17677.
2008-03-02 19:11:09 +00:00
Gleb Natapov
08abafdaa1
Initialize ib_pd to NULL.
...
This commit was SVN r17674.
2008-03-02 09:11:23 +00:00
Rich Graham
53126fa7bd
add calls to opal_progress()
...
This commit was SVN r17673.
2008-02-29 23:25:09 +00:00
Rich Graham
d37db14901
get the shared memory collectives working again with the new
...
version of orte.
This commit was SVN r17672.
2008-02-29 22:28:57 +00:00
Rich Graham
c253a7bda1
simplify the code abit.
...
This commit was SVN r17664.
2008-02-29 03:55:12 +00:00
Rich Graham
1632d8b299
revert to an older (not previosly checked in) version to get around a
...
regression.
This commit was SVN r17663.
2008-02-29 03:12:12 +00:00
Rich Graham
827e8d877e
fix bug in node type, and some memory copy optimizations.
...
This commit was SVN r17661.
2008-02-29 01:20:11 +00:00
Rich Graham
940d6732c9
remove compiler warnings.
...
This commit was SVN r17656.
2008-02-28 22:01:19 +00:00
Tim Prins
84b2099fe8
Remove the now-unused orte_value_array. As this is the last 'class' split between orte and ompi, remove the big comment about the split in ompi_bitmap.
...
Also, update some properties (source files should not be executeable...), and remove a couple unneeded inclusions of orte_proc_table.h
This commit was SVN r17655.
2008-02-28 21:39:42 +00:00
Rich Graham
2b5fab9d51
avoid 0 byte malloc.
...
This commit was SVN r17653.
2008-02-28 21:11:42 +00:00
Rich Graham
4b26adef00
remove some debug output.
...
This commit was SVN r17650.
2008-02-28 20:54:35 +00:00
Ralph Castain
48e5840c50
Restore a placeholder to make non-SVN SCM's happy.
...
This commit was SVN r17648.
2008-02-28 20:19:22 +00:00
Rich Graham
5df6c6d043
fix several race conditions.
...
This commit was SVN r17645.
2008-02-28 19:40:19 +00:00
George Bosilca
9d421bea2a
Replace all occurences of orte_pointer_array by opal_pointer_array. Remove the
...
implementation of orte_pointer_array.
This commit was SVN r17636.
2008-02-28 05:32:23 +00:00
George Bosilca
678e6c7f0d
This is a Mercurial file.
...
This commit was SVN r17635.
2008-02-28 05:18:06 +00:00
Ralph Castain
d70e2e8c2b
Merge the ORTE devel branch into the main trunk. Details of what this means will be circulated separately.
...
Remains to be tested to ensure everything came over cleanly, so please continue to withhold commits a little longer
This commit was SVN r17632.
2008-02-28 01:57:57 +00:00
Aurelien Bouteiller
76e6334a57
This change is a mistake. CONVERTOR METHOD does not work with unpatched trunk. Revert back to PACK_METHOD.
...
This commit was SVN r17629.
2008-02-27 20:02:25 +00:00
Aurelien Bouteiller
1d57b8b0e0
Replaced all the (long) cast by PRIsize_t. Should solve definitely compiler warnings that appeared from time to time depending on sizeof(size_t)...
...
This commit was SVN r17627.
2008-02-27 19:58:18 +00:00
Rich Graham
68aa691171
checkpoint work.
...
This commit was SVN r17620.
2008-02-27 14:56:36 +00:00
Galen Shipman
b378c8c12c
return success.
...
This commit was SVN r17612.
2008-02-27 02:15:53 +00:00
Galen Shipman
44003a41f2
Update common_portals to allow using portals interconnect with a modex rather
...
than relying on cnos to get the nid/pid map.
This commit was SVN r17588.
2008-02-25 19:17:21 +00:00
Rich Graham
b4bbb70bb7
got it all, but for the mem copies. Also, need to make sure volatile declarations are all inplace, as well as memory barriers.
...
This commit was SVN r17572.
2008-02-25 00:16:21 +00:00
Rich Graham
2d8c2420e8
checkpoint.
...
This commit was SVN r17571.
2008-02-24 20:54:16 +00:00
Rich Graham
771584bff5
generate reduction tree.
...
This commit was SVN r17569.
2008-02-24 03:25:40 +00:00
Brian Barrett
bc8d863ce3
* Make Portals BTL compile again (looks like the frag ownership stuff didn't
...
get copied well)
* Clean up a bunch of warnings
This commit was SVN r17562.
2008-02-23 01:45:36 +00:00
Donald Kerr
437e280829
removing a few superfluous casts when the base or super is available
...
This commit was SVN r17554.
2008-02-22 20:10:55 +00:00
Ralph Castain
b4ec81a9fd
Fix the Panasas support in ROMIO so it builds without complaints. Required a patch from Brian, plus a few edits by me to remove warnings.
...
NOTE: the code provided by PANASAS includes a "switch" that they left incomplete - it doesn't cover all possibilities. Since the value being switched is an enum, this causes problems for the compiler. I added the missing values, but - since Panasas felt they could be ignored - had the switch generate an error if those cases ever occurred.
This commit was SVN r17543.
2008-02-21 20:35:34 +00:00
Donald Kerr
fe51084d8e
fix compile warning by casting btl udapl module to base module before call to mca_btl_udapl_free
...
This commit was SVN r17541.
2008-02-21 16:19:06 +00:00
Pierre Lemarinier
2a99f89631
Modification of the mutex lock order to prevent races during connection stage.
...
This commit was SVN r17535.
2008-02-20 18:17:58 +00:00
Rich Graham
b9bb78484d
a bit of omptimization.
...
This commit was SVN r17528.
2008-02-20 16:19:49 +00:00
Pavel Shamis
a0d12a9c92
Adding support for APM over different ports
...
This commit was SVN r17521.
2008-02-20 13:44:05 +00:00
Rich Graham
09afc36f5f
correct addressing.
...
This commit was SVN r17519.
2008-02-20 01:12:43 +00:00
Rich Graham
b87b15580c
fix memory allocation error. Initialize pointer.
...
This commit was SVN r17514.
2008-02-19 20:01:42 +00:00
Gleb Natapov
60c151608c
Set flags inside fragment allocation function.
...
This commit was SVN r17508.
2008-02-19 12:26:45 +00:00
Nysal Jan
479f36adfc
Fix a SEGV on ppc64. size_t is 8 bytes on a 64-bit build
...
This commit was SVN r17507.
2008-02-19 11:01:21 +00:00
Jeff Squyres
5bb1e5151f
Suggestions/patches from Brian to make stuff better:
...
* Include all the stuff that is necessary for running autogen.sh in a
distribution tarball.
* Remove from config/Makefile.am's EXTRA_DIST that which is
automatically included in the tarball in recent versions of
Automake (i.e., all the m4 files that are acincluded).
* Make ROMIO's configure script look for something that is actually
included in the tarball.
Fixes trac:1025.
This commit was SVN r17505.
The following Trac tickets were found above:
Ticket 1025 --> https://svn.open-mpi.org/trac/ompi/ticket/1025
2008-02-19 01:49:52 +00:00
Jeff Squyres
f22f62ef1f
Fix typos.
...
This commit was SVN r17502.
2008-02-18 21:26:21 +00:00
Jeff Squyres
33a4aff18e
Make openib btl a bit more resillient in the face of driver errors --
...
return OMPI_ERR_UNREACH if the port returns an invalid speed or
width. OMPI_ERR_VALUE_OUT_OF_BOUNDS is reserved for when we exceed
the number of allowable BTLs.
This commit was SVN r17500.
2008-02-18 20:28:06 +00:00
George Bosilca
7a21d77b29
Remove some compilation warnings.
...
This commit was SVN r17498.
2008-02-18 18:55:32 +00:00
George Bosilca
fa31ec81d0
Add the ownership flags to the PML/BTL interface. The layer
...
owning the descriptor is responsible for releasing it once
the descriptor is not in use anymore.
This commit was SVN r17497.
2008-02-18 17:39:30 +00:00
Shiqing Fan
653857ddbe
Wrong function name was copied here.
...
This commit was SVN r17486.
2008-02-17 19:47:47 +00:00
Gleb Natapov
354c5bc5e1
Don't call progress() from OB1 fragment scheduling functions. They don't serve
...
any purpose and case recursion calls to progress engine.
This commit was SVN r17478.
2008-02-17 12:42:32 +00:00
Rich Graham
1cd8a2e578
checkpoint - works for 2 procs, but not more.
...
This commit was SVN r17477.
2008-02-17 05:21:58 +00:00
Rich Graham
8006927ae8
free buffer, rather than ask for another one, when done with the memory.
...
This commit was SVN r17468.
2008-02-15 04:21:58 +00:00
Rich Graham
2277b47ab9
register mca_coll_sm2_allreduce_intra - function still does not do any
...
reduction operations.
This commit was SVN r17467.
2008-02-15 04:13:00 +00:00
Rich Graham
9b0687e6df
add buffer allocation and deallocation calls to the allreduce routine, so
...
I can start debugging the memory management code. The allreduce fucntion
does nothing at this stage.
This commit was SVN r17466.
2008-02-15 03:59:14 +00:00
George Bosilca
be2579467a
With the new ompi_free_list this is not needed anymore.
...
This commit was SVN r17465.
2008-02-15 03:22:16 +00:00
Rich Graham
41943dbd76
adding missing files.
...
This commit was SVN r17462.
2008-02-15 00:59:28 +00:00
Rich Graham
41f4b06b39
buffer allocate/release code is fully written, and compiles. Now need to debug.
...
This commit was SVN r17461.
2008-02-15 00:57:44 +00:00
Rich Graham
7cc58768cd
checkpoint something that compiles
...
This commit was SVN r17460.
2008-02-15 00:33:14 +00:00
Rich Graham
292d930eea
check point.
...
This commit was SVN r17457.
2008-02-14 20:00:26 +00:00
Donald Kerr
58bf7f5a1d
add uintptr_t to prevent the possibility of a signed extension occuring
...
This commit was SVN r17456.
2008-02-14 19:16:34 +00:00
Aurelien Bouteiller
3ffe845187
Fixed warning.
...
This commit was SVN r17454.
2008-02-14 15:18:19 +00:00
Jeff Squyres
6420db7088
Add missing header file that caused compilation errors in the
...
rhc-step2b branch last night.
This commit was SVN r17453.
2008-02-14 14:10:27 +00:00
George Bosilca
255cd2186b
Improve the performance of the MX BTL. Correct the fake PUT
...
protocol.
This commit was SVN r17452.
2008-02-14 04:38:55 +00:00
Adrian Knoth
f1648f08df
Advanced address selection code from Thomas Peiselt. Re #1207 , #1027
...
This commit was SVN r17450.
2008-02-13 21:53:00 +00:00
Sharon Melamed
5b2dab2439
Reverted commit # r17443
...
This commit was SVN r17446.
The following SVN revision numbers were found above:
r17443 --> open-mpi/ompi@88ce5a2b73
2008-02-13 14:07:12 +00:00
Sharon Melamed
88ce5a2b73
Replaced PLPA to the latest PLPA (plpa-1.1a3r123)
...
This commit was SVN r17443.
2008-02-13 13:09:11 +00:00
Gleb Natapov
0a1fa2cb56
req_match_received is set inside MCA_PML_OB1_RECV_REQUEST_MATCHE().
...
This commit was SVN r17442.
2008-02-13 08:34:39 +00:00
Gleb Natapov
876f49f1a7
Remove unnecessary assignment. It is done later in the same function.
...
This commit was SVN r17441.
2008-02-13 08:28:25 +00:00
Jeff Squyres
17ede97ef8
Two fixes to revert some long-ago decisions that seemed like a good
...
idea at the time, but led to logistical difficulties in importing new
versions of ROMIO:
* We are effectively eliminating the ROMIO file prefix rule hacks in
the ROMIO component, which create symlinks from foo.c to
io_romio_foo.c. In reality, the file name conflict potential will
be small.
* Additionally, we are effectively eliminating the ROMIO function
prefix rule in the ROMIO component. This is another place where
there are generally problems with the merge up new versions of ROMIO
and/or patches from the user community (for their own local builds).
In reality, since other major MPI implementations provides the same
exact symbols, it won't cause any practical problems for users.
In return, we make it ''much'' simpler to apply ROMIO patches to Open
MPI. The problem right now is that any patch will have filenames such
as ad_panfs.c, but Open MPI will only have io_romio_ad_panfs.c, making
things extremely difficult for users. I believe, for example, that
this would make it possible for LANL to have applied their patches
without too much hassle on either their part or our part. It will
also make things easier for OMPI when we/they want to do the next
ROMIO upgrade (this was one of the sources of problems on each
upgrade).
This commit was SVN r17436.
2008-02-12 18:55:17 +00:00
Shiqing Fan
54c7b71cfd
Use the correct way of including memchecker.h, which will work with '--with-devel-headers'.
...
This commit was SVN r17435.
2008-02-12 18:01:17 +00:00
Rainer Keller
7621800477
- Fix and add comments -- output full name for pd
...
- Protect argument in macro...
This commit was SVN r17434.
2008-02-12 16:59:59 +00:00
Jeff Squyres
6adc5015f9
This file was accidentally re-introduced in r17409.
...
This commit was SVN r17428.
The following SVN revision numbers were found above:
r17409 --> open-mpi/ompi@98f70d6318
2008-02-12 13:07:44 +00:00
Shiqing Fan
f5792bbda5
merging the memchecker into trunk.
...
This commit was SVN r17424.
2008-02-12 08:46:27 +00:00
Gleb Natapov
cf801edfe5
Use carto topology framework to choose which HCAs to use.
...
This commit was SVN r17414.
2008-02-11 10:34:11 +00:00