45dfa0d0b1
- added CUDA stream reuse for both, CUDA tracing with CUPTI and CUDART wrapper - removed CUDA stream number from thread name, when CUDA stream reuse is enabled - disable tracking of MPI communicators, requests, windows, etc. if MPI is initialized with MPI_THREAD_SERIALIZED or MPI_THREAD_MULTIPLE (only MPI function enter/leave events will be recorded) - configure: - fixed detection of compiler instrumentation type on Cray platforms using the cc compiler wrapper - compiler wrappers: - fixed preprocessing source files to be parsed by OPARI (add path to empty omp.h to the preprocessor flags to avoid multiple declarations of OpenMP functions, types, etc.) - vtnvcc: Remove 'compinst' instrumentation type, if VT is configured with a non-GNU compiler instrumentation support (Fixed "unrecognized option" error) - vtdyn: - added support for instrumenting outer- and inner loops and its iterations (outer=loops within a function, inner=loops within outer loops) - try to get the full prototype of functions to be instrumented - consider default filter rules also if no filter file is given - fixed potential segfault if adding a filter rule w/o stack bounds - print verbose messages on stdout if vtdyn is started from the Dyninst attach library (libvt-dynatt) - vtunify: - print verbose messages on stderr if vtunify is started automatically from the VT library This commit was SVN r26836.