- 07 2月, 2023 1 次提交
-
-
由 Maksim Shabunin 提交于
-
- 17 5月, 2020 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 24 10月, 2019 1 次提交
-
-
由 Alexander Alekhin 提交于
- move TLS & instrumentation code out of core/utility.hpp - (*) TLSData lost .gather() method (to dispose thread data on thread termination) - use TLSDataAccumulator for reliable collecting of thread data - prefer using of .detachData() + .cleanupDetachedData() instead of .gather() method (*) API is broken: replace TLSData => TLSDataAccumulator if gather required (objects disposal on threads termination is not available in accumulator mode)
-
- 27 6月, 2019 1 次提交
-
-
由 Rostislav Vasilikhin 提交于
-
- 16 6月, 2019 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 09 4月, 2019 1 次提交
-
-
由 Alexander Alekhin 提交于
- added functionality to collect memory usage of OpenCL sybsystem - memory usage of fastMalloc() (disabled by default): * It is not accurate sometimes - external memory profiler is required. - specify common `CV_TEST_TAG_` macros - added applyTestTag() function - write memory usage / enabled tags into Google Tests output file (.xml)
-
- 07 3月, 2019 1 次提交
-
-
由 Alexander Alekhin 提交于
Lab/XYZ modes have been postponed (color_lab.cpp): - need to split code for tables initialization and for pixels processing first - no significant performance improvements for switching between SSE42 / AVX2 code generation
-
- 26 2月, 2019 1 次提交
-
-
由 Maksim Shabunin 提交于
-
- 16 11月, 2018 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 24 10月, 2018 1 次提交
-
-
由 maver1 提交于
* Updated ICV packages and IPP integration * core(test): minMaxIdx IPP regression test * core(ipp): workaround minMaxIdx problem * core(ipp): workaround meanStdDev() CV_32FC3 buffer overrun * Returned semicolon after CV_INSTRUMENT_REGION_IPP()
-
- 05 10月, 2018 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 28 9月, 2018 1 次提交
-
-
由 Rostislav Vasilikhin 提交于
Fixes for instrumentation of IPP and OCL (#12637) * fixed warning about re-declaring variable when both IPP and instrumentation are enabled * fixed segfault when no funName provided * compilation fixed when both OCL and instrumentation are enabled
-
- 14 9月, 2018 1 次提交
-
-
由 Hamdi Sahloul 提交于
-
- 12 9月, 2018 1 次提交
-
-
由 Hamdi Sahloul 提交于
-
- 16 4月, 2018 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 10 4月, 2018 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 06 4月, 2018 1 次提交
-
-
由 Maksim Shabunin 提交于
-
- 28 3月, 2018 2 次提交
-
-
由 Alexander Alekhin 提交于
CPATH entries are processed before any -isystem: https://gcc.gnu.org/onlinedocs/cpp/Environment-Variables.html
-
由 Alexander Alekhin 提交于
-
- 14 3月, 2018 1 次提交
-
-
由 Alexander Alekhin 提交于
To avoid compilation of this code: - buf = 0; This code can be received after refactoring of 1D cv::Mat to cv::AutoBuffer. - "cv_mat = 0" calls setTo(). - cv::AutoBuffer calls "allocate(0)" - this is wrong.
-
- 07 12月, 2017 1 次提交
-
-
由 Pavel Rojtberg 提交于
-
- 21 11月, 2017 3 次提交
-
-
由 Maksim Shabunin 提交于
-
由 Maksim Shabunin 提交于
-
由 Maksim Shabunin 提交于
-
- 08 9月, 2017 1 次提交
-
-
由 Pavel Vlasov 提交于
Manual IPP dispatcher simplification;
-
- 23 8月, 2017 1 次提交
-
-
由 Pavel Vlasov 提交于
- Optimizations set change. Now IPP integrations will provide code for SSE42, AVX2 and AVX512 (SKX) CPUs only. For HW below SSE42 IPP code is disabled. - Performance regressions fixes for IPP code paths; - cv::boxFilter integration improvement; - cv::filter2D integration improvement;
-
- 12 7月, 2017 1 次提交
-
-
由 Alexander Alekhin 提交于
NaN values handling issue
-
- 28 6月, 2017 1 次提交
-
-
由 Vadim Pisarevsky 提交于
* another round of dnn optimization: * increased malloc alignment across OpenCV from 16 to 64 bytes to make it AVX2 and even AVX-512 friendly * improved SIMD optimization of pooling layer, optimized average pooling * cleaned up convolution layer implementation * made activation layer "attacheable" to all other layers, including fully connected and addition layer. * fixed bug in the fusion algorithm: "LayerData::consumers" should not be cleared, because it desctibes the topology. * greatly optimized permutation layer, which improved SSD performance * parallelized element-wise binary/ternary/... ops (sum, prod, max) * also, added missing copyrights to many of the layer implementation files * temporarily disabled (again) the check for intermediate blobs consistency; fixed warnings from various builders
-
- 26 6月, 2017 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 25 4月, 2017 1 次提交
-
-
由 Pavel Vlasov 提交于
Updated integrations for: cv::split cv::merge cv::insertChannel cv::extractChannel cv::Mat::convertTo - now with scaled conversions support cv::LUT - disabled due to performance issues Mat::copyTo Mat::setTo cv::flip cv::copyMakeBorder - currently disabled cv::polarToCart cv::pow - ipp pow function was removed due to performance issues cv::hal::magnitude32f/64f - disabled for <= SSE42, poor performance cv::countNonZero cv::minMaxIdx cv::norm cv::canny - new integration. Disabled for threaded; cv::cornerHarris cv::boxFilter cv::bilateralFilter cv::integral
-
- 21 4月, 2017 1 次提交
-
-
由 Pavel Vlasov 提交于
-
- 05 4月, 2017 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 23 3月, 2017 1 次提交
-
-
由 Alexander Alekhin 提交于
- use suffixes like '.avx.cpp' - added CMake-generated files for '.simd.hpp' optimization approach - wrap HAL intrinsic headers into separate namespaces for different build flags - automatic vzeroupper insertion (via CV_INSTRUMENT_REGION macro)
-
- 01 12月, 2016 1 次提交
-
-
由 mshabunin 提交于
-
- 08 11月, 2016 1 次提交
-
-
由 Pavel Vlasov 提交于
--perf_instrument parameter now has int type and 0, 1, 2 modes (1 - simple trees, 2 - expanded trees for functions with same name but different calling address); Maximum depth limit var was added to the instrumentation structure; Trace names output console output fix: improper tree formatting could happen; Output in case of error was added; Custom regions improvements; Improved timing and weight calculation for parallel regions; New TC (threads counter) value to indicate how many different threads accessed particular node; parallel_for, warnings fixes and ReturnAddress code from Alexander Alekhin;
-
- 30 9月, 2016 1 次提交
-
-
由 sourin 提交于
-
- 08 9月, 2016 2 次提交
-
-
由 Alexander Alekhin 提交于
- enable C++11 - fix macro
-
由 Alexander Alekhin 提交于
- calculate ticksTotal instead of ticksMean - local / global width is based on ticksTotal value - added instrumentation for OpenCL program compilation - added instrumentation for OpenCL kernel execution
-
- 19 8月, 2016 1 次提交
-
-
由 Pavel Vlasov 提交于
-
- 17 8月, 2016 1 次提交
-
-
由 Pavel Vlasov 提交于
-