- 30 11月, 2021 2 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Andrew Ryrie 提交于
* dnn: LSTM optimisation This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm. fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications: - Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned. - Allow for weight matrices where the number of columns is not a multiple of 8. I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on. * Fix warning about initialisation order * Remove C++11 syntax * Fix build when AVX(2) is not available In this case the CV_TRY_X macros are defined to 0, rather than being undefined. * Minor changes as requested: - Don't check hardware support for AVX(2) when dispatch is disabled for these - Add braces * Fix out-of-bounds access in fully connected layer The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway). This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems. * Improve tail mask handling - Use static array for generating tail masks (as requested) - Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs * Revert whitespace change * Improve readability of conditions for using AVX * dnn(lstm): minor coding style changes, replaced left aligned load
-
- 28 11月, 2021 3 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
- remove similar test from IE scope under HAVE_INF_ENGINE
-
由 yuki takehara 提交于
resolves #21038 * remove C assert * revert C header * fix several points in review * fix test_ds.cpp
-
- 27 11月, 2021 4 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
-
- 26 11月, 2021 5 次提交
-
-
由 Christian Clauss 提交于
-
由 Alexander Alekhin 提交于
-
由 Christian Clauss 提交于
-
-
由 Alexander Alekhin 提交于
-
- 25 11月, 2021 2 次提交
-
-
由 Christian Clauss 提交于
Avoid `SyntaxWarning` on Python >= 3.8 ``` >>> "convolutional" == "convolutional" True >>> "convolutional" is "convolutional" <stdin>:1: SyntaxWarning: "is" with a literal. Did you mean "=="? True ``` Related to #21121
-
由 Alexander Alekhin 提交于
-
- 24 11月, 2021 1 次提交
-
- 23 11月, 2021 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 20 11月, 2021 2 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
-
- 19 11月, 2021 5 次提交
-
-
由 Vincent Rabaud 提交于
In case of very small negative h (e.g. -1e-40), with the current implementation, you will go through the first condition and end up with h = 6.f, and will miss the second condition.
-
由 Alexander Alekhin 提交于
-
由 nickjackolson 提交于
issue #20617 addresses lack of warnings on seamlessClone() function when src is None. This commit adds source check using CV_Assert therefore debugging would be easier. Signed-off-by: Nnickjackolson <metedurlu@gmail.com>
-
由 nickjackolson 提交于
Add a warning message using CV_LOG__WARNING(). This way api behaviour is preserved. Outputs are the same but user gets an extra warning in case fopen() fails to access image file for some reason. This would help new users and also debugging complex apps which use imread() Signed-off-by: Nnickjackolson <metedurlu@gmail.com>
-
由 Alexander Alekhin 提交于
-
- 18 11月, 2021 1 次提交
-
-
由 Alexander Alekhin 提交于
-
- 17 11月, 2021 2 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
-
- 16 11月, 2021 1 次提交
-
-
由 Qiushi Zheng 提交于
QR code (encoding process) * add qrcode encoder * qr encoder fixes * qr encoder: fix api and realization * fixed qr encoder, added eci and kanji modes * trigger CI * qr encoder constructor fixes Co-authored-by: NAPrigarina <ann73617@gmail.com>
-
- 12 11月, 2021 1 次提交
-
-
由 Alexander Alekhin 提交于
* dnn(ocl4dnn): fix LRN layer accuracy problems - FP16 intermediate computation is not accurate and may provide NaN values * dnn(test): update tolerance for FP16
-
- 11 11月, 2021 2 次提交
-
-
由 tv3141 提交于
Fix seg fault houghlinespointset * Clarify parameter doc for HoughLinesPointSet * Fix seg fault. * Add regression test. * Fix latex typo
-
由 Alexander Alekhin 提交于
-
- 10 11月, 2021 2 次提交
-
-
由 Piotr Kubaj 提交于
1. Code uses PPC_FEATURE_HAS_VSX, but it's not checked similarly to PPC_FEATURE2_ARCH_3_00 and PPC_FEATURE2_ARCH_3_00 for availability. FreeBSD has those macros in machine/cpu.h, but I went with the way chosen for PPC_FEATURE2_ARCH_3_00 and PPC_FEATURE2_ARCH_3_00. Other than that, FreeBSD also has sys/auxv.h and that's where elf_aux_info() is defined. 2. getauxval() is actually Linux-only, but code checked for __unix__. It won't work on all UNIX, so change it back to __linux__. Add another code variant strictly for FreeBSD. 3. Update comment. This commit adds code for FreeBSD, but recently there appeared support for powerpc64 in OpenBSD.
-
由 ZaKiiiiiiiii 提交于
fix bug: wrong output dimension when "keep_dims" is false in pooling layer. * fix bug in max layer * code align * delete permute layer and add test case * add name assert * check other cases * remove c++11 features * style:add "const" remove assert * style:sanitize file names
-
- 06 11月, 2021 2 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Nikolaos Pappas 提交于
-
- 05 11月, 2021 2 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Vincent Rabaud 提交于
In practice, processors offering AVX2/AVX512 also FMA, that is why it got unnoticed.
-
- 04 11月, 2021 2 次提交
-
-
由 Alexander Alekhin 提交于
-
由 Alexander Alekhin 提交于
-