提交 · 17d99e6266270d06068b5b6f5dc03d984bcfd41d · OpenCV / opencv

30 11月, 2021 2 次提交

A

Merge pull request #21142 from alalek:dnn_two_inputs_ocl_fp16_3.4 · 17d99e62
由 Alexander Alekhin 提交于 11月 29, 2021

17d99e62

Merge pull request #20658 from smbz:lstm_optimisation · ea7d4be3

由 Andrew Ryrie 提交于 11月 29, 2021

* dnn: LSTM optimisation

This uses the AVX-optimised fastGEMM1T for matrix multiplications where available, instead of the standard cv::gemm.

fastGEMM1T is already used by the fully-connected layer. This commit involves two minor modifications:
- Use unaligned access. I don't believe this involves any performance hit in on modern CPUs (Nehalem and Bulldozer onwards) in the case where the address is actually aligned.
- Allow for weight matrices where the number of columns is not a multiple of 8.

I have not enabled AVX-512 as I don't have an AVX-512 CPU to test on.

* Fix warning about initialisation order

* Remove C++11 syntax

* Fix build when AVX(2) is not available

In this case the CV_TRY_X macros are defined to 0, rather than being undefined.

* Minor changes as requested:

- Don't check hardware support for AVX(2) when dispatch is disabled for these
- Add braces

* Fix out-of-bounds access in fully connected layer

The old tail handling in fastGEMM1T implicitly rounded vecsize up to the next multiple of 8, and the fully connected layer implements padding up to the next multiple of 8 to cope with this. The new tail handling does not round the vecsize upwards like this but it does require that the vecsize is at least 8. To adapt to the new tail handling, the fully connected layer now rounds vecsize itself at the same time as adding the padding(which makes more sense anyway).

This also means that the fully connected layer always passes a vecsize of at least 8 to fastGEMM1T, which fixes the out-of-bounds access problems.

* Improve tail mask handling

- Use static array for generating tail masks (as requested)
- Apply tail mask to the weights as well as the input vectors to prevent spurious propagation of NaNs/Infs

* Revert whitespace change

* Improve readability of conditions for using AVX

* dnn(lstm): minor coding style changes, replaced left aligned load

ea7d4be3

28 11月, 2021 3 次提交
- A
  
  dnn(DataLayer): fix CPU/OpenCL code paths for FP16 handling · 58b06222
  由 Alexander Alekhin 提交于 11月 28, 2021
  
  58b06222
- A
  dnn(test): add two_inputs test with FP32/U8 data types · 58dc3979
  由 Alexander Alekhin 提交于 11月 27, 2021
```
- remove similar test from IE scope under HAVE_INF_ENGINE
```
  58dc3979
- Y
  Merge pull request #21107 from take1014:remove_assert_21038 · a6277370
  由 yuki takehara 提交于 11月 28, 2021
```
resolves #21038

* remove C assert

* revert C header

* fix several points in review

* fix test_ds.cpp
```
  a6277370
27 11月, 2021 4 次提交
- A
  
  Merge pull request #21133 from alalek:dnn_test_ie_update_3.4 · b55d8f46
  由 Alexander Alekhin 提交于 11月 26, 2021
  
  b55d8f46
- A
  
  dnn(test): update InferenceEngine tests · 985aa042
  由 Alexander Alekhin 提交于 11月 25, 2021
  
  985aa042
- A
  
  Merge pull request #21131 from cclauss:codespell · c14a8dce
  由 Alexander Alekhin 提交于 11月 26, 2021
  
  c14a8dce
- A
  
  Merge pull request #21130 from cclauss:print-function · f5d45221
  由 Alexander Alekhin 提交于 11月 26, 2021
  
  f5d45221
26 11月, 2021 5 次提交
- C
  
  Fix typos discovered by codespell · ebe4ca6b
  由 Christian Clauss 提交于 11月 26, 2021
  
  ebe4ca6b
- A
  
  Merge pull request #21128 from cclauss:patch-3 · f159ed20
  由 Alexander Alekhin 提交于 11月 26, 2021
  
  f159ed20
- C
  
  Use print() function in both Python 2 and Python 3 · cdbb042c
  由 Christian Clauss 提交于 11月 26, 2021
  
  cdbb042c
- C
  CMakeLists.txt: Fix typo discovered by codespell · 23bbe511
  由 Christian Clauss 提交于 11月 26, 2021
```
https://pypi.org/project/codespell/
```
  23bbe511
- A
  
  Merge pull request #21123 from cclauss:patch-3 · 444218e7
  由 Alexander Alekhin 提交于 11月 25, 2021
  
  444218e7
25 11月, 2021 2 次提交
- C
  Use ==/!= to compare constant literals (str, bytes, int, float, tuple) · 9cc60c9d
  由 Christian Clauss 提交于 11月 25, 2021
```
Avoid `SyntaxWarning` on Python >= 3.8
```
  >>> "convolutional" == "convolutional"
  True
  >>> "convolutional" is "convolutional"
  <stdin>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
  True
```
Related to #21121
```
  9cc60c9d
- A
  
  Merge pull request #21110 from alalek:update_libjpeg-turbo · 2c226d59
  由 Alexander Alekhin 提交于 11月 24, 2021
  
  2c226d59
24 11月, 2021 1 次提交
- A
  3rdparty: libjpeg-turbo 2.1.0 => 2.1.2 · c6ab32ff
  由 Alexander Alekhin 提交于 11月 24, 2021
```
https://github.com/libjpeg-turbo/libjpeg-turbo/releases/tag/2.1.2
```
  c6ab32ff
23 11月, 2021 1 次提交
- A
  
  Merge pull request #21092 from alalek:core_logger_show_timestamp · 101be77d
  由 Alexander Alekhin 提交于 11月 22, 2021
  
  101be77d
20 11月, 2021 2 次提交
- A
  
  core(logger): dump timestamp information with message · 61f1ee2d
  由 Alexander Alekhin 提交于 11月 20, 2021
  
  61f1ee2d
- A
  
  Merge pull request #21063 from vrabaud:3.4_h_clamping · 93b6e80c
  由 Alexander Alekhin 提交于 11月 19, 2021
  
  93b6e80c
19 11月, 2021 5 次提交

Fix H clamping for very small negative values. · d4741eec

由 Vincent Rabaud 提交于 11月 15, 2021

In case of very small negative h (e.g. -1e-40), with the current implementation,
you will go through the first condition and end up with h = 6.f, and will miss
the second condition.

d4741eec

A

Merge pull request #21067 from NickJackolson:nickjackolson/imread-warning · 585484cb
由 Alexander Alekhin 提交于 11月 18, 2021

585484cb

add !empty assertion in seamlessClone() · b696928a

由 nickjackolson 提交于 11月 17, 2021

issue #20617 addresses lack of warnings on
seamlessClone() function when src is None.
This commit adds source check using CV_Assert
therefore debugging would be easier.
Signed-off-by: Nnickjackolson <metedurlu@gmail.com>

b696928a

Add warning message to imread() · 79d4e865

由 nickjackolson 提交于 11月 14, 2021

Add a warning message using CV_LOG__WARNING().
This way api behaviour is preserved. Outputs are
the same but user gets an extra warning in case
fopen() fails to access image file for some reason.
This would help new users and also debugging
complex apps which use imread()
Signed-off-by: Nnickjackolson <metedurlu@gmail.com>

79d4e865

A

Merge pull request #21077 from alalek:js_test_pin_cli_table · 83808798
由 Alexander Alekhin 提交于 11月 18, 2021

83808798

18 11月, 2021 1 次提交
- A
  
  js(test): pin cli-table dependency · de7f8eec
  由 Alexander Alekhin 提交于 11月 18, 2021
  
  de7f8eec
17 11月, 2021 2 次提交
- A
  
  Merge pull request #21064 from alalek:doc_videoio_api_preference_3.4 · 49432009
  由 Alexander Alekhin 提交于 11月 16, 2021
  
  49432009
- A
  
  doc(videoio): fix apiPreference note, replace DSHOW(deprecated)->MSMF · 473f1087
  由 Alexander Alekhin 提交于 11月 16, 2021
  
  473f1087
16 11月, 2021 1 次提交

Merge pull request #17889 from ZhengQiushi:my_3.4 · 3e51448e

由 Qiushi Zheng 提交于 11月 16, 2021

QR code (encoding process)

* add qrcode encoder

* qr encoder fixes

* qr encoder: fix api and realization

* fixed qr encoder, added eci and kanji modes

* trigger CI

* qr encoder constructor fixes
Co-authored-by: NAPrigarina <ann73617@gmail.com>

3e51448e

12 11月, 2021 1 次提交

Merge pull request #21025 from alalek:issue_21004 · 8041ab8a

由 Alexander Alekhin 提交于 11月 12, 2021

* dnn(ocl4dnn): fix LRN layer accuracy problems

- FP16 intermediate computation is not accurate and may provide NaN values

* dnn(test): update tolerance for FP16

8041ab8a

11 11月, 2021 2 次提交
- T
  Merge pull request #21030 from tv3141:fix_seg_fault_houghlinespointset · cb286a66
  由 tv3141 提交于 11月 10, 2021
```
Fix seg fault houghlinespointset

* Clarify parameter doc for HoughLinesPointSet

* Fix seg fault.

* Add regression test.

* Fix latex typo
```
  cb286a66
- A
  
  Merge pull request #20870 from pkubaj:master · f33828a1
  由 Alexander Alekhin 提交于 11月 10, 2021
  
  f33828a1
10 11月, 2021 2 次提交

Add support for runtime CPU feature check on POWER on FreeBSD. · 68e425f8

由 Piotr Kubaj 提交于 10月 13, 2021

1. Code uses PPC_FEATURE_HAS_VSX, but it's not checked similarly to
PPC_FEATURE2_ARCH_3_00 and PPC_FEATURE2_ARCH_3_00 for availability. FreeBSD has
those macros in machine/cpu.h, but I went with the way chosen for
PPC_FEATURE2_ARCH_3_00 and PPC_FEATURE2_ARCH_3_00. Other than that, FreeBSD also
has sys/auxv.h and that's where elf_aux_info() is defined.
2. getauxval() is actually Linux-only, but code checked for __unix__. It won't
work on all UNIX, so change it back to __linux__. Add another code variant
strictly for FreeBSD.
3. Update comment. This commit adds code for FreeBSD, but recently there
appeared support for powerpc64 in OpenBSD.

68e425f8

Merge pull request #20904 from Crayon-new:fix_bug_in_maxLayer · 98b6ce35

由 ZaKiiiiiiiii 提交于 11月 10, 2021

fix bug: wrong output dimension when "keep_dims" is false in pooling layer.

* fix bug in max layer

* code align

* delete permute layer and add test case

* add name assert

* check other cases

* remove c++11 features

* style:add "const" remove assert

* style:sanitize file names

98b6ce35

06 11月, 2021 2 次提交
- A
  
  Merge pull request #21005 from nikpappas:bug-samples-falsecolor-trackbar · 1ac7bace
  由 Alexander Alekhin 提交于 11月 06, 2021
  
  1ac7bace
- N
  
  Fix trackbar in falsecolor cpp sample · 968d94d4
  由 Nikolaos Pappas 提交于 11月 03, 2021
  
  968d94d4
05 11月, 2021 2 次提交
- A
  
  Merge pull request #21011 from vrabaud:3.4 · 2ce47fda
  由 Alexander Alekhin 提交于 11月 05, 2021
  
  2ce47fda
- V
  Only use fma functions when CV_FMA3 is set. · ffd01076
  由 Vincent Rabaud 提交于 11月 04, 2021
```
In practice, processors offering AVX2/AVX512 also FMA, that is why it got unnoticed.
```
  ffd01076
04 11月, 2021 2 次提交
- A
  
  Merge pull request #21007 from alalek:cmake_dnn_fix_wrong_tengine_order · edf533c8
  由 Alexander Alekhin 提交于 11月 04, 2021
  
  edf533c8
- A
  
  dnn(cmake): don't hijack OpenCL options with Tengine · c1d61c88
  由 Alexander Alekhin 提交于 11月 04, 2021
  
  c1d61c88

OpenCV / opencv 上一次同步 8 个月

OpenCV / opencv
上一次同步 8 个月