提交 · 9aa89b99e1c13f4d5a82dda60a9eb4de242a3388 · BaiXuePrincess / Paddle

21 6月, 2022 2 次提交
- F
  
  [MLU] add deformable_conv kernel (#43630) · 827d9992
  由 fwenguang 提交于 6月 21, 2022
  
  827d9992
- C
  [MLU] add mlu kernel for elementwise_max_grad (#43608) · f586110d
  由 cambriconhsq 提交于 6月 21, 2022
```
* [MLU] add mlu kernel for elementwise_max_grad

* [MLU] modify mlu kernel elementwise_min_grad impl
```
  f586110d
20 6月, 2022 5 次提交
- W
  
  Add passes and plugins for distributed inference of NLU (#43049) · 007f3614
  由 whs 提交于 6月 20, 2022
  
  007f3614
- Z
  
  add cross_op cuda kernel (#43558) · ec3e0a13
  由 zhangbopd 提交于 6月 20, 2022
  
  ec3e0a13
- J
  Fix for oneDNN layernorm for begin_norm_axis != last_dim (#43476) · b6bc6f7a
  由 jakpiase 提交于 6月 20, 2022
```
* fix for layer_norm

* minor fix
```
  b6bc6f7a
- L
  Revert "Revert md-in-tensor refactoring (#43564)" (#43598) · 8727bb7c
  由 lidanqing 提交于 6月 20, 2022
```
This reverts commit 1ec626b1.
```
  8727bb7c
- Z
  Support more dimensions in MMHA (#43612) · 03f9e598
  由 Zhang Zheng 提交于 6月 20, 2022
```
* support more dimensions

* fix
```
  03f9e598
18 6月, 2022 1 次提交
- remove unuse cuSparse function (#43626) · 4a08c781
  由 zhouweiwei2014 提交于 6月 18, 2022
  
  4a08c781
17 6月, 2022 9 次提交
- C
  
  [MLU] add truncated_gaussian_random kernel. (#43575) · 5a5649c2
  由 Chenxiao Niu 提交于 6月 17, 2022
  
  5a5649c2
- Y
  Support optional residual add in fused_attention and fused_feedforward. (#43474) · 19e866f9
  由 Yiqun Liu 提交于 6月 17, 2022
```
* Support optional residual add in fused_attention and fused_feedforward.

* Add checkpoint and add the check of add_residual when pre_layer_norm is false.

* Add TODO and change the python api to add add_residual argument.
```
  19e866f9
- F
  
  [MLU]add mlu kernel for where op (#43441) · 2540b023
  由 fuyou765 提交于 6月 17, 2022
  
  2540b023
- F
  
  [MLU]add mlu kernel for tile op (#43389) · 539a9e60
  由 fuyou765 提交于 6月 17, 2022
  
  539a9e60
- F
  
  [MLU]add mlu kernel for expand_v2 op (#43353) · 6a179e48
  由 fuyou765 提交于 6月 17, 2022
  
  6a179e48
- C
  
  [MLU] add mlu kernel for iou_similarity (#43503) · f3a09de4
  由 cambriconhsq 提交于 6月 17, 2022
  
  f3a09de4
- Q
  
  [MLU]add elementwise op (#43491) · 74cc73bb
  由 qipengh 提交于 6月 17, 2022
  
  74cc73bb
- Z
  [MLU]: add shape kernel (#43347) · feebbe15
  由 zhaoying9105 提交于 6月 17, 2022
```
* [MLU]: add shape kernel

* [MLU]: set output from cpu to mlu in shape kernel
```
  feebbe15
- W
  
  Fix matrix rank name error (#43584) · 6f1d2483
  由 WangZhen 提交于 6月 17, 2022
  
  6f1d2483
16 6月, 2022 3 次提交
- J
  
  Revert md-in-tensor refactoring (#43564) · 1ec626b1
  由 joanna.wozna.intel 提交于 6月 16, 2022
  
  1ec626b1
- J
  
  fix for quant model (#43567) · 13ad8bde
  由 jakpiase 提交于 6月 16, 2022
  
  13ad8bde
- Z
  
  remove fp16 support of depthwise_conv2d and add unittest for depthwise_conv2d, test=kunlun (#43483) · 6be3ee26
  由 zhangyikun02 提交于 6月 16, 2022
  
  6be3ee26
15 6月, 2022 5 次提交
- Y
  Optimize prod's python implementation for dygraph. (#43309) · 9b7126d0
  由 Yiqun Liu 提交于 6月 15, 2022
```
* Optimize prod's python implementation for dygraph.

* Change key_dim to head_dim.

* Add comment in unittest.

* Disable TF32 in unittest.
```
  9b7126d0
- F
  
  [MLU] add size kernel for mlu (#43450) · 4d0ca02b
  由 fwenguang 提交于 6月 15, 2022
  
  4d0ca02b
- F
  
  [MLU] add bce kernel for mlu (#43467) · 1dfa2d49
  由 fwenguang 提交于 6月 15, 2022
  
  1dfa2d49
- F
  
  [MLU] add bce kernel (#43435) · 669d8689
  由 fwenguang 提交于 6月 15, 2022
  
  669d8689
- R
  Refactor dynload/port.h (#43431) · 332fdd1e
  由 Ruibiao Chen 提交于 6月 15, 2022
```
* Refactor port.h

* Remove some unnecessary code

* Fix CI errors
```
  332fdd1e
14 6月, 2022 10 次提交
- C
  
  [MLU] add mlu kernel for depthwise conv2d op (#43359) · 077f3788
  由 cambriconhsq 提交于 6月 14, 2022
  
  077f3788
- Z
  [MLU]: add elementwise_max mlu kernel (#43365) · ceb6b3f1
  由 zhaoying9105 提交于 6月 14, 2022
```
* [MLU]: add elementwise_max mlu kernel

* [MLU]: add int32 support for elementwise maxk MLU kernel
```
  ceb6b3f1
- Z
  
  [MLU]: add log log10 log2 MLU kernel (#43360) · 4642e8c4
  由 zhaoying9105 提交于 6月 14, 2022
  
  4642e8c4
- S
  
  fix update loss scaling (#43487) · 0e6462d6
  由 sneaxiy 提交于 6月 14, 2022
  
  0e6462d6
- Y
  
  [cuda graph] partial program with cuda graph under static mode (#43440) · d83d59dd
  由 Yuang Liu 提交于 6月 14, 2022
  
  d83d59dd
- Z
  
  fix compiling werror (#43337) · c6421019
  由 Zhang Jun 提交于 6月 14, 2022
  
  c6421019
- X
  [ Make FLAGS_einsum_opt as default ] Einsum memory optimization (#43397) · 83abec60
  由 xiongkun 提交于 6月 14, 2022
```
* change logic for optimize

* modifty

* optimize the backward speed of EinsumOp

* add cache optimizer for einsum op

* EinsumOp: fix new dygraph mode error

* fix bug

* change Cache->InnerCache

* fix code

* fix

* add nan inf utils for einsum op

* add as_extra

* memory optimizer for einsum

* update code
```
  83abec60
- S
  
  【code format check upgrade】 step3：enable clang-format sort these infrt files's headers (#43333) · 403b127b
  由 Sing_chan 提交于 6月 14, 2022
  
  403b127b
- W
  fix cmake-lint problems. (#43406) · 59f89236
  由 Wilber 提交于 6月 14, 2022
```
* cmake-lint

* update
```
  59f89236
- Z
  
  fix bug of infer shape for slice (#43443) · e0a01461
  由 zyfncg 提交于 6月 14, 2022
  
  e0a01461
13 6月, 2022 4 次提交
- Q
  
  [MLU]add lookup_table_v2 op and fix amp feature of bert with mlu device (#43366) · 67bd5d9c
  由 qipengh 提交于 6月 13, 2022
  
  67bd5d9c
- C
  
  add mlu interp_v2(nearest&bilinear). (#43383) · affe25b7
  由 Chenxiao Niu 提交于 6月 13, 2022
  
  affe25b7
- P
  
  Disable oneDNN adaptive pooling exhaustive check (#43236) · 4af7ebf4
  由 piotrekobi 提交于 6月 13, 2022
  
  4af7ebf4
- R
  
  Fix cmakelint errors for some files (#43428) · edf69ae0
  由 Ruibiao Chen 提交于 6月 13, 2022
  
  edf69ae0
12 6月, 2022 1 次提交
- L
  Fix the bug of slice op and optimize the code style of generate_proposals_v2... · 2d96801a
  由 Leo Guo 提交于 6月 12, 2022
```
Fix the bug of slice op and optimize the code style of generate_proposals_v2 op for kunlun. *test=kunlun (#43380)
```
  2d96801a

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致