提交 · 381410369bb0183d90609983ef1b51fc9e33bd77 · PaddlePaddle / Paddle

18 11月, 2021 6 次提交

L
polish unittest of test_pretrained_model (#37307) · 38141036
由 LielinJiang 提交于 11月 18, 2021
```
* fix cache

* Fix unittest
```
38141036
L
Fix the slow running speed of kl_div when option 'reduction' is set (#37283) · a6e9ff85
由 LielinJiang 提交于 11月 18, 2021
```
* Fix the slow running speed of kl_div when option reduction is set

* fix unittest coverage
```
a6e9ff85
L

Fix the issue of disordered loading cifar data (#37272) · 99909520
由 LielinJiang 提交于 11月 18, 2021

99909520
T
add benchmark ci(#37295) · 6a813d83
由 tianshuo78520a 提交于 11月 18, 2021
```
* add benchmark
```
6a813d83

Add the `GetFetchNames` method in CinnGraphSymbolization. (#37218) · 3ad495e8

由 Zhen Wang 提交于 11月 18, 2021

* Add the `GetFetchNames` method in CinnGraphSymbolization.

* Use unordered_set instead vector as the type of fetch_var_names.

* Reuse the definition of kCompilationKey.

* Use CompileOptions to set fetch_var_ids.

* Update the argument passing of GraphCompiler.Build.

* Fix some bugs in CinnGraphSymbolization::GetFetchIds.

3ad495e8

Opt topk (#37256) · c4862d99

由 zhangkaihuo 提交于 11月 18, 2021

topk中有cub和手写kernel两种实现，而cub是通过排序来获取topk，通过多组数据发现只有当input_width>=128且k超过input_width 75%的时候性能会比手写的更好。

c4862d99

17 11月, 2021 20 次提交

Replace custom IOHW -> OIHW reorder with build-in oneDNN reorder (#37175) · 162ac048

由 Sławomir Siwek 提交于 11月 17, 2021

* Use oneDNN reorder instead of custom one

* Fix whitespace typo

* Fix Code format error

* Incorporating feedback

* Remove unncessary reorder

* Support GIOHW format

* Fix code format error

162ac048

L
[new-exec] Refine standalone executor (#37278) · 6d6642c8
由 Leo Chen 提交于 11月 17, 2021
```
* init

* add feed ops in python side

* import LRScheduler

* update_feed

* refine code format
```
6d6642c8

Changed first batch of deprecated mkldnn headers and function names to new oneDNN names (#37040) · ce3ee9bb

由 piotrekobiIntel 提交于 11月 17, 2021

* Change first batch of mkldnn headers and namespace names to dnnl

* Revert changes to tensor.h, which require approval

* Format changes with pre-commit

* Add int32 tests

* Fix int32 tests and call GetDataFromTensor for int32

* Fix test

ce3ee9bb

N
Modify reduce_op.op.h for xpu2 with kernel primitive api (#36904) · 9c5d5665
由 niuliling123 提交于 11月 17, 2021
```
* Modify reduce_op.op.h for xpu2 with kernel primitive api
```
9c5d5665

Upgrade oneDNN to v2.4.4 (#36226) · d08753df

由 piotrekobiIntel 提交于 11月 17, 2021

* upgrade oneDNN to v2.4-rc

* Removed failing test

* Revert "Removed failing test"

This reverts commit 60e70e717fac2c86b7beb24dfa1343a5804ea455.

* Remove most tests for debugging purposes

* Update hash to oneDNN 2.4

* Revert test change

* Update oneDNN to 2.4.2

* Update oneDNN to 2.4.3

* Change oneDNN version to 2.3 for Jenkins test

* Revert "Change oneDNN version to 2.3 for Jenkins test"

This reverts commit 0b176defc3b63f65dd0ba85873a018534f287000.

* Update oneDNN to 2.4.4

* Change version of oneDNN to 2.3 for new Jenkins test

* Revert "Change version of oneDNN to 2.3 for new Jenkins test"

This reverts commit e005a0f78f2b41cdcf4d7de3a21df7f910b78268.

d08753df

A

Fix data transform bug in new executor (#37280) · 1460b761
由 Aurelius84 提交于 11月 17, 2021

1460b761
石

change the meta modification rules, test=develop (#37255) · 8c44ad47
由石晓伟提交于 11月 17, 2021

8c44ad47
S

avoid zero division problem (#37284) · ba1e0dd8
由 Sing_chan 提交于 11月 17, 2021

ba1e0dd8
Y
remove test_hapi_hub from mac (#37246) · 73203758
由 YUNSHEN XIE 提交于 11月 17, 2021
```
* remove test_hapi_hub from mac

* fix format error
```
73203758
C
[PTen] Add slice api implemention for Tensor (#37276) · 3328eb03
由 Chen Weihang 提交于 11月 17, 2021
```
* add slice api impl of Tensor

* fix test slice error
```
3328eb03
Z

update dataset (#37194) · ca8c4f3e
由 zhaocaibei123 提交于 11月 17, 2021

ca8c4f3e

[heterps]Refactor heterogenous worker (#37244) · 54d2626a

由 zmx 提交于 11月 17, 2021

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* refactor heter trainer. test=develop

* fix. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix. test=develop

* fix ut. test=develop

* fix ut. test=develop

* fix ut. test=develop

54d2626a

D

fix compile error when pslib use cpu branch;test=develop (#37248) · 0057c12d
由 danleifeng 提交于 11月 17, 2021

0057c12d
Z

add ut parallel (#37211) · 1223238f
由 zhangchunle 提交于 11月 17, 2021

1223238f
L
copy beta pow to same place when skip_update=1 (#37245) · 5e4b419b
由 Leo Chen 提交于 11月 17, 2021
```
* copy beta pow to same place when skip_update=1

* fix xpu
```
5e4b419b
Z

rename TensorBase interface data_type() to dtype() (#37257) · 1e9b3a3d
由 zyfncg 提交于 11月 17, 2021

1e9b3a3d
L

[Fleet Executor] Construct runtime graph (#37158) · 0daa69d4
由 LiYuRio 提交于 11月 17, 2021

0daa69d4
W

[npu][hybrid] support offload (#37224) · 762819a8
由 WangXi 提交于 11月 17, 2021

762819a8
T
[Einsum] correct output dimension errors. (#37222) · 5237cc05
由 Tongxin Bai 提交于 11月 17, 2021
```
* [Einsum] correct output dimension errors due to single element tensors.

* [Einsum] format polish.
```
5237cc05

Dependence analysis (#37231) · d943459b

由 xiongkun 提交于 11月 17, 2021

* add

* add BuildOperatorDependences

* fix bug

* add unittest for write after write

* fix merge bug

* fix

d943459b

16 11月, 2021 14 次提交
- C
  
  decrease pten log level (#37239) · d8982c52
  由 Chen Weihang 提交于 11月 16, 2021
  
  d8982c52
- A
  Added BF16 Pool2d grad (#37081) · f95d44a2
  由 arlesniak 提交于 11月 16, 2021
```
* Added BF16 Pool2d grad

* upstream pulled

* fix for CI

* fixes after review
```
  f95d44a2
- D
  
  [psgpu]fix pipe bug:save and pull overlap; test=develop (#37233) · 62ec644f
  由 danleifeng 提交于 11月 16, 2021
  
  62ec644f
- W
  
  Fix the logic of VarBase _to func (#37193) · f29a3c68
  由 Weilong Wu 提交于 11月 16, 2021
  
  f29a3c68
- Z
  
  refine pass by removing CommOpt, CalcOpt, ParallelOpt (#37206) · 4c160be2
  由 Zeng Jinle 提交于 11月 16, 2021
  
  4c160be2
- W
  
  Removed unnecessary ENFORCE statement (#37219) · 70b7c7ed
  由 Weilong Wu 提交于 11月 16, 2021
  
  70b7c7ed
- Y
  Add API and unit test for reshape (#37232) · 79b49c20
  由 YuanRisheng 提交于 11月 16, 2021
```
* reshape kernel refactor

* fix compile bugs when run ci

* support xpu for reshape

* fix bugs when run unittest in kunlun ci

* fix compile bugs when run kunlun

* perfect code according to suggestion

* add api and unit test for reshape
```
  79b49c20
- Z
  for pure fp16 (#37230) · 6ebc318e
  由 zhangkaihuo 提交于 11月 16, 2021
```
Add pure fp16 support for fused transformer.
```
  6ebc318e
- T
  
  test=document_fix (#37234) · 56810f45
  由 tianshuo78520a 提交于 11月 16, 2021
  
  56810f45
- Z
  Make Distributed Pass UT Timeout Smaller (#37199) · a01e27cc
  由 Zeng Jinle 提交于 11月 16, 2021
```
* make pass ut timeout smaller

* increate ut timeout
```
  a01e27cc
- Y
  Make FLAGS_determinstic effective in conv2d forward. (#37173) · ea47d211
  由 Yiqun Liu 提交于 11月 16, 2021
```
* Make FLAGS_determinstic effective in conv2d forward.

* Add call of SetCinnCudnnDeterministic in cinn_launch op.
```
  ea47d211
- S
  
  modify long time ut list (#37220) · 5091fed7
  由 Sing_chan 提交于 11月 16, 2021
  
  5091fed7
- J
  
  added onednn elu kernel (#37149) · ae40ee32
  由 jakpiase 提交于 11月 16, 2021
  
  ae40ee32
- L
  Fix attn_bias_add bug. (#37147) · a9e7a854
  由 Li Min 提交于 11月 16, 2021
```
fused_attention_op的实现中，使用了bias_add，且其实现是通过使用kernel primitive来实现的，之后kernel primitive的WriteData api接口及函数内部实现发生了更改，将判断越界的逻辑移到了template的参数中，使得调用的分支有错误，产生了越界赋值操作，污染了别的显存空间的内容。具体表现为：test_fused_attention_op_api.py 单次执行基本上不会报错，多次循环执行不同shape的输入，结果计算不对，具有偶发性，bug不易察觉。
```
  a9e7a854

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功