提交 · 5d6a8c7b73312f7d3ee224fc05a775df1cba6239 · PaddlePaddle / Paddle

23 2月, 2021 13 次提交
- A
  added support for fake_quantize_dequantize_abs_max op in quantization… (#30896) · 5d6a8c7b
  由 alncat 提交于 2月 23, 2021
```
* added support for fake_quantize_dequantize_abs_max op in quantization inference pass

* remove const_cast to pass ci

* remove compare operator to pass ci-coverage

* added detailed error message for unregistered tensorrt_subgrah_pass
```
  5d6a8c7b
- C
  [CustomOp] Split test and add inference test (#31078) · e60fd1f6
  由 Chen Weihang 提交于 2月 23, 2021
```
* split test & add inference test

* add timeout config

* change to setup install

* change to jit compile

* add verbose for test

* fix load setup name repeat

* polish details

* resolve conflict

* fix code format error
```
  e60fd1f6
- J
  
  Update of onednn to 2.2 (#31067) · d3f09ad7
  由 Jacek Czaja 提交于 2月 23, 2021
  
  d3f09ad7
- G
  
  merge develop conflict (#31122) · 24ba5ee0
  由 Guanghua Yu 提交于 2月 23, 2021
  
  24ba5ee0
- X
  Optimization of Transformer API (#30957) · edacb629
  由 xiemoyuan 提交于 2月 23, 2021
```
* Support 'bool' and 'int' for attention mask.

* Update docs.

* Add unittest for Transformer.

* fix bugs.
```
  edacb629
- W
  Save load/save pickle protocol (#31044) · ee1801c1
  由 WeiXin 提交于 2月 23, 2021
```
* add default argument  for paddle.save/static.save

* edit documentation of

* Add comments for special processing for protocol=2 and protocol=3.

* Update python/paddle/fluid/io.py
Co-authored-by: Nlanxianghit <47554610+lanxianghit@users.noreply.github.com>
Co-authored-by: Nlanxianghit <47554610+lanxianghit@users.noreply.github.com>
```
  ee1801c1
- Q
  
  [ROCM] update fluid operators for rocm (part1), test=develop (#31077) · cced930b
  由 Qi Li 提交于 2月 23, 2021
  
  cced930b
- Y
  fix flops api (#31081) · 99fd9815
  由 yukavio 提交于 2月 23, 2021
```
* remove PrettyTable dependence from paddle.flops

* fix bug in python2.7

* fix flops

* fix flops

* fix bug

* fix bug
```
  99fd9815
- W
  fix windows for optimization of elementwise_add Op (#31068) · 364cfa26
  由 wangchaochaohu 提交于 2月 23, 2021
```
* fix windows for optimization of elementwise_add Op
```
  364cfa26
- J
  Unification of BF16 enablement process (#31034) · 781df300
  由 joanna.wozna.intel 提交于 2月 23, 2021
```
* Unification of bfloat16 enablement process and refactor

* Remove unnecessary function

* Standardize the output name search
```
  781df300
- Z
  fix softmax cross entropy integer overflow (#30590) · 16fe11d7
  由 Zhong Hui 提交于 2月 23, 2021
```
[BUG FIX] Fix softmax cross entropy overflow problem.
```
  16fe11d7
- Z
  
  fix UNIX cmake problem (#31113) · 44ee251f
  由 Zhou Wei 提交于 2月 23, 2021
  
  44ee251f
- Q
  
  [ROCM] update fluid framework for rocm (part2), test=develop (#31010) · a60d93fb
  由 Qi Li 提交于 2月 23, 2021
  
  a60d93fb
22 2月, 2021 11 次提交

T
support save multi sparse table in one path (#31108) · 565354f6
由 Thunderbrook 提交于 2月 22, 2021
```
* save multi table one path

* format
```
565354f6
Q

[ROCM] update fluid framework for rocm (part3), test=develop (#31011) · 50967135
由 Qi Li 提交于 2月 22, 2021

50967135

[Dy2stat] Refactoring tensor_shape_transformer.py to Fix Change after Assign Bug (#31082) · cf43a321

由 Huihuang Zheng 提交于 2月 22, 2021

**Problem**
In our old shape transformer logic, if user write:
```
s = tensor.shape
...
y = paddle.some_api(s)
```
Dy2stat will change it to
```
...
y = paddle.some_api(convert_var_shape(tensor))
```
However it will cause fatal bug if user changes the shape of `x` after assign. For example:
```
s = tensor.shape
...
tensor = paddle.some_change_shape_api(tensor)
...
y = paddle.some_api(s)
```
Then the Dy2stat will get wrong result because the code is translated into:
```
tensor = paddle.some_change_shape_api(tensor)
...
y = paddle.some_api(convert_var_shape(tensor)) # tensor shape has been changed, not origin `s` value
```

**Solution Logic**

It can not be solved in the old logic, so I refactoring tensor_shape_transformer logic. Now we will use `s` to store shape attribute and generate a var `s__STATIC_CONVERT_VAR_SHAPE_SUFFIX` to store static shape API `shape(tensor)`
```
s = tensor.shape
...
y = paddle.some_api(s)
```
Dy2stat will change it to
```
s = tensor.shape
s__STATIC_CONVERT_VAR_SHAPE_SUFFIX = shape(tensor)
...
y = paddle.some_api(choose_shape_attr_or_api(s, s__STATIC_CONVERT_VAR_SHAPE_SUFFIX ))
```
In this case, the code is consistent with origin dygraph meaning and it fixed the change after assign bug.

**Code Key Note**

To help reviewers, the key change of this PR is changing `self.name_to_var_shape` from "mapping name to shape node" to "mapping name to its STATIC_CONVERT_VAR_SHAPE_SUFFIX name", then if a variable name has the SUFFIX, we can choose to use attribute shape or shape api. Other changes go with the key change.

**Consideration**
The issue of this PR is that we store extra static `shape` API result, will it harms the speed of Dy2stat? In some cases it will, but we argue that the benefit would be greater than the cost.

1. The extra calling to static `shape` API will happen when coder assign among shape variables. Take the following dygraph code as an instance:
```
s1 = tensor.shape
s2 = s1
s3 = s2
...
```
Then we called extra static `shape` APIs again and again, however users seldom write code like this.

2. If the shape variable is used a lot, for example:
```
s = tensor.shape
y1 = paddle.some_api1(s)
y2 = paddle.some_api2(s)
y3 = paddle.some_api3(s)
```
Our old logic will create 3 shape APIs but now just 1. This is more common user code pattern. In fact, if reviewers take a look at the current unit test in this PR, you could see the op numbers decrease after this PR. So we argue that this PR can also improve speed in this code pattern.

cf43a321

fix dist fleet ctr ut (#31087) · 0e4b1542

由 tangwei12 提交于 2月 22, 2021

* fix dist fleet ctr ut

Change-Id: I59bf5123c7bd47bd0e8f1ca2a26295257597c0f5

* fix dist fleet ctr ut

Change-Id: Iafcdd172364be47fe67b753774ce09af050bcbce

* Update CMakeLists.txt

0e4b1542

Q

[ROCM] update fluid framework for rocm (part1), test=develop (#31009) · 8fe09faf
由 Qi Li 提交于 2月 22, 2021

8fe09faf
Q

[ROCM] update fluid platform for rocm39 (part4), test=develop (#30936) · 33429630
由 Qi Li 提交于 2月 22, 2021

33429630
S
update trt int8 calibrator to IEntropyCalibratorV2 (#31060) · a5c56d83
由 Shang Zhizhou 提交于 2月 22, 2021
```
* update trt int8 calibrator to IEntropyCalibratorV2

* add delele opt_cache for trt_split_converter_test
```
a5c56d83

[2.0Custom OP]Support New Custom OP on Windows (#31063) · adaec007

由 Zhou Wei 提交于 2月 22, 2021

* [2.0.1]Support New Custom OP on windows

* fix CI

* fix code style

* fix CI

* fix CI

* fix coverage

* fix CI

* fix CI

adaec007

C

add optional for param attr args, test=document_fix (#31105) · 2168f08a
由 Chen Weihang 提交于 2月 22, 2021

2168f08a

[ROCM] update fluid imperative for rocm (part1), test=develop (#31017) · 1d996637

由 Qi Li 提交于 2月 22, 2021

* [ROCM] update fluid imperative for rocm (part1), test=develop

* [ROCM] update reducer.cc after merge, test=develop

* update reducer cmake after merge, test=develop

1d996637

J

fix the bug in backward OP of index_sample. (#31026) · b95eb38b
由 JamesLim 提交于 2月 22, 2021

b95eb38b

20 2月, 2021 12 次提交
- C
  Remove PE special profiler (#30886) · 6b3371e0
  由 Chengmo 提交于 2月 20, 2021
```
* remove pe special profiler

* add profiler info
```
  6b3371e0
- C
  [CustomOp] Add more dispatch marco for users (#31058) · 6beeafe7
  由 Chen Weihang 提交于 2月 20, 2021
```
* add more dispatch marco

* add more dispatch marco

* add more tests

* revert unneeded change

* add timeout for test dispatch

* add float and complex test

* remove and marco
```
  6beeafe7
- T
  add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel... · d5323dab
  由 TTerror 提交于 2月 20, 2021
```
add squeeze_op/unsqueeze_op on kunlun;fix conv op and parallel executor;optimize lookup_table op (#31056)

* add squeeze_op/unsqueeze_op on kunlun; fix conv op and parallel executor on kunlun; optimize lookup_table op on kunlun

* update squeeze/unsqueeze op
```
  d5323dab
- 1
  test=develop, save/load, shrink (#30625) · 16b4260b
  由 123malin 提交于 2月 20, 2021
```
* test=develop, save/load, shrink
Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
```
  16b4260b
- S
  export paddle.static.normalize_program method. (#31072) · 4424aac6
  由 Shibo Tao 提交于 2月 20, 2021
```
* export paddle.static.normalize_program method. test=develop

* fix ut coverage.test=develop
```
  4424aac6
- J
  
  hide useless headers and add complex support (#31074) · 628451af
  由 Jiabin Yang 提交于 2月 20, 2021
  
  628451af
- W
  update paddle_fluid.so to paddle_inference.so (#30850) · 463eae03
  由 Wilber 提交于 2月 20, 2021
```
* update paddle_fluid.so to paddle_inference.so
```
  463eae03
- T
  change fleet reviewer (#31069) · a2170a08
  由 tangwei12 提交于 2月 20, 2021
```
* change reviewer, test=document

Change-Id: I7592ee5c93bd580300ce39df885b603597b09026

* Update check_file_diff_approvals.sh

test=document_fix
```
  a2170a08
- L
  [static setitem] Support the index is Tensor; step>1; step<0 .(#30949) · 5b367dab
  由 liym27 提交于 2月 20, 2021
```
* [static setitem] support the index step > 1. tensor_a[::3] = value

* [static setitem] support the index step < 0. Eg: tensor_a[::-3] = value

* [static setitem] support the index is Tensor. eg: tensor_a[tensor_3:0:-1] = value

* Add op version.
```
  5b367dab
- Q
  
  [ROCM] update fluid inference for rocm (part1), test=develop (#31018) · eb3050fa
  由 Qi Li 提交于 2月 20, 2021
  
  eb3050fa
- J
  
  add detail about states index in rnn result, test=document_fix (#31048) · 6df1ca54
  由 Jack Zhou 提交于 2月 20, 2021
  
  6df1ca54
- H
  Fix that convert_var_shape doesn't support slice like [0:], test=develop (#31051) · ef627ac5
  由 Huihuang Zheng 提交于 2月 20, 2021
```
As the title, when slice_node like 1:3 being passed to idx of convert_var_shape, it will cause syntax error because a function cannot take this as argument. This PR fixed it.
```
  ef627ac5
19 2月, 2021 4 次提交
- J
  Added reshape grad bf16 (#31035) · f7465641
  由 Jacek Czaja 提交于 2月 19, 2021
```
* - added Reshape grad bf16

* - Added reshape grad bf16

* - cosmetics in py
```
  f7465641
- A
  [CustomOp] Refine name argument in setup (#31049) · 4dbe16c4
  由 Aurelius84 提交于 2月 19, 2021
```
* refine setup name usage

* fix unittest failed
```
  4dbe16c4
- A
  
  [CustomOp] Support output dtypes in generated Python API (#31045) · f2dc29a9
  由 Aurelius84 提交于 2月 19, 2021
  
  f2dc29a9
- W
  Modify relu native implementation 2 (#30996) · 615d8a22
  由 Wojciech Uss 提交于 2月 18, 2021
```
* Modify relu native implementation

* fix GPU performance
```
  615d8a22

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功