提交 · 110febdc541db8dd7e75fc3aeb614dff0fede4b7 · 机器未来 / Paddle

16 11月, 2020 3 次提交

Fix gradients with ignore_idx in softmax_with_cross_entropy (#28622) · 110febdc

由 Guo Sheng 提交于 11月 16, 2020

* Fix gradients with ignore_idx in softmax_with_cross_entropy.
test=develop

* Fix gradients with ignore_idx in softmax_with_cross_entropy on cpu.
Remove softmax_with_cross_entropy from op_threshold_white_list.
test=develop

* Fix test_softmax_cross_entropy_op.py.
test=develop

110febdc

L

Fix cudnn workspace limit in cudnn-8 (#28611) · f962bd34
由 Leo Chen 提交于 11月 16, 2020

f962bd34
L
Register op_version for new attribute use_addto (#28463) · 90805e2d
由 Leo Chen 提交于 11月 16, 2020
```
* register op_version for addto

* upgrade pass capability

* change eq to le

* change eq to le

* fix merge
```
90805e2d

13 11月, 2020 3 次提交
- L
  add send and recv ops (#28590) · ed9dd7c9
  由 lilong12 提交于 11月 13, 2020
```
* update, test=develop
```
  ed9dd7c9
- Z
  register the op version for some ops · a829357e
  由 Zhong Hui 提交于 11月 13, 2020
```
register the op version for some ops
```
  a829357e
- Z
  updata 2.0 API english doc (#28525) · bf6e7cba
  由 Zhou Wei 提交于 11月 13, 2020
```
* make Numpy version is below 1.19.3

* fix 2.0 doc
```
  bf6e7cba
12 11月, 2020 2 次提交
- S
  裁剪transformer模型trt支持；修复tensorRT不支持DeletePass的bug (#28517) · 8699f38d
  由 Shang Zhizhou 提交于 11月 12, 2020
```
* skip_layernorm_op done

* add unittest

* slice op convertor support trt < 6

* skip_layernorm only work in ernie
```
  8699f38d
- J
  add log2 operator (#28319) · 08d24131
  由 joejiong 提交于 11月 12, 2020
```
As the title
```
  08d24131
11 11月, 2020 2 次提交
- W
  
  fix the GetKernelTypeForVar of input for fluid.gather (#28534) · c52fe48f
  由 wangchaochaohu 提交于 11月 11, 2020
  
  c52fe48f
- W
  Checkout point add (#28488) · d7cfee9b
  由 wangchaochaohu 提交于 11月 11, 2020
```
* upgrade pass capability
```
  d7cfee9b
10 11月, 2020 1 次提交
- Z
  
  fix softmax unittest float16 random error (#28480) · 47cbf61d
  由 zhupengyang 提交于 11月 10, 2020
  
  47cbf61d
09 11月, 2020 1 次提交
- W
  
  refine the performance of gather Op (#28458) · e14ed71c
  由 wangchaochaohu 提交于 11月 09, 2020
  
  e14ed71c
08 11月, 2020 1 次提交

exec ut no more than 15s 1 (#28439) · ba075632

由 YUNSHEN XIE 提交于 11月 08, 2020

* disable ut test_parallel_executor_fetch_isolated_var,test=document_fix

* test for limiting ut exec time as 15S

* fix an error caused by cannot find ut

* fix some error

* can not find test_transformer

* fix error caused by ut not run in windows

* fix error caused by Compiler Options

* fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt

* setting timeout value to 120s for old ut

* add the timeout value setting

* fix error caused by ut only run in coverage_ci

* add analyzer_transformer_profile_tester

* fix some error

* fix some error

* fix error with inference option

* fix error with inference option setting as ON_INFER

* add some ut to set timeout

* modified some option

* fix error

* fix some timeout error

* fix error

* fix error

* fix timeout for test_analyzer_bfloat16_resnet50

* fix error

* setting timeout properity for some ut

* first pr for new ut timeout as 15S

ba075632

06 11月, 2020 3 次提交
- T
  
  fix crash in adam in xpu, *test=kunlun (#28433) · fad4744a
  由 taixiurong 提交于 11月 06, 2020
  
  fad4744a
- Q
  fix batch_norm_xpu bug & remove xpusimulator dependence (#28430) · 6bba8e57
  由 QingshuChen 提交于 11月 06, 2020
```
*test=kunlun
```
  6bba8e57
- J
  Add bfloat16 softmax and gelu (#28394) · 7821759d
  由 joanna.wozna.intel 提交于 11月 06, 2020
```
* Add bfloat16 softmax and gelu

* Add pass attr bfloat16_enabled_op_types

* Changes from review
```
  7821759d
05 11月, 2020 2 次提交
- 石
  
  check op_version_registry in CI test, test=develop (#28402) · c41fd033
  由石晓伟提交于 11月 05, 2020
  
  c41fd033
- J
  [oneDNN]Sum bf16 kernel (#28382) · ca415414
  由 Jacek Czaja 提交于 11月 05, 2020
```
* - Added sum bf16 oneDNN

test=develop

* - Fix to UT of sum bf16

test=develop
```
  ca415414
04 11月, 2020 2 次提交

Add broadcast_shape api (#28257) · 8b2436a7

由 Leo Chen 提交于 11月 04, 2020

* add broadcast_shape api

* add ut

* follow comments

* add example code, test=dodument_fix

* update example code, test=document_fix

8b2436a7

石

enhance the op_version_registry, test=develop (#28347) · 21a63f6f

由石晓伟提交于 11月 04, 2020

* enhance the op_version_registry, test=develop

* add unittests, test=develop

* enhance the op_version_registry, test=develop

* fix bugs, test=develop

* revert pybind_boost_headers.h, test=develop

* fix a attribute bug, test=develop

21a63f6f

03 11月, 2020 5 次提交
- S
  TensorRT中ernie模型推理性能优化，支持变长输入 (#28367) · ea851796
  由 Shang Zhizhou 提交于 11月 03, 2020
```
* fp16 result ok

* change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS

* auto detect special slice op converter for ernie with trt oss

* ernie oss only support fp16

* fix special_slice_plugin serialize bug

* matmul in tensorrt ok

* ernie unittest ok

* add matmul tensorrt unittest

* remove demo code
```
  ea851796
- J
  
  [oneDNN] sum op refactor (#28318) · 84cc61b2
  由 Jacek Czaja 提交于 11月 03, 2020
  
  84cc61b2
- W
  
  Paddle support compile on sw (#27858) · 09fd2b2a
  由 Wilber 提交于 11月 03, 2020
  
  09fd2b2a
- L
  Pool2d cuda kernel supports fp16 (#28316) · 6115c14f
  由 Leo Chen 提交于 11月 02, 2020
```
* pool2d cuda kernel supports fp16

* fix compile issue of template

* add ut
```
  6115c14f
- G
  Add rnn_op (#28197) · 9a600df3
  由 Guo Sheng 提交于 11月 03, 2020
```
* Add rnn_op.
test=develop

* Fix rnn_op grad maker's drop_empty_grad.
test=develop
```
  9a600df3
02 11月, 2020 1 次提交
- W
  add generate_proposals_v2 op (#28214) · 5262b025
  由 wangguanzhong 提交于 11月 02, 2020
```
* add generate_proposals_v2 op
```
  5262b025
29 10月, 2020 2 次提交
- J
  
  Add bf16 transpose2, reshape2, concat ops (#28195) · 571a63e7
  由 joanna.wozna.intel 提交于 10月 29, 2020
  
  571a63e7
- G
  Enhance multiclass_nms op to support LoD for dygraph mode (#28276) · e8f2614d
  由 Guanghua Yu 提交于 10月 29, 2020
```
* Enhance multiclass_nms to support LoD for dygraph mode

* fix some error in multiclass_nms

* update GetLodFromRoisNum to GetNmsLodFromRoisNum
```
  e8f2614d
28 10月, 2020 5 次提交
- L
  
  Fix transpose in conv cudnn kernel when addto enabled (#28295) · 89530384
  由 Leo Chen 提交于 10月 28, 2020
  
  89530384
- T
  
  fix conv mkldnn build error (#28288) · e1e666a0
  由 Tao Luo 提交于 10月 28, 2020
  
  e1e666a0
- J
  - sum (#28233) · 0b678d40
  由 Jacek Czaja 提交于 10月 28, 2020
```
test=develop
```
  0b678d40
- J
  
  [oneDNN ] conv2d fwd&bwd optimization (#27871) · c11d9b30
  由 Jacek Czaja 提交于 10月 28, 2020
  
  c11d9b30
- W
  update matrix nms op to api 2.0 (#28265) · 41d26a82
  由 wangxinxin08 提交于 10月 28, 2020
```
* update matrix nms op to api 2.0

* modify code according to review
```
  41d26a82
27 10月, 2020 4 次提交
- L
  
  fill_constant op supports NINF (#28270) · 7fcb32dd
  由 Leo Chen 提交于 10月 27, 2020
  
  7fcb32dd
- W
  
  refine yolo box Op for performace optimization (#28155) · 6905608c
  由 wangchaochaohu 提交于 10月 27, 2020
  
  6905608c
- W
  
  refine temporal_shift_op for performance optimization using gpu kernel config (#28114) · cdadc8f0
  由 wangchaochaohu 提交于 10月 27, 2020
  
  cdadc8f0
- Z
  add Fuse bn add act pass (#28196) · fdc06f21
  由 Zhang Ting 提交于 10月 27, 2020
```
* add fuse_bn_add_act pass
```
  fdc06f21
23 10月, 2020 1 次提交

Add compile limit for PADDLE_ENFORCE without error message (#28221) · 2babd6ff

由 Chen Weihang 提交于 10月 23, 2020

* add compile limit for paddle enforce

* polish elementwise_op_function.cu.h

* fix failed unittest

* fix windows compile failed

* detail polish

* revert no type constructor

2babd6ff

22 10月, 2020 2 次提交
- D
  
  fix wrong data type, test=develop (#28203) · 2db77be4
  由 Double_V 提交于 10月 22, 2020
  
  2db77be4
- F
  fix strided_slice_op's GetExpectedKernelType (#28192) · efe6e284
  由 Feiyu Chan 提交于 10月 22, 2020
```
* fix strided_slice_op's GetExpectedKernelType when input tensor is at CUDAPinnedPlace

* add unittest for tensors in cuda pinned place

* skip test for cuda pinned place on cpu machines
```
  efe6e284

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致