提交 · 606939de76af62afc1d4170b6b2e53e4ba743a74 · Crayon鑫 / Paddle

15 6月, 2021 1 次提交

Support reduce_sum_op float16 (#32966) · 606939de

由 jiangcheng 提交于 6月 15, 2021

* add reduce_sum_op by add self-kernel

* set all ReduceKernel MPType for accuracy

* add float16 test script which input is integer number

* solve reduce sum float16 check_grad problem

* solve conflict and change test script for CI

* change kernel register for CI

* remove all useless template

606939de

28 5月, 2021 1 次提交
- C
  
  modify to complex template types in reduce_sum OP and rewrite it's IdentityFunctor struct (#33164) · 5756d3e5
  由 chentianyu03 提交于 5月 28, 2021
  
  5756d3e5
23 3月, 2021 1 次提交
- Q
  
  [ROCM] fix reduce_sum nan in ROCM platform, test=develop (#31780) · 46dd1d4a
  由 Qi Li 提交于 3月 23, 2021
  
  46dd1d4a
08 3月, 2021 1 次提交
- Q
  
  [ROCM] fix dropout and remove hipcub, test=develop (#31455) · f9377965
  由 Qi Li 提交于 3月 08, 2021
  
  f9377965
01 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid operators for rocm (part2), test=develop (#31211) · 9b016c7c
  由 Qi Li 提交于 3月 01, 2021
  
  9b016c7c
17 4月, 2020 1 次提交
- Z
  OP error message enhancement of l2_normalize, matmul, mean, etc · 361c6ccc
  由 Zhong Hui 提交于 4月 17, 2020
```
* fix error message of l2_normalize, matmul, mean, etc. 
* add the test case for those ops
```
  361c6ccc
05 4月, 2020 1 次提交
- W
  Add the sum op to API 2.0， add some parameters for new api · 6577f91b
  由 wawltor 提交于 4月 05, 2020
```
* Add the sum op to API 2.0, test=develop
* Fix the import meesage in common_ops_import
```
  6577f91b
12 11月, 2019 1 次提交

fix the computation for dx (grad for x) for prelu operation. (#20949) · e249d9a3

由 lilong12 提交于 11月 12, 2019

* set the default value of alpha for prelu to 0.25, test=develop

* add the call to __syncthreads(), test=develop

* fix the implementation of cpu prelu, test=develop

* repair the implementation of element mode prelu, test=develop

* modify test_prelu_op.py, test=develop

e249d9a3

17 10月, 2019 1 次提交

Refine reduce codes to save compiling time and binary size (#20676) · 34e3adae

由 Zeng Jinle 提交于 10月 17, 2019

* refine reduce code to save compiling time and binary sizes, test=develop

* add reduce rank check to avoid bug, test=develop

34e3adae

05 9月, 2019 1 次提交
- T
  paddle::framework::vectorize() templatization (#19627) · d6c85c96
  由 Tao Luo 提交于 9月 05, 2019
```
test=develop
```
  d6c85c96
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

09 10月, 2018 1 次提交
- Q
  Fix bug in reduce_op caused by PR #13534 (#13748) · 6094a723
  由 qingqing01 提交于 10月 09, 2018
```
* Fix bug in reduce_op caused by PR #13534
* Fix output shape and enhance unit test.
test=develop
```
  6094a723
29 9月, 2018 1 次提交

Optimization of Kernels that related to DeepLabv3+ (#13534) · 161c3e31

由 Dun 提交于 9月 29, 2018

* refine reduce by cub
* optimize KernelDepthwiseConvFilterGrad
* optimize depthwise conv and reduce mean and reduce sum
* fix bug: dilation
* cuda arch and cuda 8 compatible

161c3e31

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致