提交 · 676995c86cb4b49f9a41c7a32c5e054b16201753 · 机器未来 / Paddle

22 2月, 2019 1 次提交

Optimze Gelu with MKL Erf function (#15770) · 676995c8

由 Yihua Xu 提交于 2月 22, 2019

* Optimize for gelu operator

* Set up the low accuracy mode of MKL ERF function.

test=develop

* Only enable MKLML ERF when OS is linux

* Use the speical mklml version included vmsErf function to verify gelu mkl kernel.

test=develop

* Add the CUDA macro to avoid NVCC's compile issue.

test=develop

* Add the TODO comments for mklml library modification.

test=develop

* Clean Code

test=develop

* Add the comment of marco for NVCC compiler.

test=develop

676995c8

12 12月, 2018 1 次提交
- Y
  Fix the gelu backward to avoid nan (#14857) · 6951ef9a
  由 Yibing Liu 提交于 12月 12, 2018
```
* Fix the gelu backward to avoid nan

test=develop

* Remove unnecessary calls

test=develop
```
  6951ef9a
05 12月, 2018 1 次提交

Fix clip.py (#14718) · 04539d4c

由 chengduo 提交于 12月 05, 2018

* expose square
test=develop

* fix activation
test=develop

* Add square API
test=develop

* add necessary op

* code refine

* fix API.spec
test=develop

* fix unit test
test=develop

* add unit test sparse_grad_clip
test=develop

* fix API.spec
test=develop

* remove mac test for test_gradient_clip
test=develop

* remove selectedrows_mul_tensor
test=develop

04539d4c

27 11月, 2018 1 次提交
- C
  
  Add activation gelu (#14569) · 6c71c1f8
  由 Clementine 提交于 11月 27, 2018
  
  6c71c1f8
26 11月, 2018 1 次提交
- M
  Revert the changes of VLOG · 53433d7f
  由 minqiyang 提交于 11月 26, 2018
```
test=develop
```
  53433d7f
08 11月, 2018 1 次提交
- M
  Change the origin VLOG level to 10 times · 0c3227a5
  由 minqiyang 提交于 11月 08, 2018
```
Fix code to support cpplint syntax check

test=develop
```
  0c3227a5
07 11月, 2018 1 次提交

Add fp16 backward support (#14202) · a9b5d42d

由 chengduo 提交于 11月 07, 2018

* add fp16 backward support
test=develop

* add sum_op fp16 test

* disable test_dist_save_load
test=develop

* add check_grad for sum

* add unit test for softmax_grad fp16
test=develop

* add scale_op unit test

* add mul_grad_op unit test for fp16

* add cross_entropy_grad and eman_grad unit test for fp16
test=develop

* fix cross_entropy unit test

* add pool2d fp16 unit test

* refine conv2d fp16 unit test
test=develop

* refine activation unit test
test=develop

* fix ci
test=develop

* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop

a9b5d42d

03 9月, 2018 1 次提交
- D
  
  fix windows compile (#13147) · e722f683
  由 dzhwinter 提交于 9月 03, 2018
  
  e722f683
25 8月, 2018 1 次提交
- D
  
  more platform is done · d7f98f37
  由 dzhwinter 提交于 8月 25, 2018
  
  d7f98f37
17 8月, 2018 1 次提交
- D
  Revert ""cherry picked operators changes" (#12184)" (#12747) · 4069262f
  由 dzhwinter 提交于 8月 17, 2018
```
This reverts commit bf3c3496.
```
  4069262f
16 8月, 2018 1 次提交

"cherry picked operators changes" (#12184) · bf3c3496

由 dzhwinter 提交于 8月 16, 2018

* "cherry picked operators changes"

* "remove duplicated code"

* "add constant setter"

* "add get expected kernel"

* "fix ci"

* "add fill constant"

bf3c3496

25 6月, 2018 1 次提交
- S
  Revert "refine ZeroGradFunctor in activation_op.h" · 748e204e
  由 sneaxiy 提交于 6月 25, 2018
```
This reverts commit 1eeb11ef.
```
  748e204e
12 6月, 2018 1 次提交
- S
  
  refine ZeroGradFunctor in activation_op.h · 1eeb11ef
  由 sneaxiy 提交于 6月 12, 2018
  
  1eeb11ef
16 4月, 2018 1 次提交
- D
  
  "move to a new PR" · e54f203c
  由 dzhwinter 提交于 4月 15, 2018
  
  e54f203c
10 4月, 2018 1 次提交
- K
  
  add fp16 support to activation op (#9769) · 0f38bb45
  由 Kexin Zhao 提交于 4月 09, 2018
  
  0f38bb45
09 4月, 2018 1 次提交
- A
  
  Fix cpplint issues in some operators · 2e2726f1
  由 Abhinav Arora 提交于 4月 08, 2018
  
  2e2726f1
29 3月, 2018 1 次提交
- C
  
  add sin · bdda08d9
  由 chengduoZH 提交于 3月 28, 2018
  
  bdda08d9
28 3月, 2018 1 次提交
- C
  
  add cos · 2e577379
  由 chengduoZH 提交于 3月 28, 2018
  
  2e577379
23 3月, 2018 3 次提交
- K
  
  Fixed tests · d8bd436f
  由 Krzysztof Binias 提交于 3月 21, 2018
  
  d8bd436f
- K
  
  Correcting for PR comments · a64b312e
  由 Krzysztof Binias 提交于 3月 20, 2018
  
  a64b312e
- K
  
  MKLDNN Relu Tanh Sqrt Abs activations added · 4466f0be
  由 Krzysztof Binias 提交于 3月 14, 2018
  
  4466f0be
21 3月, 2018 1 次提交
- K
  
  inital commit · d60180af
  由 Kexin Zhao 提交于 3月 20, 2018
  
  d60180af
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
29 1月, 2018 1 次提交
- Q
  
  fix floor_op (#7926) · 59357f4f
  由 Qiao Longfei 提交于 1月 29, 2018
  
  59357f4f
03 1月, 2018 1 次提交
- Y
  
  Update · 5a4367bb
  由 Yang Yu 提交于 1月 03, 2018
  
  5a4367bb
26 12月, 2017 2 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
- F
  
  change activations · e0be63bf
  由 fengjiayi 提交于 12月 26, 2017
  
  e0be63bf
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

07 12月, 2017 1 次提交
- A
  
  Swish activation operator (#6358) · 113c026d
  由 Abhinav Arora 提交于 12月 07, 2017
  
  113c026d
26 11月, 2017 1 次提交

"add floor, ceil, round op" (#5898) · 513b1e01

由 dzhwinter 提交于 11月 26, 2017

* "add floor, ceil, round op"

* "reuse zero gradient"

* "fix divide zero"

* "fix numpy floor error"

513b1e01

03 11月, 2017 1 次提交
- K
  
  small fix · 81ba077e
  由 Kexin Zhao 提交于 11月 02, 2017
  
  81ba077e
31 10月, 2017 1 次提交

refine square_error_cost layer (#5216) · 669786bf

由 QI JUN 提交于 10月 30, 2017

* reimplement pow operator

* add pow_grad operator

* fix code style

* fix build error

* fix op_test bug

* revert pow operator

* add FIXME comment

669786bf

27 10月, 2017 1 次提交

Gradient check use graph (#5027) · be00b0c4

由 Yu Yang 提交于 10月 26, 2017

* Simplize Gradient Check

* Stash

* Extract apply_backward_pass to backward.py

Rename apply_backward_pass to append_backward_ops

* Use graph API to check gradient

* Fix ci

* Fix CI

* Fix backward for double precision

* Stash

* Fix CI

* Fix ci

* Ignore GRU test

* Ignore xe op

* Fix CI

* Fix softmax with xe gradient

The correct equation should be IG = OG * (d_softmax_with_xe())

* Fix typo

* Fix merge error

* Disable LRN

be00b0c4

13 10月, 2017 1 次提交

Adding Hard Sigmoid Activation (#4771) · 3b954e1d

由 Abhinav Arora 提交于 10月 12, 2017

* Adding Hard Sigmoid Activation

* Adding a comment for slope to be only positive

* Fixing grammatical mistake in comment

3b954e1d

12 10月, 2017 1 次提交
- A
  Adding the Thresholded Relu Op (#4685) · b504a234
  由 Abhinav Arora 提交于 10月 11, 2017
```
* Adding thresholded_relu op
* Adding test for thresholded relu op
```
  b504a234
11 10月, 2017 3 次提交
- K
  Implementing Softplus operator (#4690) · 9995aed1
  由 kexinzhao 提交于 10月 10, 2017
```
* implementing softplus

* small fix

* small fix

* small fix

* small fix
```
  9995aed1
- K
  Implemented the hardShrink activation (#4653) · 1397e17f
  由 kavyasrinet 提交于 10月 10, 2017
```
* Implemented the hardShrink activation

* Fixing the unit test
```
  1397e17f
- S
  Add logsigmoid (numerically stable) and softshrink (#4663) · 6604d7cd
  由 Siddharth Goyal 提交于 10月 10, 2017
```
* Add numerically-stable logsigmoid activation

* Add softshrink operator

* Adjust relative tolerance for grad-check

* Address review comments
```
  6604d7cd

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致