提交 · 9adb158e5bcc6ef5ce3abd977bb6b403268d8937 · Crayon鑫 / Paddle

10 1月, 2019 1 次提交

[Feature] support mix precision training for resnet (#14899) · fd854183

由 Wu Yi 提交于 1月 10, 2019

* clip softmax for fp16

* updates

* fuse xent support fp16 test=develop

* wip

* wip

* add simple row reduce

* wip fp16 accurate softmax

* add accurate softmax kernel for fp16 test=develop

* update test=develop

* fix cpu build test=develop

* update api.spec test=develop

* follow comments test=develop

* fix build test=develop

* fix trt build test=develop

* fix inference build test=develop

* fix merge test=develop

* update test=develop

* try fix build test=develop

* fix build test=develop

* rename real_exp test=develop

* fortest

* remove hacky kernels test=develop

* clean up test=develop

fd854183

18 11月, 2018 1 次提交
- J
  - Removing partial specialization of sotmax for inference for GPU · 9b0eae30
  由 Jacek Czaja 提交于 11月 18, 2018
```
test=develop
```
  9b0eae30
14 11月, 2018 2 次提交
- J
  - Softmax for Inference is enabled when ON_INFER is set · b361579f
  由 Jacek Czaja 提交于 11月 14, 2018
```
test=develop
```
  b361579f
- T
  
  Revert "Softmax op optimization for inference " · 5b9c62fa
  由 Tao Luo 提交于 11月 14, 2018
  
  5b9c62fa
09 11月, 2018 1 次提交
- J
  
  - Noise adding removed for Test phase of softmax · c1fccc29
  由 Jacek Czaja 提交于 11月 08, 2018
  
  c1fccc29
15 3月, 2018 1 次提交

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

13 11月, 2017 1 次提交
- D
  
  Fix compling for softmax_with_cross_entropy_op. · 91d4fc69
  由 dangqingqing 提交于 11月 13, 2017
  
  91d4fc69
29 9月, 2017 1 次提交
- Q
  
  refine SoftmaxFunctor · 84ff7e97
  由 qijun 提交于 9月 28, 2017
  
  84ff7e97
28 9月, 2017 1 次提交
- L
  
  Add SoftmaxGradFunctor, and use SoftmaxGradFunctor in softmax_op instead. · 05ed8ee8
  由 Liu Yiqun 提交于 9月 28, 2017
  
  05ed8ee8
26 9月, 2017 2 次提交
- C
  
  add negative clipping for softmax. · 3d77360b
  由 caoying03 提交于 9月 26, 2017
  
  3d77360b
- C
  
  fix implementations of supporting soft labels. · 8b8ad6b1
  由 caoying03 提交于 9月 25, 2017
  
  8b8ad6b1
22 9月, 2017 1 次提交
- C
  
  support soft labels. · f1d5fb3b
  由 caoying03 提交于 9月 21, 2017
  
  f1d5fb3b
13 9月, 2017 1 次提交
- C
  
  softmax as functor. · c6366c81
  由 caoying03 提交于 9月 12, 2017
  
  c6366c81
12 9月, 2017 1 次提交
- C
  
  softmax as function. · c0cef849
  由 caoying03 提交于 9月 12, 2017
  
  c0cef849
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
03 8月, 2017 1 次提交
- F
  
  Simplify building precess of gradient operator · ab18947e
  由 fengjiayi 提交于 8月 02, 2017
  
  ab18947e
26 7月, 2017 1 次提交
- Y
  
  Refining Unittest · 831d4e1c
  由 Yu Yang 提交于 7月 26, 2017
  
  831d4e1c
24 7月, 2017 1 次提交

Change gradient Op registry mechanism · 77af58f8

由 fengjiayi 提交于 7月 24, 2017

OLD: op_type -> grad_op_creator

NEW: grad_op_type -> grad_op_creator
     op_type -> grad_op_type

77af58f8

20 7月, 2017 3 次提交
- F
  
  Fix compile errors · 9418717f
  由 fengjiayi 提交于 7月 20, 2017
  
  9418717f
- F
  
  Fix some compile errors · 8a5ee462
  由 fengjiayi 提交于 7月 20, 2017
  
  8a5ee462
- F
  
  Refactor the implementation of gradient Op creating · e192d0fd
  由 fengjiayi 提交于 7月 20, 2017
  
  e192d0fd

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致