提交 · 82d31503643d5a01bab87f1a916498740258725b · PaddlePaddle / Paddle

06 5月, 2019 1 次提交

Add use_cuda to inplace pass (#17205) · ee2028a1

由 Zeng Jinle 提交于 5月 05, 2019

* add use_cuda to inplace pass,test=develop

* add test softmax_with_xe_inplace test,test=develop

ee2028a1

21 4月, 2019 1 次提交

Refine model gpu memory (#16993) · 1202d3fc

由 Zeng Jinle 提交于 4月 21, 2019

* speedup gc and inplace softmax_with_cross_entropy_grad
test=develop

* refine models gpu mem
Merge skip vars and warning messages of mem opt
remove relu mem opt
test=develop

* follow comments
test=develop

1202d3fc

11 4月, 2019 1 次提交
- P
  softmax corss entropy support high rank · bbfc82cc
  由 phlrain 提交于 4月 11, 2019
```
test=develop
```
  bbfc82cc
03 4月, 2019 1 次提交
- M
  
  Polish code · 61fe139f
  由 minqiyang 提交于 4月 03, 2019
  
  61fe139f
02 4月, 2019 1 次提交
- M
  Add UT for most layers without params · e377d759
  由 minqiyang 提交于 4月 02, 2019
```
test=develop
```
  e377d759
19 3月, 2019 1 次提交
- Z
  add allocator flags · 22715487
  由 zhhsplendid 提交于 3月 19, 2019
```
test=develop
```
  22715487
17 3月, 2019 1 次提交
- C
  Fix cross_entropy bug (#16236) · efca4de7
  由 chengduo 提交于 3月 17, 2019
```
test=develop
```
  efca4de7
10 1月, 2019 1 次提交

[Feature] support mix precision training for resnet (#14899) · fd854183

由 Wu Yi 提交于 1月 10, 2019

* clip softmax for fp16

* updates

* fuse xent support fp16 test=develop

* wip

* wip

* add simple row reduce

* wip fp16 accurate softmax

* add accurate softmax kernel for fp16 test=develop

* update test=develop

* fix cpu build test=develop

* update api.spec test=develop

* follow comments test=develop

* fix build test=develop

* fix trt build test=develop

* fix inference build test=develop

* fix merge test=develop

* update test=develop

* try fix build test=develop

* fix build test=develop

* rename real_exp test=develop

* fortest

* remove hacky kernels test=develop

* clean up test=develop

fd854183

11 12月, 2018 1 次提交
- Y
  Fix Eigen macro when using GPU · 7604b1ad
  由 Yu Yang 提交于 12月 11, 2018
```
The macro should be defined by compiler rather than by source.

test=develop
```
  7604b1ad
30 10月, 2018 1 次提交
- S
  
  test=develop · 5e5d2223
  由 sneaxiy 提交于 10月 26, 2018
  
  5e5d2223
13 9月, 2018 1 次提交
- B
  
  code fix (#13365) · e69d9c84
  由 Bai Yifan 提交于 9月 13, 2018
  
  e69d9c84
11 9月, 2018 1 次提交
- B
  Add ignore_index in cross_entropy op (#13217) · faf8ad24
  由 Bai Yifan 提交于 9月 11, 2018
```
* add ignore index

* update api.spec

* enhance softmax_with_cross_entropy
```
  faf8ad24
08 8月, 2018 1 次提交
- S
  
  refine softmax_with_cross_entropy · 1b4515f6
  由 sneaxiy 提交于 8月 06, 2018
  
  1b4515f6
15 3月, 2018 1 次提交
- Q
  Fix a critical bug in softmax_with_cross_entropy_op backward. (#9120) · b5a16dca
  由 qingqing01 提交于 3月 15, 2018
```
* Fix a critical bug in softmax_with_cross_entropy_op, which will lead to the wrong gradients.

* Enhance unit testing.
```
  b5a16dca
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

06 11月, 2017 1 次提交
- C
  
  fix softmax with cross entropy op. · 6f4bf505
  由 caoying03 提交于 11月 06, 2017
  
  6f4bf505
27 10月, 2017 1 次提交

Gradient check use graph (#5027) · be00b0c4

由 Yu Yang 提交于 10月 26, 2017

* Simplize Gradient Check

* Stash

* Extract apply_backward_pass to backward.py

Rename apply_backward_pass to append_backward_ops

* Use graph API to check gradient

* Fix ci

* Fix CI

* Fix backward for double precision

* Stash

* Fix CI

* Fix ci

* Ignore GRU test

* Ignore xe op

* Fix CI

* Fix softmax with xe gradient

The correct equation should be IG = OG * (d_softmax_with_xe())

* Fix typo

* Fix merge error

* Disable LRN

be00b0c4

20 10月, 2017 1 次提交

Remove template parameter for Tensor methods (#4937) · c532b967

由 Yu Yang 提交于 10月 19, 2017

* Remove template parameter for Tensor methods

* Also check the type is correct when data()
* Simplize holder_

* Fix accuracy_op

* Register Code

c532b967

17 10月, 2017 1 次提交
- Y
  Change Name convention of operator attributes (#4807) · 75d0c790
  由 Yu Yang 提交于 10月 16, 2017
```
* Change dataType to data_type

Follow PEP8

* Change name_convention to fit PEP8
```
  75d0c790
29 9月, 2017 2 次提交
- Q
  
  fix gpu build error · b611a479
  由 qijun 提交于 9月 28, 2017
  
  b611a479
- Q
  
  refine SoftmaxFunctor · 84ff7e97
  由 qijun 提交于 9月 28, 2017
  
  84ff7e97
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
27 9月, 2017 1 次提交
- C
  
  cross entropy as a functor to avoid duplicated codes. · 97509b68
  由 caoying03 提交于 9月 26, 2017
  
  97509b68
26 9月, 2017 1 次提交
- C
  
  fix implementations of supporting soft labels. · 8b8ad6b1
  由 caoying03 提交于 9月 25, 2017
  
  8b8ad6b1
22 9月, 2017 1 次提交
- C
  
  support soft labels. · f1d5fb3b
  由 caoying03 提交于 9月 21, 2017
  
  f1d5fb3b
18 9月, 2017 1 次提交
- C
  
  fix implementations. · 8f8ea005
  由 caoying03 提交于 9月 15, 2017
  
  8f8ea005
15 9月, 2017 1 次提交
- C
  
  finish implementation and fix unittest. · efa4526c
  由 caoying03 提交于 9月 13, 2017
  
  efa4526c
11 9月, 2017 1 次提交
- C
  
  softmax with cross entropy as a cost operator. · 513bc997
  由 caoying03 提交于 9月 08, 2017
  
  513bc997
08 8月, 2017 1 次提交
- D
  
  "fix clang format" · 22f03c39
  由 dongzhihong 提交于 8月 08, 2017
  
  22f03c39
07 8月, 2017 1 次提交
- D
  
  "remove a lot alias" · 610801b5
  由 dongzhihong 提交于 8月 07, 2017
  
  610801b5
04 8月, 2017 3 次提交
- L
  
  ClangFormat for proto and cuda · 1d4fa243
  由 liaogang 提交于 8月 04, 2017
  
  1d4fa243
- L
  
  fix softmax_op code line > 80 · c6186120
  由 liaogang 提交于 8月 04, 2017
  
  c6186120
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
03 8月, 2017 1 次提交

Softmax grad op (#3164) · d953611e

由 Qiao Longfei 提交于 8月 03, 2017

* init softmax grad op

* add compute code

* export Backward to python

* update test ,export op.type to python

* update python test, fix compute bug

* update unit test

* use eigen

* optimize eigen code

* add gpu test

* register softmax_grad GPU kernel and fix test bug

* typo

* follow comments

d953611e

31 7月, 2017 1 次提交
- Q
  
  add EIGEN_USE_GPU macro to op.cu file · 61f94f00
  由 qijun 提交于 7月 31, 2017
  
  61f94f00
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功