提交 · 28d69d710a00a04423b0a28a0c72ac2690d1f641 · PaddlePaddle / Paddle

28 4月, 2019 1 次提交

Refine dropout gpu memory (#17095) · 28d69d71

由 Zeng Jinle 提交于 4月 28, 2019

* refine_dropout_mem,test=develop

* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)

# This is the 2nd commit message:

Fleet unify distributed training (#16791)

* implement distributed transpiler with fleet
# This is the 3rd commit message:

ParallelDyGraph with GPU collective mode (#16827)

implement dygraph.parallel.DataParallel to hook reduce op.

# This is the 4th commit message:

Init mixed precision training interface (#16856)

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

# This is the 5th commit message:

fix reference_count_pass,test=develop (#17060)

test=develop
# This is the 6th commit message:

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

# This is the 7th commit message:

remove unnecessary prepare_data (#17080)

test=develop
# This is the 8th commit message:

fix interpolate cu. test=develop (#17101)

# This is the 9th commit message:

test=develop, double backward leaky_relu (#17067)

backward of backward: leaky_relu
# This is the 10th commit message:

fix fuse optimizer ops (#17102)

test=develop
# This is the 11th commit message:

truncated_gaussian_random supported in distributed training, test=develop (#17091)

# This is the 12th commit message:

 Detailed coordinate description for yolov3 loss (#17007)

* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop

# This is the 13th commit message:

fix test_weight_decay (#17109)

test=develop
# This is the 14th commit message:

Path flag (#17105)

* fix python/paddle/fluid/__init__.py detecting problems

28d69d71

30 1月, 2019 1 次提交
- Y
  Some improvements to support bert mixed precision training (#15585) · 170842cb
  由 Yibing Liu 提交于 1月 30, 2019
```
* Some improvements to support bert mixed precision training

test=develop

* Revert the cast in layer_norm

test=develop
```
  170842cb
11 12月, 2018 1 次提交
- Y
  Fix Eigen macro when using GPU · 7604b1ad
  由 Yu Yang 提交于 12月 11, 2018
```
The macro should be defined by compiler rather than by source.

test=develop
```
  7604b1ad
24 10月, 2018 1 次提交
- P
  
  modify dropout att; test=develop · a6e6bc45
  由 phlrain 提交于 10月 24, 2018
  
  a6e6bc45
23 10月, 2018 1 次提交
- P
  
  add dropout attr; test=develop · ffb24a73
  由 phlrain 提交于 10月 23, 2018
  
  ffb24a73
20 4月, 2018 1 次提交
- Y
  Revert "accelerate dropout (#9902)" (#10082) · f2e400d6
  由 Yu Yang 提交于 4月 20, 2018
```
* Revert "accelerate dropout (#9902)"

This reverts commit 2e331c65.

* Correct discard
```
  f2e400d6
19 4月, 2018 1 次提交

accelerate dropout (#9902) · 2e331c65

由 dzhwinter 提交于 4月 19, 2018

* accelerate dropout

* accelerate dropout

* "fix the dropout test"

* "rerun ci"

* "fix ci"

* "rerun ci"

* "fix ci"

* "fix"

* "stage"

* disable

2e331c65

27 3月, 2018 1 次提交
- G
  
  Add drop_out_op unit test (#9364) · e0b5691e
  由 gongweibao 提交于 3月 27, 2018
  
  e0b5691e
22 3月, 2018 1 次提交
- D
  
  "fast hack" · e33af241
  由 dzhwinter 提交于 3月 22, 2018
  
  e33af241
20 3月, 2018 2 次提交
- K
  
  remove AttrType · d03dbb97
  由 Kexin Zhao 提交于 3月 19, 2018
  
  d03dbb97
- K
  
  initial commit · 05ad1583
  由 Kexin Zhao 提交于 3月 19, 2018
  
  05ad1583
12 2月, 2018 2 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
- D
  Memory/dropout4 (#8407) · 07923ba0
  由 dzhwinter 提交于 2月 12, 2018
```
* "merge random generator kernel and mul"

* "fix dropout"
```
  07923ba0
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
30 1月, 2018 1 次提交
- C
  
  fix the bug that dropout always use a fixed seed. · 7d303bdc
  由 caoying03 提交于 1月 30, 2018
  
  7d303bdc
26 12月, 2017 2 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
- C
  
  refine · 52119d62
  由 chengduoZH 提交于 12月 25, 2017
  
  52119d62
21 12月, 2017 1 次提交
- Y
  
  Correct the dropout_op's computation in test · c2b1ddb6
  由 Yibing Liu 提交于 12月 20, 2017
  
  c2b1ddb6
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

24 11月, 2017 1 次提交

support testing when training and handle dropout and batch_norm operator in testing mode (#5734) · 3a76062c

由 QI JUN 提交于 11月 24, 2017

* is_training to is_test in dropout op

* handle dropout and batch_norm operator when prune pdesc in testing mode

* handle dropout and batch_norm operator when prune pdesc in testing mode

* add get_inference_program method

* fix dropout op

* fix ci

* test data after each batch training

* refine code

* refine test_book3

* fix ci

* follow comments

3a76062c

28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
20 9月, 2017 1 次提交
- D
  
  Add bool type for attribute and use it in dropout_op. · 72ba0270
  由 dangqingqing 提交于 9月 20, 2017
  
  72ba0270
19 9月, 2017 2 次提交
- X
  
  Remove unnecessary mask operations in test phase for dropout operator. · ffeeef82
  由 Xinghai Sun 提交于 9月 19, 2017
  
  ffeeef82
- X
  Add is_training attr and testing phrase compuation to dropout operator. · 585d12a3
  由 Xinghai Sun 提交于 9月 19, 2017
```
Change type of dropout_prob to template typename.
```
  585d12a3
16 9月, 2017 1 次提交
- X
  
  Move dropout gpu kernel to dropout_op.cu. · 32645b52
  由 Xinghai Sun 提交于 9月 16, 2017
  
  32645b52
03 9月, 2017 1 次提交
- X
  
  Fixed SEGFAULT of dropout operator in GPU. · b1a18552
  由 Xinghai Sun 提交于 9月 03, 2017
  
  b1a18552
02 9月, 2017 1 次提交
- X
  
  Add dropout operator. · 9a44f3d6
  由 Xinghai Sun 提交于 9月 02, 2017
  
  9a44f3d6
08 8月, 2017 1 次提交
- D
  
  "fix clang format" · 22f03c39
  由 dongzhihong 提交于 8月 08, 2017
  
  22f03c39
07 8月, 2017 1 次提交
- D
  
  "remove a lot alias" · 610801b5
  由 dongzhihong 提交于 8月 07, 2017
  
  610801b5
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
02 8月, 2017 1 次提交
- D
  
  Add sigmoid backward implenmention. · 0560733c
  由 dangqingqing 提交于 8月 02, 2017
  
  0560733c
31 7月, 2017 1 次提交
- Q
  
  add EIGEN_USE_GPU macro to op.cu file · 61f94f00
  由 qijun 提交于 7月 31, 2017
  
  61f94f00
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 1 次提交
- Q
  
  replace Tensor::tensor to EigenTensor::From · 736d078c
  由 qijun 提交于 7月 19, 2017
  
  736d078c
18 7月, 2017 1 次提交
- Q
  
  implement some basic OpKernel · b6c07552
  由 qijun 提交于 7月 18, 2017
  
  b6c07552
17 7月, 2017 1 次提交
- Y
  Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops · 1ed237c1
  由 Yu Yang 提交于 7月 17, 2017
```
* Implement InferShape and register them, give a stub Kernel method
  by LOG(INFO)
```
  1ed237c1

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功