提交 · e8ac7fc30a25e5d4626d8b483bf936bb9abe2e93 · BaiXuePrincess / Paddle

10 2月, 2022 1 次提交

[bf16] add bf16 kernel: dropout & reshape & slice (#39395) · e8ac7fc3

由 zhangbo9674 提交于 2月 10, 2022

* add dropout

* add reshape

* add slice

* refien slice unittest

* refine slice unittest

* add cpu bf16 kernel

e8ac7fc3

18 11月, 2021 1 次提交
- L
  fix bug to support dropout eval grad computing. (#37305) · c3d3001f
  由 Li Min 提交于 11月 18, 2021
```
* fix bug to support dropout eval grad computing.

* Remove useless code.
```
  c3d3001f
15 9月, 2021 1 次提交
- L
  
  Refactor dropout cuda impl for code reuse. (#35621) · 2b88057f
  由 Li Min 提交于 9月 15, 2021
  
  2b88057f
03 9月, 2021 1 次提交
- Y
  
  Unify the implementation of AlignedVector and simplify the codes of dropout and cast. (#35373) · c171eca2
  由 Yiqun Liu 提交于 9月 03, 2021
  
  c171eca2
03 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid operators for rocm (part7), test=develop (#31307) · 3b9db171
  由 Qi Li 提交于 3月 03, 2021
  
  3b9db171
16 12月, 2020 1 次提交
- Z
  improve dropout grad (#29605) · 1e9127f6
  由 Zhang Ting 提交于 12月 16, 2020
```
* improve grad perf
```
  1e9127f6
11 12月, 2020 1 次提交

improve dropout (#29465) · 6702040e

由 Zhang Ting 提交于 12月 11, 2020

* improve drop out

* add VectorizedRandomGeneratorWithGenerator

* fix bug

* modify according to comments

6702040e

04 9月, 2020 1 次提交
- Y
  
  add cuda generator (#26786) · 7f3e6ca5
  由 yaoxuefeng 提交于 9月 04, 2020
  
  7f3e6ca5
13 4月, 2020 1 次提交
- M
  add cuda kernel for seed, test=develop (#23749) · 6b4a51ba
  由 mapingshuo 提交于 4月 13, 2020
```
* add cuda kernel for seed, test=develop
```
  6b4a51ba
10 12月, 2019 1 次提交
- M
  Dropout with seed (#21590) · e2d849b9
  由 mapingshuo 提交于 12月 10, 2019
```
* add seed op
```
  e2d849b9
03 9月, 2019 1 次提交
- T
  refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603) · 75d15719
  由 Tao Luo 提交于 9月 03, 2019
```
test=develop
```
  75d15719
20 8月, 2019 1 次提交

optimize the realization of cuda dropout (#19136) · 6e326ca2

由 wangchaochaohu 提交于 8月 20, 2019

* cuda optimie for dropout

* remove tmp swp file

* fix compile error test=develop

* test=develop optimize the cuda realization of dropout op

* remove unsed code test=develop

* remove tmp file test=develop

6e326ca2

28 4月, 2019 1 次提交

Refine dropout gpu memory (#17095) · 28d69d71

由 Zeng Jinle 提交于 4月 28, 2019

* refine_dropout_mem,test=develop

* # This is a combination of 14 commits.
# The first commit's message is:
remove ut test_dist_word2vec in mac ci, will fix it in private, test=develop (#17066)

# This is the 2nd commit message:

Fleet unify distributed training (#16791)

* implement distributed transpiler with fleet
# This is the 3rd commit message:

ParallelDyGraph with GPU collective mode (#16827)

implement dygraph.parallel.DataParallel to hook reduce op.

# This is the 4th commit message:

Init mixed precision training interface (#16856)

* Init mixed precision training interface

* Add fp16 test script

test=develop

* All initializers support float16

test=develop

* Code cleanup & add more code annotations

test=develop

* Update API spec

test=develop

* Add usage example in doc

test=develop

# This is the 5th commit message:

fix reference_count_pass,test=develop (#17060)

test=develop
# This is the 6th commit message:

Speedup roi_perspective_transform op by caching the information of linear interpolation in forward (#17090)

* Cache the information of linear interpolation in forward and use it in backward.
test=develop

* Fix cuda kernel.
test=develop

# This is the 7th commit message:

remove unnecessary prepare_data (#17080)

test=develop
# This is the 8th commit message:

fix interpolate cu. test=develop (#17101)

# This is the 9th commit message:

test=develop, double backward leaky_relu (#17067)

backward of backward: leaky_relu
# This is the 10th commit message:

fix fuse optimizer ops (#17102)

test=develop
# This is the 11th commit message:

truncated_gaussian_random supported in distributed training, test=develop (#17091)

# This is the 12th commit message:

 Detailed coordinate description for yolov3 loss (#17007)

* Detailed coordinate description for yolov3 loss

test=develop

* modified api.spec

test=develop

* modified loss name

* fix api.spec

test=develop

* polish description

test=develop

* modified api.spec

test=develop

# This is the 13th commit message:

fix test_weight_decay (#17109)

test=develop
# This is the 14th commit message:

Path flag (#17105)

* fix python/paddle/fluid/__init__.py detecting problems

28d69d71

30 1月, 2019 1 次提交
- Y
  Some improvements to support bert mixed precision training (#15585) · 170842cb
  由 Yibing Liu 提交于 1月 30, 2019
```
* Some improvements to support bert mixed precision training

test=develop

* Revert the cast in layer_norm

test=develop
```
  170842cb
11 12月, 2018 1 次提交
- Y
  Fix Eigen macro when using GPU · 7604b1ad
  由 Yu Yang 提交于 12月 11, 2018
```
The macro should be defined by compiler rather than by source.

test=develop
```
  7604b1ad
24 10月, 2018 1 次提交
- P
  
  modify dropout att; test=develop · a6e6bc45
  由 phlrain 提交于 10月 24, 2018
  
  a6e6bc45
23 10月, 2018 1 次提交
- P
  
  add dropout attr; test=develop · ffb24a73
  由 phlrain 提交于 10月 23, 2018
  
  ffb24a73
20 4月, 2018 1 次提交
- Y
  Revert "accelerate dropout (#9902)" (#10082) · f2e400d6
  由 Yu Yang 提交于 4月 20, 2018
```
* Revert "accelerate dropout (#9902)"

This reverts commit 2e331c65.

* Correct discard
```
  f2e400d6
19 4月, 2018 1 次提交

accelerate dropout (#9902) · 2e331c65

由 dzhwinter 提交于 4月 19, 2018

* accelerate dropout

* accelerate dropout

* "fix the dropout test"

* "rerun ci"

* "fix ci"

* "rerun ci"

* "fix ci"

* "fix"

* "stage"

* disable

2e331c65

27 3月, 2018 1 次提交
- G
  
  Add drop_out_op unit test (#9364) · e0b5691e
  由 gongweibao 提交于 3月 27, 2018
  
  e0b5691e
22 3月, 2018 1 次提交
- D
  
  "fast hack" · e33af241
  由 dzhwinter 提交于 3月 22, 2018
  
  e33af241
20 3月, 2018 2 次提交
- K
  
  remove AttrType · d03dbb97
  由 Kexin Zhao 提交于 3月 19, 2018
  
  d03dbb97
- K
  
  initial commit · 05ad1583
  由 Kexin Zhao 提交于 3月 19, 2018
  
  05ad1583
12 2月, 2018 2 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
- D
  Memory/dropout4 (#8407) · 07923ba0
  由 dzhwinter 提交于 2月 12, 2018
```
* "merge random generator kernel and mul"

* "fix dropout"
```
  07923ba0
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
30 1月, 2018 1 次提交
- C
  
  fix the bug that dropout always use a fixed seed. · 7d303bdc
  由 caoying03 提交于 1月 30, 2018
  
  7d303bdc
26 12月, 2017 2 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
- C
  
  refine · 52119d62
  由 chengduoZH 提交于 12月 25, 2017
  
  52119d62
21 12月, 2017 1 次提交
- Y
  
  Correct the dropout_op's computation in test · c2b1ddb6
  由 Yibing Liu 提交于 12月 20, 2017
  
  c2b1ddb6
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

24 11月, 2017 1 次提交

support testing when training and handle dropout and batch_norm operator in testing mode (#5734) · 3a76062c

由 QI JUN 提交于 11月 24, 2017

* is_training to is_test in dropout op

* handle dropout and batch_norm operator when prune pdesc in testing mode

* handle dropout and batch_norm operator when prune pdesc in testing mode

* add get_inference_program method

* fix dropout op

* fix ci

* test data after each batch training

* refine code

* refine test_book3

* fix ci

* follow comments

3a76062c

28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
20 9月, 2017 1 次提交
- D
  
  Add bool type for attribute and use it in dropout_op. · 72ba0270
  由 dangqingqing 提交于 9月 20, 2017
  
  72ba0270
19 9月, 2017 2 次提交
- X
  
  Remove unnecessary mask operations in test phase for dropout operator. · ffeeef82
  由 Xinghai Sun 提交于 9月 19, 2017
  
  ffeeef82
- X
  Add is_training attr and testing phrase compuation to dropout operator. · 585d12a3
  由 Xinghai Sun 提交于 9月 19, 2017
```
Change type of dropout_prob to template typename.
```
  585d12a3
16 9月, 2017 1 次提交
- X
  
  Move dropout gpu kernel to dropout_op.cu. · 32645b52
  由 Xinghai Sun 提交于 9月 16, 2017
  
  32645b52
03 9月, 2017 1 次提交
- X
  
  Fixed SEGFAULT of dropout operator in GPU. · b1a18552
  由 Xinghai Sun 提交于 9月 03, 2017
  
  b1a18552
02 9月, 2017 1 次提交
- X
  
  Add dropout operator. · 9a44f3d6
  由 Xinghai Sun 提交于 9月 02, 2017
  
  9a44f3d6

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致