提交 · 908a381de9be00037a62fb2640591dc19dce2cf1 · BaiXuePrincess / Paddle

07 11月, 2022 1 次提交

[Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d

由 HongyuJia 提交于 11月 07, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

* Call SetDnnFallback function in the base class

* activation fallback to plain kernel

* fix default GetExpectedKernelType find wrong kernel

* search cudnn kernel instead of fallback

* fix cudnn_handle bug

* remove tanh use_cudnn

* restore tanh use_cudnn

* debug tanh

* fix tanh bug

* delete activation cudnn kernel

* polish code

908a381d

02 11月, 2022 1 次提交
- H
  Revert "[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325)" (#47582) · a57a19ea
  由 HongyuJia 提交于 11月 02, 2022
```
This reverts commit f9134045.
```
  a57a19ea
01 11月, 2022 1 次提交

[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045

由 HongyuJia 提交于 11月 01, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

f9134045

25 10月, 2022 1 次提交
- H
  [CUDNN hardcode] Opt CUDNN hardcode of sequence_softmax (#47319) · ff07f8a2
  由 HongyuJia 提交于 10月 25, 2022
```
* opt cudnn hardcode of sequence_softmax

* fix grad datatype
```
  ff07f8a2
17 10月, 2022 1 次提交
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
26 9月, 2022 1 次提交
- Z
  
  clear extra atts of sequence_softmax in opmaker (#46457) · 159f10e3
  由 zyfncg 提交于 9月 26, 2022
  
  159f10e3
01 8月, 2022 1 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

02 7月, 2022 1 次提交

unify cpu context, part2 (#44012) · 755438a7

由 Leo Chen 提交于 7月 02, 2022

* fix init()

* delete test_device_context

* replace CPUDeviceContext with CPUContext

* fix test_scalar

* remove dot_op.cc

* fix compile

755438a7

26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
10 9月, 2021 1 次提交
- add the extra for op rnn/sequence_conv/sequence_pool/sequence_softmax (#35554) · d8bfe83d
  由 zhouweiwei2014 提交于 9月 10, 2021
  
  d8bfe83d
01 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid operators for rocm (part2), test=develop (#31211) · 9b016c7c
  由 Qi Li 提交于 3月 01, 2021
  
  9b016c7c
13 5月, 2020 1 次提交

API/OP (Some SL API) error message enhancement (#24441) · 05d20e57

由 Chen Weihang 提交于 5月 13, 2020

* polish some sl api error message, test=develop

* polish python input check of stride slice, test=develop

* fix unittest bugs, test=develop

05d20e57

25 3月, 2020 1 次提交
- Z
  
  rename no_need_buffer_vars_macro, test=develop (#23159) · b8886bf1
  由 Zeng Jinle 提交于 3月 25, 2020
  
  b8886bf1
31 10月, 2019 1 次提交

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

28 10月, 2019 1 次提交

Replace risky GetInputType method with secure IndicateVarDataType interface (#20668) · 26cc1fe5

由 Chen Weihang 提交于 10月 28, 2019

* replace part of the old implementation, test=develop

* restore concat op, test=develop

* update all ops implemention & delete GetDataTypeOfVar func, test=develop

26cc1fe5

08 10月, 2019 1 次提交
- Z
  
  refine sequence_softmax grad maker, test=develop (#20127) · 3eebd5b3
  由 Zeng Jinle 提交于 10月 08, 2019
  
  3eebd5b3
12 12月, 2018 1 次提交
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

10 10月, 2018 1 次提交

Set the right shape of selected_rows (#13723) · e1761709

由 chengduo 提交于 10月 10, 2018

* set the right shape of selected_rows
test=develop

* enhance check

* fix activation_op

* remove cast

* use ShareDimInfo replace SetDim and ShareLod

* use ShareDimAndLod
test=develop

* follow comment

test=develop

* check whether the input has lod
test=develop

* Split ShareDimAndLod

test=develop

* checkout clip.py
test=develop

e1761709

08 5月, 2018 1 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

19 4月, 2018 1 次提交
- Y
  add semicolon to op registry (#10034) · e04c43d5
  由 Yang Yang(Tony) 提交于 4月 18, 2018
```
* script to add semicolon

* fix typo
```
  e04c43d5
17 4月, 2018 2 次提交
- Y
  
  fix duplication · 411e888c
  由 Yang Yang 提交于 4月 17, 2018
  
  411e888c
- Y
  
  script to fix all · ce7c2e86
  由 Yang Yang 提交于 4月 16, 2018
  
  ce7c2e86
14 4月, 2018 1 次提交

Fix CPPLint errors in operators (#9826) · 7b86da71

由 Abhinav Arora 提交于 4月 13, 2018

* Fix CPPLint errors in operators

* Fix cast in softmax

* Fix softmax_mkldnn

* Fix send_recv_op_test

* Send_recv

* Fix softmax mkldnn

7b86da71

15 3月, 2018 1 次提交

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
21 12月, 2017 1 次提交
- W
  
  Fix equation of sequence_softmax_op. (#6810) · f04f4f9a
  由 whs 提交于 12月 21, 2017
  
  f04f4f9a
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

05 11月, 2017 1 次提交

Fixing documentation for operators (#5373) · 2ac5d7d0

由 kavyasrinet 提交于 11月 04, 2017

* Adding documentation for seq_expand

* Adding documentation for seq_concat_op

* Adding documentation for sequence_conv

* Adding sequence_pool

* Fixing review comment

* Adding sequence_softmax

* Updating doc for sigmoid_cross_entropy_with_logits

2ac5d7d0

17 10月, 2017 1 次提交
- Y
  Correct OpWithKernel's infershape (#4847) · 73a8b78a
  由 Yu Yang 提交于 10月 16, 2017
```
They are public now
```
  73a8b78a
07 10月, 2017 1 次提交
- Q
  
  rename InferShapeContextBase to InferShapeContext · c0a34e1c
  由 qiaolongfei 提交于 10月 07, 2017
  
  c0a34e1c
28 9月, 2017 1 次提交
- L
  
  Finish the SequenceSoftmaxGradKernel, using SoftmaxGradFunctor. · 03897f25
  由 Liu Yiqun 提交于 9月 28, 2017
  
  03897f25
25 9月, 2017 1 次提交
- L
  
  Correct the forward of sequence_softmax_op. · 12f2b8eb
  由 Liu Yiqun 提交于 9月 25, 2017
  
  12f2b8eb
21 9月, 2017 1 次提交
- L
  
  Initialize the sequence softmax operator. · f14a7966
  由 Liu Yiqun 提交于 9月 21, 2017
  
  f14a7966

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致