提交 · 9aecf286cf898cb70be79ea82c3ed08b02fd6cae · BaiXuePrincess / Paddle

15 8月, 2022 1 次提交
- W
  convert_fp16 support multi block (#45050) · 9aecf286
  由 Wilber 提交于 8月 15, 2022
```
* convert_fp16 support multi block

* update

* update
```
  9aecf286
01 8月, 2022 1 次提交

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

08 7月, 2022 1 次提交
- X
  
  conv_fusion_fp16 (#44173) · 9900b42b
  由 xiaoxiaohehe001 提交于 7月 08, 2022
  
  9900b42b
06 7月, 2022 1 次提交
- X
  [Paddle Inference] Add conv_elementwise_act. (#43871) · 4c269ccb
  由 xiaoxiaohehe001 提交于 7月 06, 2022
```
* conv_fusion
```
  4c269ccb
26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
12 5月, 2022 1 次提交
- S
  
  Fix some typos in paddle/. (#42408) · 2012672c
  由 Shuangchi He 提交于 5月 12, 2022
  
  2012672c
03 3月, 2022 1 次提交
- X
  [phi] transfer pad kernel into phi and pass the test_pad_op (#40012) · 9f74b84e
  由 xiongkun 提交于 3月 03, 2022
```
* add pad forward

* fix error

* transfer pad and pass the test_pad_op
```
  9f74b84e
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

15 2月, 2022 1 次提交

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
06 5月, 2021 1 次提交

[ROCM] bugfix for unittest (#32392) · 31392627

由 ronnywang 提交于 5月 06, 2021

* fix test_unpool_op

* fix test_inplace_addto_strategy

* fix test_conv2d_fusion_op

* fix test_imperative_lod_tensor_to_selected_rows, test_imperative_selected_rows_to_lod_tensor

* fix test_dot_op

* fix test_correlation_op

* fix tracer

* fix test_memcpy_op

31392627

15 4月, 2021 1 次提交
- A
  
  Correct typos (#32288) · 825d4957
  由 AshburnLee 提交于 4月 15, 2021
  
  825d4957
11 1月, 2021 1 次提交
- A
  
  Add tf32 switch for cuDNN (#29192) · 924aac22
  由 AshburnLee 提交于 1月 11, 2021
  
  924aac22
23 9月, 2020 1 次提交
- S
  [bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436) · c17f9cf2
  由 Shang Zhizhou 提交于 9月 23, 2020
```
* [bug fix]:Memory increases after adapting the cudnn version to 8

* [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
```
  c17f9cf2
10 8月, 2020 1 次提交
- Z
  fix cudnn workspace size problem during inference. (#26021) · 50f149a4
  由 Zhaolong Xing 提交于 8月 10, 2020
```
test=develop
```
  50f149a4
05 8月, 2020 1 次提交
- Z
  [CUDNN8 support] : support CUDNN8 (#25664) · 358bc06c
  由 Zhaolong Xing 提交于 8月 05, 2020
```
* cunn8 support
test=develop

* fix ci error
test=develop
```
  358bc06c
21 4月, 2020 1 次提交
- Z
  
  fix conv_fusion_op conflict,test=develop (#24020) · 76d78c63
  由 Zhou Wei 提交于 4月 21, 2020
  
  76d78c63
20 4月, 2020 1 次提交
- Y
  
  Op(conv2d_fusion) error message enhancement. (#23596) · 8d0b0cb4
  由 Yiqun Liu 提交于 4月 20, 2020
  
  8d0b0cb4
12 4月, 2020 1 次提交
- Z
  
  fix bug for exhaustive_search in conv_fusion_op, test=develop (#23727) · b4b6763a
  由 zhongpu 提交于 4月 12, 2020
  
  b4b6763a
03 4月, 2020 1 次提交

support Exhaustive search in dygraph (#23415) · dbfbd7ea

由 zhongpu 提交于 4月 03, 2020

* use global conv cache; test=develop

* use singleton cache; test=develop

* fix format error; test=develop

* add cudnn helper header; test=develop

* fix header error; test=develop

* fix mac unitest; test=develop

* fix mac unitest; test=develop

* fix file format; test=develop

* fix include file error, test=develop

* remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop

* fix test_elementwise_mul_op_dim, test=develop

* fix compile error, test=develop
Co-authored-by: Nphlrain <phliuhongyu@126.com>

dbfbd7ea

02 4月, 2020 2 次提交

Z
Revert "Exhaustive search (#22821)", test=develop (#23401) · bfb07aaf
由 zhongpu 提交于 4月 02, 2020
```
This reverts commit 48144e40.
```
bfb07aaf

Exhaustive search (#22821) · 48144e40

由 zhongpu 提交于 4月 02, 2020

* use global conv cache; test=develop

* use singleton cache; test=develop

* fix format error; test=develop

* add cudnn helper header; test=develop

* fix header error; test=develop

* fix mac unitest; test=develop

* fix mac unitest; test=develop

* fix file format; test=develop

* fix include file error, test=develop

* remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop

* fix test_elementwise_mul_op_dim, test=develop
Co-authored-by: Nphlrain <phliuhongyu@126.com>

48144e40

07 1月, 2020 2 次提交
- Z
  Fix windows build not kernel issue, test=develop (#22105) · 3dbd4087
  由 zhaoyuchen2018 提交于 1月 07, 2020
```
windows conv_fusion failed as no kernel， explicit declare lambda
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  3dbd4087
- C
  
  replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109) · ba8414d3
  由 Chen Weihang 提交于 1月 07, 2020
  
  ba8414d3
12 11月, 2019 1 次提交

Add Asypadding for conv fusion. (#21041) · 4a544762

由 zhaoyuchen2018 提交于 11月 12, 2019

* Add Asypadding for conv fusion.

test=develop

reference: pr/20042

* Fix eigen build link error

* Change back file mode

* Use math function & add more checks.

4a544762

30 10月, 2019 1 次提交

Move the codes of fused operators to operators/fused directory. (#20881) · 03ba0fda

由 Yiqun Liu 提交于 10月 30, 2019

* Move the codes of fused operators to operators/fused directory.
test=develop

* Correct the op name in cmake.

* Change the use of PADDLE_ENFORCE.
test=develop

03ba0fda

05 9月, 2019 1 次提交
- T
  paddle::framework::vectorize() templatization (#19627) · d6c85c96
  由 Tao Luo 提交于 9月 05, 2019
```
test=develop
```
  d6c85c96
16 8月, 2019 1 次提交
- Z
  
  move_flags_to_unified_files_for_management, test=develop (#19224) · 708bd979
  由 Zeng Jinle 提交于 8月 16, 2019
  
  708bd979
19 6月, 2019 1 次提交

翟

fix spelling errors (#17941) · 802ea509

由翟飞跃提交于 6月 19, 2019

* fix spelling errors; test=develop

* Update API.spec

update md5

* Update API.spec

* change the order of api;test=develop

802ea509

28 4月, 2019 1 次提交

Use CudnnWorkspaceHandle in exhaustive search (#17082) · b9494058

由 Huihuang Zheng 提交于 4月 28, 2019

1. Use CudnnWorkspaceHandle in exhaustive search of conv_cudnn.
2. For Ops using CudnnWorkspaceHandle in exhaustive search, release their GPU memory after exhaustive search.

test=develop

b9494058

23 4月, 2019 1 次提交
- Z
  Make conv cudnn workspace size configurable (#17036) · 0c335dcd
  由 Zeng Jinle 提交于 4月 23, 2019
```
* make_conv_cudnn_ws_size_configurable, test=develop

* change std::max to std::min
test=develop
```
  0c335dcd
25 2月, 2019 1 次提交
- X
  polish · 5dd281f7
  由 Xin Pan 提交于 2月 25, 2019
```
test=develop
```
  5dd281f7
21 2月, 2019 1 次提交
- X
  add per kernel config and remove const_cast. · 5eb87506
  由 Xin Pan 提交于 2月 21, 2019
```
test=develop
```
  5eb87506
25 1月, 2019 1 次提交

Revert conv transpose cudnn (#15514) · f8f91fb4

由 chengduo 提交于 1月 24, 2019

* Revert "set constant for loss"

This reverts commit 167933f6.

* Revert "remove workspace_handle"
test=develop
This reverts commit b4aca8ed.

f8f91fb4

22 1月, 2019 1 次提交
- C
  Remove workspace_handle (#15376) · 5a8bd82c
  由 chengduo 提交于 1月 22, 2019
```
* remove workspace_handle
test=develop

* set constant for loss
test=develop
```
  5a8bd82c
28 12月, 2018 1 次提交

Inception fusion operator. (#14968) · 6f0a1d7b

由 qingqing01 提交于 12月 28, 2018

* Inception fusion operator.
* Support horizontal layer fusion in conv_fusion_op.
* Search conv algo strategy for variable-length input.
   search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.

6f0a1d7b

26 12月, 2018 1 次提交
- H
  Fix conv_elementwise_add2_act pass · 956cf921
  由 hjchen2 提交于 12月 26, 2018
```
test=develop
```
  956cf921
25 12月, 2018 1 次提交
- N
  add affine_channel fuse. · ce3782c1
  由 nhzlx 提交于 12月 25, 2018
```
fix conv+elemenwise fuse bug.
```
  ce3782c1

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致