提交 · dcfe198631058dbcd4fe6e887a4e514008ed1e68 · PaddlePaddle / Paddle

20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

15 2月, 2022 1 次提交

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

27 1月, 2022 1 次提交
- Y
  
  refactor elementwise sub grad (#39225) · 7a1e1193
  由 YuanRisheng 提交于 1月 27, 2022
  
  7a1e1193
21 1月, 2022 2 次提交

[PTen]Separate origin Kernel and add Kernel for C++ API (#39002) · a0f586bc

由 YuanRisheng 提交于 1月 21, 2022

* add kernel for c++ api

* fix compile bugs

* fix kunlun compile bugs

* perfect cmake

* fix compile bugs when run ci-inference

* fix compile bugs

* add non-raw kernel for fluid op

* fix compile bugs

* fix compile bugs

* fix unit test bug

a0f586bc

[PTEN] Add cpu context (#38979) · 064bc4b8

由 Wilber 提交于 1月 21, 2022

* add cpu_context.

* update

* update

* update

* update

* update

* fix ci problem

* fix npu ci problem

* update

* fix ci compile

064bc4b8

11 1月, 2022 1 次提交

Remove useless headers for some grad ops (#38823) · 9f34a070

由 limingshu 提交于 1月 11, 2022

* fix the wrong filename

* first commit

* first commit

* remove rest useless headers

* for ci approval

9f34a070

06 1月, 2022 2 次提交
- L
  Revert "Remove useless headers for some grad ops (#38732)" (#38743) · fc990d08
  由 limingshu 提交于 1月 06, 2022
```
This reverts commit c0e2b98e.
```
  fc990d08
- L
  Remove useless headers for some grad ops (#38732) · c0e2b98e
  由 limingshu 提交于 1月 06, 2022
```
* fix the wrong filename

* first commit
```
  c0e2b98e
05 1月, 2022 1 次提交

implementation of broadcast div backward by reduce (#38044) · 55cd9cb8

由 crystal 提交于 1月 05, 2022

* add elementwise div

* move mul and div grad functor

* Combine multiple CUDA kernels

* Update the reduce interface call

* add multi-output

* add multi-output div

* add branch judge

* Package branch

* Combine the x and y functions into one

55cd9cb8

31 12月, 2021 1 次提交
- Y
  [Pten]Move math to new directory and change 「math」 to 「math_kernel」 (#38604) · e76087ad
  由 YuanRisheng 提交于 12月 31, 2021
```
* change 'math' to 'math_kernel'

* fix compile bugs

* merge develop

* fix compile bugs
```
  e76087ad
16 12月, 2021 1 次提交

[Pten]Modify registered kernel name (#38109) · be874c08

由 YuanRisheng 提交于 12月 16, 2021

* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile

* modify register name

* fix compile bugs

be874c08

23 11月, 2021 1 次提交
- Y
  [PTen]Elementwise_div Kernel Refactor (#37418) · 32d9beef
  由 YuanRisheng 提交于 11月 23, 2021
```
* elementwise_div refactor

* fix compile bugs in windows ci
```
  32d9beef
15 9月, 2021 1 次提交
- Y
  
  Unify the functor definition of elementwise add, sub, mul, div, floordiv, max, min. (#35684) · 2367cca6
  由 Yiqun Liu 提交于 9月 15, 2021
  
  2367cca6
25 5月, 2021 1 次提交

modify complex template for elementwise ops (#33071) · dbc08d69

由 chentianyu03 提交于 5月 25, 2021

* modify complex template for elementwise ops

* modify mul, div grad struct

* add complex template for CudaShuffleDownSync CudaShuffleXorSync funcs and fix the bug when delete cuda<9000

* fix shuffle func args bug

* fix shuffle func args bug

* fix shuffle func args bug

dbc08d69

03 3月, 2021 1 次提交

[ROCM] update fluid elementwise op for rocm (part10), test=develop (#31361) · 7cdf6ea7

由 Qi Li 提交于 3月 03, 2021

* [ROCM] update fluid elementwise op for rocm (part10), test=develop

* update, test=develop

* address review comments, test=develop

7cdf6ea7

25 1月, 2021 1 次提交

More precise mkldnn kernel rules in GetExpectedKernelType (#29840) · 5bf25d1e

由 arlesniak 提交于 1月 25, 2021

* More precise mkldnn kernel choice in GetExpectedKernelType

* Fixes after review

* Refresh develop for CI

* CI experiment

* get back from CI exper

5bf25d1e

11 1月, 2021 1 次提交
- C
  type promotion for grad (#30177) · c7371b7b
  由 chentianyu03 提交于 1月 11, 2021
```
* type promotion for grad

* add type promotion for div op
```
  c7371b7b
22 12月, 2020 1 次提交
- C
  change the grad of div when complex types (#29804) · 2a260d9b
  由 chentianyu03 提交于 12月 22, 2020
```
* change the grad of div when complex types

* fix the grads of inputs args order not match bug
```
  2a260d9b
27 11月, 2020 1 次提交
- A
  
  Fixes mkldnn dygraph learning rate scheduler crashes (#28988) · bc902044
  由 arlesniak 提交于 11月 27, 2020
  
  bc902044
30 12月, 2019 1 次提交
- D
  
  fix broadcast bug;test=develop (#21898) · b7697f62
  由 danleifeng 提交于 12月 30, 2019
  
  b7697f62
20 11月, 2019 1 次提交
- D
  
  edit elementwise_mul doublegrad inplace (#21245) · 6fc3e8ec
  由 danleifeng 提交于 11月 20, 2019
  
  6fc3e8ec
19 11月, 2019 1 次提交
- D
  
  extend elementwise broadcast function (#20957) · 0e7baabe
  由 danleifeng 提交于 11月 19, 2019
  
  0e7baabe
28 10月, 2019 1 次提交

Replace risky GetInputType method with secure IndicateVarDataType interface (#20668) · 26cc1fe5

由 Chen Weihang 提交于 10月 28, 2019

* replace part of the old implementation, test=develop

* restore concat op, test=develop

* update all ops implemention & delete GetDataTypeOfVar func, test=develop

26cc1fe5

30 9月, 2019 1 次提交
- D
  Improve elementwise operators performance in same dimensions. (#19763) · 425279a5
  由 danleifeng 提交于 9月 30, 2019
```
Improve elementwise operators performance in same dimensions
```
  425279a5
18 9月, 2019 1 次提交

Update elementwise double grad to save gpu memory (#19509) · 982e61f5

由 Leo Chen 提交于 9月 18, 2019

* update elementwise double grad to save gpu memory, test=develop

* update elementwise_mul/div_grad_grad to save memory, test=develop

* remove eval function in eigen statement to save memory, test=develop

* add unittest for elementwise_div_grad_grad without dout, test=develop

* add unittest for elementwise_add_grad_grad without ddx, test=develop

* add float16 cuda kernel for elementwise double grad op, test=develop

982e61f5

20 5月, 2019 1 次提交

Double backward elementwise div (#17416) · 10b23a72

由 lvmengsi 提交于 5月 20, 2019

* double backward, elementwise_div

* fix dx empty. test=develop

* bug fix (#17392)

fix secure bug

* Eanble stack operator for a Ngraph, test=develop (#17406)

* fix sqrt_grad_grad unittest. test=develop (#17410)

* fix sqrt_grad_grad unittest. test=develop

* disable sqrt_grad_grad unittest. test=develop

* test=develop, fix unittest

* test=develop, fix unittest

* test=develop, fix unittest

* test=develop, fix bug

* fix unittest. test=develop

* fix unittest dx. test=develop

* tmp fix! for test... test=develop

* reduce tmp, test=develop

* test=develop, reduce tmp

* fix broadcast unittest. test=develop

* fix format. test=develop

* refine code. test=develop

* refine code. test=develop

* refine GetDoubleGradSafeTensor. test=develop

* fix format. test=develop

10b23a72

03 4月, 2019 1 次提交
- Z
  Fix some grad op desc makers (#16633) · 1c526e1d
  由 Zeng Jinle 提交于 4月 02, 2019
```
* fix some grad op desc maker
test=develop

* fix grad op desc makers
test=develop
```
  1c526e1d
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

08 11月, 2018 1 次提交

Fix input<tensor> (#14208) · c5b6573a

由 chengduo 提交于 11月 08, 2018

* fix input<tensor>
test=develop

* fix split_ids
test=develop

* ElementwiseMul should not support SelectedRows

* fix scale op
test=develop

* change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()

* fix operator

* refine MultiOutput

* fix MultiOutput
test=develop

* disable test_dist_save_load
test=develop

* fix elementwise_op
test=develop

* add get_sparse_as_op
test=develop

* add info for check
test=develop

* rename get_sparse_as_op with extract_rows_as_op.
test=develop

* elementwise doesn't support selected_rows

* fix regularizer

* remove extract_rows_as
test=develop

* fix ci
test=develop

* add test for sum_op

* fix regularizer
test=develop

*  test=develop

* fix pserver weight decay multi inputs test=develop

c5b6573a

22 8月, 2018 1 次提交
- Y
  
  Process elemwise grad op's lod. mul_op's lod · 211d8186
  由 Yu Yang 提交于 8月 22, 2018
  
  211d8186
07 3月, 2018 1 次提交
- C
  
  refine elementwise sub,div,min,max · 8b30fada
  由 chengduoZH 提交于 3月 07, 2018
  
  8b30fada
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
03 2月, 2018 1 次提交
- C
  
  Add layer norm [GPU] · 76e188e5
  由 chengduoZH 提交于 2月 02, 2018
  
  76e188e5
02 2月, 2018 1 次提交
- C
  
  refine elementwise_op · affce733
  由 chengduoZH 提交于 2月 02, 2018
  
  affce733
17 1月, 2018 1 次提交
- F
  
  make elementwise op support scalar as input Y · 14f6fa34
  由 fengjiayi 提交于 1月 17, 2018
  
  14f6fa34
15 1月, 2018 1 次提交
- F
  
  remove unnecessary functor1 · 6ee8a2e1
  由 fengjiayi 提交于 1月 15, 2018
  
  6ee8a2e1
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功