提交 · 92568edbf7a6023f897b8d7e5f9f1ea985f28fa2 · BaiXuePrincess / Paddle

01 11月, 2021 1 次提交

Paddle Tensor Operation Library initial implementation (#34425) · b9fdd3bc

由 Chen Weihang 提交于 11月 01, 2021

* initial tensor design & sign kernel demo

* add move constructor for meta & add lodtensor

* add dirs & sign xpu kernel

* add mean cpu&cuda kernel impl

* move sign & mean xpu & npu kernel

* add selected_rows basic impl

* refactor design, BaseTensor to DenseTensor, etc.

* add scale mkldnn kernel

* polish xpu & npu impl details

* fix mkldnn reuse compile failed

* change tensor operation lib name

* rename util filename

* add more comments

* change TensorImplInterface to TensorInterface

* add kernel key and factory

* remove MKLDNNTensorMeta, add MKLDNNDenseTensor

* change XXDeviceContext to XXContext

* add base kernel registrar utils & test on sign

* replace boost::any by paddle::any

* fix several ci failed

* fix npu compile error

* add ordered map util

* fix multiple ordered_map compile errors

* move dev into include dir

* support sign op in static op run

* fix static op run error

* fix new executor compile failed

* add dygraph branch & remove sign_op.h

* fix test_infer_no_need_buffer_slots

* fix rocm compile link error

* fix unitybuild error & clear glog

* fix npu compile failed

* skip quant trans test

* fix part windows compile problem

* fix xpu enforce error

* fix inference test failed

* remove ordered_map to solve quant failed

* fix part of rcom compile faild

* add more register kernels

* revert scale kernel temporarily

* fix code format error

* add new kernel registrar marco

* rename top to tcmpt

* revert xpu, npu, mkldnn impl & remove op def

* add kernel args parse functor to auto parse args

* revert some change & add scale kernels

* add op proto in dygraph kernelcontext building

* polish kernel dispatch logic & nameing rule

* fix scale kernel match error

* fix scale test failed

* add mean API and unittest

* test mean api success

* add branch to solve compiled error

* skip clang format error

* add mean skip rule in op_library

* add dot kernel, api and unittest (#6)

* remove old kernel and add symbol link

* fix dot compiled failed

* add merco for module declare

* fix npu and xpu compile error

* revert sign, mean, scale, dot kernel removing

* add comment for keeping old kernel impl

* fix mutable_data error

* fix bfloat16 conflit

* fix inference undef error

* adapt to msvc compile rules

* polish comment for template inst

* add cmake template instantiation for win

* fix backend to place device id bug

* fix ifdef error

* Op2functor (#7)

* add kernel args maker class

* make args maker non-const

* remove debug log

* modify codes by review options

* split constructPrKernelContext function

* fix output name bug

* fix test_mean_op test_sign_op failed

* fill_any_like kernel refactor (#10)

* fill_any_like kernel refactor

* remove useless code of full_like c++ api

* skip dtype for fill_any_like

* add attrs for kernel key constrcut

* add use_pt_kernel Flags to control whether to use pt kernel (#13)

* add use_pt_kernel Flags to control whether to use pt kernel

* change the default value to true for cheking pt kernels

* fix mutable_data cuda place error

* move high level apis into hapi

* remove selectedrows adapting temporarily

* Support Scalar in Tensor Compute Library (#14)

* fill_any_like kernel refactor

* remove useless code of full_like c++ api

* Support Scalar in Tensor Compute Library

* add scalar in dygraph and static graph mode

* keep the basic type for attr, instead of using scalar for all

* merge the code

* remove mkldnn tensor & polish details

* use flat_hash_map and small_vector in kernel factory

* Refactor flatten kernel (#12)

* refactor flatten kernel

* update infershape function

* fix compile bugs

* fix bugs when merge

* fix compiler bugs

* fix bugs when run test_flatten_api

* fix bugs when run test

* Revert "use flat_hash_map and small_vector in kernel factory"

This reverts commit 23091495cfdd3df8cc1be592d30f09ea66a7c72b.

* Move cpu, cuda and other device code into kernels (#15)

* fill_any_like kernel refactor

* remove useless code of full_like c++ api

* Support Scalar in Tensor Compute Library

* add scalar in dygraph and static graph mode

* keep the basic type for attr, instead of using scalar for all

* merge the code

* start refactor matmul

* move cpu, cuda and other device modules into kernels

* merge code

* polish code in operator.cc

* Perfect unitests (#16)

* perfect unittest

* update license

* replace with flat_hash_map, small_vector (#19)

* fix small_vector build error on windows platform

* replace with flat_hash_map, small_vector

* remove todo

* Perfect unitests (#20)

* perfect unittest

* update license

* fix bug when run tcmpt_utils_test

* refactor execution adapting impl

* fix insert conflit

* Fix CI bug of test_yolov3 (#21)

* fill_any_like kernel refactor

* remove useless code of full_like c++ api

* Support Scalar in Tensor Compute Library

* add scalar in dygraph and static graph mode

* keep the basic type for attr, instead of using scalar for all

* merge the code

* start refactor matmul

* move cpu, cuda and other device modules into kernels

* merge code

* polish code in operator.cc

* Fix CI bug of test_yolov3

* add the tensor base class, test=develop (#17)

* update the tensor base class, test=develop

* remove two funcs, test=develop

* update the error msg, test=develop
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

* [no-verify] commit backend and tensor signature changes

* Rename tcmpt to pten (#23)

* rename tcmpt to pten

* update omitted files for rename to pten

* update omitted file for rename to pten

* remove k of all enum var

* remove kernel_instantiate (#26)

* remove symbols and spatial_tensor

* change common to functions

* readd share tensor impl methods

* add a candidate dense tensor class, test=develop (#28)

* change all Pt to Pten

* resolve conflit with xiaowei

* Op2functor opt1 (#27)

* replace to small vector and change to const &

* add std::move
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

* polish kernel factory and kernel registry

* fix operator test error msg mismatch

* remove tensor signature and backend set member

* move scalar and polish enforce

* revert dtype layout change to fix error

* fix enum operator override error

* add several base unittests

* add pten utils tests

* polish some details

* Dev/op2func refactor 3 (#30)

* add a candidate dense tensor class, test=develop

* remove TensorBase::backend(), test=develop

* remove some ops, test=develop

* cherry-pick the pr of tensor meta, test=develop

* moves the dense tensor and some ops, test=develop

* update the linalg operator, test=develop

* update other operators, test=develop

* fix errors, test=develop

* fix bugs, test=develop

* try to resolve the problem of windows ci, test=develop

* updates codes, test=develop

* fix the tensor_utils.cc, test=develop

* modify the dense tensor, test=develop

* fix the data type, test=develop
Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

* polish some details

* polish kernel signature details

* fix a bug about offsets of the tensor, test=develop (#31)
Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

* polish some details
Co-authored-by: Nchentianyu03 <ctychentianyu@gmail.com>
Co-authored-by: Nzyfncg <1370305206@qq.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>

b9fdd3bc

22 8月, 2020 1 次提交
- W
  
  【API】Add sign and tanh api (#26357) · ed102ea1
  由 WangXi 提交于 8月 22, 2020
  
  ed102ea1
12 4月, 2020 1 次提交

update error info of ops，add some test cases for raise message (#23750) · ac4da77a

由 Steffy-zxf 提交于 4月 12, 2020

1. update error info of the ops (abs, acos, asin, atan, ceil, cos, exp, floor, log, pow, reciprocal, round, rsqrt, sin, sqrt, square, tanh)
2. add the unittests of the above refered ops (test error info)

ac4da77a

20 12月, 2019 1 次提交
- J
  
  Update the precision of some op tests from fp32 to fp64 (#21847) · f4379a91
  由 juncaipeng 提交于 12月 20, 2019
  
  f4379a91
09 12月, 2019 1 次提交

Fix unit tests to avoid check_grad checking failures (#21554) · 548efcd2

由 Zhang Ting 提交于 12月 09, 2019

* fix python API tests that do not need to inherit OpTest, test=develop

* fix fp16 cases that will only be enabled in GPU mode, test=develop

* remove TestSoftmaxFP16Op from test cases of softmax_mkldnn_op, test=develop

* fix tests so that the cases are only created in GPU mode, test=develop

548efcd2

11 10月, 2019 1 次提交
- W
  fix sign op input error check on float16 (#20472) · eb526e3f
  由 wawltor 提交于 10月 11, 2019
```
fix sign op input error check
test=develop
```
  eb526e3f
09 10月, 2019 1 次提交

Fix api, add input type and dtype check for sign_op (#20138) · 08c8f0c5

由 wawltor 提交于 10月 09, 2019

* test=develop
Add input type and dtype check for sign_op.

* test=develop
Fix the api text format in sign op.

* test=develop
Fix the api examples in sign op add update the api.spec.

08c8f0c5

15 8月, 2018 1 次提交
- M
  
  Add print_function for all python files · 99d3f089
  由 minqiyang 提交于 8月 15, 2018
  
  99d3f089
26 7月, 2018 2 次提交
- M
  
  Remove python3 relative import of unittest · 9fc13fde
  由 minqiyang 提交于 7月 26, 2018
  
  9fc13fde
- M
  
  Change iter_parameters back and port unittests code to Python3 · 35e6abd7
  由 minqiyang 提交于 7月 26, 2018
  
  35e6abd7
24 2月, 2018 1 次提交
- L
  
  move Fluid API code out of V2 API code · b11956a0
  由 Luo Tao 提交于 2月 24, 2018
  
  b11956a0
13 2月, 2018 1 次提交

Run Python OP tests in a single Python process to improve test time. (#8362) · cde6241a

由 Xin Pan 提交于 2月 13, 2018

Currently, our tests run with 2 GPUs, the init time is absurdly long:
about 4s for each process.  Currently, we run each OP test on
different processes. This PR:

1. create cmake function py_test_modules which will generate the
Makefile that runs a list of Python unittest module in a single Python
process.

2. move all "python unittest compatible" (e.g., used the unittest
package, not just a regular python file). from fluid/tests to
fluid/tests/unittests.

3. cmake now will run all OP tests in fluid/tests/unittests in a
single process, except the time-consuming tests, they are separated
into different processes to utilize parallelism. Please make sure to
use the unittest package if you put the python test file in
fluid/tests/unittests

4. remove all exit(0) from fluid/tests/unittests/*.py, exit(0) is used
to disable unittest, we can not do it when running all tests in a
single process since it will terminate the process without running the
other tests. Instead, the test is disabled in
fluid/tests/unittests/CMakeLists.txt. FIXME is added for each disabled
item. Please disable the unittest from
fluid/tests/unittests/CMakeLists.txt, instead of adding exit(0) to the
Python file, for all Python file in fluid/tests/unittests/.

5. add an option WITH_FAST_BUNDLE_TEST. When OFF, will run the unit
tests in separate process so that they can be tested individually.

cde6241a

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
21 1月, 2018 1 次提交

"fix decode bug" (#7711) · e983cc90

由 dzhwinter 提交于 1月 21, 2018

* "fix decode bug"

* "follow commnet"

* "fix error"

* "fix hook bug"

* fix based comment

* fix copyright

* fix based on comment

e983cc90

15 1月, 2018 1 次提交

Feature/hooks (#7513) · b9b75377

由 dzhwinter 提交于 1月 15, 2018

* add copyright hook

* add copyright hook

* refine copyright hook

* "test copyright hook"

* fix check style

* fix ci

b9b75377

14 11月, 2017 1 次提交
- Q
  Change framework to fluid (#5637) · 4adc8a7a
  由 Qiao Longfei 提交于 11月 14, 2017
```
* init commit

* change some dir name
```
  4adc8a7a
28 10月, 2017 1 次提交
- A
  
  Adding the Sign Op for L1 Weight Decay Regularization (#5138) · 1a26f5a5
  由 Abhinav Arora 提交于 10月 27, 2017
  
  1a26f5a5
15 9月, 2017 1 次提交
- L
  
  Add the check of inputs and outputs in all operators. · eef1ccbf
  由 Liu Yiqun 提交于 9月 15, 2017
  
  eef1ccbf
11 9月, 2017 1 次提交
- Q
  
  init refine op python tests · 2d807f2b
  由 qijun 提交于 9月 11, 2017
  
  2d807f2b
07 9月, 2017 1 次提交
- L
  Correct the definition of Operator in TestFCGradOp, and rename the output name · 734a9eea
  由 Liu Yiqun 提交于 9月 07, 2017
```
of identity to Y.
```
  734a9eea
21 8月, 2017 2 次提交
- Y
  
  Change IdentityOp to ScaleOp · d3f219aa
  由 Yu Yang 提交于 8月 21, 2017
  
  d3f219aa
- Y
  
  Identity operator and its gradient · c108d610
  由 Yu Yang 提交于 8月 21, 2017
  
  c108d610
17 8月, 2017 1 次提交
- Y
  
  Add MeanOp's Gradient Test And Fix Mean Op Gradient · 7f8c3f82
  由 Yu Yang 提交于 8月 17, 2017
  
  7f8c3f82
04 8月, 2017 1 次提交
- D
  
  Refine unit test in op_test_util · c540aa04
  由 dangqingqing 提交于 8月 04, 2017
  
  c540aa04
01 8月, 2017 1 次提交
- L
  
  Add mean op unit test in python · 1e676f68
  由 liaogang 提交于 8月 01, 2017
  
  1e676f68
31 7月, 2017 1 次提交
- Q
  
  reduce gpu memory allocation in op_test · cf5ac588
  由 qijun 提交于 7月 31, 2017
  
  cf5ac588
21 7月, 2017 1 次提交
- Q
  
  add unittest for some basic OpKernels · 06acd6d0
  由 qijun 提交于 7月 21, 2017
  
  06acd6d0

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致