提交 · 5f5f626b22324329c3d2d6b3415cc1a4b3d53b11 · TonyTonyFun / Paddle

12 1月, 2022 1 次提交
- C
  [PTen] Remove hybird dir (#38863) · 5f5f626b
  由 Chen Weihang 提交于 1月 12, 2022
```
* remove hybird dir

* resolve conflit
```
  5f5f626b
20 12月, 2021 1 次提交
- F
  
  [MLU]add mlu backend (#38207) · 76514a1f
  由 fwenguang 提交于 12月 20, 2021
  
  76514a1f
09 12月, 2021 2 次提交
- J
  
  add ipu device p2 (#37840) · cb636a48
  由 jianghaicheng 提交于 12月 09, 2021
  
  cb636a48
- C
  
  adjust main dir (#37916) · 1911b6f0
  由 Chen Weihang 提交于 12月 08, 2021
  
  1911b6f0
25 11月, 2021 1 次提交

Added GradTensorHolder to Eager Dygraph (#37458) · bc9f9f43

由 Zhanlue Yang 提交于 11月 25, 2021

* Added GradTensorHolder to Eager Dygraph

* Added accumulation codes to Eager Dygraph

* Fix windows-ci issue

* Fix NPU-CI issue

* Fixed CI-Coverage issue

bc9f9f43

13 9月, 2021 1 次提交
- Z
  
  Support int16_t in fill_constant_op (#35619) · 4b6f8099
  由 Zhang Zheng 提交于 9月 13, 2021
  
  4b6f8099
01 6月, 2021 1 次提交

replace and remove complex64/128 types in custom OP and other files (#33195) · 06c63ca0

由 chentianyu03 提交于 6月 01, 2021

* replace and remove complex64/128 types in custom OP and other files

* fix custom_tensor_test fail bug

* fix custom_conj_test fail bug

* fix dispatch_test_op build fail bug

06c63ca0

20 5月, 2021 1 次提交

Add complex template type (#32857) · 738bf20e

由 chentianyu03 提交于 5月 20, 2021

* add complex template file

* add numtraits for complex template

* add complex template type register

* modify specify template of complex

* modify specify template of complex

* modify specify template of complex

* modify specify template of complex

* make TensorCheckerVisitor support complex type

* fix operator= error

* add complex template

* add complex template type

* add complex template type to pyarray transform

* add complex template type to pyarray transform

* remove complex type for dlpack register

* set dlpack supprot complex type

* set dlpack supprot complex type

* set dlpack supprot complex type

* remove explict for complex constructor

* add complex unit test file

738bf20e

12 5月, 2021 1 次提交
- L
  
  [NPU] Support npu pinned allocator and manage Tensor on NPUPinnedPlace (#32840) · 6b3bb796
  由 liym27 提交于 5月 12, 2021
  
  6b3bb796
19 4月, 2021 1 次提交
- J
  
  Add BF16 Constant Initializer and support for other initializer (#31935) · 76cb83e8
  由 joanna.wozna.intel 提交于 4月 19, 2021
  
  76cb83e8
09 4月, 2021 1 次提交

[NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d

由 Leo Chen 提交于 4月 09, 2021

* [feature] support npu allocator (#30840)

[feature] support npu allocator

* [feature] support npu operator (#30951)

[feature] support npu operator

* [feature] support npu allocator, part 2 (#30972)

* support npu allocator

* add npu device context

* fix some compile problem

* fix some compile problem

* add npu info

* compile ok

* fix include dir

* support naive_best_fit_allocator

* run ut ok, bug failed to exit

* call aclrtResetDevice before exit

* fix aclFinilize

* add system allocatot test

* add selected_gpus in gtest

* add tensor_test for npu

* support npu op, initial commit

* add npu stream

* add elementwise_add_op

* compile ok

* fix typo

* fix elementwise_add_op_npu_test

* support op run

* test can run but failed

* change aclopExecuteV2 to aclopCompileAndExecute

* support parsing ascend rank table file (#31000)

support parsing ascend rank table file

* Fix reshape on GE graph. (#31084)

Fix reshape on GE graph

* add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)

* add npu sub op

* fix typo

* rename test

* fix bug

* fix bug

* add fp16 kernel

* fix typo

* support sub grad op

* support elementwise_sub_grad op
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>

* Fix compilation problem (#31100)

Fix compilation problem (#31100)

* fix compile

* fix code stype

* remove const_cast

* support adding correct npu op in pybind.h (#31143)

* support adding correct npu op in pybind.h

* refine code

* [NPU] Support executor with NPU (#31057)

* [NPU] Support executor with NPU

* Fix code according to reviews

* Fix code

* Add unittest for sub op npu

* refactor npu device manager (#31154)

refactor npu device manager (#31154)

* fix selected npus

* fix compile

* fix reading flags from env

* format
Co-authored-by: Nxiayanming <41795079@qq.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>

ccf5709d

01 4月, 2021 1 次提交
- Z
  
  Support uint8_t for fill_constant_op (#31911) · 980227f9
  由 Zhang Zheng 提交于 4月 01, 2021
  
  980227f9
02 3月, 2021 1 次提交

[ROCM] update fluid operators for rocm (part5), test=develop (#31258) · 65bcaeb0

由 Qi Li 提交于 3月 02, 2021

* [ROCM] update fluid operators for rocm (part5), test=develop

* address review comments, test=develop

* fix typo, test=develop

65bcaeb0

15 12月, 2020 1 次提交

Add complex dtype op (add) test example (#29603) · f02aece1

由 Chen Weihang 提交于 12月 15, 2020

* add op test case for complex

* polish code details

* add xpu set constant support

* fix argument rror

* remove useless pyc file

f02aece1

01 12月, 2020 1 次提交

add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199) · 8f45d142

由 chentianyu03 提交于 12月 01, 2020

* add complex64 and complex128 type; add +-*/@ and slice opreator for complex types

* add test cases for complex elementwise, matmul and getitem unittest

* add test cases for complex types

* add test cases for complex matmul unittest

8f45d142

25 11月, 2020 1 次提交
- W
  remove eigen threadpool for the speed up · b2c8a007
  由 wawltor 提交于 11月 25, 2020
```
remove eigen threadpool for the speed up
```
  b2c8a007
14 10月, 2020 1 次提交
- W
  
  xpu support for fill_constant Op (#27675) · c5fcc96d
  由 wangchaochaohu 提交于 10月 14, 2020
  
  c5fcc96d
17 9月, 2020 1 次提交
- J
  enhance reduce op which can reduce tensor with arbitrary rank · 63203c4a
  由 Jack Zhou 提交于 9月 17, 2020
```
enhance reduce op which can reduce tensor with arbitrary rank 
```
  63203c4a
14 9月, 2020 1 次提交
- J
  Error description optimize for math dir · 9437ce36
  由 Jack Zhou 提交于 9月 14, 2020
```
Error description optimize for math dir
```
  9437ce36
03 9月, 2020 1 次提交
- J
  
  Add bfloat16 data type (#25402) · 95e1434b
  由 joanna.wozna.intel 提交于 9月 03, 2020
  
  95e1434b
21 8月, 2020 1 次提交

support Baidu Kunlun AI Accelerator (#25959) · 138ecf24

由 QingshuChen 提交于 8月 21, 2020

* support Baidu AI Accelerator
  * test=kunlun

* minor
 * test=kunlun

* support xpu op in separate file
 * test=kunlun

* update XPU error message and remove duplicated code

 * test=kunlun

* minor
 * test=kunlun

* minor
 * test=kunlun

138ecf24

03 6月, 2020 1 次提交

Support gradient accumulation of fp16 in imperative mode (#24823) · b67ded04

由 Leo Chen 提交于 6月 03, 2020

* support gradient accumulation of fp16 in imperative mode, test=develop

* enhance coverage test, test=develop

* follow comments, test=develop

b67ded04

12 12月, 2018 1 次提交
- Y
  Change tensor uses proto::VarType::type · 9bd70a1e
  由 Yu Yang 提交于 12月 11, 2018
```
test=develop
```
  9bd70a1e
30 9月, 2018 1 次提交

"fix compile error" (#13579) · 26771f41

由 dzhwinter 提交于 9月 30, 2018

* "fix compile error"

* "fix ci"

* rerun ci
test=develop

* test=develop

rerun ci

26771f41

03 9月, 2018 1 次提交
- D
  
  squash commit · 379b471e
  由 dzhwinter 提交于 9月 03, 2018
  
  379b471e
31 8月, 2018 1 次提交
- D
  Feature/template (#13093) · ab1097cd
  由 dzhwinter 提交于 8月 31, 2018
```
* remove template operator

* "fix compile"

* "fix ci"

* "fix ci"
```
  ab1097cd
27 8月, 2018 1 次提交
- Q
  Support data type int8_t . (#12841) · 1f09bc32
  由 qingqing01 提交于 8月 27, 2018
```
* Support int8 type.
```
  1f09bc32
20 6月, 2018 1 次提交
- F
  
  fix a compile error · 12619fcf
  由 fengjiayi 提交于 6月 20, 2018
  
  12619fcf
16 5月, 2018 1 次提交
- Y
  
  Make tensor support uint8 · fd2b4b47
  由 yuyang18 提交于 5月 16, 2018
  
  fd2b4b47
04 5月, 2018 1 次提交
- Y
  
  Clean and extract blas · ef6ea790
  由 Yu Yang 提交于 5月 04, 2018
  
  ef6ea790
03 5月, 2018 1 次提交
- Y
  
  Clean MatMul · 815d8884
  由 Yu Yang 提交于 5月 03, 2018
  
  815d8884
28 4月, 2018 1 次提交
- Y
  
  Refactor GEMM in blas · c888e016
  由 Yu Yang 提交于 4月 28, 2018
  
  c888e016
25 4月, 2018 2 次提交
- Y
  
  Fix compile when there is no mkl · 580dad0c
  由 Yu Yang 提交于 4月 25, 2018
  
  580dad0c
- Y
  Fix batch_gemm bugs · 2a06e307
  由 Yu Yang 提交于 4月 25, 2018
```
stride should be int64_t, not int
```
  2a06e307
27 3月, 2018 1 次提交
- C
  
  Add CUDAPinnedPlace · ab601c19
  由 chengduoZH 提交于 3月 26, 2018
  
  ab601c19
17 3月, 2018 1 次提交
- K
  
  initial commit · 39c676e2
  由 Kexin Zhao 提交于 3月 16, 2018
  
  39c676e2
16 3月, 2018 1 次提交
- Y
  
  Finish adaption for backward. · bf3f56e8
  由 yangyaming 提交于 3月 15, 2018
  
  bf3f56e8
09 3月, 2018 1 次提交

Add float16 GEMM math function on GPU (#8695) · 90215b78

由 kexinzhao 提交于 3月 08, 2018

* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* initial commit

* fix error

* small fix

* add more gemm fp16 tests

* fix error

* add utility function

90215b78

07 3月, 2018 1 次提交

Integrate float16 into data_type_transform (#8619) · 266ccaa8

由 kexinzhao 提交于 3月 06, 2018

* test cpu float16 data transform

* add isnan etc

* small fix

* fix containsNAN test error

* add data_type transform GPU test

* add float16 GPU example

* fix error

* fix GPU test error

* add context wait

266ccaa8

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a

TonyTonyFun / Paddle 与 Fork 源项目一致

TonyTonyFun / Paddle
与 Fork 源项目一致