提交 · bf30503335c2c8015dd20f991ef4480af9b5898d · BaiXuePrincess / Paddle

11 2月, 2022 1 次提交
- Z
  Support different dtypes of inputs for elementwise ops (#38859) · bf305033
  由 Zhang Ting 提交于 2月 11, 2022
```
* improve backward performance

* support different dtypes for elementwise ops
```
  bf305033
08 2月, 2022 1 次提交
- Y
  
  Rename partial function name TensorReduceFunctorImpl to TensorReduceImpl. (#39388) · f71241b9
  由 Yiqun Liu 提交于 2月 08, 2022
  
  f71241b9
06 2月, 2022 1 次提交
- W
  
  [PTEN] Add Gpu context (#39305) · a821c4a9
  由 Wilber 提交于 2月 06, 2022
  
  a821c4a9
26 1月, 2022 1 次提交

[pten] remove deprecated fluid op kernel for pten (#38842) · 3ab9aef1

由 Leo Chen 提交于 1月 26, 2022

* update cmake file to remove fluid kernel

* add pten declaration.h to where pybind.h used

* fix sync_bn and tensorrt_engine

* refine detection_library

* fix interpreter_core

* support eager legacy

* fit eager legacy for pten

* fall back to cpu if not found kernel

* fix compile problem

* fix compile problem

* refine fallback logic

* fit operator.run()

* fix xpu compile

* fit for new_exec

* add REGISTER_OP_WITHOUT_GRADIENT

* un-cache pt_kernel_context

* fix compile

* fix cudnn

* fix compiling with on_infer

* fix mkldnn

* fix isfinite_v2

* fix xpu problem

* fix op_device

* refine fallback for xpu

* fix xpu compile

* merge develop

* refine code format

* fix compile

* fix compile

* add data_transfer

* fix PreparePtenData

* fix cpu context

* merge develop

* fix compile

* fix error device context

* fix xpu

* fix dev_ctx

3ab9aef1

24 1月, 2022 1 次提交
- Z
  
  unify compare functor (#39024) · def81b4f
  由 Zhang Ting 提交于 1月 24, 2022
  
  def81b4f
20 1月, 2022 1 次提交
- A
  [Pten] Migrate bfloat16/float16/complex from paddle::platform into pten::common (#39044) · f1143f0c
  由 Aurelius84 提交于 1月 20, 2022
```
* Migrate bfloat16/float16/complex from platform into pten::common

* fix typo

* fix code style
```
  f1143f0c
18 1月, 2022 2 次提交

[Unify Tensors PR #8] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

Y

Unify the functor of elementwise and logical ops. (#35767) · b1365d25
由 Yiqun Liu 提交于 1月 18, 2022

b1365d25

17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

15 1月, 2022 1 次提交

[Unify Tensors PR #7] Merged LoDTensor with Tensor, test=allcases (#38880) · 88966b28

由 Zhanlue Yang 提交于 1月 15, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Fixed example code failure

* Polished function names, removed duplicated forward declarations

88966b28

12 1月, 2022 1 次提交
- Z
  [part 3]change type of function args (#38887) · 0efcae86
  由 Zhang Ting 提交于 1月 12, 2022
```
* code clean

* [part 3]change type of function args
```
  0efcae86
17 12月, 2021 1 次提交
- N
  
  Delete cub_reduce.h and modified the TensorReduce to TensorReduceFunctorImpl (#38197) · 9a8a4c77
  由 niuliling123 提交于 12月 17, 2021
  
  9a8a4c77
03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
27 11月, 2021 1 次提交

[NPU] reorganization for device API abstraction (#37110) · 72241a6a

由 Aganlengzi 提交于 11月 27, 2021

* [NPU] reorganization for device API abstraction

* [NPU] delete old files

* [NPU] fix npu_collective_helper

* [NPU] fix collective_helper

* [NPU] fix ut

* [NPU] mod memory allocation and hccl_helper

* [NPU] fix place_type

* [NPU] split enfoce.h

* move acl* call into npu_info

* merge conflict

* fix merge

* merge conflict

* merge conflict

72241a6a

24 11月, 2021 1 次提交
- A
  
  Fix lod in fetch_v2 (#37514) · acbf9974
  由 Aurelius84 提交于 11月 24, 2021
  
  acbf9974
22 11月, 2021 1 次提交
- L
  
  [new feature] add local scope for interpretercore (#37379) · 1f0512be
  由 Leo Chen 提交于 11月 22, 2021
  
  1f0512be
02 11月, 2021 1 次提交
- W
  
  fix some bug, test=develop (#36888) · b0941102
  由 wanghuancoder 提交于 11月 02, 2021
  
  b0941102
29 10月, 2021 1 次提交
- W
  fix some bug in new executor (#36822) · b5af9575
  由 wanghuancoder 提交于 10月 29, 2021
```
* fix some bug in new executor, test=develop

* fix error message, test=develop
```
  b5af9575
25 10月, 2021 1 次提交

add some ops to train ssd on kunlun (#36407) · 50778ad6

由 TTerror 提交于 10月 25, 2021

* add some ops to train ssd on kunlun

* add some ops to train ssd on kunlun

* add some ops to train ssd on kunlun

* update cast op unittest

* update cast op unittest

* update cast op unittest

* update xpu cmake

* update cast unittest

50778ad6

20 10月, 2021 1 次提交

Add FasterTokenizer Operator (#34491) · 3f2d6a3f

由 Steffy-zxf 提交于 10月 20, 2021

Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.

* support the text string as an input Tensor
* support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
* Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
* It first applies basic tokenization, followed by wordpiece tokenization.

3f2d6a3f

14 9月, 2021 1 次提交
- Y
  Implement FunctionTraits to support two kinds of elementwise functor and... · 12bf0502
  由 Yiqun Liu 提交于 9月 14, 2021
```
Implement FunctionTraits to support two kinds of elementwise functor and remove some old codes for broadcast. (#35688)
```
  12bf0502
13 9月, 2021 2 次提交
- Y
  Revert "Implement FunctionTraits to support two kinds of elementwise functor... · 40d4a295
  由 Yiqun Liu 提交于 9月 13, 2021
```
Revert "Implement FunctionTraits to support two kinds of elementwise functor and remove some old codes for broadcast. (#35487)" (#35686)
```
  40d4a295
- Y
  Implement FunctionTraits to support two kinds of elementwise functor and... · d4f84d46
  由 Yiqun Liu 提交于 9月 13, 2021
```
Implement FunctionTraits to support two kinds of elementwise functor and remove some old codes for broadcast. (#35487)
```
  d4f84d46
08 9月, 2021 1 次提交
- C
  mark WhileOp AsExtra attribute (#35499) · ce7c18f6
  由 CtfGo 提交于 9月 08, 2021
```
* mark WhileOp AsExtra attribute

* revert kX and kOutputs
```
  ce7c18f6
01 9月, 2021 1 次提交
- W
  modify fetch logic, use D2H Stream (#35191) · c56d6978
  由 wanghuancoder 提交于 9月 01, 2021
```
* modify fetch logic, use D2H Stream, test=develop

* refine, test=develop
```
  c56d6978
31 8月, 2021 1 次提交
- H
  Add AsExtra() for conditional_block_op.h (#35268) · 2100816c
  由 Huihuang Zheng 提交于 8月 31, 2021
```
As the title, see details at the PR description.
```
  2100816c
24 8月, 2021 1 次提交

add fetch, test=develop (#35019) · a5060b55

由 wanghuancoder 提交于 8月 24, 2021

* add fetch, test=develop

* fix fetch2op, test=develop

* fix fetch2op, test=develop

* refine, test=develop

* fix fetch ctx, test=develop

* add wait, test=develop

* rename fetch2 to fetch_v2, test=develop

* merge, test=develop

a5060b55

11 8月, 2021 1 次提交
- P
  [NPU] add while, read_from_array and write_to_array npu op (#34755) · 234c21ac
  由 pangyoki 提交于 8月 11, 2021
```
* add while read_from_array write_to_array npu op

* optimize unittest
```
  234c21ac
05 8月, 2021 1 次提交

add not_equal NPU op (#34560) · 7e707ce8

由 baoachun 提交于 8月 05, 2021

* add not_equal NPU op

* add not_equal NPU op

* add not_equal NPU op

* add not_equal NPU op

7e707ce8

28 7月, 2021 1 次提交

[NPU] add NPU ops of compare, test=develop (#34365) · 68b4a2c3

由 Aganlengzi 提交于 7月 28, 2021

* [NPU] add NPU ops&uts of compare, test=develop

* testing

* try style-format

* [NPU] update compare_op_npu uts

* [NPU] fix code sytle of test_compare_op_npu.py

68b4a2c3

26 7月, 2021 1 次提交
- Q
  
  [NPU] fix logcial op on NPU, test=develop (#34371) · d3d174f7
  由 Qi Li 提交于 7月 26, 2021
  
  d3d174f7
23 7月, 2021 1 次提交

Logical Ops support more data types (#34141) · 27417f1f

由 will-jl944 提交于 7月 23, 2021

* logical ops support int8, int16, int32, int64, float, double

* update docs of logical ops

* fix npu and xpu logical ops

* fix npu and xpu logical ops

* fix bug in xpu logical op code

* update test_logical_op_npu and test_logical_op_xpu

* correct error type

27417f1f

15 7月, 2021 2 次提交
- Q
  
  [NPU] add ops of bce_loss logical_and logical_or, test=develop (#34159) · 9e18114f
  由 Qi Li 提交于 7月 15, 2021
  
  9e18114f
- A
  Upgrade Executor into ParallelExcutor to apply Graph Optimization in @to_static (#32283) · 2850391d
  由 Aurelius84 提交于 7月 15, 2021
```
* Refine Constructor logic of ParallelExecutor

* Replace executor into ParallelExecutor in run_program_op
```
  2850391d
05 7月, 2021 1 次提交
- L
  
  Replace usage of elementwise cuda forward kernel in Compare_all_op (#33754) · ea1a0d45
  由 limingshu 提交于 7月 05, 2021
  
  ea1a0d45
29 6月, 2021 1 次提交
- L
  
  [NPU] remove duplicated stream sync in fetch op (#33819) · 0d3de8d0
  由 Leo Chen 提交于 6月 29, 2021
  
  0d3de8d0
16 6月, 2021 1 次提交
- Z
  
  Add bitwise_and/or/xor/not OP/API and unittest (#33524) · ecc05377
  由 Zhou Wei 提交于 6月 16, 2021
  
  ecc05377
15 6月, 2021 1 次提交
- W
  add the support for the bool in compare ops · 1f8de080
  由 wawltor 提交于 6月 15, 2021
```
add the support for the bool in compare ops
```
  1f8de080
04 6月, 2021 2 次提交
- W
  fix inference prepare data bug (#33305) · dd181238
  由 wenbin 提交于 6月 04, 2021
```
* fix inference prepare data bug

* rename functions

* typo

* typo

* typo

* UT correct

* correct condition

* correct condition

* ci coverage

* morelines

* fix ci coverage
```
  dd181238
- L
  
  Reimplement logical functors with the new optimized elementwise function (#33089) · 941308c2
  由 limingshu 提交于 6月 04, 2021
  
  941308c2

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致