提交 · d6d0820e1b6a53eae186f88a96fc98c577c545c6 · Crayon鑫 / Paddle

18 2月, 2022 2 次提交
- R
  
  [CustomRuntime] add pten::Backend support (#39606) · d6d0820e
  由 ronnywang 提交于 2月 18, 2022
  
  d6d0820e
- A
  [IPU] Update IpuStrategy (#39644) · 46161679
  由 Allen Guo 提交于 2月 18, 2022
```
* Update IpuStrategy

* fix ci

* rerun ci
```
  46161679
17 2月, 2022 5 次提交

L
[new-exec] refactor code of interpretercore gc (#39617) · c3135426
由 Leo Chen 提交于 2月 17, 2022
```
* relocate code of interpretercore gc
```
c3135426

[bugfix] to concat input squash (#39593) · f29da150

由 Sylwester Fraczek 提交于 2月 17, 2022

* fix and add more tests

* remove unwanted changes

* check only concat and elementwise

* move check to a function

* add todo comment

* Revert "fix ptq fc attr name fuse_activation->activation_type"

This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.

f29da150

T
save the name lists of variables of a cinn subgraph as its attributes (#39622) · a1ad003c
由 TeFeng Chen 提交于 2月 17, 2022
```
* save the name lists of the input,internal and output variables of a subgraph as its attribute

* fix compile error
```
a1ad003c

adaptive pool2d pass fix (#39600) · c1c5c1fc

由 wenbin 提交于 2月 17, 2022

* first commit

* teller fix

* bug fix

* enable for pool2d only

* fix global_pooling issue

* pooling_type

* fix test

c1c5c1fc

B
update inference ut to support nhwc format (#39551) · b4d3597a
由 baoachun 提交于 2月 17, 2022
```
* update inference ut to support nhwc format

* update ut and pass OpCompat

* update ut

* update ut
```
b4d3597a

16 2月, 2022 4 次提交

[bf16] pten matmul cuda kernel support bf16 (#39485) · d5a0d31a

由 Leo Chen 提交于 2月 16, 2022

* pten matmul cuda kernel support bf16

* fix pten kernel name

* add matmul_grad bf16 kernel

* add emptylike bf16 kernel

* fix compile

* suppport rocm

* fix error

* fix rocm

* add bf16 header file

* fix compile

d5a0d31a

[PTen] Add attr support for infershape utils (#39513) · 6eb95caf

由 Chen Weihang 提交于 2月 16, 2022

* add attr support for infershape

* add unittest for coverage

* add unittest for coverage

* polish unittest detail

* fix windows test failed

6eb95caf

A
[Pten]Modify framework::VisitDataType into Pten::VisitDataType (#39550) · 6b756fb7
由 Aurelius84 提交于 2月 16, 2022
```
* Modify framework::VisitDataType into Pten::VisitDataType

* migrate unittest
```
6b756fb7

[Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270

由 YuanRisheng 提交于 2月 16, 2022

* remove reshape and elementwise_add registry

* delete code

* fix bugs when run ci ut

* remove log

* fix bugs when run unit test

* fix bugs when run unit test

* fix bugs when run cinn

* fix bugs when run ci-mac-python3

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix bugs when run kunlun

* fix bugs when compile

* update code according comment

c6478270

15 2月, 2022 5 次提交

[Paddle-Inference] support preln_ernie: add... · 2bc91cc5

由 Wangzheee 提交于 2月 15, 2022

[Paddle-Inference] support preln_ernie: add preln_embedding_eltwise_layernorm_fuse_pass, preln_skip_layernorm_fuse_pass (#39508)

* support preln_ernie

* support preln_ernie

2bc91cc5

[PluggableDevice] Add custom runtime support (#38740) · 3e7825f3

由 ronnywang 提交于 2月 15, 2022

* [CustomRuntime] Add DeviceManager

* [CustomRuntime] Add DeviceInterface

* [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager

* [CustomRuntime] Add plug-in device

* [CustomRuntime] Memory module support PluggableDevice

* [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option

* update

* [API] update API doc based on comments, test=develop
Co-authored-by: Nqili93 <qili93@qq.com>

3e7825f3

Add cinn_instruction_run_op for launching execution of a cinn instruction (#39435) · 9d0baeab

由 TeFeng Chen 提交于 2月 15, 2022

* add cinn_instruction_run_op for launching execution of a cinn instruction

* fix multi definition compilation error

* update cmake

* fix bug at infershape

* fix compile error due to lacking header file

9d0baeab

move histogram to pten (#39496) · 556f6eb0

由 hong 提交于 2月 15, 2022

* move histogram to pten; test=develop

* fix format error; test=develop

* fix histogram kernel format; test=develop

556f6eb0

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

14 2月, 2022 5 次提交

[Bug fix] prevent squashing pair u8 dequantize -> s8 quantize (#39346) · 66b5348e

由 Sylwester Fraczek 提交于 2月 14, 2022

* prevent squashing pair u8 dequantize -> s8 quantize

* add relu op to check for uint8

* fix ptq fc attr name fuse_activation->activation_type

* fix

* add unit test

* remove unused variable

* test fix unsuccessful

* fix test and logic

* multiline comment

* remove cout

* Revert "fix ptq fc attr name fuse_activation->activation_type"

This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.

* fix ptq fc attr name fuse_activation->activation_type

66b5348e

W
context add generator (#39475) · 463e31f4
由 Wilber 提交于 2月 14, 2022
```
* context add generator

* update
```
463e31f4

[NewExe] Ignore eof Exception(#39487) · 2f642159

由 liutiexing 提交于 2月 14, 2022

* add align for WorkQueue

* add spinlock

* merge develop

* merge

* Add EventsWaiter

* Revert "Add EventsWaiter"

This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.

* add log for Executor

* Avoid thread reconsruction when EOF
Co-authored-by: Nliutiexing <liutiexing@google.com>

2f642159

C
[PTen] Add HasAttr for ArgumentMappingContext (#39464) · ddb1e23f
由 Chen Weihang 提交于 2月 14, 2022
```
* add has_attr for arg map context

* skip useless attr now

* skip attr if not exists

* fix typo
```
ddb1e23f

[pten] add split kernel (#39060) · d0df5632

由 chentianyu03 提交于 2月 14, 2022

* add split kernel

* add split kernel signature

* fix split bug

* modify MakePtenScalarArrayFromVarList

* modify MakePtenScalarArrayFromVarList

* fix split windows register error

* add test case for split kernel

* replace raw split kernel with pten kernel

* fix makeScalar/ScalarArray bug

* remove debug log

* remove int64_t type in buildPtcontext

* update by code review

* fix split dev test failed

* change DenseTensorMeta to MetaTensor

* change split api code from auto gen to manual

* split cuda kernel support bfloat16 type

* fix conflict

* rm raw split kernel

* merge develop branch

* change to pten::errors

d0df5632

11 2月, 2022 6 次提交

F
[Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
d25a7f9e

[PTen] Remove pten core's dependency on fluid xxx_info.h (#39401) · d763a91a

由 Chen Weihang 提交于 2月 11, 2022

* ermove xxx_info include

* fix namespace error

* resolve conflict

* skip xpu context in registry

* fix macro error

* resolve conflict

* resolve conflict

* revert xpu convert

* remove trans to fluid place

* remove useless headers

d763a91a

[PTen] Move grad GetExpectedPtenKernelArgs into pten (#39418) · 667bd962

由 Chen Weihang 提交于 2月 11, 2022

* move grad get expected pten kernel args

* fix reduce sum error

* fix element_sub_grad failed

* revert kernel judge change

667bd962

[Paddle Inference] support ernie quant model with interleaved (#39424) · 1c44d3e2

由 Wangzheee 提交于 2月 11, 2022

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

* support ernie quant model with interleaved

1c44d3e2

Add log for executor (#39459) · 7e52beae

由 liutiexing 提交于 2月 11, 2022

* add align for WorkQueue

* add spinlock

* merge develop

* merge

* Add EventsWaiter

* Revert "Add EventsWaiter"

This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2.

* add log for Executor
Co-authored-by: Nliutiexing <liutiexing@google.com>

7e52beae

L

[new-exec] set type of op-kernel op by place (#39458) · 7392578d
由 Leo Chen 提交于 2月 11, 2022

7392578d

10 2月, 2022 2 次提交

share MemOptVarInfos of external variables into cinn_launch subgraph (#39209) · 35b03e1c

由 TeFeng Chen 提交于 2月 10, 2022

* add a graph pass to share MemOptVarInfos of external variables into subgraph

* update pass name

* fix compile failed

* add share_mem_opt_info_to_subgraph_pass test

* share_mem_opt_info_to_subgraph_pass_test pass

* modify some codes for better style and more robust

* update cmake

35b03e1c

A

[PluggableDevice] custom kernel supports multi cpp_dtype registering (#39385) · 63d2333e
由 Aganlengzi 提交于 2月 10, 2022

63d2333e

09 2月, 2022 6 次提交
- W
  [Paddle-Inference] rebuild matmul pass: trt and gpu_cpu (#39369) · db7d129e
  由 Wangzheee 提交于 2月 09, 2022
```
* rebuild matmul pass: trt and gpu_cpu

* rebuild matmul pass: trt and gpu_cpu

* rebuild matmul pass: trt and gpu_cpu

* rebuild matmul pass: trt and gpu_cpu
```
  db7d129e
- [MLU] add mlu kernel for c_comm_init op (#39364) · 1bd7a143
  由 mhhhh1 提交于 2月 09, 2022
  
  1bd7a143
- C
  [CustomOp] Fix slice bug of custom op (#39393) · 91b074a2
  由 Chen Weihang 提交于 2月 09, 2022
```
* fix slice bug of cusstom op

* add offset in check
```
  91b074a2
- C
  
  move stream into pten (#39392) · 266955a9
  由 Chen Weihang 提交于 2月 09, 2022
  
  266955a9
- H
  update basic infrastructure (#39383) · b12e7a17
  由 hong 提交于 2月 09, 2022
```
* update basic infrastructure; support string,  suport vecotr<int>, add tensor args type index; test=develop

* remove useless code; test=develop

* fix bug; test=develop

* polish code; test=develop
```
  b12e7a17
- T
  
  Fix operator== for float16 (#39400) · e606b44a
  由 Tomasz Socha 提交于 2月 09, 2022
  
  e606b44a
08 2月, 2022 5 次提交

S
Make Embedding layer support more int ids type (#39381) · 60f1461a
由 sneaxiy 提交于 2月 08, 2022
```
* add more int id type support for embedding

* add ut

* add more ut

* fix ci error
```
60f1461a

Add FuseOptimizerPass and test_dist_fuse_adam_pass unittest. (#39208) · ccdcfa2d

由 hlygit66666 提交于 2月 08, 2022

* add fuse_relu_depthwise_conv_pass unittest

* fix atol and rtol

* fix according to review

* Add FuseOptimizerPass and fuse_adam_pass unittest

* add sgd and momentum unittest

* add fuse_optimizer_pass

* close amp

* close amp

* update

* fix run on two cards

* Update test_dist_fuse_adam_pass.py

* Update test_dist_fuse_momentum_pass.py

* Update test_dist_fuse_sgd_pass.py

* Create test_dist_fuse_sgd_pass.py

* Create test_dist_fuse_sgd_pass.py

* Create test_dist_fuse_sgd_pass.py

* Update test_dist_fuse_adam_pass.py

* Update test_dist_fuse_momentum_pass.py

* Update test_dist_fuse_sgd_pass.py

ccdcfa2d

J
[Bug fix] Fixed handling of one of the cases in the quantization process (#39342) · e4d475ea
由 joanna.wozna.intel 提交于 2月 08, 2022
```
* Fix quantization next op findings

* Corrections according to the review
```
e4d475ea

Fix to #38126 (#39097) · f884edb9

由 Jacek Czaja 提交于 2月 08, 2022

* - 38126 potential fix

* - fix

* - build fix

* - another candidate fix

* - compilation fix

* - another fix

* - Fix to activation of NHWC being first oneDNN op in chain on oneDNN ops

* - compilation fix

* - added NHWC reotating for elementwise being first op

* - compilation fix

* - compilation fix

* - Added UT

* - cosmetic fixes

f884edb9

Update op support gpu impl (#39386) · ba882657

由 hong 提交于 2月 08, 2022

* find gpu kernel in pten factory; test=develop

* check in functional kernel first; test=develop

ba882657

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致