提交 · d70104e5358cfd788730aea89ea667a5c6f10b29 · 机器未来 / Paddle

20 4月, 2022 1 次提交

enable auto-tune when using cinn (#41795) · d70104e5

由 TeFeng Chen 提交于 4月 20, 2022

* optimize preparation overhead before executing cinn compiled program

* update code notes

* fix flag annotation

* enable auto-tune when using CINN

* update cinn commit tag

* skip test

* fix lacking header file

d70104e5

19 4月, 2022 1 次提交

[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from... · 2cb19d8f

由 YuanRisheng 提交于 4月 19, 2022

[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from ElementwiseKernel（Part1） (#41806)

* seperate add/div/sub/mul from elementwise

* delete code

* fix compile bugs

* deal with conflict

* fix bugs when compile

* fix windows unit test bug

* fix ci converage bugs

2cb19d8f

18 4月, 2022 1 次提交

cinn_launch_op: optimize the overhead of preparing variables before executing... · 2d4fe163

由 TeFeng Chen 提交于 4月 18, 2022

cinn_launch_op: optimize the overhead of preparing variables before executing cinn compiled program (#41777)

* optimize preparation overhead before executing cinn compiled program

* update code notes

* fix flag annotation

* add a flag of auto-tune feature beforehand

2d4fe163

11 3月, 2022 1 次提交

[Phi] Remove needless deps in unittests (#40256) · 89ed57e2

由 Chen Weihang 提交于 3月 11, 2022

* remove needless deps in unittests

* add gpu marco

* fix other unittests

* fix kernel name error

* fix test_prepare_op

* fix failed dygraph unittests

* fix gpu failed tests

* fix cinn test failed

* fix cinn test failed

* fix dropout tests

89ed57e2

03 3月, 2022 1 次提交

cinn_launch_op: switch to execution by PE (#39911) · 167d511f

由 TeFeng Chen 提交于 3月 03, 2022

* swith to PE execution in cinn launch

* fix outer variables erased

* skip the map bug temporarily for test

* temporary solution for batch_norm bug

* update comment

* fix compile error

* cinn_instruction_run_op_test: update code to skip external alloc/free instructions generated

167d511f

20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

18 2月, 2022 1 次提交

cinn_instruction_run_op test (#39576) · fdc4fe3b

由 TeFeng Chen 提交于 2月 18, 2022

* add cinn_instruction_run_op test code

* update several interfaces of CinnLaunchContext

* update several interfaces and add detail comments in CinnLaunchContext class

* to skip the bug of error message check

* fix ut test failed due to reliant interface updated

fdc4fe3b

16 2月, 2022 1 次提交

[Pten]Remove reshape and elementwise_add's registry code in Fluid (#39317) · c6478270

由 YuanRisheng 提交于 2月 16, 2022

* remove reshape and elementwise_add registry

* delete code

* fix bugs when run ci ut

* remove log

* fix bugs when run unit test

* fix bugs when run unit test

* fix bugs when run cinn

* fix bugs when run ci-mac-python3

* fix compile bugs

* fix compile bugs

* fix compile bugs

* fix bugs when run kunlun

* fix bugs when compile

* update code according comment

c6478270

18 1月, 2022 1 次提交

[Unify Tensors PR ] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3

由 Zhanlue Yang 提交于 1月 18, 2022

* Merged LoDTensor with Tensor,test=allcases

* Patched python level LoDTensor

* Patched python level LoDTensor

* Merge Tensor into DenseTensor

* Fixed namespace issues,test=allcases

* Fixed merge issues

* Fixed inference issues

* Fixed NPU test issues

* Fixed merge issues

2052f1e3

09 12月, 2021 1 次提交
- C
  cache scope and place on CinnLaunchContext and pass them to callback (#37983) · 151c5d74
  由 CtfGo 提交于 12月 09, 2021
```
cinn_launch_op： cache scope and place on CinnLaunchContext to skip duplicate alloc/free callback construction
```
  151c5d74
08 12月, 2021 1 次提交

add a subdirectory named cinn in operators and move releated files into it (#37938) · 9cb637ed

由 CtfGo 提交于 12月 08, 2021

1. add a subdirectory named `cinn` in `paddle/fluid/operators` directory and move releated files into it
2. seperate CinnLaunchContext class from `cinn_launch_op.h` and put it in a  new independent file named `cinn_launch_context.h`, so that it can be included by others clearly.

9cb637ed

01 12月, 2021 1 次提交
- H
  Modify ShareTensorWithCinnBuffer by callback to save memory (#37493) · 661dbdbe
  由 Huihuang Zheng 提交于 12月 01, 2021
```
Modify ShareTensorWithCinnBuffer by callback to save memory
```
  661dbdbe
19 11月, 2021 1 次提交
- J
  Optimize cinn_cache_key by replace GraphToProgram to Dot string (#37317) · edc3496f
  由 jiangcheng 提交于 11月 19, 2021
```
* optimize cache-key by replace GraphToProgram to Dot string

* fix compile failure bug
```
  edc3496f
13 11月, 2021 1 次提交

cinn_launch_op: skip checking input variables must be used (#37119) · 228eb898

由 CtfGo 提交于 11月 13, 2021

Modify serveral implements on CinnLaunchOp：
1. Skip checking input variables must be used 
2. Move current helper functions to a CinnlaunchContext

228eb898

05 11月, 2021 1 次提交
- C
  cinn_launch_op: remove the check on extracting temporary variables (#36997) · 7ee727a8
  由 CtfGo 提交于 11月 05, 2021
```
cinn_launch_op: remove the check on extracting temporary variables
```
  7ee727a8
03 11月, 2021 1 次提交

improve CinnLaunchOpKernel implement (#36936) · 0590277a

由 CtfGo 提交于 11月 03, 2021

1. 功能不变，简化CinnLaunchOpKernel实现：将原先直接从Scope获取变量信息的方式改为借助参数ExecutionContext标准接口获取，简化了实现逻辑，相应地也简化了辅助函数的实现，原先cinn_launch_op_helper较为冗余，删除不必要的接口并迁移至cinn_launch_op.cc中定义。
2. 修复CinnLaunchOp InferShape判断是否有指定输出：HasOutput->HasOutputs
3. 添加详细的注释和debug信息，方便问题排查和代码维护

0590277a

01 11月, 2021 1 次提交

add cinn_launch_op for using CINN to optimize graph (#36600) · 0a963ee9

由 CtfGo 提交于 11月 01, 2021

增加CinnLaunchOp，负责执行Cinn子图编译的结果，要点如下：
1. 在子图划分的BuildCinnPass中，每个子图在原图中会被替换为该CinnLaunchOp，由它来调用Cinn进行子图编译、执行的功能。
2. CinnLaunchOp的输入/输出即为子图的输入和输出，另外增加`compilation_key`属性，它可由该属性key从全局Cache中获取子图对象、编译结果，该属性由BuildCinnPass在创建Op时进行设置
3. CinnLaunchOp功能实现的流程为：
        - 从全局Cache中获取子图对象
        - 从全局Cache中获取子图编译结果，未命中cache时进行即时编译
        - 根据编译结果的变量信息(数据类型、shape）初始化运行时数据，分配内存/显存
        - 将运行时数据打包为参数，调用cinn的可执行对象runtime program进行计算
        - 子图运行结果通过参数指针同步到paddle侧的tensor

0a963ee9

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致