提交 · aaa71ea43cdef2ba1297cbe8f6b10b1ef651dc5e · PaddlePaddle / Paddle

18 3月, 2022 2 次提交
- W
  support register with attr (#40564) · 755a6c53
  由 Wilber 提交于 3月 18, 2022
```
* support register with attr

* add infrt_with_gpu macor
```
  755a6c53
- 王
  [infrt] rename pd dialect from mlir to infrt. (#40651) · ef4ef154
  由王明冬提交于 3月 18, 2022
```
* [infrt] rename pd dialect from mlir to infrt. test=develop

* [infrt] fix the kernel signature generator bug.
```
  ef4ef154
17 3月, 2022 2 次提交
- 王
  
  [infrt] move pd_ops.td to pd floder. test=develop (#40613) · 4c01763c
  由王明冬提交于 3月 17, 2022
  
  4c01763c
- 王
  
  [infrt] move pd dialect position. test=develop (#40616) · 3a256637
  由王明冬提交于 3月 17, 2022
  
  3a256637
15 3月, 2022 2 次提交

Skip infrt when checking log fatal (#40529) · c9f3ad03

由 Chen Weihang 提交于 3月 15, 2022

* skip infrt when checking log fatal, test=document_fix

* remove test=document_fix

* update commit

c9f3ad03

[Phi]Move Tanh/BRelu/LeakyRelu/ThresholdedRelu Kernels to Phi (#40385) · d7112180

由 YuanRisheng 提交于 3月 15, 2022

* move activation op

* adjust code format

* fix compile bugs

* fix ci bugs

* code format adjust

* code format adjust2

* activate ci status

* modify according to comment

* move activation kernel

* revert relu6

* reduce add code

* perfect use_phi_functor

* completing func name

* fix bugs when run ci

* fix bugs when run infr

* modifpy infrt get kernel signature

d7112180

14 3月, 2022 1 次提交
- H
  
  [infrt] add skip list (#40450) · 95a526b2
  由 huzhiqiang 提交于 3月 13, 2022
  
  95a526b2
10 3月, 2022 1 次提交

Add trt execute (#40224) · e72ef603

由 Shang Zhizhou 提交于 3月 10, 2022

* add trt.execute

* merge trt.engine type

* update return op

* update comments

* fix style

* fix style

e72ef603

09 3月, 2022 2 次提交

H

[Infrt]Update kernel dialect (#40141) · 767647ce
由 huzhiqiang 提交于 3月 09, 2022

767647ce

build documents if public apis modified, meanwhile their samplecodes should be tested (#39728) · 041c4bca

由 Ren Wei (任卫) 提交于 3月 09, 2022

* run document_preview when samplecodes be tested

* run document_preview when samplecodes be tested

* sphinx-build symbol link; and build-doc default

* FLUIDDOCDIR typo

* download the required configirations and some other scripts

* install required python packages.

* clone specified branch of docs repo, and if failed, clone the default branch

* clean workspace for docs repo

* use the conf.py imported by https://github.com/PaddlePaddle/docs/pull/4222/

* download and install the boscmd

* Optimaze the code comments.

* specify the pypi index server

* only do doc-build when running in cpu mode

* pull docs pr

git log

paddle_pr_info

* install jq

* force using sphinx-build under py3.7

* using our new domain name for preview

* install python package error

* don't build doc default

041c4bca

07 3月, 2022 2 次提交

王

[infrt] fold the infrt.cvtTensorOp. test=develop (#40214) · b798fb07
由王明冬提交于 3月 07, 2022

b798fb07

cuBlasLt Epilogue To Fuse Linear + ReLU|GeLU (#39437) · 2a3d9eca

由 Ming-Xu Huang 提交于 3月 07, 2022

* Added cuBlasLtHandle_t to device context.

* Added fused_gemm_epilogue op.

1. Added fused_gemm_epilogue op to leverage cuBlastLt Epilogue.
2. Support fusion Act(X*Y + bias), X'dims >=2 and Y'dims shoule be 2.
2. Act currently only be supported ReLU. (Will add GeLU in the future).

* Added UT to fused_gemm_epilogue op.

* Added LinearAct Pattern

1. Added LinearAct into graph_pattern_detector.* to define (2.)'s
pattern.
2. LinearAct is used to detect act(element_add(matmul_v2(x, w), bias)).
3. act currently only support ReLU (Will support GeLU in the future).

* Added FuseGemmEpiloguePass

1, Added FuseGemmEpiloguePass to handle nn.Linear + Act{ReLU}
fusion (GeLU will be supported in the future).
2. Only support matmul_v2 from nn.Linear.

* Added pybind to BuildStrageter.fuse_gemm_epilogue_.

* Added UT for fuse_gemm_epilogue_pass.

* GeLU support and EpilogueSingleton

1. Added GeLU support to fused_gemm_epilogue op.
2. Added EpilogueSingleton to cache auxiliary pointer.
3. Added related UTs.

* Rename cublaslt_epilogue_opto gemm_epilogue_op.*.

* Added both train and infer pattern to LinearAct.

1. Added support of fwd graph with grap_ops linking to LinearAct.
2. Added related changes to fuse_gemm_epilogue_pass for above
modification.

* Changed CUDA requirement from 11.4 to 11.6 for fuse_gemm_epilogue_pass.

* Added identity activation support to gemm_epilogue_op.

* Added Linear Fusion (matmul_v2 + ele_add)

1. Added matmul_v2 + ele_add pattern to LinearActPattern.
2. Added matmul_v2 + ele_add support to fuse_gemm_epilogue_pass.

* Rename gemm_epilogue_op.* to fused_gemm_epilogue_op.*

* Add fused_gemm_epilogue_grad op.

1. Added fused_gemm_epilogue_grad to support backward epilogue fusion.

* Add UTs to fused_gemm_epilogue_grad_op.

* Change attribute name in fused_gemm_epilogue_grad_op for clearing.

* Allow DX and DBias be dispensable to fused_gemm_epilogue_grad op.

* Added ElementwiseAdd+Matmul+Act graph pattern detection.

* Fuse backward of Linear( Act(x))

1. Added backward fusion pass to Linear( Act(x)).
2. Added backward fusion pass to Linear(x).

* Added UTs to backward fusion of Linear(Act(x)).

* Complete document of arguments to fused_gemm_epilogue_op.

* Made arguments of some functions pass by reference.

* Modify code with review comments.

1. Made arguments of some function pass by reference.
2. Removed redundant code.
3. Followed Google code style to change code.

* Made 'const' code style be consistent

* Fixed random seed of python UTs.

* Set Compiling constrains to cuBlasLt

1. Require CUDA 11.6+
2. Remove fuse_gemm_epilogue related tests when CUDA < 11.6.

* Code Reivew from Paddle

1. Changed arguments name is_first_gemm to without_x_gradient for
clearing.
2. Applied PADDLE_THROW in fused_gemm_epilogue_op.

* Remove EpilogueSingleton

1. Applied ReserveSpace to replace Epilogue for passing auxiliary
pointers between FWD and BWD.

* Fix a logical error and enhance UTs.

1. Added act op count checking in UTs.
2. Fix issue to fuse backward or ReLU(Linear(X)).
3. TODO: solve GELU fusion issues.

* Fix Linear and GeLU fusion issues.

1. Modified graph_detech_pattern to fit with both linear wiht gelu or
relu.
2. Modified data range in Uts to allow negative values.

* Removed fused_gemm_epilogue_op.h.

* Rename namespace pten to phi.

* Rename name of arguments in fused_gemm_epilogue_op

1. bias -> Bias.
2. out -> Out.
3. reserve_space -> ReserveSpace.

* Change EpiloguePassActivationCache as local variable.

1. Removed singleton in EpiloguePassActivationCache.
2. Made EpiloguePassActivationCache as an argument to each pass
functions.

2a3d9eca

04 3月, 2022 1 次提交
- 王
  
  [infrt] add ir for convert pd dilect to phi dialect. test=develop (#40104) · 3ac9bc95
  由王明冬提交于 3月 04, 2022
  
  3ac9bc95
03 3月, 2022 1 次提交
- 石
  mlir attr types for infrt place, test=develop (#40087) · b1d38dea
  由石晓伟提交于 3月 03, 2022
```
* mlir attr types for infrt place, test=develop

* fix a bug, test=develop
```
  b1d38dea
02 3月, 2022 3 次提交
- A
  [IPU] update dockerfile (#40061) · 7ef61789
  由 Allen Guo 提交于 3月 02, 2022
```
* update dockerfile for ipu

* update comments, test=document_fix
```
  7ef61789
- P
  support checking `phi` directory in CI op benchmark (#40026) · f30b3f81
  由 pangyoki 提交于 3月 02, 2022
```
* support phi checking in CI op benchmark

* add sparse/gpu

* remove h file in cpu directory
```
  f30b3f81
- H
  
  [Infrt]add phi kernel dialect (#39726) · 07dad6d6
  由 huzhiqiang 提交于 3月 02, 2022
  
  07dad6d6
01 3月, 2022 2 次提交
- W
  remove conv_affine_channel_fuse_pass (#39817) · fc06be9d
  由 wenbin 提交于 3月 01, 2022
```
* remove

* pass

* more pass
```
  fc06be9d
- P
  
  change tests_v2 to dynamic_tests_v2 in CI op benchmark (#39995) · 4204b97a
  由 pangyoki 提交于 3月 01, 2022
  
  4204b97a
28 2月, 2022 2 次提交
- T
  
  Change CI-Build build develop (#39863) · 61443a0e
  由 tianshuo78520a 提交于 2月 28, 2022
  
  61443a0e
- W
  
  infrt add trt engine (#39885) · 27536a32
  由 Wilber 提交于 2月 28, 2022
  
  27536a32
22 2月, 2022 4 次提交
- 王
  
  add pten convert pass.test=develop (#39664) · a6abb6e7
  由王明冬提交于 2月 22, 2022
  
  a6abb6e7
- C
  [pten]add check for using HostAlloc (#39771) · 12c6d06a
  由 chentianyu03 提交于 2月 22, 2022
```
* add check for using HostAlloc

* add check for using HostAlloc
```
  12c6d06a
- Z
  
  update precision catalog (#39717) · df1dbff1
  由 zhangchunle 提交于 2月 22, 2022
  
  df1dbff1
- C
  [PTen->Phi PR2] Rename PT_REGISTER macro to PD_REGISTER (#39790) · 4a338796
  由 Chen Weihang 提交于 2月 22, 2022
```
* unify register macro

* rename declare macro

* fix infrt error
```
  4a338796
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

18 2月, 2022 1 次提交

Infrt registers pten kernels (#39588) · dc39eb18

由 Wilber 提交于 2月 18, 2022

* the mlir representation of pten, test=develop

* fixes an error, test=develop

* infrt registers pten kernels
Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

dc39eb18

17 2月, 2022 3 次提交
- Y
  
  add fluid/eager dirs to coverage ci (#39599) · 2129b300
  由 YUNSHEN XIE 提交于 2月 17, 2022
  
  2129b300
- Q
  update kunlun label_smooth unitest (#39611) · 1f7f8561
  由 QingshuChen 提交于 2月 17, 2022
```
* update kunlun label_smooth unitest
*test=kunlun

* minor
*test=kunlun
```
  1f7f8561
- H
  refine data loader api in infrt (#39580) · 1035d21f
  由 huzhiqiang 提交于 2月 17, 2022
```
* update generate_pd_op_dialect_from_paddle_op_maker.py

* update mlir tensor load interface

* refine

* fix bug

* fix

* refine

* fix

* 3

* fix

* codestyle
Co-authored-by: weishengying <1343838695@qq.com>
```
  1035d21f
16 2月, 2022 2 次提交
- S
  
  update tools for infrt build (#39552) · a7d4ddc4
  由 Shang Zhizhou 提交于 2月 16, 2022
  
  a7d4ddc4
- C
  [pten]change ci using mutable_data() check's directions from pten to pten/kernels (#39597) · d4144616
  由 chentianyu03 提交于 2月 16, 2022
```
* change ci using mutable_data() check's directions from paddle/pten to paddle/pten/kernels

* change echo info from paddle/pten to paddle/pten/kernels
```
  d4144616
15 2月, 2022 1 次提交
- Y
  
  add pten dirs to coverage ci (#39379) · a094c4e7
  由 YUNSHEN XIE 提交于 2月 15, 2022
  
  a094c4e7
14 2月, 2022 2 次提交
- C
  
  [pten] add CI check for using DenseTensor::mutable_data() in pten directions (#39467) · 14049ae5
  由 chentianyu03 提交于 2月 14, 2022
  
  14049ae5
- Q
  
  [Approver Update] update check approver of qili93, test=document_fix (#39483) · db11357c
  由 Qi Li 提交于 2月 14, 2022
  
  db11357c
11 2月, 2022 1 次提交
- Z
  
  get build time (#39368) · 72ad280b
  由 zhangchunle 提交于 2月 11, 2022
  
  72ad280b
09 2月, 2022 1 次提交
- H
  
  convert paddle model to mlir paddle dialect (#39216) · 2be20e20
  由 huzhiqiang 提交于 2月 08, 2022
  
  2be20e20
07 2月, 2022 1 次提交
- Y
  
  INFRT/Refine TensorMap (2nd PR) (#39262) · ed0990e7
  由 Yan Chunwei 提交于 2月 07, 2022
  
  ed0990e7
29 1月, 2022 2 次提交
- Z
  
  Removed approval request for tensor/lod_tensor modifications (#39326) · 984b16fc
  由 Zhanlue Yang 提交于 1月 29, 2022
  
  984b16fc
- Q
  fix kunlun2 softmax unitest bug (#39274) · 23bb2836
  由 QingshuChen 提交于 1月 29, 2022
```
* fix kunlun2 softmax unitest bug
*test=kunlun

* minor
```
  23bb2836

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功