提交 · d4ca7ffbdf77232c75a96aa9ae396e7b33278cdb · Crayon鑫 / Paddle

05 8月, 2022 13 次提交
- Z
  
  Add feed&fetch as default deny ops. (#44708) · d4ca7ffb
  由 Zhen Wang 提交于 8月 05, 2022
  
  d4ca7ffb
- S
  Merge matmul_v1 and matmul_v2 fuse passes (#44870) · d0cf9d9d
  由 Sławomir Siwek 提交于 8月 05, 2022
```
* remove v2_transpose_reshape

* matmul_transpose_reshape

* reshape_transpose_matmul

* restore ut

* adjust old ut

* restore parallel UT ruels

* feedback from review
```
  d0cf9d9d
- O
  
  modify pybind.cc (maybe a typo) (#44883) · 1f7e9546
  由 OccupyMars2025 提交于 8月 05, 2022
  
  1f7e9546
- J
  
  Add int8 support for matmulV2 (#44908) · f3c14762
  由 joanna.wozna.intel 提交于 8月 05, 2022
  
  f3c14762
- Q
  
  [DCU] fix hipDeviceAttributeManagedMemory not support on DTK, test=develop (#44816) · 075d7219
  由 Qi Li 提交于 8月 05, 2022
  
  075d7219
- Y
  
  fix bugs when clip xshape (#44898) · df790b9b
  由 YuanRisheng 提交于 8月 05, 2022
  
  df790b9b
- D
  migrate kernel (#44841) · 62a98130
  由 duanboqiang 提交于 8月 05, 2022
```
* migrate kernel

* fix sig order

* remove header files

* remove header

* remove header

* modify logits grad
```
  62a98130
- C
  enhance fused_multi_transformer_op(post_layer_norm) (#44789) · 643c94e4
  由 carryyu 提交于 8月 05, 2022
```
* add fused_multi_transformer post_layer_norm

* add test post_layer_norm
```
  643c94e4
- Z
  update trt workspace size param (#44469) · bdce552b
  由 Zhang Jun 提交于 8月 05, 2022
```
* update trt workspace size param

* update

* update

* update

* use int64_t

* use int64_t

* upate

* update
```
  bdce552b
- Z
  
  refactor xpu tests for squeeze/unsqueeze, *test=kunlun (#44812) · 54d98963
  由 zhangxiaoci 提交于 8月 05, 2022
  
  54d98963
- F
  move fft kernels to phi (#44714) · 153f1138
  由 Feiyu Chan 提交于 8月 05, 2022
```
* move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
* make stft_op use fft from phi/kernels/funcs, clean code
```
  153f1138
- R
  
  Skip distributed Program for standalone executor (#44897) · 0cbc870e
  由 Ruibiao Chen 提交于 8月 05, 2022
  
  0cbc870e
- A
  [IPU] restore dockerfile to gcc8.2 (#44909) · 206102af
  由 Allen Guo 提交于 8月 05, 2022
```
* restore to gcc8.2

* test=document_fix
```
  206102af
04 8月, 2022 26 次提交
- S
  Matmuls with activation and elementwise_add fuses (#44655) · 0420d514
  由 Sławomir Siwek 提交于 8月 04, 2022
```
* Add unit tests

* matmul_v2 + activation

* matmuls + elementwise_add

* matmul_v2 postops

* transform matmul to v2

* opcompat

* fix fusing matmul with multipe outs

* add shape constraints

* remove unused vars

* change pass order

* - Unit tests to be debugged

- fix

- refactor

- diagnostic

- more diagnostic

- fix

- Fix number two

- fix

- fix

- fix

- alpha added

- more fixes

- compilation fix

- removed diagnostic code

- cosmetic fixes

* lint

* add alpha constraint

* merge matmul refactor

* trigger CI

* - fix

* - another fix

* code style

* add support for matmul+elementwise_add+activation

* code style

* fix bfloat16 bugs

* change append_binary to append_sum
Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
```
  0420d514
- W
  
  Set ALL_BACKEND for shape cpu kernel (#44884) · 5e74ba33
  由 WangZhen 提交于 8月 04, 2022
  
  5e74ba33
- N
  [Docs][en] adjust code example format (#44679) · d5de7886
  由 Nyakku Shigure 提交于 8月 04, 2022
```
* add name attribute to code-block, test=document_fix

* remove redundant labels, test=document_fix

* remove redundant labels (from upstream), test=document_fix

* more COPY-FROM (try multiple code example), test=document_fix

* empty commit, try to trigger PR-CI-build

* fix some `Examples:` format issues

* fix some ci errors
```
  d5de7886
- Z
  [Paddle-TRT] add Rnn (#44678) · ffc8defa
  由 zhoutianzi666 提交于 8月 04, 2022
```
* add rnn
```
  ffc8defa
- J
  
  added conv and conv_tranpose support for md (#44677) · b2727020
  由 jakpiase 提交于 8月 04, 2022
  
  b2727020
- L
  Addition of fp16 type support for Compare OP (#44405) · 6506668e
  由 limingshu 提交于 8月 04, 2022
```
* first commit

* add fp16 ctest files for compare op

* add cpu register of float16 for compare ops
```
  6506668e
- C
  
  fix bug (#44875) · c693a027
  由 ccrrong 提交于 8月 04, 2022
  
  c693a027
- D
  [XPU] add merged_momentum including fp32 and fp16 (#44824) · 4922376c
  由 dongfangshenzhu 提交于 8月 04, 2022
```
* add merged_momentum *test=kunlun

* add merged_momentum *test=kunlun

* add fp16 to merged_momentum,*test=kunlun
```
  4922376c
- W
  [Eager] fix slice's input mistake (#44855) · cfc9bf76
  由 Weilong Wu 提交于 8月 04, 2022
```
* [Eager] fix slice's input mistake

* add tests for slice
```
  cfc9bf76
- Z
  phi_fill_diagonal_tensor (#44649) · 2140e825
  由 zhiboniu 提交于 8月 04, 2022
```
* phi_fill_diagonal_tensor

* delete extra lines

* update

* add legacy api test

* rename sig
```
  2140e825
- Z
  Phi generate_proposals_v2 (#44436) · 566c80ff
  由 zhiboniu 提交于 8月 04, 2022
```
* phi_generate_proposals_v2

* remove old kernels

* optest add eager_check

* del lod

* update

* update

* update test_detection with_lod

* update nms_util

* remove old nms_util.h
```
  566c80ff
- X
  mv fold & unpool to phi (#44836) · e9994f2e
  由 xiaoting 提交于 8月 04, 2022
```
* fix conflicts

* mv unused file

* revert backward.h

* revert lu_unpack kernel

* rm .cu file

* Update lu_unpack_kernel.cc

* format phi yaml
```
  e9994f2e
- W
  convert support multi block. (#44866) · b4a4eef2
  由 Wilber 提交于 8月 04, 2022
```
* convert support multi block.

* update
```
  b4a4eef2
- A
  
  phi fix header (#44873) · f9e7fe66
  由 Aganlengzi 提交于 8月 04, 2022
  
  f9e7fe66
- W
  [JitLayer]Move Function classes to a sub dir (#44844) · 882053dc
  由 WangZhen 提交于 8月 04, 2022
```
* Move Function classes to a sub dir

* Format code
```
  882053dc
- K
  
  fix logger pollution (#44857) · c3a2cdcc
  由 kuizhiqing 提交于 8月 04, 2022
  
  c3a2cdcc
- K
  
  launch no python script (#44849) · a3f3172c
  由 kuizhiqing 提交于 8月 04, 2022
  
  a3f3172c
- Z
  
  fix bug of kernel signatore code-gene (#44859) · 58d8ead2
  由 zyfncg 提交于 8月 04, 2022
  
  58d8ead2
- L
  
  change approver (#44325) · 725c33f5
  由 Leo Chen 提交于 8月 04, 2022
  
  725c33f5
- J
  
  add DCINN_GIT_TAG cmake flag to set cinn version (#44868) · edc482a2
  由 jiangcheng 提交于 8月 04, 2022
  
  edc482a2
- H
  [XPU] fleet dist_model support xpu (#44854) · 7335b679
  由 houj04 提交于 8月 04, 2022
```
* [XPU] fleet dist_model support xpu. test=kunlun

* [XPU] fleet dist_model support xpu. test=kunlun

* move unittest file location. test=kunlun
```
  7335b679
- 王
  
  add xpu garbage collector for standalone executor. (#44572) · 0e26361c
  由王明冬提交于 8月 04, 2022
  
  0e26361c
- C
  
  take back unnecessary header file (#44831) · cd55385a
  由 Chen Weihang 提交于 8月 03, 2022
  
  cd55385a
- S
  
  opt allreduce (#44843) · 1f9e2742
  由 sneaxiy 提交于 8月 04, 2022
  
  1f9e2742
- A
  
  [PHI] Add ones/zeros in tensor_compat.h (#44860) · d3e90680
  由 Aurelius84 提交于 8月 04, 2022
  
  d3e90680
- Y
  Set the lr var's dtype to fp32 when create a fp16 lr var in optimizer if user... · 9e39d746
  由 Yuang Liu 提交于 8月 04, 2022
```
Set the lr var's dtype to fp32 when create a fp16 lr var in optimizer if user not mean to use global fp16. (#44840)
```
  9e39d746
03 8月, 2022 1 次提交
- W
  
  update test_uniform_random_inplace_op.py (#44852) · 9a17f05f
  由 wuyefeilin 提交于 8月 03, 2022
  
  9a17f05f

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致