提交 · cadbd2cc72b29e4509b81d2b581c3ed5b1833034 · Crayon鑫 / Paddle

11 10月, 2022 6 次提交
- 傅
  Fix set_value failure when source tensor is fp16 Dtype (#46801) · 2341ed5e
  由傅剑寒提交于 10月 11, 2022
```
* add fp16 data type for set_value

* cancel flip modification

* add fp16 dtype support for set_value
```
  2341ed5e
- H
  [Opt transpose2] Opt GetExpectedKernelType code of transpose2 (#46692) · 98e00793
  由 HongyuJia 提交于 10月 11, 2022
```
* solve transpose2, follow #22402

* fix CI cmake

* update REGISTER_OP_KERNEL of transpose2
```
  98e00793
- H
  
  fix typo (#46814) · 46595d6b
  由 HongyuJia 提交于 10月 11, 2022
  
  46595d6b
- Z
  Fix some bugs hidden in build_cinn_pass. (#46843) · a19b082e
  由 Zhen Wang 提交于 10月 11, 2022
```
* Fix some bugs hidden in build_cinn_pass.

* Update codes about OpTransInfo.

* Only support for the static reshape/reshape2 op.
```
  a19b082e
- N
  
  Update layout autotune for module with no modified (#46541) · 3da3462f
  由 niuliling123 提交于 10月 11, 2022
  
  3da3462f
- W
  
  [DOC] update docs of activation op (#46556) · 20eb6e00
  由 wuyefeilin 提交于 10月 11, 2022
  
  20eb6e00
10 10月, 2022 22 次提交
- T
  Add libpaddle.so log. (#46589) · e5bcfacc
  由 tianshuo78520a 提交于 10月 10, 2022
```
* Add libpaddle.so log

* Add libpaddle.so log
```
  e5bcfacc
- T
  [CodeStyle][F541] Convert f-strings without curly braces to normal strings (#46700) · c64e1dcf
  由 Tony Cao 提交于 10月 10, 2022
```
* Update README.md

* Update README.md

* Fix F541 by converting f-string to normal strings
```
  c64e1dcf
- Y
  [PHI]Add RNN yaml (#46812) · ab60fd8b
  由 YuanRisheng 提交于 10月 10, 2022
```
* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta

* WIP: move rnn kernrl to phi

* Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.

* fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.

* fix ci bugs

* fix ci bugs

* fix ci bugs

* modify code according comment
Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
```
  ab60fd8b
- L
  reduce time cost on atomic in interpretercore (#46688) · dd3d45de
  由 Leo Chen 提交于 10月 10, 2022
```
* reduce time cost on atomic in interpretercore

* clear code of PrepareAtomic in interpretercore

* refine threadpool cache
```
  dd3d45de
- Z
  
  [inference] CPU-> GPU async io copy for TensorRT using ShareExternalData API (#46636) · c333af2f
  由 Zhang Jun 提交于 10月 10, 2022
  
  c333af2f
- S
  Add fc residual pattern (#46757) · 0c789ae5
  由 Sylwester Fraczek 提交于 10月 10, 2022
```
* fix fc pattern

remove use_bias
add residual input switch
fix references to pattern

* review fixes
```
  0c789ae5
- R
  
  remove comment (#46827) · 8a5f17e8
  由 Rayman 提交于 10月 10, 2022
  
  8a5f17e8
- S
  add function FindInputNameByVarName (#46759) · 8eaff62d
  由 Sylwester Fraczek 提交于 10月 10, 2022
```
* Add methods that find input or output name by var name

* kind of bugfix - initialize variables

* ci fix

* review fixed
```
  8eaff62d
- Z
  
  [Paddle-TRT] support new quant format from slim (#46022) · 7987a905
  由 zhoutianzi666 提交于 10月 10, 2022
  
  7987a905
- W
  [Paddle Inference]fix embedding fused (#46789) · 6512e087
  由 Wangzheee 提交于 10月 10, 2022
```
* fix embedding fused
```
  6512e087
- W
  preln_res_bias_layernorm half2 bugfix and unroll opt (#46619) · ae6b4713
  由 Wang Bojun 提交于 10月 10, 2022
```
* preln_res_bias_layernorm bugfix unroll opt

* code style refine

* NOLINT for codestyle
```
  ae6b4713
- C
  make fused_multi_transformer support dynamically set the cache_kvs' shape and... · 9ea279a4
  由 carryyu 提交于 10月 10, 2022
```
make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches. (#46777)

* make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches.
```
  9ea279a4
- H
  
  delete_activation_headerfile (#46690) · c4bbe5d9
  由 HongyuJia 提交于 10月 10, 2022
  
  c4bbe5d9
- H
  
  delete_multi_gru_headerfile (#46689) · 749da9a9
  由 HongyuJia 提交于 10月 10, 2022
  
  749da9a9
- H
  [MKLDNN] Delete mkldnn headerfile in quantize and requantize (#46676) · 8ec3b737
  由 HongyuJia 提交于 10月 10, 2022
```
* delete_quantize_headerfile

* delete_requantize_headerfile
```
  8ec3b737
- H
  
  delete_gaussian_random_mkldnn_headerfle (#46669) · 26d1d83e
  由 HongyuJia 提交于 10月 10, 2022
  
  26d1d83e
- P
  [PHI] transpose2_grad op migration (#46139) · e3407a80
  由 Paulina Gacek 提交于 10月 10, 2022
```
* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed
```
  e3407a80
- L
  
  Move group and all reduce from collective to communication (#45848) · a0dffd39
  由 LiYuRio 提交于 10月 10, 2022
  
  a0dffd39
- F
  fix:gather op (#46779) · 45b93325
  由 feng_shuai 提交于 10月 10, 2022
```
* fix:gather op

* add ut
```
  45b93325
- R
  
  【Hackathon No.36】优化 lerp_grad op 在 GPU 上的计算性能 (#45946) · ef61df30
  由 Rayman 提交于 10月 10, 2022
  
  ef61df30
- R
  【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111) · 5e0614a1
  由 Rayman 提交于 10月 10, 2022
```
support fp16 for deformable conv
```
  5e0614a1
- R
  Delete gpu_memory information of every unnitest to less time consuming (#46472) · a7e1b9d2
  由 risemeup1 提交于 10月 10, 2022
```
* it is a test

* it is a test

* it is a test,test=coverage

* optimizing the proceess of generating ut_file_map.json

* optimizing the proceess of generating ut_file_map.json

* optimizing the proceess of generating ut_file_map.json

* optimizing the proceess of generating ut_file_map.json
```
  a7e1b9d2
09 10月, 2022 9 次提交
- Z
  
  add sync_batch_norm_kernel (#46430) · 5cd6a707
  由 zhangkaihuo 提交于 10月 09, 2022
  
  5cd6a707
- W
  fix fp16 (#46713) · 9a849a37
  由 Wang Bojun 提交于 10月 09, 2022
```
* fix fp16

* remove debug info

* code style refine
```
  9a849a37
- Z
  
  [Sparse] Add a batch_norm kernel (#46359) · 888223b7
  由 zhangkaihuo 提交于 10月 09, 2022
  
  888223b7
- Z
  
  Update device_worker.cc (#46723) · 57cdde13
  由 zmxdream 提交于 10月 09, 2022
  
  57cdde13
- S
  
  add seed check (#46747) · 97ec57fe
  由 Sławomir Siwek 提交于 10月 09, 2022
  
  97ec57fe
- S
  Enable hard_swish_grad unit test (#46621) · ff0171e4
  由 Sławomir Siwek 提交于 10月 09, 2022
```
* enable hard_swish_grad unit test

* remove unused argument
```
  ff0171e4
- H
  
  [Dygraph] Fix Perf of FusedFeedForward and FusedAttention with AllReduce (#46780) · 078e8c78
  由 Haohongxiang 提交于 10月 09, 2022
  
  078e8c78
- Z
  
  interpretercore thread not always spin (#46687) · 2e217dbb
  由 zhangbo9674 提交于 10月 09, 2022
  
  2e217dbb
- R
  
  [MLU] fix cmake error (#46772) · 4df12303
  由 ronnywang 提交于 10月 09, 2022
  
  4df12303
08 10月, 2022 3 次提交
- H
  
  fix typo (#46680) · 6e9bb9f9
  由 HongyuJia 提交于 10月 08, 2022
  
  6e9bb9f9
- H
  
  [Dygraph] Fix performance of pp+mp by using send/recv_calc_stream instead of send/recv (#46116) · 8c0529fd
  由 Haohongxiang 提交于 10月 08, 2022
  
  8c0529fd
- C
  
  [MLU] add fluid MLUOps prior_box (#46585) · ff37e48e
  由 cifar10 提交于 10月 08, 2022
  
  ff37e48e

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致