提交 · 206a33b3ae0caeb6d676d21334d084601457a2b7 · PaddlePaddle / Paddle

14 12月, 2021 11 次提交
- B
  add conv_gelu_mkldnn_fuse_pass (#38107) · 206a33b3
  由 baoachun 提交于 12月 14, 2021
```
* add conv_gelu_mkldnn_fuse_pass

* add post ops
```
  206a33b3
- A
  
  Add const in GetInput/OutputVarPtrs in InferShapeContext (#38066) · 22f14e74
  由 Aurelius84 提交于 12月 14, 2021
  
  22f14e74
- W
  
  modify the fix_seed attribute in dropout op is a def attribute.test=develop (#38100) · f44add7b
  由 weishengying 提交于 12月 14, 2021
  
  f44add7b
- Y
  
  remove KernelName (#38082) · 8198cad7
  由 YuanRisheng 提交于 12月 14, 2021
  
  8198cad7
- Y
  
  [fleet_executor] Take task node from python side (#38083) · 7eb121df
  由 Yuang Liu 提交于 12月 14, 2021
  
  7eb121df
- Y
  [PTen] Reduce reshape kernel functions in pten (#38055) · a3c8abc7
  由 YuanRisheng 提交于 12月 14, 2021
```
* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile
```
  a3c8abc7
- F
  Mkldnn depthwise conv pass (#37798) · 19a833c8
  由 feng_shuai 提交于 12月 14, 2021
```
* test_mkldnn_depthwise_conv_pass

* test: add TimeOut

* sset TIMEOUT

* fix:add random num for dilation and group
```
  19a833c8
- Z
  Handled Dispensable Inputs/Outputs in Eager AutoCodeGen (#37959) · f2043bd1
  由 Zhanlue Yang 提交于 12月 14, 2021
```
* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
```
  f2043bd1
- H
  add layer_norm_fuse_pass test case (#37830) · b95c9cf2
  由 heliqi 提交于 12月 14, 2021
```
* add layer_norm_fuse_pass test case

* restore cmakelist code

* Merge branch 'develop' into layer_norm_fuse_pass

* Merge branch 'develop' into layer_norm_fuse_pass

* add bad case test
```
  b95c9cf2
- W
  
  fix generate_proposals op doc (#38048) · c117dfba
  由 wangguanzhong 提交于 12月 14, 2021
  
  c117dfba
- S
  add reshape+transpose+matmul_v2 only (#37847) · a922168a
  由 Sylwester Fraczek 提交于 12月 14, 2021
```
* reshape+transpose+matmul_v2

* in_name->input_name

* fix pr-ci-static-check
```
  a922168a
13 12月, 2021 14 次提交
- Z
  update 3 tests (#37922) · 33fbb66e
  由 zhenlin 提交于 12月 13, 2021
```
* update 3 tests

* fix typo error
```
  33fbb66e
- W
  disable bad case for shuffle pass (#38072) · e7f5d325
  由 wenbin 提交于 12月 13, 2021
```
* disabled bad case

* int to size_t
```
  e7f5d325
- J
  
  add popart_canonicalization p4 (#37967) · 69252fd8
  由 jianghaicheng 提交于 12月 13, 2021
  
  69252fd8
- T
  
  update xpu_memcpy (#38049) · bdf5834e
  由 taixiurong 提交于 12月 13, 2021
  
  bdf5834e
- X
  fix single card 8 unittests in new executor (#37957) · 9a4eec98
  由 xiongkun 提交于 12月 13, 2021
```
* fix single card 8 unittests in new executor

* fix

* fix
```
  9a4eec98
- N
  
  [pnorm] Optimize p_norm op for special cases (#37685) · 10d9ab4b
  由 Noel 提交于 12月 13, 2021
  
  10d9ab4b
- C
  
  fix custom op infershape error (#38045) · 3a339cc0
  由 Chen Weihang 提交于 12月 13, 2021
  
  3a339cc0
- W
  
  fix mac import hang, test=develop (#38051) · d3569c7e
  由 wanghuancoder 提交于 12月 13, 2021
  
  d3569c7e
- W
  add logit API (#37844) · b197bfe6
  由 wangzhen38 提交于 12月 13, 2021
```
* add Logit API

* add unittest

* conflict

* pull conflit

* pull conflit logit

* fix unititest

* fix code style

* update docs style of

* update en doc

* fix docs en style

* fix docs en style1

* fix docs en style2

* fix docs en style3

* fix docs en style4

* fix docs en style5

* fix docs en style6

* fix docs en style7

* fix docs en style8

* update by review

* fix nan bug
```
  b197bfe6
- C
  complement deps on cinn_launch_context cmake (#38031) · cba84f88
  由 CtfGo 提交于 12月 13, 2021
```
complement deps of cmake files under WITH_CINN compilation
```
  cba84f88
- Z
  【PTen】Add variadic args kernel for PTen API to replace KernelContext (#37942) · b76ef045
  由 zyfncg 提交于 12月 13, 2021
```
* add variadic_args kernel in pten

* merge develop code

* add variadic_args kernel and benchmark

* change dynamic_cast to static_cast for DeviceContext

* merge the code

* modify code format

* refactor variadic kernel function
```
  b76ef045
- S
  fix reduce_max bug (#38026) · 512e4339
  由 Shang Zhizhou 提交于 12月 13, 2021
```
* fix reduce_max bug

* add unittest
```
  512e4339
- Z
  
  fix trt de/serialization and refine the data type selection (#38057) · 92ad682f
  由 zlsh80826 提交于 12月 13, 2021
  
  92ad682f
- Z
  [Paddle-TRT] Fix trt dynamic shape ernie unit test on V100 (#38056) · 099cb75a
  由 zlsh80826 提交于 12月 13, 2021
```
* add restriction on plugin supportsFormat to eliminate errors from TensorRT8

* ernie-varlen is only supported on architecture >= sm75
```
  099cb75a
10 12月, 2021 15 次提交
- L
  
  fix int32 overflow in cuda kernel loop (#38007) · 37f43ebc
  由 Leo Chen 提交于 12月 10, 2021
  
  37f43ebc
- P
  
  fix dygraph_grad_maker to support set_value (#38014) · dabf8152
  由 pangyoki 提交于 12月 10, 2021
  
  dabf8152
- C
  
  rename TensoCopy (#38036) · 8f2b0860
  由 chentianyu03 提交于 12月 10, 2021
  
  8f2b0860
- Z
  fix pscore geo&lr_decay (#37995) · 513d1f97
  由 zhaocaibei123 提交于 12月 10, 2021
```
* fix

* modify log

* fix batch_size
```
  513d1f97
- K
  
  fix ndiv for npu (#37998) · 11c785a4
  由 kuizhiqing 提交于 12月 10, 2021
  
  11c785a4
- Y
  [PTen]Add alias name for matmul and remove redundant member in kernel factory (#38011) · c5a7da4b
  由 YuanRisheng 提交于 12月 10, 2021
```
* add alias kernel name

* modify code as suggestions

* add alias name for matmul and remove redundant member in kernel factory
```
  c5a7da4b
- F
  add as_complex and as_real op (#37784) · ae40370d
  由 Feiyu Chan 提交于 12月 10, 2021
```
* add as_complex and as_real op
```
  ae40370d
- L
  git ignore eager_op_function_impl.h (#38030) · 01b6bdf4
  由 Leo Chen 提交于 12月 10, 2021
```
* git ignore eager_op_function_impl.h

* test=document_fix
```
  01b6bdf4
- C
  [PTen]fix pten::Copy use error (#37982) · 2360406d
  由 chentianyu03 提交于 12月 10, 2021
```
* fix pten::Copy use error in redcue_impl

* remove in_dtype args in reduce kernel

* fix copy error

* fix copy error
```
  2360406d
- S
  
  make cuda graph thread local allocator (#37814) · 62b1f38c
  由 sneaxiy 提交于 12月 10, 2021
  
  62b1f38c
- J
  
  support pylayer with different input dtype (#37974) · c732c831
  由 Jiabin Yang 提交于 12月 10, 2021
  
  c732c831
- Y
  
  [fleet_executor] Fix overlap hang (#38024) · b4e44b0a
  由 Yuang Liu 提交于 12月 10, 2021
  
  b4e44b0a
- C
  
  change serval variable name and usage related cinn_launch (#38022) · a9bd6f0c
  由 CtfGo 提交于 12月 10, 2021
  
  a9bd6f0c
- H
  add fc_elementwise_layernorm_fuse_pass (#37771) · 0127e92d
  由 heliqi 提交于 12月 10, 2021
```
* add fc_elementwise_layernorm_fuse_pass

* fix name conflictn

* rebuild CI

* fix Ran Programs=0 bug
```
  0127e92d
- L
  
  revert flags_benchmark (#38005) · 26c44a86
  由 Leo Chen 提交于 12月 10, 2021
  
  26c44a86

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功