提交 · bcf86e5ce7b18050ffc469b93b69da469862bfd5 · Crayon鑫 / Paddle

24 12月, 2021 5 次提交

add new API/OP: paddle.poisson (#38117) · bcf86e5c
由 zhouweiwei2014 提交于 12月 24, 2021
```
* add new API/OP:paddle.poisson

* fix comment
```
bcf86e5c

[Dy2stat]Fix error when calling sublayer's non-forward func in dy2stat (#37296) · 7339a124

由 0x45f 提交于 12月 24, 2021

* fix error when calling sublayer's non-forward func in dy2stat

* fix circular import using an inelegant way

* deal with parameters

* remove param_guard in __call__

* remove comment

* fix error when jit.load

* rename block var

* remove wrong code

* add unit test

7339a124

A
[Dy2Stat]Consider InputSpec.name to calculate Cachekey hash id (#38273) · 8e6d5d2b
由 Aurelius84 提交于 12月 24, 2021
```
* Consider InputSpec.name to calculate Cachekey hash id

* fix function
```
8e6d5d2b

add conv+hard_sigmoid and conv+hard_swish fuse pass ut (#37553) · a858326a

由 baoachun 提交于 12月 24, 2021

* add conv+hard_sigmoid fuse pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_hard_sigmoid_mkldnn_fuse_pass ut

* update conv+hard_sigmoid and conv+hard_swish fuse pass ut

* update ut

* update ut

a858326a

Support test imperative basic in eager (#38313) · d48f7c89

由 Jiabin Yang 提交于 12月 24, 2021

* Rearranged Eager AutoCodeGen directory structure

* Removed USE_OP in Eager AutoCodeGen

* Enabled generation for Operators without Grad/Inputs/Outputs

* Resolved operators without input

* Fixed merge conflicts

* Enabled Eager AutoCodeGen for 10+ more operators

* Refactored Eager AutoCodeGen with more organized helper objects

* Enabled Eager AutoCodeGen for operators with multiple OpBases

* Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument

* Handled Dispensable Inputs/Outputs in Eager AutoCodeGen

* Adjusted function generation/call between Python-C API & Dygraph API

* Synchronized auto-generated Python-C API with Dygraph Forward Functions

* support more eager tensor api

* fix merge compile error

* fix compile error and fit develop code

* support pure CPU

* fix some logic error in eager_mode

* support _varbase_creator in eager mode

* Added safe_initialized interface to EagerTensor for use in processing dispensable inputs

* for eager mode

* refine

* support multiple constructor for eager tensor

* add place related code

* polish code

* specific randint with dtype of int64

* Support pure cpu test

* eager logic

* refine test in pure cpu

* eager logic

* eager logic

* eager logic, test=develop

* skip core.eager when in inference, test=develop

* refine, test=develop

* refine, test=develop

* call RetainGrad after run forward kernel, test=develop

* refine, test=develop

* support dygraph util, meta, guard test

* support inference test

* refine test and fix initializer failed
Co-authored-by: Njim19930609 <jim19930609@gmail.com>
Co-authored-by: NWang Huan <wanghuan29@baidu.com>

d48f7c89

23 12月, 2021 10 次提交

X
move distribution.py into distribution package and split into different file... · a3e6f18c
由 Xiaoxu Chen 提交于 12月 23, 2021
```
move distribution.py into distribution package and split into different file for better scalability (#38047)
```
a3e6f18c

add control/status API (#37885) · 21b7ed3e

由 wuhuanzhou 提交于 12月 23, 2021

* add control/status API, test=develop

* fix import error, test=develop

* add is_grad_enabled unittest, test=develop

* add code comment for example code and API, test=develop

* add checking for type, test=develop

* add api description, test=develop

* fix docs index_en, test=document_fix

* fix doc of is_floating_point, test=document_fix

21b7ed3e

Add erfinv API (#38295) · 6b59b58c

由 wuhuanzhou 提交于 12月 23, 2021

* add erfinv API, test=develop

* fix gradient accuracy error, test=develop

* fix cuda compilation error on Windows, test=develop

* fix M_2_SQRTPI undeclared identifier on Windows, test=develop

6b59b58c

Z
【PTen】Add empty and empty_like kernel in pten (#38334) · 4221cd33
由 zyfncg 提交于 12月 23, 2021
```
* add empty and empty_like kernel in pten

* add empty dev_api
```
4221cd33

add mkldnn conv_elementwise_add_mkldnn_fuse_pass ut (#37612) · f88065d3

由 baoachun 提交于 12月 23, 2021

* add mkldnn conv_elementwise_add_mkldnn_fuse_pass ut

* update mkldnn conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update conv_elementwise_add_mkldnn_fuse_pass ut

* restrict conv2d data_format in conv_elementwise_add_mkldnn_fuse_pass

* update conv_elementwise_add_mkldnn_fuse_pass OpCompat

* update conv_elementwise_add_mkldnn_fuse_pass ut

* update ut

f88065d3

S

Fixed corner case in fill_constant (#38284) · 4e4d58b3
由 Siming Dai 提交于 12月 23, 2021

4e4d58b3
add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector (#38020) · 0eb03ed7
由 zhouweiwei2014 提交于 12月 23, 2021
```
* add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector

* fix comment
```
0eb03ed7

Add unittest for flatten2_matmul squeeze2_matmul reshape2_matmul pass (#37644) · aa059885

由 heliqi 提交于 12月 23, 2021

* add flatten2_matmul squeeze2_matmul reshape2_matmul test case

* modify skip func to ignore_pass_case func

* rebuild CI

* add test_xx_matmul_fuse_pass timeout

* add test_map_xx_pass timeout

* add max_duration of test cast

* add trt skip

* add timeout

* del commented code

aa059885

J

remove unitest for auto_searcher (#38370) · ebbd3564
由 JZ-LIANG 提交于 12月 23, 2021

ebbd3564
Z

Revise input_data1 shape to same as input_data2 shape for non-broadcast cases (#38206) · f50768e8
由 zlsh80826 提交于 12月 23, 2021

f50768e8

22 12月, 2021 10 次提交
- H
  
  del mkldnn options of baseline (#38349) · 4c5ea4ca
  由 heliqi 提交于 12月 22, 2021
  
  4c5ea4ca
- Z
  Fix multi tensor momentum regular bug (#38344) · bf6d65fc
  由 zhangbo9674 提交于 12月 22, 2021
```
* fix merged_momentum regular bug

* fix bug
```
  bf6d65fc
- C
  [PTen] Add cmake function for kernels (#38311) · e6310dbd
  由 Chen Weihang 提交于 12月 22, 2021
```
* add pten kernel cmake

* add pten kernel cmake function

* fix compile error

* add enforce include for full kernel

* fix compile failed

* change cuda to gpu

* fix cmake function error
```
  e6310dbd
- Z
  
  Replaced core.ops with _C_ops (#38337) · 242ef2b9
  由 Zhanlue Yang 提交于 12月 22, 2021
  
  242ef2b9
- B
  add mkldnn reshape_transpose_matmul fuse pass ut and op version check (#37468) · 274b135b
  由 baoachun 提交于 12月 22, 2021
```
* add mkldnn reshape_transpose_matmul fuse pass ut and op version check

* update reshape_transpose_matmul_mkldnn_fuse_pass ut

* update ut
```
  274b135b
- B
  update mkldnn batch_norm_activation fuse pass ut (#37402) · 3d7e737c
  由 baoachun 提交于 12月 22, 2021
```
* update mkldnn batch_norm_activation fuse pass ut

* update ut

* update mkldnn batch_norm_act_fuse_pass ut

* update batch_norm_act_fuse_pass ut

* update ut
```
  3d7e737c
- G
  
  fix clip extra when QAT export model (#38323) · 142ea171
  由 Guanghua Yu 提交于 12月 22, 2021
  
  142ea171
- G
  
  fix prelu weight shape for NHWC of static mode (#38310) · 0a79499c
  由 Guoxia Wang 提交于 12月 22, 2021
  
  0a79499c
- J
  
  Add nearest_interp/v2 int8 and uint8 support (#37985) · 56e2a6a6
  由 joanna.wozna.intel 提交于 12月 22, 2021
  
  56e2a6a6
- Z
  Rename full infer_meta (#38332) · abb07f35
  由 zyfncg 提交于 12月 22, 2021
```
* rename full infer_meta

* fix merge problem
```
  abb07f35
21 12月, 2021 11 次提交
- Z
  Fix inplace problem of setitem (#38298) · da61df5c
  由 zyfncg 提交于 12月 21, 2021
```
* add inplace_map for trace_op in pybind

* fix inplace problem of setitem

* refactor the param format  of trace_op
Co-authored-by: Npangyoki <pangyoki@126.com>
```
  da61df5c
- B
  update seqconv_eltadd_relu_fuse_pass ut (#37907) · 4e578c2b
  由 baoachun 提交于 12月 21, 2021
```
* update seqconv_eltadd_relu_fuse_pass ut

* update ut

* update ut

* update ut
```
  4e578c2b
- B
  update squared_mat_sub_fuse_pass ut (#37838) · aadc8674
  由 baoachun 提交于 12月 21, 2021
```
* update squared_mat_sub_fuse_pass ut

* update ut

* update ut
```
  aadc8674
- Y
  
  [fleet_executor] Python side fleet executor and task node (#38290) · a4afb97a
  由 Yuang Liu 提交于 12月 21, 2021
  
  a4afb97a
- G
  
  fix recompute no grad warning (#38293) · 2005b98b
  由 Guoxia Wang 提交于 12月 21, 2021
  
  2005b98b
- B
  add seqpool_cvm_concat_fuse_pass ut (#37902) · 06cf314a
  由 baoachun 提交于 12月 21, 2021
```
* add seqpool_cvm_concat_fuse_pass ut

* rename ut name
```
  06cf314a
- S
  Support FP16 mean (#38289) · 643a268e
  由 sneaxiy 提交于 12月 21, 2021
```
* mean first version

* fix scalar mean

* add fp16 dtype for api
```
  643a268e
- Y
  Fix test_conv_eltwiseadd_bn_fuse_pass timeout bug (#38302) · c197d73b
  由 yeliang2258 提交于 12月 21, 2021
```
* fix timeout bug

* update
```
  c197d73b
- B
  update repeated_fc_relu_fuse_pass ut (#37845) · a896d1ce
  由 baoachun 提交于 12月 21, 2021
```
* update repeated_fc_relu_fuse_pass ut

* update ut
```
  a896d1ce
- H
  optimize performance of offload in dygraph sharding stage2 (#38064) · f74ebd8a
  由 Haohongxiang 提交于 12月 21, 2021
```
* update

* fix bugs

* modify code style

* fix bugs of _get_global_group
```
  f74ebd8a
- H
  PassAutoScan 基线跟测试用例使用一样配置的config (#38252) · 61ef56a1
  由 heliqi 提交于 12月 21, 2021
```
* add timeout

* add timeout

* PassAutoScan base_line use same config

* try run base_line

* fix dropout Mask of output attr error

* fix dropout Mask of output attr error
```
  61ef56a1
20 12月, 2021 4 次提交

S

add check pass conflict tools (#38276) · 0d12aa64
由 sneaxiy 提交于 12月 20, 2021

0d12aa64

add mkldnn conv_transpose_bias fuse pass ut (#37508) · ac696941

由 baoachun 提交于 12月 20, 2021

* add mkldnn conv_transpose_bias fuse pass ut

* update conv_transpose_bias_mkldnn_fuse_pass ut

* update conv_transpose_bias_mkldnn_fuse_pass ut

* update conv_transpose_bias_mkldnn_fuse_pass ut

* restrict conv2d data_format in conv_transpose_bias_mkldnn_fuse_pass

* update ut timeout setting

* update ut

ac696941

[pten]add pten conj kernel (#38247) · a2793e5e

由 chentianyu03 提交于 12月 20, 2021

* add pten conj kernel

* modify conj_kernel file path

* add defined cuda macro to cuda/conj_kernel.h

a2793e5e

Support FP16 for more ops (#38123) · 1f445bf3

由 sneaxiy 提交于 12月 20, 2021

* support FP16 for more ops

* add amp list tests

* refine reduce_mean_grad

* fix OP benchmark ci

* fix fp16 reduce_mean

* updat ut, but still have some problems

* remove mean/reduce_mean fp16 kernel

1f445bf3

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致