提交 · a112ce4260b51966beef01ee8ca43210ce280095 · Crayon鑫 / Paddle

27 9月, 2021 4 次提交

Added flatten and flatten2 BF16/FP32 FWD/BWD kernels (#35892) · e427a0f1

由 jakpiase 提交于 9月 27, 2021

* refactored reshape multiop kernel and added flatten1/2 kernels

* added formatting for flatten tests

* CI fix

* disabled reshape_kernel ops after succesful CI run

* minor fix

e427a0f1

Add functional autograd API: jacobian (#35917) · ec2f68e8

由 levi131 提交于 9月 27, 2021

* init functional jacobian api

* finish test with dtype float32

* add float64 test case

* polish code

* use atol=1e-5 with dtype float64

* fix for ci

* set timeout for test_jacobian

* polish API docstring

* modify docstring

ec2f68e8

W
Add roi pool (#35084) · 6d62769a
由 Wenyu 提交于 9月 27, 2021
```
* add roi pool

* rename input as x
```
6d62769a

support saving model defined parameters without add scale_op (#36119) · 8db6d221

由 Haipeng Wang 提交于 9月 27, 2021

* add scale_op in model save step is not necessary, just fix the prune method to support static graph and inplace op

* fix jit.save, no need to add scale_op to each outputvar anymore.
fix prune_with_input, now it supports inplace op

* temporarily disable test_trt_dynamic_shape.TRTDynamicShapeOutOfBound2Test

* allow user to export parameters defined in model

8db6d221

26 9月, 2021 9 次提交
- J
  [new api] add func/class API psroi_pool and UT (#35352) · e45d64ec
  由 JYChen 提交于 9月 26, 2021
```
* add func/class API psroi_pool and UT

* add UT in static mode

* Remove redundant type checks in static mode

* More detailed description for test_psroi_pool_op

* fix code format of UT

* fix en-doc
```
  e45d64ec
- L
  
  Correct the misspelled part of the unit test (#36044) · 991ae3b6
  由 LJQ❤️ 提交于 9月 26, 2021
  
  991ae3b6
- Z
  
  update multi_dot exposure rules (#36018) · 52b45007
  由 zhangkaihuo 提交于 9月 26, 2021
  
  52b45007
- A
  
  fix pinv api explosure rule (#36093) · c330c3d9
  由 andyjpaddle 提交于 9月 26, 2021
  
  c330c3d9
- T
  set file_num in one shard (#35835) · 991dc67d
  由 Thunderbrook 提交于 9月 26, 2021
```
* set file_num in one shard

* format
```
  991dc67d
- Z
  modify adam to adamw in AdamW (#36028) · 49c8253f
  由 zhangbo9674 提交于 9月 26, 2021
```
* adam to adamw in AdamW

* add lr_ratio in adamw

* refine logic bug in cpu adamw

* delete fix bug for cpu adamw

* delete fix bug for cpu adamw
```
  49c8253f
- C
  
  CPU forward calculation replaces Eigen with Lapack;Modify linalg exposure rules (#35916) · 7ff226f0
  由 crystal 提交于 9月 26, 2021
  
  7ff226f0
- W
  
  修改了示例代码错误 (#36041) · d70e45d9
  由 wangzhuang01 提交于 9月 26, 2021
  
  d70e45d9
- Y
  
  add doc for two softmax fuse api, test=document_fix (#35943) · 97922557
  由 Yuang Liu 提交于 9月 26, 2021
  
  97922557
24 9月, 2021 12 次提交

J
add gradient kernel of det op and slogdet op (#36013) · b91e8eec
由 jiangcheng 提交于 9月 24, 2021
```
* add gradient kernel of det op and slogdet op

* fix CI APPROVAL problem
```
b91e8eec

Added elementwise_sub_mkldnn operator (#35662) · 787273ed

由 piotrekobiIntel 提交于 9月 24, 2021

* Add elementwise_sub_mkldnn_op without grad

* Add test to static_mode_white_list

* Refactor code, change license years

* Remove invalid grad implementation

* Fix element_wise_sub_op test

* Fix CI Approval error

* Remove unnecessary EltwiseSubMKLDNNGradKernel class

* Fix CI Approval 2

* Fix CI Approval 3

* Fix CI Approval Attempt #4

* Fix CI Approve Attempt #5

* Fix CI Approval Attempt #6

* Fix CI Approval Attemt #7

* Change test names containing add to sub

* Fix old tests testing add instead of sub

* Copy grad implementation from elementwise_add_mkldnn

* CI test fix attempt

* Revert "CI test fix attempt"

This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900.

* Fix CI attempt 2

* Fix elementwise_sub tests, temporary mkldnn broadcast test disable

* Add working implementation of elementwise_sub grad

* Fix build errors caused by pull

* Fix format error

* Fix format error 2

* Disable elementwise_sub_mkldnn test on GPU

* Apply fix for paddle.fluid import

* Revert changes of test_elementwise_sub and Fix mkldnn test

* Revert "Apply fix for paddle.fluid import"

This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742.

* fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862)

* Add changes suggested by reviewers

* Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI

* Remove check_dygraph=False to satisify CI Approval
Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>

787273ed

S

add update (#36017) · 1691dc7a
由 ShenLiang 提交于 9月 24, 2021

1691dc7a

add pool2d convert test (#35923) · 82f255d0

由 JingZhuangzhuang 提交于 9月 24, 2021

* add pool2d convert test

* modify error

* modify error

* modify error

* modify error

* modify error

* modify error

82f255d0

K

fix undefined var in test_batch_sampler. test=develop (#35924) · 4f42e5d7
由 Kaipeng Deng 提交于 9月 24, 2021

4f42e5d7
W

concat api support empty tensor. (#35845) · eb28a36d
由 wuhuachaocoding 提交于 9月 24, 2021

eb28a36d
fix pad tuple (#35985) · 0c0817cf
由 littletomatodonkey 提交于 9月 24, 2021
```
* fix pad tuple

* fix format
```
0c0817cf

Add paddle.linalg.solve OP (#35715) · 8caf951c

由 Weilong Wu 提交于 9月 24, 2021

* Add linalg.solve op, test=develop

* Fix a bug caused by accidental deletion

* updated description and fix a bug: missing a comma

* Add linalg.solve op, test=develop

* updated solve op backward logic

* updated solve op backward logic again

* Add linalg.solve Op, test=develop

* Updated and modified to fit CI requirements

* Fix a bug

* 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code

* Remove redundant comments

* 1)Removed redundant code; 2)Updated to enhance code robustness

* Removed redundant code

* Updated API documents

8caf951c

fix distributed ops combining problems (#35942) · 4c35f515

由 seemingwang 提交于 9月 24, 2021

* graph engine demo

* upload unsaved changes

* fix dependency error

* fix shard_num problem

* py client

* remove lock and graph-type

* add load direct graph

* add load direct graph

* add load direct graph

* batch random_sample

* batch_sample_k

* fix num_nodes size

* batch brpc

* batch brpc

* add test

* add test

* add load_nodes; change add_node function

* change sample return type to pair

* resolve conflict

* resolved conflict

* resolved conflict

* separate server and client

* merge pair type

* fix

* resolved conflict

* fixed segment fault; high-level VLOG for load edges and load nodes

* random_sample return 0

* rm useless loop

* test:load edge

* fix ret -1

* test: rm sample

* rm sample

* random_sample return future

* random_sample return int

* test fake node

* fixed here

* memory leak

* remove test code

* fix return problem

* add common_graph_table

* random sample node &test & change data-structure from linkedList to vector

* add common_graph_table

* sample with srand

* add node_types

* optimize nodes sample

* recover test

* random sample

* destruct weighted sampler

* GraphEdgeBlob

* WeightedGraphEdgeBlob to GraphEdgeBlob

* WeightedGraphEdgeBlob to GraphEdgeBlob

* pybind sample nodes api

* pull nodes with step

* fixed pull_graph_list bug; add test for pull_graph_list by step

* add graph table;name

* add graph table;name

* add pybind

* add pybind

* add FeatureNode

* add FeatureNode

* add FeatureNode Serialize

* add FeatureNode Serialize

* get_feat_node

* avoid local rpc

* fix get_node_feat

* fix get_node_feat

* remove log

* get_node_feat return  py:bytes

* merge develop with graph_engine

* fix threadpool.h head

* fix

* fix typo

* resolve conflict

* fix conflict

* recover lost content

* fix pybind of FeatureNode

* recover cmake

* recover tools

* resolve conflict

* resolve linking problem

* code style

* change test_server port

* fix code problems

* remove shard_num config

* remove redundent threads

* optimize start server

* remove logs

* fix code problems by reviewers' suggestions

* move graph files into a folder

* code style change

* remove graph operations from base table

* optimize get_feat function of graph engine

* fix long long count problem

* remove redandunt graph files

* remove unused shell

* recover dropout_op_pass.h

* fix potential stack overflow when request number is too large & node add & node clear & node remove

* when sample k is larger than neigbor num, return directly

* using random seed generator of paddle to speed up

* fix bug of random sample k

* fix code style

* fix code style

* add remove graph to fleet_py.cc

* fix blocking_queue problem

* fix style

* fix

* recover capacity check

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* fix distributed op combining problems

* optimize

* remove logs
Co-authored-by: NHuang Zhengjie <270018958@qq.com>
Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
Co-authored-by: Nluobin06 <luobin06@baidu.com>
Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
Co-authored-by: Ntangwei12 <tangwei12@baidu.com>

4c35f515

B

add emb_eltwise_layernorm trt converter test case (#36027) · 0bbaf9bd
由 baoachun 提交于 9月 24, 2021

0bbaf9bd
B
add multihead_matmul trt converter test case (#36023) · fcaa64b3
由 baoachun 提交于 9月 24, 2021
```
* add multihead_matmul trt converter test case

* move attribute check to op_teller
```
fcaa64b3
W
add the shape check for the matmul (#35791) · 8e19d1ba
由 wawltor 提交于 9月 24, 2021
```
* add the shape check for the matmul

* remove the test case for the linear
```
8e19d1ba

23 9月, 2021 1 次提交

add argmax and iou_similarity for kunlun (#35836) · 7bf84e2d

由 TTerror 提交于 9月 23, 2021

* add argmax and iou_similarity for kunlun

* add argmax and iou_similarity for kunlun

* add argmax and iou_similarity for kunlun

7bf84e2d

22 9月, 2021 11 次提交

Z

fix adamw DeprecationWarining (#35869) · f67a50bd
由 zhaoyingli 提交于 9月 22, 2021

f67a50bd

[AMP]split minimize and add unscale_ for GradScaler (#35825) · bf6f0e54

由 zhangbo9674 提交于 9月 22, 2021

* split minimize() to step() + update()

* add unscale and step for grad_scaler

* add unittest

* refine code in minimize

* delete step in loss_scaler

* fix example bug

* refine comment

* refine unittest

* add unittest

bf6f0e54

R
[NPU] add randperm_op_npu (#35763) · 4f0c3278
由 ronnywang 提交于 9月 22, 2021
```
* add randperm_op_npu

* fix test_set_value_op_npu
```
4f0c3278

op:transpose_op supports bool type (#35886) · 0c6ee945

由 TeslaZhao 提交于 9月 22, 2021

* Pass compat of conv_transpose_bias_mkldnn_fuse_pass

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of transpose op, about accessing memory out of bounds of the perm param

* op:transpose_op supports bool type

0c6ee945

Det &Slogdet (#34992) · 9ce45ddd

由 huangxu96 提交于 9月 22, 2021

Add new API : paddle.linalg.det & paddle.linalg.slogdet

API Alias：paddle.det& paddle.slogdet

9ce45ddd

fix conv2d convert test (#35627) · 1238115e

由 JingZhuangzhuang 提交于 9月 21, 2021

* support nnadapter and ascend310

* modify code

* add anchor_generator convert test

* add gelu convert test

* add conv2d convert test

* modify anchor_operator convert test

* modify conv2d test

* modify con2d convert test

* modify conv2d convert test

* modify conv2d convert test

* modify conv2d test

* fix WITH_PYTHON compile error

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file
Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>
Co-authored-by: Njiweibo <jiweibo@baidu.com>

1238115e

J

Add quant2 int8 lstm model test (#35887) · be4d0026
由 joanna.wozna.intel 提交于 9月 22, 2021

be4d0026
W
fix feed for new executor (#35803) · 4c2a06df
由 wanghuancoder 提交于 9月 21, 2021
```
* fix feed, test=develop

* delete one test case, test=develop
```
4c2a06df
F

disable tests for fft on windows with gpu (#35872) · 5af6081a
由 Feiyu Chan 提交于 9月 22, 2021

5af6081a
Z

fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862) · 12ab017e
由 zhangbo9674 提交于 9月 22, 2021

12ab017e
W

add dilation check for conv (#35838) · 77134300
由 wangguanzhong 提交于 9月 22, 2021

77134300

21 9月, 2021 2 次提交

G

support fp16 (#35888) · 087c23a9
由 Guoxia Wang 提交于 9月 21, 2021

087c23a9

Reuse OneDNN handler for SGD and SUM for SelectedRows input tensors. (#35510) · 799f3861

由 Adam Osewski 提交于 9月 20, 2021

* Create stateful OneDNNAXPYHandler object.

This makes it possible to call it multiple times without recreating the
oneDNN primitives every time.

* Prepare SGDOpKernel to reuse its implementation from OneDNN kernel.

* OneDNN SGD kernel.

* Update call to use new OneDNNAXPYHandler object api.

* Setup seed in proper place.

* Enable OneDNN kernel only for single case.

* For dense param and sparse grad.

* Small refactor.

* Enable oneDNN by op attr or by cmd line flag.

* Use int64_t type for number of elements.

* Support dense param and grad from OneDNN kernel.

* Enable SGD OneDNN kernel when use MP BF16 optimizer.

* Force non-copyable/movable OneDNNAXPYHandler.

* Reuse OneDNNAXPYHandler for spare tensors in SUM op.

* Fix SFINAE rules.

* Remove recording event inside AXPY.

* Get rid of internal primitive caching.

* Stop use PP cache mechanims to store mem and primitive obj.
* Handler obj store and reuse needed desc & prim

* Do not derive from MKLDNNHandlerT

799f3861

19 9月, 2021 1 次提交
- B
  
  add hard_sigmoid trt converter test cases (#35876) · 9f88d327
  由 baoachun 提交于 9月 19, 2021
  
  9f88d327

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致