提交 · 8db6d221772d95fe96181d199b1458b3707e0cfd · Crayon鑫 / Paddle

27 9月, 2021 1 次提交

support saving model defined parameters without add scale_op (#36119) · 8db6d221

由 Haipeng Wang 提交于 9月 27, 2021

* add scale_op in model save step is not necessary, just fix the prune method to support static graph and inplace op

* fix jit.save, no need to add scale_op to each outputvar anymore.
fix prune_with_input, now it supports inplace op

* temporarily disable test_trt_dynamic_shape.TRTDynamicShapeOutOfBound2Test

* allow user to export parameters defined in model

8db6d221

26 9月, 2021 9 次提交
- J
  [new api] add func/class API psroi_pool and UT (#35352) · e45d64ec
  由 JYChen 提交于 9月 26, 2021
```
* add func/class API psroi_pool and UT

* add UT in static mode

* Remove redundant type checks in static mode

* More detailed description for test_psroi_pool_op

* fix code format of UT

* fix en-doc
```
  e45d64ec
- L
  
  Correct the misspelled part of the unit test (#36044) · 991ae3b6
  由 LJQ❤️ 提交于 9月 26, 2021
  
  991ae3b6
- Z
  
  update multi_dot exposure rules (#36018) · 52b45007
  由 zhangkaihuo 提交于 9月 26, 2021
  
  52b45007
- A
  
  fix pinv api explosure rule (#36093) · c330c3d9
  由 andyjpaddle 提交于 9月 26, 2021
  
  c330c3d9
- T
  set file_num in one shard (#35835) · 991dc67d
  由 Thunderbrook 提交于 9月 26, 2021
```
* set file_num in one shard

* format
```
  991dc67d
- Z
  modify adam to adamw in AdamW (#36028) · 49c8253f
  由 zhangbo9674 提交于 9月 26, 2021
```
* adam to adamw in AdamW

* add lr_ratio in adamw

* refine logic bug in cpu adamw

* delete fix bug for cpu adamw

* delete fix bug for cpu adamw
```
  49c8253f
- C
  
  CPU forward calculation replaces Eigen with Lapack;Modify linalg exposure rules (#35916) · 7ff226f0
  由 crystal 提交于 9月 26, 2021
  
  7ff226f0
- W
  
  修改了示例代码错误 (#36041) · d70e45d9
  由 wangzhuang01 提交于 9月 26, 2021
  
  d70e45d9
- Y
  
  add doc for two softmax fuse api, test=document_fix (#35943) · 97922557
  由 Yuang Liu 提交于 9月 26, 2021
  
  97922557
24 9月, 2021 12 次提交

J
add gradient kernel of det op and slogdet op (#36013) · b91e8eec
由 jiangcheng 提交于 9月 24, 2021
```
* add gradient kernel of det op and slogdet op

* fix CI APPROVAL problem
```
b91e8eec

Added elementwise_sub_mkldnn operator (#35662) · 787273ed

由 piotrekobiIntel 提交于 9月 24, 2021

* Add elementwise_sub_mkldnn_op without grad

* Add test to static_mode_white_list

* Refactor code, change license years

* Remove invalid grad implementation

* Fix element_wise_sub_op test

* Fix CI Approval error

* Remove unnecessary EltwiseSubMKLDNNGradKernel class

* Fix CI Approval 2

* Fix CI Approval 3

* Fix CI Approval Attempt #4

* Fix CI Approve Attempt #5

* Fix CI Approval Attempt #6

* Fix CI Approval Attemt #7

* Change test names containing add to sub

* Fix old tests testing add instead of sub

* Copy grad implementation from elementwise_add_mkldnn

* CI test fix attempt

* Revert "CI test fix attempt"

This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900.

* Fix CI attempt 2

* Fix elementwise_sub tests, temporary mkldnn broadcast test disable

* Add working implementation of elementwise_sub grad

* Fix build errors caused by pull

* Fix format error

* Fix format error 2

* Disable elementwise_sub_mkldnn test on GPU

* Apply fix for paddle.fluid import

* Revert changes of test_elementwise_sub and Fix mkldnn test

* Revert "Apply fix for paddle.fluid import"

This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742.

* fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862)

* Add changes suggested by reviewers

* Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI

* Remove check_dygraph=False to satisify CI Approval
Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>

787273ed

S

add update (#36017) · 1691dc7a
由 ShenLiang 提交于 9月 24, 2021

1691dc7a

add pool2d convert test (#35923) · 82f255d0

由 JingZhuangzhuang 提交于 9月 24, 2021

* add pool2d convert test

* modify error

* modify error

* modify error

* modify error

* modify error

* modify error

82f255d0

K

fix undefined var in test_batch_sampler. test=develop (#35924) · 4f42e5d7
由 Kaipeng Deng 提交于 9月 24, 2021

4f42e5d7
W

concat api support empty tensor. (#35845) · eb28a36d
由 wuhuachaocoding 提交于 9月 24, 2021

eb28a36d
fix pad tuple (#35985) · 0c0817cf
由 littletomatodonkey 提交于 9月 24, 2021
```
* fix pad tuple

* fix format
```
0c0817cf

Add paddle.linalg.solve OP (#35715) · 8caf951c

由 Weilong Wu 提交于 9月 24, 2021

* Add linalg.solve op, test=develop

* Fix a bug caused by accidental deletion

* updated description and fix a bug: missing a comma

* Add linalg.solve op, test=develop

* updated solve op backward logic

* updated solve op backward logic again

* Add linalg.solve Op, test=develop

* Updated and modified to fit CI requirements

* Fix a bug

* 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code

* Remove redundant comments

* 1)Removed redundant code; 2)Updated to enhance code robustness

* Removed redundant code

* Updated API documents

8caf951c

fix distributed ops combining problems (#35942) · 4c35f515

由 seemingwang 提交于 9月 24, 2021

* graph engine demo

* upload unsaved changes

* fix dependency error

* fix shard_num problem

* py client

* remove lock and graph-type

* add load direct graph

* add load direct graph

* add load direct graph

* batch random_sample

* batch_sample_k

* fix num_nodes size

* batch brpc

* batch brpc

* add test

* add test

* add load_nodes; change add_node function

* change sample return type to pair

* resolve conflict

* resolved conflict

* resolved conflict

* separate server and client

* merge pair type

* fix

* resolved conflict

* fixed segment fault; high-level VLOG for load edges and load nodes

* random_sample return 0

* rm useless loop

* test:load edge

* fix ret -1

* test: rm sample

* rm sample

* random_sample return future

* random_sample return int

* test fake node

* fixed here

* memory leak

* remove test code

* fix return problem

* add common_graph_table

* random sample node &test & change data-structure from linkedList to vector

* add common_graph_table

* sample with srand

* add node_types

* optimize nodes sample

* recover test

* random sample

* destruct weighted sampler

* GraphEdgeBlob

* WeightedGraphEdgeBlob to GraphEdgeBlob

* WeightedGraphEdgeBlob to GraphEdgeBlob

* pybind sample nodes api

* pull nodes with step

* fixed pull_graph_list bug; add test for pull_graph_list by step

* add graph table;name

* add graph table;name

* add pybind

* add pybind

* add FeatureNode

* add FeatureNode

* add FeatureNode Serialize

* add FeatureNode Serialize

* get_feat_node

* avoid local rpc

* fix get_node_feat

* fix get_node_feat

* remove log

* get_node_feat return  py:bytes

* merge develop with graph_engine

* fix threadpool.h head

* fix

* fix typo

* resolve conflict

* fix conflict

* recover lost content

* fix pybind of FeatureNode

* recover cmake

* recover tools

* resolve conflict

* resolve linking problem

* code style

* change test_server port

* fix code problems

* remove shard_num config

* remove redundent threads

* optimize start server

* remove logs

* fix code problems by reviewers' suggestions

* move graph files into a folder

* code style change

* remove graph operations from base table

* optimize get_feat function of graph engine

* fix long long count problem

* remove redandunt graph files

* remove unused shell

* recover dropout_op_pass.h

* fix potential stack overflow when request number is too large & node add & node clear & node remove

* when sample k is larger than neigbor num, return directly

* using random seed generator of paddle to speed up

* fix bug of random sample k

* fix code style

* fix code style

* add remove graph to fleet_py.cc

* fix blocking_queue problem

* fix style

* fix

* recover capacity check

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* add remove graph node; add set_feature

* fix distributed op combining problems

* optimize

* remove logs
Co-authored-by: NHuang Zhengjie <270018958@qq.com>
Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
Co-authored-by: Nluobin06 <luobin06@baidu.com>
Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
Co-authored-by: Ntangwei12 <tangwei12@baidu.com>

4c35f515

B

add emb_eltwise_layernorm trt converter test case (#36027) · 0bbaf9bd
由 baoachun 提交于 9月 24, 2021

0bbaf9bd
B
add multihead_matmul trt converter test case (#36023) · fcaa64b3
由 baoachun 提交于 9月 24, 2021
```
* add multihead_matmul trt converter test case

* move attribute check to op_teller
```
fcaa64b3
W
add the shape check for the matmul (#35791) · 8e19d1ba
由 wawltor 提交于 9月 24, 2021
```
* add the shape check for the matmul

* remove the test case for the linear
```
8e19d1ba

23 9月, 2021 1 次提交

add argmax and iou_similarity for kunlun (#35836) · 7bf84e2d

由 TTerror 提交于 9月 23, 2021

* add argmax and iou_similarity for kunlun

* add argmax and iou_similarity for kunlun

* add argmax and iou_similarity for kunlun

7bf84e2d

22 9月, 2021 11 次提交

Z

fix adamw DeprecationWarining (#35869) · f67a50bd
由 zhaoyingli 提交于 9月 22, 2021

f67a50bd

[AMP]split minimize and add unscale_ for GradScaler (#35825) · bf6f0e54

由 zhangbo9674 提交于 9月 22, 2021

* split minimize() to step() + update()

* add unscale and step for grad_scaler

* add unittest

* refine code in minimize

* delete step in loss_scaler

* fix example bug

* refine comment

* refine unittest

* add unittest

bf6f0e54

R
[NPU] add randperm_op_npu (#35763) · 4f0c3278
由 ronnywang 提交于 9月 22, 2021
```
* add randperm_op_npu

* fix test_set_value_op_npu
```
4f0c3278

op:transpose_op supports bool type (#35886) · 0c6ee945

由 TeslaZhao 提交于 9月 22, 2021

* Pass compat of conv_transpose_bias_mkldnn_fuse_pass

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of transpose op, about accessing memory out of bounds of the perm param

* op:transpose_op supports bool type

0c6ee945

Det &Slogdet (#34992) · 9ce45ddd

由 huangxu96 提交于 9月 22, 2021

Add new API : paddle.linalg.det & paddle.linalg.slogdet

API Alias：paddle.det& paddle.slogdet

9ce45ddd

fix conv2d convert test (#35627) · 1238115e

由 JingZhuangzhuang 提交于 9月 21, 2021

* support nnadapter and ascend310

* modify code

* add anchor_generator convert test

* add gelu convert test

* add conv2d convert test

* modify anchor_operator convert test

* modify conv2d test

* modify con2d convert test

* modify conv2d convert test

* modify conv2d convert test

* modify conv2d test

* fix WITH_PYTHON compile error

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file

* modify test file
Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>
Co-authored-by: Njiweibo <jiweibo@baidu.com>

1238115e

J

Add quant2 int8 lstm model test (#35887) · be4d0026
由 joanna.wozna.intel 提交于 9月 22, 2021

be4d0026
W
fix feed for new executor (#35803) · 4c2a06df
由 wanghuancoder 提交于 9月 21, 2021
```
* fix feed, test=develop

* delete one test case, test=develop
```
4c2a06df
F

disable tests for fft on windows with gpu (#35872) · 5af6081a
由 Feiyu Chan 提交于 9月 22, 2021

5af6081a
Z

fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862) · 12ab017e
由 zhangbo9674 提交于 9月 22, 2021

12ab017e
W

add dilation check for conv (#35838) · 77134300
由 wangguanzhong 提交于 9月 22, 2021

77134300

21 9月, 2021 2 次提交

G

support fp16 (#35888) · 087c23a9
由 Guoxia Wang 提交于 9月 21, 2021

087c23a9

Reuse OneDNN handler for SGD and SUM for SelectedRows input tensors. (#35510) · 799f3861

由 Adam Osewski 提交于 9月 20, 2021

* Create stateful OneDNNAXPYHandler object.

This makes it possible to call it multiple times without recreating the
oneDNN primitives every time.

* Prepare SGDOpKernel to reuse its implementation from OneDNN kernel.

* OneDNN SGD kernel.

* Update call to use new OneDNNAXPYHandler object api.

* Setup seed in proper place.

* Enable OneDNN kernel only for single case.

* For dense param and sparse grad.

* Small refactor.

* Enable oneDNN by op attr or by cmd line flag.

* Use int64_t type for number of elements.

* Support dense param and grad from OneDNN kernel.

* Enable SGD OneDNN kernel when use MP BF16 optimizer.

* Force non-copyable/movable OneDNNAXPYHandler.

* Reuse OneDNNAXPYHandler for spare tensors in SUM op.

* Fix SFINAE rules.

* Remove recording event inside AXPY.

* Get rid of internal primitive caching.

* Stop use PP cache mechanims to store mem and primitive obj.
* Handler obj store and reuse needed desc & prim

* Do not derive from MKLDNNHandlerT

799f3861

19 9月, 2021 1 次提交
- B
  
  add hard_sigmoid trt converter test cases (#35876) · 9f88d327
  由 baoachun 提交于 9月 19, 2021
  
  9f88d327
18 9月, 2021 3 次提交
- Z
  
  increase test_imperative_auto_mixed_precision timePROPERTIES TIMEOUT (#35863) · e7617512
  由 zhangbo9674 提交于 9月 18, 2021
  
  e7617512
- W
  
  [hybird] fix pipeline section program Parameter (#35847) · 67c63639
  由 WangXi 提交于 9月 18, 2021
  
  67c63639
- H
  Basic PR on Cost Model (#35774) · 5ba9fe6e
  由 Huihuang Zheng 提交于 9月 18, 2021
```
Add basic Cost Model, it uses executor to run program and profile it to get op time.

This is an early basic version, we will add more functions in the future.
```
  5ba9fe6e

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致