提交 · 2497f4392fe60f4c72e9b7ff5de9b8b6117aacac · PaddlePaddle / Paddle

18 2月, 2021 2 次提交
- W
  
  Handle missing symlink method on Windows (#31006) · 2497f439
  由 Wojciech Uss 提交于 2月 17, 2021
  
  2497f439
- A
  [CustomOp] Check Compiler ABI compatibility (#30869) · 5653c3a4
  由 Aurelius84 提交于 2月 18, 2021
```
* support setup.py to compile custom op

* move file into paddle.utils.cpp_extension

* support python setup.py install

* refine code style

* Enrich code and add unittest
```
  5653c3a4
11 2月, 2021 1 次提交
- H
  
  fix lrn bug in reshape size, test=develop (#30968) · 20e300e2
  由 huangjun12 提交于 2月 11, 2021
  
  20e300e2
10 2月, 2021 2 次提交

W

delay timeout of unnittest 'test_static_save_load'. (#30975) · 8ab29f4b
由 WeiXin 提交于 2月 10, 2021

8ab29f4b

New custom operator extension mechanism (#30690) · f649442d

由 Chen Weihang 提交于 2月 09, 2021

* initial commit: simple demo

* polish copyright format

* add grap op simple demo

* adapt uncertain number of argument

* change trait marco name

* add place & dtype support for add kernel

* add dispath and infershape func

* poish code & add notes

* add dynamic_loader dep for paddle_framework

* add new custom op test dir

* polish impl details

* add unittest for new custom op

* fix failed unittest

* Costum op (#1)

* fix compile error

* wrap framework tensor with LoDTensor

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* add CustomTensor default constructor

* add size() for CustomTensor

* make size const for CustomTensor

* refactor place related api to circle the concept

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* make place const

* make Tensor copy

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* remove additional head of framework

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* add gpu test

* merge latest cwh code in

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* Remove ShareData from user && Change CustomTensor to Tensor && Support more data type (#2)

* fix compile error

* wrap framework tensor with LoDTensor

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* add CustomTensor default constructor

* add size() for CustomTensor

* make size const for CustomTensor

* refactor place related api to circle the concept

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* make place const

* make Tensor copy

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* remove additional head of framework

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* add gpu test

* merge latest cwh code in

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* hid share data from and to

* rename CustomTensor to Tensor

* refactor register design & add test

* change op_funtion to op_meta_info

* split op meta info into .h and .cc

* move get methods into friend class

* move OpMetaInfoHelper into framework space

* move CustomTensorUtils into framework space

* change pybind api name

* move PD C API into op meta info

* add register custom op api

* remove inference cmake change

* refactor copy to api && change Reshape to lowercase && support more dtype && add more test (#3)

* fix compile error

* wrap framework tensor with LoDTensor

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* add CustomTensor default constructor

* add size() for CustomTensor

* make size const for CustomTensor

* refactor place related api to circle the concept

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* make place const

* make Tensor copy

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* remove additional head of framework

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* add gpu test

* merge latest cwh code in

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* hid share data from and to

* rename CustomTensor to Tensor

* support multi dtype

* remove lod, make reshape lowercase, add copy test and refactor copy api

* remove lod, make reshape lowercase, add copy test and refactor copy api

* remove lod, make reshape lowercase, add copy test and refactor copy api

* remove lod, make reshape lowercase, add copy test and refactor copy api

* fix copy to error

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* polish detail & error message

* polish test details

* Add cast api && Change copy related api to copy_to && add more test (#4)

* fix compile error

* wrap framework tensor with LoDTensor

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* add CustomTensor default constructor

* add size() for CustomTensor

* make size const for CustomTensor

* refactor place related api to circle the concept

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* fix compile error

* make place const

* make Tensor copy

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* debug CustomTensor core

* remove additional head of framework

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* use back to shared ptr for custom tensor

* add gpu test

* merge latest cwh code in

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* adjust ut code of custom op

* hid share data from and to

* rename CustomTensor to Tensor

* support multi dtype

* remove lod, make reshape lowercase, add copy test and refactor copy api

* remove lod, make reshape lowercase, add copy test and refactor copy api

* remove lod, make reshape lowercase, add copy test and refactor copy api

* remove lod, make reshape lowercase, add copy test and refactor copy api

* fix copy to error

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add more test

* add type cast

* add cast and make copy to api

* add cast and make copy to api

* add cast and make copy to api

* add cast and make copy to api

* merge cwh code

* merge cwh code

* merge cwh code

* merge cwh code

* merge cwh code

* add more error log

* add more error log

* polish code

* used for test

* remove test comment

* remove test comment

* fix uint8 type error

* fix lost uint8 type error

* add test for coverage

* polish details by reviewer comments

* add prefix for DISABLE_COPY_AND_ASSIGN
Co-authored-by: NJiabin Yang <360788950@qq.com>

f649442d

09 2月, 2021 2 次提交
- C
  support label with float input of cross_entropy, test=develop (#30929) · f5ca2db2
  由 chajchaj 提交于 2月 09, 2021
```
* support label with float input of cross_entropy, test=develop

* fix code style in nn/functional/loss.py, test=develop
```
  f5ca2db2
- C
  
  try to fix reader and signal test failed (#30960) · 010f2caa
  由 Chen Weihang 提交于 2月 08, 2021
  
  010f2caa
08 2月, 2021 2 次提交
- L
  
  [Static setitem] Support index is ellipsis for setitem in static mode (#30836) · 12c15beb
  由 liym27 提交于 2月 08, 2021
  
  12c15beb
- L
  
  [kunlun]fix sync in multi kunlun xpu dygraph training. (#30943) · 87197f8c
  由 liuyuhui 提交于 2月 08, 2021
  
  87197f8c
07 2月, 2021 1 次提交
- W
  fix a bug of Sequential::__getitem__ (#30899) · 823f499a
  由 wanghuancoder 提交于 2月 07, 2021
```
* fix a bug of Sequential::__getitem__, test=develop

* add testcase, test=develop
```
  823f499a
06 2月, 2021 1 次提交
- J
  
  [oneDNN] Added basic changes for elementwise_add_grad bf16 (#30925) · 9e527d99
  由 Jacek Czaja 提交于 2月 06, 2021
  
  9e527d99
05 2月, 2021 4 次提交
- L
  
  [Kunlun] add gen_bkcl_id_op, support multi XPU cards training using multiprocess (#30858) · 4a8b8b45
  由 liuyuhui 提交于 2月 05, 2021
  
  4a8b8b45
- W
  
  let LayerList could add [None], test=develop (#30911) · 90d92111
  由 wanghuancoder 提交于 2月 05, 2021
  
  90d92111
- T
  
  dyngraph (#30892) · 24873f4f
  由 taixiurong 提交于 2月 05, 2021
  
  24873f4f
- Z
  Use correct master weights in AdamW. (#30895) · 71acde9a
  由 Zhen Wang 提交于 2月 05, 2021
```
* Use correct master weights in AdamW.

* Just modify the master weight.

* Update for CI Coverage.
```
  71acde9a
04 2月, 2021 2 次提交
- J
  
  [oneDNN]Extended adaptive pooling support for oneDNN pool kernel (#30757) · abfa8226
  由 Jacek Czaja 提交于 2月 04, 2021
  
  abfa8226
- Z
  
  improve performance of momentum (#30881) · e97905c5
  由 Zhang Ting 提交于 2月 04, 2021
  
  e97905c5
03 2月, 2021 10 次提交
- C
  
  add clip_by_norm on kunlun, *test=kunlun (#30862) · ac2e2e6b
  由 cucuzg 提交于 2月 03, 2021
  
  ac2e2e6b
- K
  
  remove numpy array check in single-process dataloader. test=develop (#30861) · 30242717
  由 Kaipeng Deng 提交于 2月 03, 2021
  
  30242717
- W
  fix the broadcast for the large second input (#30818) · b7560a59
  由 wawltor 提交于 2月 03, 2021
```
fix the broadcast for the large second input 
```
  b7560a59
- J
  
  Implement cuda kernel for index_sample. (#30380) · 6e1e036a
  由 JamesLim 提交于 2月 03, 2021
  
  6e1e036a
- A
  
  Call new cudnn batch norm API regardless of data type and data layout (#30157) · 666efc23
  由 AshburnLee 提交于 2月 03, 2021
  
  666efc23
- 石
  support xpu with analysis predictor, test=develop (#30832) · 2ac4143b
  由石晓伟提交于 2月 03, 2021
```
* support xpu inference with analysis predictor, test=develop

* merge the cmake of the xpu toolchain, test=develop

* add c-apis, test=develop

* fix a bug in extern_xpu, test=develop
```
  2ac4143b
- J
  Update paddle.static.Print with paddle2.0 api (#30846) · 05d2b7a3
  由 joejiong 提交于 2月 03, 2021
```
As the title
```
  05d2b7a3
- A
  [CustomOp] Support install as Package and Add load interface (#30798) · e49d0746
  由 Aurelius84 提交于 2月 03, 2021
```
* support setup.py to compile custom op

* move file into paddle.utils.cpp_extension

* support python setup.py install

* refine code style

* Enrich code and add unittest

* Polish code and api doc

* fix cpp_extension not include in package

* fix relative import

* fix os.makedirs exist_ok param compatibility PY2

* add compile flags in test_jit_load
```
  e49d0746
- A
  
  Layer normalization fuse pass. (#30721) · 4f066e31
  由 Adam Osewski 提交于 2月 03, 2021
  
  4f066e31
- W
  
  【kunlun】dygraph supports multi xpu card training (#30671) · b1026f64
  由 WangXi 提交于 2月 03, 2021
  
  b1026f64
02 2月, 2021 2 次提交
- L
  Fix unittest random failed of test_datasets (#30804) · 3a3ff75c
  由 LielinJiang 提交于 2月 02, 2021
```
* fix test_datasets unittest
```
  3a3ff75c
- S
  fix trt plugin clone and initialize bugs in TRT7.1+ (#30709) · b9094509
  由 Shang Zhizhou 提交于 2月 02, 2021
```
* fix trt plugin clone and initialize bugs

* fix unit test error

* enable trt in ci py3

* update unittest timeout
```
  b9094509
01 2月, 2021 3 次提交
- S
  
  fix unittest random error (#30808) · 200ee33d
  由 Shang Zhizhou 提交于 2月 01, 2021
  
  200ee33d
- X
  Optimize the encoder of Transformer. (#30439) · db870872
  由 xiemoyuan 提交于 2月 01, 2021
```
* Add cache for Transformer encoder.

* Bug fixed.

* add unittests for transformer encoder.
```
  db870872
- W
  
  Fleet distributed strategy support pure fp16 (#30754) · 31ed9c9e
  由 WangXi 提交于 2月 01, 2021
  
  31ed9c9e
29 1月, 2021 2 次提交
- A
  
  【CustomOp】support setup.py to compile custom op (#30753) · 2c974cc3
  由 Aurelius84 提交于 1月 29, 2021
  
  2c974cc3
- J
  
  fix paddle.static.acc and auc sample code bug, test=document_fix (#30715) · 65a9744c
  由 Jiaqi Liu 提交于 1月 29, 2021
  
  65a9744c
28 1月, 2021 3 次提交
- W
  
  A fix for oneDNN matmul kernel. Fixes issue #30309 (#30723) · fc002405
  由 Wojciech Uss 提交于 1月 28, 2021
  
  fc002405
- T
  
  add readme in whl package (#30726) · a12b6bb9
  由 tianshuo78520a 提交于 1月 28, 2021
  
  a12b6bb9
- W
  
  Split unittest. (#30727) · 3491acfb
  由 WeiXin 提交于 1月 28, 2021
  
  3491acfb
27 1月, 2021 3 次提交

update gather_tree doc (#30693) · a87d78f1

由 liu zhengxi 提交于 1月 27, 2021

* update gather_tree doc, test=document_fix

* update sample code, test=document_fix

* remove tensor type, test=document_fix

a87d78f1

L
upgrade gather_tree to core.ops (#30697) · fef3654b
由 liu zhengxi 提交于 1月 27, 2021
```
* upgrade gather_tree to core.ops

* update gather_tree unittests
```
fef3654b

REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719) · f8da5536

由 jakpiase 提交于 1月 27, 2021

* added external reorder to profiler

* resolved conflict

* added enable_static

* initial version of lstm, not working yet

* added lstm to operators.cmake

* added vanilla lstm mkldnn op

* added peephole weights integration

* minor changes

* added formatting

* added fusion_lstm_mkldnn to static_whitelist

* added formatting

* removed comment

* moved use_peepholes attribute inside is_cached block

* reverted wrong changes

* minor formatting change

* minor changes

* changed stream handling

* minor change

* added datatype to GetExpectedKernelType()

* added reading stream from TLS

f8da5536

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功