提交 · 7ef04da6c1036cda80dc5bf85da41b3940b5d72c · Crayon鑫 / Paddle

06 9月, 2021 7 次提交

Add fusion_lstm INT8 PTQ (#35334) · 7ef04da6

由 joanna.wozna.intel 提交于 9月 06, 2021

* Add fusion_lstm INT8 PTQ

* Correct mkldnn_cache_capacity and enable fc_lstm_fuse_pass only for this test

* Change mkldnn_cache_capacity

7ef04da6

W
Add grad grad for AvgPool2D (#35388) · 97798f9a
由 Wei Shengyu 提交于 9月 06, 2021
```
* add pool2d grad grad

* dbg

* add unittest

* update format

* add more unittests

* dbg
```
97798f9a

add kernel, stride check (#35106) · 13bbb6b6

由 Double_V 提交于 9月 06, 2021

* add kernel, stride check

* add unitest for param out of range

* delete max limit check

13bbb6b6

[NPU]add depthwise_conv_npu_grad op (#35374) · 4bea0ff1

由 heliqi 提交于 9月 06, 2021

* add depthwise_conv_npu_grad op

* add depthwise_conv_npu_grad op

* add depthwise_conv_npu_grad op

* add NHWC test case

4bea0ff1

W
support numpy dtype and polish code of list index. (#35404) · 60c5adaa
由 WeiXin 提交于 9月 06, 2021
```
* support numpy dtype and polish code of list index.

* polish code.
```
60c5adaa

replase pass with error exception (#35367) · 5675042d

由 Feng Xing 提交于 9月 06, 2021

This PR adds error exception in fused transformer python interface.
The function body are not implemented (will be implemented later).
Following zhiqiu's comment in previous PR-35206 (merged already), it is better to raise an exception instead of using "pass".

5675042d

W

update trt ut. (#35458) · 18934c53
由 Wilber 提交于 9月 06, 2021

18934c53

05 9月, 2021 1 次提交
- F
  [WIP] paddle.where api add broadcast, when x_shape == y_shape, and x_shape != cond_shape (#35092) · ffc3d364
  由 furnace 提交于 9月 05, 2021
```
* where op add broadcast, when x_shape == y_shape, and x_shape != cond_shape

* add static api tests, and delete debug codes
```
  ffc3d364
04 9月, 2021 1 次提交
- W
  
  update inference trt ut framework (#35418) · e8772486
  由 Wilber 提交于 9月 04, 2021
  
  e8772486
03 9月, 2021 12 次提交
- A
  
  disable test_standalone_executor temporarily (#35436) · e8a88164
  由 Aurelius84 提交于 9月 03, 2021
  
  e8a88164
- L
  support lodtensorarray for send/recv (#35279) · b6adfd97
  由 lilong12 提交于 9月 03, 2021
```
* support lodtensorarray
```
  b6adfd97
- Z
  [NPU] Add huber_loss op (#34826) · 4e67cd17
  由 zhulei 提交于 9月 03, 2021
```
* [NPU] Add huber_loss op

* [NPU] Add huber_loss op

* [NPU] Add huber_loss p[

* [NPU] Add huber_loss
```
  4e67cd17
- Q
  [NPU] add int64_t kernels for YoloV3, test=develop (#35045) · f014e301
  由 Qi Li 提交于 9月 03, 2021
```
* [NPU] add int64 kernels, test=develop

* update ci scripts to be able to trun WITH_ASCEND_INT64 on, test=develop
```
  f014e301
- J
  Add AsExtra for transpose, lstm, gru (#35317) · f13dcfb1
  由 Jack Zhou 提交于 9月 03, 2021
```
* Add AsExtra for transpose

* add AsExtra for lstm op

* add AsExtra for gru
```
  f13dcfb1
- F
  [iscan] bugfix: DLTP-33615 / DLTP-33953 / DLTP-33968 / DLTP-34166 (#35383) · b333dac0
  由 Fan Zhang 提交于 9月 03, 2021
```
* [iscan] bugfix

* test_standalone_executor modify
```
  b333dac0
- H
  [NPU]add conv2d_transpose npu op (#35232) · a9dfebb9
  由 heliqi 提交于 9月 03, 2021
```
* add conv2d_transpose npu op

* CopyRight 2020 to 2021

* add fp32

* delete repeat test case

* delete repeat test case

* fix paddle.NPUPlace
```
  a9dfebb9
- X
  
  fix a quantization bug (#35407) · 07126112
  由 XGZhang 提交于 9月 03, 2021
  
  07126112
- D
  
  fix flatten infershape (#35321) · ccd42db7
  由 danleifeng 提交于 9月 03, 2021
  
  ccd42db7
- 0
  
  [Dy2Stat]Modify dy2stat error message in runtime and format error message (#35365) · a6cc567f
  由 0x45f 提交于 9月 03, 2021
  
  a6cc567f
- W
  [NPU] Add elementwise_pow_grad npu op (#35278) · e913796c
  由 WJJ1995 提交于 9月 03, 2021
```
* add elementwise_pow_grad_npu

* fixed bug for CI

* deal with comments

* fixed bug for CI

* deal with comments
```
  e913796c
- add log_softmax_op_npu (#35006) · ba6a312d
  由沉潜的鱼儿提交于 9月 03, 2021
```
* add log_softmax_op_npu

* log_softmax_op_v1

* import test_log_softmax_grad
```
  ba6a312d
02 9月, 2021 7 次提交

[NPU] Support npu kernel for gather_nd op (#34800) · bb633965

由 JingZhuangzhuang 提交于 9月 02, 2021

* [NPU] Support npu kernel for gather_ng op

* [NPU] Support npu kernel for gather_nd op

* [NPU] Support npu kernel for gather_nd and gather_nd_grad op

* update py format error.

* modify gather_nd_op_npu

* modify gather_nd 910 test

* modify gather_nd 910 test
Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>

bb633965

Add SVD Op and it's GPU and CPU kernel (#34953) · 7e5fb462

由 xiongkun 提交于 9月 02, 2021

* Add SVD Op and it's GPU and CPU kernel

* Remove CUDAPlace in test_svd_op, make the test available in CPU package

* modfity the file

* fix windows bug/ fix ROCM / fix test timeout

* for pass the CIs

* improve error report

* for code review

* some modification to test_svd_op

* change python code style

* expose the svd interface for document

7e5fb462

Z
[NPU] Add label_smooth_op (#34828) · e57a88b3
由 zhulei 提交于 9月 02, 2021
```
* [NPU] Add label_smooth_op

* [NPU] Add label_smooth_op
```
e57a88b3
Y

[hybrid] [npu] fit npu nan/inf check (#35171) · 67ed7e12
由 Yuang Liu 提交于 9月 02, 2021

67ed7e12
W

fix static error in summary (#35303) · b28cc734
由 wangna11BD 提交于 9月 02, 2021

b28cc734

[Auto Parallel] Logical Partition & Dist Op (#35117) · a622b701

由 JZ-LIANG 提交于 9月 02, 2021

* support shard reader

* support shard reader

* add parallel mode

* update process mesh

* add method to compute comm_group

* implement dist_embedding forward func

* implement dist matmul forward func

* implement dist reshape forward func

* add transpiler framework

* add transpiler forward

* implement transpiler forward

* implement transpiler backward & update

* add process

* add unitest

* chmod

* chmod

* chmod

* update unitest

* add unitest for gpt

* remove unused print

* rename transpiler --> partitioner

* rename transpiler --> partitioner

* chmod

* chmod

* bug fixed

* remove amp function

* update case for dp mode

* update case for dp mode

a622b701

B

[npu] add update_loss_scaling npu min value (#35270) · 280d7421
由 Baibaifan 提交于 9月 02, 2021

280d7421

01 9月, 2021 12 次提交
- J
  Added slice BF16/FP32 FWD/BWD kernels (#34332) · 070cab11
  由 jakpiase 提交于 9月 01, 2021
```
* aded slice FWD FP32

* added tests for slice FWD FP32

* added slice bwd

* added bf16 tests

* CI fix

* CI fix

* added reason to skip_if

* minor change

* temporary fix for failing test

* temporary fix

* changes after review

* CI rerun
```
  070cab11
- T
  [HeterPs] merge dense && data norm && g2sum (#35029) · a647b80a
  由 Thunderbrook 提交于 9月 01, 2021
```
* merge dense

* log level

* tensor copy sync

* format
```
  a647b80a
- S
  [HybridParallel]Support finetinue model for PipelineParallel (#35287) · 264ff9ef
  由 ShenLiang 提交于 9月 01, 2021
```
* add cache for send_recv

* add eval_batch for pipeline

* add eval batch for pipelineparallel

* add style code
```
  264ff9ef
- B
  add strided_slice_grad op for npu (#35204) · 7743cdf2
  由 baoachun 提交于 9月 01, 2021
```
* add strided_slice_grad op for npu
```
  7743cdf2
- L
  support setting linewidth when printing tensor (#35175) · 5fa7d9ce
  由 Leo Chen 提交于 9月 01, 2021
```
* support setting linewith when printing tensor

* fix ut

* refine code

* update comments

* use small precision since windows/linux has different ramdom value

* fix typo

* adjust parameter order for consistency
```
  5fa7d9ce
- L
  add input and output description docs for vision transform (#34926) · 4f54891c
  由 LielinJiang 提交于 9月 01, 2021
```
* add input and output docs for vision transform
```
  4f54891c
- J
  
  bugfix for mp accuracy (#35326) · 7f17f9a0
  由 JZ-LIANG 提交于 9月 01, 2021
  
  7f17f9a0
- 0
  [Dy2stat]modify dy2stat error message in compile time (#35320) · b24f84c8
  由 0x45f 提交于 9月 01, 2021
```
* modify dy2stat error message in compile time

* fix variable name
```
  b24f84c8
- W
  fix bug:When axes in paddle.slice is a tuple, an error occurs. (#35267) · b53887fd
  由 WeiXin 提交于 9月 01, 2021
```
* fix bug:When axes in paddle.sile is a tuple, an error occurs.

* polish code.
```
  b53887fd
- Q
  support KL label smooth (#35177) · 7ca28bb6
  由 QingshuChen 提交于 9月 01, 2021
```
* support KL label smooth

* update UT for KL label_smooth
```
  7ca28bb6
- C
  
  add support ops for quantization (#35312) · 5baccfdd
  由 cc 提交于 9月 01, 2021
  
  5baccfdd
- R
  
  [NPU]shard index op for npu (#35281) · 5c27c2c0
  由 Roc 提交于 9月 01, 2021
  
  5c27c2c0

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致