提交 · 6840cf558abc55931f2118021529e92e2b722091 · BaiXuePrincess / Paddle

22 10月, 2021 1 次提交

Fix a bug in ReadData, ReadDataBc and ReadDataReduce when NX != 1 (#36373) (#36616) · 6840cf55

由 niuliling123 提交于 10月 22, 2021

* Fix a bug in ReadData, ReadDataBc and ReadDataReduce when NX != 1
* Update the implement of reduceAnyKernel according to kernel primitive api

6840cf55

21 10月, 2021 2 次提交
- N
  [Cherry-pick] Add functor_primitives.h for kernel primitive api (#36418) · 30909889
  由 niuliling123 提交于 10月 21, 2021
```
* Add functor_primitives.h for kernel primtive api
```
  30909889
- improve replicate pad error information (#36531) · a201a691
  由 littletomatodonkey 提交于 10月 21, 2021
```
* fix replicate pad when input size is 0

* add unit test
```
  a201a691
20 10月, 2021 1 次提交
- W
  
  [cherry-pick] Inference add type check in copy_from_cpu (#36552) · b5404f09
  由 Wilber 提交于 10月 20, 2021
  
  b5404f09
19 10月, 2021 3 次提交

[cherry-pick]Add sparse attention cherrypick (#36447) · 36edb0e1

由 Liu-xiandong 提交于 10月 19, 2021

The code of this PR can only support CUDA 11.2. Currently, CI does not have GPU with CUDA 11.2 , and all tests will be skipped automatically.

The new OP is paddle._C_ops.sparse_attention. Regarding the work of the python API, it will be resolved in a follow-up PR.

The code of this PR lacks tests on dynamic graphs and static graphs, and will be added in subsequent PRs.

36edb0e1

W

cherry-pick 36424 inference support bert when exists matmul_v2 (#36500) · d974dbd1
由 Wilber 提交于 10月 19, 2021

d974dbd1

Add operators for async read & async write (#36333) (#36501) · d65f8af8

由 Siming Dai 提交于 10月 19, 2021

* fix async_read bug

* change index place to cpu

* add tensor size judge

* add async_read & async_write test

* fix bug in async_write

* fix mac py3 ci

* fix bug for cpu version paddle

* fix windows ci bug

* change input argument error type

* change const_cast to mutable_data

* add async_write out-of-bound check and consumate error hint

* fix a small bug for dst_tensor

* add docs and refine codes

* refine docs

* notest,test=windows_ci

* fix windows ci

* fix require

* fix code-block

* add core.is_compiled_with_cuda()

d65f8af8

15 10月, 2021 1 次提交

[cherry-pick]Verify the correctness of graph rewrited by GeneratePass (#36453) · cc449652

由 wuhuanzhou 提交于 10月 15, 2021

* [WIP]Verify the correctness of graph rewrited by GeneratePass, test=develop

* add delete subgraph and unittest, test=develop

* check simple pass, test=develop

* fix coverage, test=develop

* limit with input_spec via Paddle API, test=develop

cc449652

13 10月, 2021 1 次提交
- J
  
  fix for matmul_v2 6D x 2D (#36379) · ce6a27d9
  由 jakpiase 提交于 10月 13, 2021
  
  ce6a27d9
12 10月, 2021 2 次提交
- A
  Fix stop_gradient in RunProgramOp (#36339) (#36353) · a6868c91
  由 Aurelius84 提交于 10月 12, 2021
```
* Fix stop_gradient in RunProgramOp

* fix reference
```
  a6868c91
- W
  
  fix yolo precision issue(#36365) · 10eebfa0
  由 wenbin 提交于 10月 12, 2021
  
  10eebfa0
11 10月, 2021 2 次提交
- S
  
  dlpack fix (#35817) (#36177) · 31a5829a
  由 Siming Dai 提交于 10月 11, 2021
  
  31a5829a
- W
  [cherry-pick]C++ support register pass via PassDesc (#36302) · 21c65f66
  由 wuhuanzhou 提交于 10月 11, 2021
```
(cherry picked from PR #36095)

PR主要功能：支持C++开发注册GeneratePass，简化针对fusion等子图优化场景开发方式。
```
  21c65f66
30 9月, 2021 2 次提交
- G
  
  fix bug of reduce_sum when src_dtype != dst_dtype and reduce_num == 1 (#36123) (#36193) · e8efba57
  由 Guoxia Wang 提交于 9月 30, 2021
  
  e8efba57
- G
  
  support fp16 (#35888) (#36191) · 87cc8d48
  由 Guoxia Wang 提交于 9月 30, 2021
  
  87cc8d48
29 9月, 2021 1 次提交

add API paddle.linalg.eig (#35674) (#36188) · 4e2daa9a

由 Lijunhui 提交于 9月 29, 2021

向PaddlePaddle中的线性代数库添加eig算子，该算子计算一般方阵的特征分解。
cherry-pick 自#35674.

4e2daa9a

28 9月, 2021 1 次提交
- R
  [cherry-pick] [ROCM] bugfix for bilinear_interp_v2_grad (#36160) #36161 · c576169b
  由 ronnywang 提交于 9月 28, 2021
```
ATT, cherry-pick #36160
```
  c576169b
27 9月, 2021 9 次提交
- Y
  Add paddle.device.cuda.get_device_properties (#35875) · cea0bc26
  由 Yanxing Shi 提交于 9月 27, 2021
```
* Initial Commit

* fix py2 error

* fix wrong words and doc

* test=document_fix

* fix _gpuDeviceProperties
```
  cea0bc26
- J
  cherry-pick #36021 fix unique/unstack zero tensor (#36163) · 749bc240
  由 Jiawei Wang 提交于 9月 27, 2021
```
* fix unique unstack dim 0

* fix unique_op format
```
  749bc240
- J
  
  bugfix reshape -1 (#36143) · 45b7627b
  由 JZ-LIANG 提交于 9月 27, 2021
  
  45b7627b
- W
  
  fix windows ut precession error (#36124) · b171aaba
  由 Wilber 提交于 9月 27, 2021
  
  b171aaba
- R
  [ROCM] fixbug for arg_min_max (#36113) · 40a29186
  由 ronnywang 提交于 9月 27, 2021
```
ATT, cherry-pick #36098
```
  40a29186
- J
  [Cherry-pick] Add new func/class API psroi_pool and UT (#36111) · 81557da6
  由 JYChen 提交于 9月 27, 2021
```
cherry-pick from #35352

Add new detection api paddle.vision.ops.psroi_pool and paddle.vision.ops.PSRoIPool
```
  81557da6
- S
  [cherry-pick] fix third_party cache bugs (#36048) · 68911342
  由 Sing_chan 提交于 9月 27, 2021
```
cherry-pick #35858、#35895
```
  68911342
- Z
  [cherry pick] Modify adam to adamw in Optimizer AdamW (#36028) (#36103) · 2de7a7f5
  由 zhangbo9674 提交于 9月 27, 2021
```
The AdamW optimizer modify the op from adamw to adam in pr35521, this is a inappropriate modify. Modify adam to adamw in AdamW.
```
  2de7a7f5
- Y
  [cherry-pick]Support fixed seed in Python for test (#36065) (#36094) · c3a0eaab
  由 YuanRisheng 提交于 9月 27, 2021
```
When users use gumbel_softmax, they can use paddle.seed() in python for fixed seed.
```
  c3a0eaab
26 9月, 2021 5 次提交
- C
  [cherry-pick]CPU forward calculation replaces Eigen with Lapack (#35916) (#36091) · effb70f4
  由 crystal 提交于 9月 26, 2021
```
cherry-pick #35916，CPU前向计算将Eigen替换为Lapack，修改linalg暴露规则
```
  effb70f4
- H
  [cherry-pick] Add Det and Slogdet API to Release 2.2 (#36083) · ba2a1bb4
  由 Huihuang Zheng 提交于 9月 26, 2021
```
This PR added det and slogdet API to release/2.2
It is cherry-pick from #34992 and #36013
```
  ba2a1bb4
- N
  [cherry-pick] Add function comments and instructions to the Primitive API #36024 · 05621f7f
  由 niuliling123 提交于 9月 26, 2021
```
[cherry-pick] Add function comments and instructions to the Primitive API
```
  05621f7f
- W
  [Cherry-Pick]Add paddle.linalg.solve OP (#35715) (#36056) · 6b4f2fbf
  由 Weilong Wu 提交于 9月 26, 2021
```
This PR supports linalg.solve calculation for linear algorithm module of Paddle. One may call paddle.linalg.solve to use it.
```
  6b4f2fbf
- R
  [NPU] add randperm_op_npu (#35763) (#36026) · df81915a
  由 ronnywang 提交于 9月 26, 2021
```
* add randperm_op_npu

* fix test_set_value_op_npu
```
  df81915a
25 9月, 2021 1 次提交
- B
  
  temporarily fix the performance drop of recurrent op (#36053) · 33fbdafa
  由 baoachun 提交于 9月 25, 2021
  
  33fbdafa
24 9月, 2021 5 次提交
- F
  [cherry-pick] Replace Eigen with Lapack library for eigvals OP kernel (#35909) (#36038) · e9c04149
  由 From00 提交于 9月 24, 2021
```
This PR implements the kernel of "eigvals" OP with the Lapack library, which has a better performance than the previous Eigen library.
```
  e9c04149
- H
  Basic PR on Cost Model (#35774) (#35915) · efcd108d
  由 Huihuang Zheng 提交于 9月 24, 2021
```
Add basic Cost Model, it uses executor to run program and profile it to get op time.

This is an early basic version, we will add more functions in the future.
```
  efcd108d
- W
  [cherry-pick] inference fix trt problem (#35939) · ae78940a
  由 Wilber 提交于 9月 24, 2021
```
* update xpu version
```
  ae78940a
- L
  [cherry-pick] fix cusparse compile bug in windows CUDA11.2, test=release/2.2 (#36015) · 0e19aeb9
  由 Liu-xiandong 提交于 9月 24, 2021
```
解决Windows中CUDA11.2编译出错的问题。
cherry-pick #35941
```
  0e19aeb9
- J
  
  add pool2d convert test (#35925) · 063fca8e
  由 JingZhuangzhuang 提交于 9月 23, 2021
  
  063fca8e
23 9月, 2021 3 次提交
- C
  [cherry-pick] FixEighOP; Unified MatrixEighFunctor function (#35812) (#35919) · 4629401e
  由 crystal 提交于 9月 23, 2021
```
cherry-pick #35812，修复Eigh OP
```
  4629401e
- W
  
  add dilation check for conv (#35894) · 91f25ee3
  由 wangguanzhong 提交于 9月 23, 2021
  
  91f25ee3
- T
  op:transpose_op supports bool type (#35886) (#35926) · 95c100c1
  由 TeslaZhao 提交于 9月 23, 2021
```
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass

* Fix a bug of strided_slice op, about the axes parameter access memory out of bounds

* Fix a bug of transpose op, about accessing memory out of bounds of the perm param

* op:transpose_op supports bool type
```
  95c100c1

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致