提交 · 2bc72a06be7a2df79b2324bc97ea6eb5f3c847b3 · 机器未来 / Paddle

03 4月, 2022 3 次提交

add maximum limit for grid of index_select (#41127) · af8d2482

由 FlyingQianMM 提交于 4月 03, 2022

* limit grid dim for index select

* mv LimitGridDim into gpu_launch_config.h

* fix conflicts

* fix conflicts

* fix code style

* set block to 256

* fix grid setting

* set dtype of block_dim to unsigned int

af8d2482

Z
Add randperm and range yaml (#41265) · fd1ecfc5
由 zyfncg 提交于 4月 03, 2022
```
* add randperm and range yaml

* add eager test for randperm
```
fd1ecfc5

Add some yaml config (#41053) · e4914734

由 From00 提交于 4月 03, 2022

* Add yaml config

* Add yaml for flatten_contiguous_range_op

* Remove h_sigmoid yaml

* Fix CI errors

* Fix code format

* Fix flatten OP errors

* Fix conflicts

* Fix CI errors

* Remove flatten_contiguous_range OP

* Remove redundant code

* Fix typos

e4914734

02 4月, 2022 11 次提交

C
[Phi] Fix no pinned transform (#41300) · 78200976
由 Chen Weihang 提交于 4月 02, 2022
```
* fix no pinned trans

* fix cond error
```
78200976

Add graph apis (#40809) · b0398c8e

由 Siming Dai 提交于 4月 02, 2022

* Add graph_reindex API

* add graph_sample_neighbors api

* Add buffer

* delete VLOG

* delete thrust::copy for output

* add ShareDataWith

* delete graph_reindex hashtable output

* add graph_reindex dispensable

* add reindex unittest, move memset to cuda kernel, change api

* fix conflict

* add reindex buffer for gpu version note

* fix conflicts for op_func_generator

* Add fisher_yates sampling, add dispensable, change infermeta

* add dtype for edge_id

* fix rocm ci and static check ci

* add unittest

* fix unittest

* fix unittest

* fix bug

b0398c8e

X
[Yaml] add yaml for 5 ops [ elementwise_pow, expm1, floor_divide, logsumexp, mish ] (#41288) · 36f97cdc
由 xiongkun 提交于 4月 02, 2022
```
* add yaml for ele_max ele_min

* add yaml for: mish / logexpsum / expm1 / elemenwise_pow / elementwise_floordiv
```
36f97cdc

[phi] Move clip op to phi (#40602) · c0658045

由 wuyefeilin 提交于 4月 02, 2022

* move clip op to phi

* fix as review

* update hierarchical_sigmoid_kernel.cc

* update selected_rows

* update clip_kernel.cu

* fix as review

c0658045

L
enable new-executor on windows to test it (#41301) · e59a693e
由 Leo Chen 提交于 4月 02, 2022
```
* enable new-executor on windows to test it

* add message

* fix ut
```
e59a693e

[DoubleGrad PR #5] Enabled gradient computations for grad_tensors passed to paddle.grad() (#41198) · afadb8c5

由 Zhanlue Yang 提交于 4月 02, 2022

* [Refactor] refactored eager_gen.py PR #2

* [DoubleGrad PR #1] Decoupled code generation logics for Dygraph ForwardFunctions and GradNodes

* Fixed minor issue

* Adjusted logics of GenerateNodeCreationCodes and GenerateForwardDefinition

* Fixed issues

* Supported higher-order grad node generation

* [DoubleGrad PR #4] Supported higher-order GradNode generation

* [DoubleGrad #4] Bug Fixes to Double Grad Node Generation

* Fixed yaml typo

* Fixed yaml typo

* fixed minor issues

* [DoubleGrad PR #5] Enabled gradient computations for grad_tensors passed to paddle.grad()

* Fixed minor issue

* Fixed CI-Inference issue

* Fixed CI-inference issues

afadb8c5

Z

Sparse conv and pool support indices as template (#41137) · 5d3fd4fe
由 zhangkaihuo 提交于 4月 02, 2022

5d3fd4fe
N

Fix a bug when reduceHigherDim in HIP (#41273) · 7dd4a9fe
由 niuliling123 提交于 4月 02, 2022

7dd4a9fe
Z
Limit the condition of entering optimized kernel (#41296) · 3b686b18
由 Zhang Zheng 提交于 4月 02, 2022
```
Co-authored-by: Nroot <root@yq01-sys-hic-k8s-v100-box-a225-0186.yq01.baidu.com>
```
3b686b18

[Yaml] transfer around 22 ops yaml file and pass the final state OpTest. (#41024) · 16bfcd18

由 xiongkun 提交于 4月 02, 2022

* 1. add the python api grad 2. add final and intermediate state vlog 3. change the python_api error logic

* add python api or close the check_eager=True

* fix the compatibility

16bfcd18

Z

Fix sparse conv and verify sparse conv backward (#40961) · ad0c106c
由 zhangkaihuo 提交于 4月 02, 2022

ad0c106c

01 4月, 2022 9 次提交

H

update (#41245) · 99029dc9
由 hong 提交于 4月 01, 2022

99029dc9

Add nll_loss yaml (#41126) · 8e032db8

由 zyfncg 提交于 4月 01, 2022

* add nll_loss yaml

* fix nll loss

* fix nll loss bug

* fix bug

* fix bug

* fix infrt problem
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

8e032db8

[Eager] Support pinned (#41035) · f3270fc8

由 wanghuancoder 提交于 4月 01, 2022

* support pinned, test=develop

* support async_write, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine,test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

f3270fc8

[Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe

由 Chen Weihang 提交于 4月 01, 2022

* add cross_entropy_with_softmax phi kernel

* remove softmax_with_cross_entropy kernel

* add softmax_with_cross_entropy grad kernel

* remove original op kernel

* refine cross entropy impl

* fix pointer error

* revert kernel cu change

* fix xpu failed

* fix cinn failed

* fix npu failed

* add forward sig

* add check_nan_inf for pt kernel

* remove repeat cmake item

* fix unittest error

e6ec98fe

[Phi]Interploatd kernels into phi (#40855) · d65a7a46

由 chentianyu03 提交于 4月 01, 2022

* add interploate cpu kernel

* fix nullptr bug

* add interpolate gpu kernel

* fix unit test error

* remove raw kernels

* add cuda kernel impl

* add infermeta

* recover accidentally deleted kernels in interpolate op

* fix grad x_grad name error

* remove interpolate_v2_op.h

* rm unused codes

* fix xpu build error

* fix build error

* fix namespace error

* add register header for nup

* fix infermeta error

* modify by review

* add the missing args in test_trt_convert_nearest_interp_v2

d65a7a46

L
[KP] fix bug in activation xpu kp kernel (#41219) · 705776ca
由 Liu-xiandong 提交于 4月 01, 2022
```
* fix bug in activation xpu kp kernel

* delete useless comment
```
705776ca
Z

Add Sparse Op: copy_sparse_coo and copy_sparse_csr (#41193) · 3a29e4f8
由 zhangkaihuo 提交于 4月 01, 2022

3a29e4f8

[Phi] Add shape and strided_slice yaml & Adapt eager mode (#41131) · 9b6a02d4

由 Chen Weihang 提交于 4月 01, 2022

* add several yaml

* polish strided slice kernel & add yaml

* reorder yaml

* add several yaml

* revert yaml config change

* resolve conflict

* Update test_strided_slice_op.py

9b6a02d4

Add basic yaml backward (#40751) · 98303291

由 hong 提交于 4月 01, 2022

* fix error; test=develop

* update

* close some yaml

* fix backward attrite error; test=develop

* add div test

* polish code; test=develop

* update

* update

* fix bug

* update bitwise code; test=develop

* update

* update

* fix some bug

* update

* revert cmakelist

* fix optional bug;

* fix bug

* fix bug;

* add backward test

* open bn

* update

* update

* revert eager_gen

* polish code

* fix topk error

* update

* update

* fix bug;

* move label smooth, nll loss

* revert topk

* fix topk label smooth bug;

* remove batch_norm

* remove topk

* change flip infer meta

* fix flip bug

* update yaml

* close abs

* fix histogram bug

* fix histogram bug

* add abs

* fix histogram kernel

* remove expand

98303291

31 3月, 2022 7 次提交
- C
  
  fix conflict (#40851) · 74894cd7
  由 csy0225 提交于 3月 31, 2022
  
  74894cd7
- Z
  [Phi] Rename ScalarArray to IntArray (#40975) · e559fe41
  由 zyfncg 提交于 3月 31, 2022
```
* rename scalar_array to int_array

* update cmake

* fix conflict

* remove useless log
```
  e559fe41
- W
  [phi] move yolov3_loss to phi (#40944) · fb93bd5c
  由 wuyefeilin 提交于 3月 31, 2022
```
* mv yolov3_loss op to phi

* fix as review

* update operator.h
```
  fb93bd5c
- Z
  
  Implement AutotuneCache class for Kernel AutoTune (#41169) · 7dfd3846
  由 Zhang Ting 提交于 3月 31, 2022
  
  7dfd3846
- Z
  
  Opt the compilation of sparse kernel (#41086) · b9da48da
  由 zhangkaihuo 提交于 3月 31, 2022
  
  b9da48da
- L
  add_autotune_kernel_tool (#40658) · 7c5dca9f
  由 limingshu 提交于 3月 31, 2022
```
* for 1st time interface combine.

* modification with kernel factory

* first auto_tune version.

* first version.

* basic version

* add warm up step.

* a debug version.

* optimize the functionality of class auto_tuner.

* add some quotes for optimized auto_tuner class.

* add some quotes for optimized auto_tuner class.

* add namespace.

* modification according to the advices

* replace fluid header with phi header.

* replace fluid header with phi header.
```
  7c5dca9f
- A
  
  move inplace_version_counter_ location (#41146) · a09058b2
  由 Aganlengzi 提交于 3月 31, 2022
  
  a09058b2
30 3月, 2022 10 次提交
- Z
  [Phi] Move Rnn Op from fluid to phi (#41007) · 66cf8b08
  由 zyfncg 提交于 3月 30, 2022
```
* move rnn kernel to phi

* move infershape of rnn to phi

* fix HIP bug

* rename function

* fix HIP bug

* fix hip bug
```
  66cf8b08
- H
  [Op] Fix uncontrolled randomness of index_select op (#41078) · 8f7c02f2
  由 Haohongxiang 提交于 3月 30, 2022
```
* fix uncontrolled randomness of op

* fix bugs
```
  8f7c02f2
- C
  Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi... · eef46770
  由 Chen Weihang 提交于 3月 30, 2022
```
Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi (#40993)" (#41065)" (#41110)

This reverts commit 3a6f1135.
```
  eef46770
- X
  
  Fix unsqueeze op get wrong output shape in compile time infer shape. (#41097) · 3d39f5c7
  由 xiongkun 提交于 3月 30, 2022
  
  3d39f5c7
- C
  Revert "Revert "[Phi] trans logsumexp op (#40790)" (#41068)" (#41109) · ee8eeb45
  由 Chen Weihang 提交于 3月 30, 2022
```
This reverts commit 054fc997.
```
  ee8eeb45
- H
  Revert "Revert "Move some activation to phi (#40727)" (#41056)" (#41095) · 91bb52cd
  由 hong 提交于 3月 30, 2022
```
This reverts commit 05f3d48e.
```
  91bb52cd
- P
  
  add _reset_grad_inplace_version (#41101) · cb8afc24
  由 pangyoki 提交于 3月 30, 2022
  
  cb8afc24
- Y
  
  move elementwise_mul selected rows input (#41042) · 13f1641d
  由 YuanRisheng 提交于 3月 30, 2022
  
  13f1641d
- Z
  Optimize the perf of top_k when k is too large (#40941) · 45078d9f
  由 Zhang Zheng 提交于 3月 30, 2022
```
* Optimize the perf of top_k when k is too large

* fix rcom compile

* fix

* only compile in cuda

* fix log info
```
  45078d9f
- P
  suppor inplace in tensor_method_setitem (#40915) · 7170c687
  由 pangyoki 提交于 3月 30, 2022
```
* suppor inplace in tensor_method_setitem

* delete bump_inplace_version

* optimize inplace unittest

* fix

* fix setitem bug

* update eager_generator

* optimize inplace unittest

* little change
```
  7170c687

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致