提交 · 300b1009b5053f53515c4c705ac46f52f8dda945 · 机器未来 / Paddle

10 10月, 2022 12 次提交

N

[CodeStyle][F401] remove unused import in unittests/test_[u-z] (#46706) · 300b1009
由 Nyakku Shigure 提交于 10月 10, 2022

300b1009

[Auto Parallel] Fix bugs caused by the inconsistent outputs of Engine API (#46633) · 0ce5554c

由 Yulong Ao 提交于 10月 10, 2022

* [Auto Parallel] Unify the logger and outputs of Engine API

* [Auto Parallel] Fix the bugs of to_static

* [Auto Parallel] Adjust the test_to_static.py

0ce5554c

A

rm fp16 dtype_check (#46739) · 21612be7
由 Allen Guo 提交于 10月 10, 2022

21612be7
Z

[Paddle-TRT] support new quant format from slim (#46022) · 7987a905
由 zhoutianzi666 提交于 10月 10, 2022

7987a905

make fused_multi_transformer support dynamically set the cache_kvs' shape and... · 9ea279a4

由 carryyu 提交于 10月 10, 2022

make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches. (#46777)

* make fused_multi_transformer support dynamically set the cache_kvs' shape and support input prefix_caches.

9ea279a4

【Hackathon No.10】新增 LogNormal API (#46426) · af6d80fb

由 MayYouBeProsperous 提交于 10月 10, 2022

* add LogNormal API

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* fix bug

* add comment

* fix bug

* fix docs

* fix bug

* fix bug

* fix bug

* add test

* add test

* change the args type of Normal sample

* fix bug

* fix bug

* fix bug

* fix bug

* add test

* add test

* format

* add comment

* add comment

* add comment

* add comment

* format code

* fix bug

* fix bug

* fix bug

* add comment

* remove name parameter for LogNormal

* organize imports

af6d80fb

[docs] add ipustrategy Hyperlink (#46422) · 60d5a912

由 gouzil 提交于 10月 10, 2022

* [docs] add ipustrategy Hyperlink

* fix ipu_shard_guard docs; test=document_fix

* [docs] add set_ipu_shard note

* [docs] fix hyperlink

* update framework.py

* fix mlu_places docs; test=document_fix

* fix put_along_axis docs; test=document_fix

* fix flake8 W293 error, test=document_fix

* fix typo in typing, test=document_fix
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>

60d5a912

[PHI] transpose2_grad op migration (#46139) · e3407a80

由 Paulina Gacek 提交于 10月 10, 2022

* op migrated, Copy(OneDNNContext, ...) added

* mutable_data & op registration in fluid removed

* refactoring

* OneDNNGetDataType to uppercase

* missing cpu check added, handler moved to .h file

* name changed to transpose_grad

* Copy changed back to TensorCopy

* Resizing corrected, Copy(OneDNNContext) removed

e3407a80

F
fix:gather op (#46779) · 45b93325
由 feng_shuai 提交于 10月 10, 2022
```
* fix:gather op

* add ut
```
45b93325
A

[Dy2St]Fix Regex DeprecationWarning in PY3 (#46800) · 140f3b24
由 Aurelius84 提交于 10月 10, 2022

140f3b24
R

【Hackathon No.36】优化 lerp_grad op 在 GPU 上的计算性能 (#45946) · ef61df30
由 Rayman 提交于 10月 10, 2022

ef61df30
R
【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111) · 5e0614a1
由 Rayman 提交于 10月 10, 2022
```
support fp16 for deformable conv
```
5e0614a1

09 10月, 2022 4 次提交
- Q
  
  [MLU]fix unittest of sync_bn (#46797) · 218c0129
  由 qipengh 提交于 10月 09, 2022
  
  218c0129
- Y
  
  [dygraph sharding stage 2] sharding broadcast overlap (#46656) · d8b4ca92
  由 Yuang Liu 提交于 10月 09, 2022
  
  d8b4ca92
- Z
  
  [Sparse] Add a batch_norm kernel (#46359) · 888223b7
  由 zhangkaihuo 提交于 10月 09, 2022
  
  888223b7
- S
  Enable hard_swish_grad unit test (#46621) · ff0171e4
  由 Sławomir Siwek 提交于 10月 09, 2022
```
* enable hard_swish_grad unit test

* remove unused argument
```
  ff0171e4
08 10月, 2022 3 次提交
- C
  [Auto Parallel]Update comp cost and completion for gpt auto search (#46387) · 9edf8502
  由 caozhou 提交于 10月 08, 2022
```
* update comp cost and completion for gpt auto search

* add unittest
```
  9edf8502
- C
  
  [MLU] add fluid MLUOps prior_box (#46585) · ff37e48e
  由 cifar10 提交于 10月 08, 2022
  
  ff37e48e
- W
  [Paddle Inference] add lookup_table op_convert, add lookup_table plugin (#46613) · 2a9c590b
  由 Wangzheee 提交于 10月 08, 2022
```
* add lookup_table op_convert, add lookup_table plugin
```
  2a9c590b
30 9月, 2022 5 次提交
- W
  
  Support both use_calc_stream and sync_op in allgather API (#46295) · ecae7b31
  由 Wen Sun 提交于 9月 30, 2022
  
  ecae7b31
- C
  
  [MLU] fix phi::Tensor compile error of mlu. (#46649) · 2e231402
  由 Chenxiao Niu 提交于 9月 30, 2022
  
  2e231402
- [MLU] add_fluid_mluop_yolo_box (#46573) · 832b0a15
  由光明和真理提交于 9月 30, 2022
  
  832b0a15
- 六
  
  【Hackathon No.21】为 Paddle 新增 paddle.incubate.sparse.transpose 稀疏 API (#45849) · 2b879a69
  由六个骨头提交于 9月 30, 2022
  
  2b879a69
- S
  support pure bfloat16 for more ops (#46364) · b7b231a6
  由 sneaxiy 提交于 9月 30, 2022
```
* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error
```
  b7b231a6
29 9月, 2022 8 次提交

[Hackathon No.18] 为 Paddle 新增 frexp API (#46401) · 1e2af54c

由 Zheng_Bicheng 提交于 9月 29, 2022

* 之前的pr合并了大量错误代码，重新提交一份

* 之前的pr合并了大量错误代码，重新提交一份

* 修正格式问题

* 改回原来的格式

* 按照要求修改

* 按照要求修改格式

* 修复注释的问题

* 更新格式

* 测试自动格式化

* 修正英文注释

* fix docs build error

* pre-commit

* for docs build

* for docs build

* 修复mantissa计算错误的bug

* 修复误判exponent可能存在负数，导致计算量增加的情况
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>

1e2af54c

Add index_select, index_select_grad, reduce_min kernel and their unittests for... · 9a1855ff

由 Leo Guo 提交于 9月 29, 2022

Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)

9a1855ff

N
[CodeStyle][F401] remove unused imports in unittests/collective (#46615) · 0ef7a02f
由 Nyakku Shigure 提交于 9月 29, 2022
```
* [CodeStyle][F401] remove unused import in unittests/collective

* empty commit, test=document_fix

* empty commit
```
0ef7a02f
Y
Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
d71f1b3f
N

[CodeStyle][F401] remove unused import in unittests/{asp,autograd,interpreter} (#46376) · f6039929
由 Nyakku Shigure 提交于 9月 29, 2022

f6039929
傅

fix uniform_rand_kernel FP16 support in dygraph mode (#46212) · ccab0e2a
由傅剑寒提交于 9月 29, 2022

ccab0e2a
R
[CustomDevice] add to_static, amp ut (#46536) · acf785b6
由 ronnywang 提交于 9月 29, 2022
```
* [CustomDevice] add to_static, amp ut

* update

* fix failed ut

* update
```
acf785b6

[Eager, Performance optimization] support mod / matmul ( % and @ operator) to... · 7d7444cc

由 Weilong Wu 提交于 9月 29, 2022

[Eager, Performance optimization] support mod / matmul ( % and @ operator) to sink to Cpp layer (#46565)

* [Eager, Performance optimization] support mod ( % operator) to sink to Cpp layer

* fix mod logic

* support matmul math operator

* rm LOG(warning), use VLOG(6)

* fix conflicts mistake

7d7444cc

28 9月, 2022 8 次提交

Z

[AutoParallel] fix process_mesh (#46583) · 7a7826b7
由 zhaoyingli 提交于 9月 28, 2022

7a7826b7
Z
[AutoParallel] fix dist_split (#46505) · e87f65c3
由 zhaoyingli 提交于 9月 28, 2022
```
* [AutoParallel] fix dist_split

* add unittest

* update cmakelist
```
e87f65c3

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

J
[New AD] Fix p_norm n=1 issue (#46514) · 3fc4fa29
由 Jiabin Yang 提交于 9月 28, 2022
```
* fix p_norm n=1 issue

* fix p norm test error
```
3fc4fa29
Y

[dygraph sharding] Overlap the reduce and the caculation for sharding stage 2. (#46495) · 9c01eaed
由 Yuang Liu 提交于 9月 28, 2022

9c01eaed
W
[Eager, Performance optimization] support less_than & less_equal( < & <=... · 7d238139
由 Weilong Wu 提交于 9月 28, 2022
```
[Eager, Performance optimization] support less_than & less_equal( < & <= operator) to sink to Cpp layer (#46542)
```
7d238139

Replacing set_format with set_mem_desc in FC onednn kernel (#46372) · 844d9855

由 Jacek Czaja 提交于 9月 28, 2022

* added fc int8 tests

* CI fix

* added skipping UTs for GPUs

* fixes for CI

* added support for residual connections inside fc

* fix for quant int8 bias

* - lint
Co-authored-by: Njakpiase <jakpia21@gmail.com>

844d9855

[NPU] add gpu kernel for transfer layout (#46307) · 526d963e

由 kangguangli 提交于 9月 28, 2022

* add gpu kernel for transfer layout

* comment error throw

* fix: flag setting in testcase; add condition check for raising error

* fix typo

* fix: add error type for PADDLE_THROW

* remove kernel fallback in data_transfer.cc

* remove useless variable definition

526d963e

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致