提交 · a072fca8229b26042fe24bff42989533e1d2050a · PaddlePaddle / Paddle

18 5月, 2022 1 次提交

matmul and matmul_v2 refactor (#42732) · 570d0322

由 Sławomir Siwek 提交于 5月 18, 2022

* matmul refactor

* remove UT which only check ENFORCE output

* code format

* improve memory usage

570d0322

09 5月, 2022 1 次提交

[Ready to merge] oneDNN NHWC matmul & elementwise kernels fixes (#42506) · bf481550

由 Jacek Czaja 提交于 5月 09, 2022

* - fix to crash

- more fixes

- added diagnostic

- matmul output fixes.

- compilation fix

- stop rotating too small shapes

* - Added enabling of matmul_V2 onednn test

bf481550

20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

18 2月, 2022 1 次提交
- F
  [Pten] blas and lapck migration (#39587) · 8c7ee8c2
  由 Feiyu Chan 提交于 2月 18, 2022
```
* move blas related files
* move lapack related files
```
  8c7ee8c2
15 2月, 2022 1 次提交

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

11 2月, 2022 1 次提交
- F
  [Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
  由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
  d25a7f9e
27 12月, 2021 1 次提交
- B
  
  update mkldnn matmul_transpose_reshape fuse pass ut (#38467) · 9cfdae91
  由 baoachun 提交于 12月 27, 2021
  
  9cfdae91
29 11月, 2021 1 次提交
- P
  
  Add third batch of deprecated mkldnn namespace name changes (#37558) · 1ba81500
  由 piotrekobiIntel 提交于 11月 29, 2021
  
  1ba81500
27 10月, 2021 1 次提交
- B
  add matmul_v2 to v1 CPU pass and fix matmul dim error (#36731) · d5245a35
  由 baoachun 提交于 10月 27, 2021
```
* fix matmul dim error

* fix wrong dim check in matmul
```
  d5245a35
15 9月, 2021 1 次提交
- J
  
  added fix for matmul and support for 6 rank tensor (#35740) · e80acff3
  由 jakpiase 提交于 9月 15, 2021
  
  e80acff3
06 9月, 2021 1 次提交
- W
  Add the extra flag for the some ops (#35442) · 49797d85
  由 wawltor 提交于 9月 06, 2021
```
* Add the extra flag for the some ops

* fix the compile problem in matmul extra
```
  49797d85
18 8月, 2021 1 次提交
- W
  
  add the safe check for the some ops (#34978) · 12bf046b
  由 wawltor 提交于 8月 18, 2021
  
  12bf046b
12 7月, 2021 1 次提交
- W
  
  Support finetuning the model saved on the mac platform on the Linux platform (#34027) · 4d259b91
  由 WeiXin 提交于 7月 12, 2021
  
  4d259b91
22 5月, 2021 1 次提交

Added oneDNN matmul grad BF16/FP32 kernel (#32968) · e2a3a6f7

由 jakpiase 提交于 5月 22, 2021

* added support for most matmul cases

* added more functionality

* full functionality of matmul op, fp32 only

* added bf16 tests and functionality

* added formatting

* changes after review

* minor change

* added reviewers suggestions

e2a3a6f7

25 3月, 2021 1 次提交
- C
  Polish two error messages (#31852) · 27f2d8df
  由 Chen Weihang 提交于 3月 25, 2021
```
* polish two error messages

* polish details
```
  27f2d8df
02 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid operators for rocm (part8), test=develop (#31309) · 59940cb3
  由 Qi Li 提交于 3月 02, 2021
  
  59940cb3
25 1月, 2021 1 次提交

More precise mkldnn kernel rules in GetExpectedKernelType (#29840) · 5bf25d1e

由 arlesniak 提交于 1月 25, 2021

* More precise mkldnn kernel choice in GetExpectedKernelType

* Fixes after review

* Refresh develop for CI

* CI experiment

* get back from CI exper

5bf25d1e

30 12月, 2020 1 次提交
- W
  add the support the op version check for matmul, test=op_version (#30011) · cc2f9462
  由 wawltor 提交于 12月 30, 2020
```
* add the support the op version check for matmul, test=op_version
```
  cc2f9462
04 12月, 2020 1 次提交

Support type promote for basic math ops (quantum required) (#29265) · 9ad800eb

由 Chen Weihang 提交于 12月 04, 2020

* basic impl of type promote

* add comment & another testcase

* fix complex bugs & support python op promote type

* fix failed unittests & polish code

* add unittest for coverage

* change to only promote complex type

* polish code details

* polish several comments

9ad800eb

27 11月, 2020 1 次提交
- A
  
  Fixes mkldnn dygraph learning rate scheduler crashes (#28988) · bc902044
  由 arlesniak 提交于 11月 27, 2020
  
  bc902044
10 10月, 2020 1 次提交

add double grad op for matmul (#27776) · ad99e638

由 wangxinxin08 提交于 10月 10, 2020

* add matmul doublegrad op

* fix compile errors

* modify code according to review

* delete float16

ad99e638

08 8月, 2020 1 次提交

Change use_quantizer attribute name and data type (#25838) · 734cf1c3

由 joanna.wozna.intel 提交于 8月 08, 2020

* Change use_quantizer attribute name and data type

* Fix problem with setting attribute

* Add changes due to review

* Small change in function

* Restore use_quantizer attr for compatibility

734cf1c3

28 4月, 2020 1 次提交
- S
  
  added reshape transpose matmul fuse pass (#23754) · e1a7a880
  由 Sylwester Fraczek 提交于 4月 28, 2020
  
  e1a7a880
24 4月, 2020 1 次提交
- A
  
  added fusing matmul-transpose-reshape pass (#23866) · d31a174f
  由 arlesniak 提交于 4月 24, 2020
  
  d31a174f
17 4月, 2020 1 次提交
- Z
  OP error message enhancement of l2_normalize, matmul, mean, etc · 361c6ccc
  由 Zhong Hui 提交于 4月 17, 2020
```
* fix error message of l2_normalize, matmul, mean, etc. 
* add the test case for those ops
```
  361c6ccc
11 4月, 2020 1 次提交

[DNNL][INT8][FP32] MatMul (#23395) · a63bcf9a

由 Michał Gallus 提交于 4月 11, 2020

* Initial FP32 DNNL MatMul Implementation

* Implement int8 DNNL MatMul

* Unify in-kernel-naming, clean UTs

* MatmuL: Introduce op caching

* Final adjustments

test=develop

* Remove dy_graph disablement

test=develop

* Change dnnl header name to new one

test=develop

* Contrain multi head check to prevent fails

test=develop

* Resolve dnnl header problems on MAC CI

* Variable namings to kernel and skip_grad_ci added

test=develop

* Prevent MAC CI from failing

* Prevent windows build from failing

test=develop

* Modify UTs to conform to the rules

* Modify MatMul aux functions namings

test=develop

a63bcf9a

04 4月, 2020 1 次提交

Delete Ref & VectorRef and add GetDataSafely (#22997) · 16315d3d

由 Chen Weihang 提交于 4月 04, 2020

* delete invalid check inferface Ref & VectorRef, test=develop

* fix vector ref delete error, test=develop

* try the new check inferface, test=develop

* change all related code with new check macro, test=develop

* remove static assert, test=develop

* polish detail, test=develop

* skip coverage problem, test=develop

* add new check macro, test=develop

16315d3d

11 3月, 2020 1 次提交
- W
  Speed up the matmul op, use the gemm replace the batch gemm (#22926) · f154d586
  由 wawltor 提交于 3月 11, 2020
```
In the op of gemm, we use the gemm to replace batch gemm, speed up the matmul op 
```
  f154d586
09 3月, 2020 1 次提交

Imperative tracer refactoring (#22457) · d33c4343

由 Zeng Jinle 提交于 3月 09, 2020

* refine grad maker, test=develop

* refactor tracer stage 1, test=develop

* merge develop to solve conflict third times, test=develop

d33c4343

25 12月, 2019 1 次提交
- H
  
  fix matmul error message; test=develop (#21885) · 30d000f8
  由 hong 提交于 12月 25, 2019
  
  30d000f8
31 10月, 2019 1 次提交

GradMaker for dygraph (#19706) · 8c4573a3

由 hong 提交于 10月 31, 2019

* refactor dygraph,test=develop

* fix failed unittest,test=develop

* polish code,test=develop

* check windows ci error,test=develop
try to fix windows ci error by np.allclose,test=develop

* polish vlog and profiler, test=develop

* try to fix preceding ops order,test=develop

* test transformer in windows ci, test=develop

* use python c-api to speed up tracer.trace,test=develop

* test=develop, fix docker with paddle nccl problem

* test=develop, add ut for debug string and gradient_accumulator

* test=develop, add tests for layer/gradient_accumulator/prepared_op

* test=develop, fix complie error for test_prepared_op

* test=develop, add more ut for dygraph

* test=develop, create API.spec for dygraph api change

* optimize grad maker; test=develop

* optimize grad maker

* test

* grad make optim; test=develop

* fix unittest bugs; test=develop

* add dygraph grad op maker and split_op

* grad op maker refactor; test=develop

* add dygraph grad maker; test=develop

* fix op deformable_conv_v1_op bug; test=develop

* fix deformable_conv prroi pool bugs;

* fix new op grad op maker bug; test=develop

* fix split by ref bug; test=develop

* fix dygraph auto prune bug; test=develop

* fix test_trace bug; test=develop

* fix fused emb seq pool bug; test=develop

* remove useless code in op_desc file; test=develop

* remove useless code, StrVarBaseNode; test=develop

* fix review issues; test=develop

* fix rank_loss grad maker; test=develop

* remove flag in VarBase; test=develop

* fix distributed_notify_op compile bug ; test=develop

* fix reshape op double grad; test=develop

* fix expand as op; test=develop

* add impertive type_defs.h for demo_train; test=develop

* fix inference lib cmake; test=develop

* fix inference lib; test=develop

* fix infernce_lib; test=develop

* fix inference cmake; test=develop

* fix inference lib; test=develop

* fix inference lib; test=develop

* remove condition dygraph grad maker, modify local name; test=develop

* fix split grad maker bug; test=develop

* fix pyramid_op bug; test=develop

* change travis time out limit; test=develop

* restore travis; test=develop

* change timeout limit; test=develop

8c4573a3

23 10月, 2019 1 次提交

石

update the infer shape of matmul, test=develop (#20717) · 37cd4354

由石晓伟提交于 10月 23, 2019

* update the infer shape of matmul,  test=release/1.6

* add unittests of matmul, test=release/1.6

* change func names, test=develop

37cd4354

15 10月, 2019 1 次提交

石

Optimize error message of mean_op and matmul_op (#20413) · a4753f3a

由石晓伟提交于 10月 15, 2019

* add data type check, test=develop

* polish error messages, test=develop

* polish error messages, test=develop

* Remove support for the CPU architecture matmul, test=develop

* fix syntax bug, test=develop

a4753f3a

25 9月, 2019 1 次提交

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

24 7月, 2019 1 次提交

Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60

由 Bob Zhu 提交于 7月 24, 2019

* extend matmul op to support multiple head multiplication

With the support of multiple head, the multiplication of two big matrixes is
split into multiplication of several (head_number) small matrixes. e.g. if
Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
[6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].

220eef60

21 3月, 2019 1 次提交
- P
  
  fix matmul shape check; test=develop · 0e402989
  由 phlrain 提交于 3月 21, 2019
  
  0e402989
18 9月, 2018 1 次提交
- S
  
  modification · 0718113a
  由 sneaxiy 提交于 9月 18, 2018
  
  0718113a
17 9月, 2018 1 次提交
- S
  
  tiny change to save memory · abf9832c
  由 sneaxiy 提交于 9月 17, 2018
  
  abf9832c
10 5月, 2018 1 次提交
- Y
  
  matmul support float16/double · 27197290
  由 yuyang18 提交于 5月 10, 2018
  
  27197290

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功