提交 · 5c66338f4e9678d1a1254c6f1adb5d124a15512c · PaddlePaddle / Paddle

18 2月, 2022 17 次提交
- X
  [pten] trans diagonal kernel into pten (#39575) · 5c66338f
  由 xiongkun 提交于 2月 18, 2022
```
* trans diagonal kernel into pten

* fix by code review
```
  5c66338f
- Z
  [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
  由 zhangbo9674 提交于 2月 18, 2022
```
* support dtype param for auto_cast

* add amp_dtype for tracer

* add unsupported bf16 list

* support bf16 amp for O2

* refine python interface for bfloat16

* refine code

* refine code

* refine unittest

* refine code

* refine code

* add bf16 o1

* refine code by comment

* add gradient accumulator

* add recompute
```
  7d6d3848
- R
  
  [CustomDevice]Improved custom device initialization (#39634) · 7e4ed848
  由 ronnywang 提交于 2月 18, 2022
  
  7e4ed848
- R
  
  [CustomRuntime] add pten::Backend support (#39606) · d6d0820e
  由 ronnywang 提交于 2月 18, 2022
  
  d6d0820e
- A
  [IPU] Update IpuStrategy (#39644) · 46161679
  由 Allen Guo 提交于 2月 18, 2022
```
* Update IpuStrategy

* fix ci

* rerun ci
```
  46161679
- B
  
  refactor the forward implementation of shape npu op (#39613) · e674af23
  由 baoachun 提交于 2月 18, 2022
  
  e674af23
- W
  Infrt registers pten kernels (#39588) · dc39eb18
  由 Wilber 提交于 2月 18, 2022
```
* the mlir representation of pten, test=develop

* fixes an error, test=develop

* infrt registers pten kernels
Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
```
  dc39eb18
- Z
  [Pten] Support inplace and intermediate in C++ API (#39651) · 638aab6e
  由 zyfncg 提交于 2月 18, 2022
```
* support inplace and intermediate in yaml

* add cmake for dygraph_api
```
  638aab6e
- T
  
  dropout support Seed, fix elementwise_add_grad bug, test=kunlun (#39656) · 70b9f2ac
  由 taixiurong 提交于 2月 18, 2022
  
  70b9f2ac
- C
  [pten]add T, remove default value of DataType in DeviceContext::Alloc (#39620) · 8363406a
  由 chentianyu03 提交于 2月 18, 2022
```
* add T to Alloc and remove default value of DataType in DeviceContext::Alloc

* add dtype
```
  8363406a
- S
  add tool: print kernel signaturs (#39670) · 03b875a8
  由 Shang Zhizhou 提交于 2月 18, 2022
```
* add tool: print kernel signaturs

* fix windows compile
```
  03b875a8
- W
  
  fix compile error in jetson (#39669) · c8c24460
  由 Wilber 提交于 2月 18, 2022
  
  c8c24460
- Q
  [MLU]add matmul and matmul_v2 op (#39539) · 229ec32a
  由 qipengh 提交于 2月 18, 2022
```
* [MLU]add matmul and matmul_v2 op

* [MLU] fix data_type and del matmul

* [MLU] fix compile error

* [MLU] fix ci_check error
```
  229ec32a
- J
  
  add flatten op for mlu (#39530) · 4c5cec5c
  由 joeqiao12 提交于 2月 18, 2022
  
  4c5cec5c
- Z
  [MLU]add sync stream ops and broadcast pytest (#39518) · d2bd05b9
  由 zn 提交于 2月 18, 2022
```
* [MLU]add sync stream ops and broadcast pytest

* [MLU]fix broadcast pytest to add data type
```
  d2bd05b9
- J
  [Bug Fix]Fix gradient accumulator (#39577) · a7cbd3ef
  由 Jiabin Yang 提交于 2月 18, 2022
```
* merge legacy to fluid

* Remove legacy code

* Remove legacy code

* Remove DataType test

* Using Tensor directly instead of using EagerTensor

* support gradient_accumulation

* make test_imperative_lod_tensor_to_selected_rows longer

* make test_imperative_lod_tensor_to_selected_rows longer

* refine code

* Rename all EagerTensor to Tensor

* Rename some EagerTensor to Tensor

* rename EagerTensor to EagerVariable

* add more test

* fix different device gradient_accmulator bug

* merge develop

* remove useless tests
```
  a7cbd3ef
- W
  [Eager] Support GradientHook before running separate GradNode (#39638) · adf4b98f
  由 Weilong Wu 提交于 2月 18, 2022
```
* [Eager] Support GradientHook before running seperate GradNode

* Fix CI issue

* Fix CI issue
```
  adf4b98f
17 2月, 2022 19 次提交
- L
  avoid custom kernel deps on pten_function_api (#39661) · cbce0e60
  由 Leo Chen 提交于 2月 17, 2022
```
* pten matmul cuda kernel support bf16

* avoid custom kernel deps on pten_function_api

* Revert "pten matmul cuda kernel support bf16"

This reverts commit 5d520845b9a189375677276efb673235ed8e5ee0.

* refine code

* fix compile

* fix test_split_api
```
  cbce0e60
- L
  [pten] move bernoulli kernel to pten (#39590) · f86073c4
  由 Leo Chen 提交于 2月 17, 2022
```
* move bernoulli kernel to pten

* follow comments
```
  f86073c4
- L
  [new-exec] refactor code of interpretercore gc (#39617) · c3135426
  由 Leo Chen 提交于 2月 17, 2022
```
* relocate code of interpretercore gc
```
  c3135426
- S
  [bugfix] to concat input squash (#39593) · f29da150
  由 Sylwester Fraczek 提交于 2月 17, 2022
```
* fix and add more tests

* remove unwanted changes

* check only concat and elementwise

* move check to a function

* add todo comment

* Revert "fix ptq fc attr name fuse_activation->activation_type"

This reverts commit ffd023353a5e9b0fd15e81b9e9f9fe1794035017.
```
  f29da150
- J
  
  add reshape2 op for mlu (#39562) · 2d2f11d1
  由 joeqiao12 提交于 2月 17, 2022
  
  2d2f11d1
- Z
  
  fix selected_rows bug in C++ API (#39658) · b72d4cb4
  由 zyfncg 提交于 2月 17, 2022
  
  b72d4cb4
- T
  save the name lists of variables of a cinn subgraph as its attributes (#39622) · a1ad003c
  由 TeFeng Chen 提交于 2月 17, 2022
```
* save the name lists of the input,internal and output variables of a subgraph as its attribute

* fix compile error
```
  a1ad003c
- S
  move trunc to pten (#39543) · 4501abd6
  由 Sing_chan 提交于 2月 17, 2022
```
* move trunc to pten

* modify according to YuanRisheng's comment
```
  4501abd6
- C
  [PTen] Clean useless header in pten core (#39560) · c05cd7ed
  由 Chen Weihang 提交于 2月 17, 2022
```
* clean useless header in pten core

* fix compiled failed

* fix cmake target

* fix typo

* resolve conflict
```
  c05cd7ed
- H
  add softplus op for kunlun2. test=kunlun (#39555) · 9f99b591
  由 houj04 提交于 2月 17, 2022
```
* add softplus op for kunlun2. test=kunlun

* add softplus op for kunlun2. test=kunlun

* fix code style. test=kunlun

* fix code style. test=kunlun

* add more test cases. test=kunlun
```
  9f99b591
- W
  adaptive pool2d pass fix (#39600) · c1c5c1fc
  由 wenbin 提交于 2月 17, 2022
```
* first commit

* teller fix

* bug fix

* enable for pool2d only

* fix global_pooling issue

* pooling_type

* fix test
```
  c1c5c1fc
- Z
  [Pten] Remove register of matmul_v2 kernel (#39542) · db43b541
  由 zyfncg 提交于 2月 17, 2022
```
* remove register of matmul_v2 kernel

* delete matmul_v2 grad register in fluid
```
  db43b541
- 石
  
  change classes to pten, test=develop (#39643) · 8f2d14ad
  由石晓伟提交于 2月 17, 2022
  
  8f2d14ad
- H
  refine data loader api in infrt (#39580) · 1035d21f
  由 huzhiqiang 提交于 2月 17, 2022
```
* update generate_pd_op_dialect_from_paddle_op_maker.py

* update mlir tensor load interface

* refine

* fix bug

* fix

* refine

* fix

* 3

* fix

* codestyle
Co-authored-by: weishengying <1343838695@qq.com>
```
  1035d21f
- C
  
  move trace infer shape (#39517) · 1c9b2483
  由 Chen Weihang 提交于 2月 17, 2022
  
  1c9b2483
- C
  
  support set fp32 input for fp16 kernel (#39625) · 5fb9cf60
  由 Chen Weihang 提交于 2月 17, 2022
  
  5fb9cf60
- C
  [PTen] Remove fluid device context deps (#39604) · d63ece1f
  由 Chen Weihang 提交于 2月 17, 2022
```
* remove fluid device context deps

* fix compile failde
```
  d63ece1f
- B
  update inference ut to support nhwc format (#39551) · b4d3597a
  由 baoachun 提交于 2月 17, 2022
```
* update inference ut to support nhwc format

* update ut and pass OpCompat

* update ut

* update ut
```
  b4d3597a
- N
  
  Modified distribution kernel with Kernel Primitive API (#39563) · 1354652b
  由 niuliling123 提交于 2月 17, 2022
  
  1354652b
16 2月, 2022 4 次提交
- F
  
  [MLU] fix TensorAdd for mlu (#39523) · 24b8f63e
  由 fwenguang 提交于 2月 16, 2022
  
  24b8f63e
- Y
  
  [fleet exe] Update comm init for dist model (#39603) · 7d53a288
  由 Yuang Liu 提交于 2月 16, 2022
  
  7d53a288
- T
  
  optimize prior_box for kunlun, *test=kunlun (#39477) · e254e7c6
  由 TTerror 提交于 2月 16, 2022
  
  e254e7c6
- F
  
  [MLU] support adative pooling (#39500) · f138371c
  由 fwenguang 提交于 2月 16, 2022
  
  f138371c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功