提交 · 2f3b393d151afd6e82cb88095cb2079a9be65f61 · BaiXuePrincess / Paddle

31 8月, 2021 5 次提交

New whl release strategy with pruned nv_fatbin (#35239) · 2f3b393d

由 Zhanlue Yang 提交于 8月 31, 2021

[Background]
Expansion in code size can be irreversible in the long run, leading to huge release packages which
not only hampers user experience but also exceeds a hard limit of pypi.

In such, NV_FATBIN section takes up 86% of the compiled dylib size, owing to the vast number of GPU
arches supported.

This PR aims to prune this NV_FATBIN.

[Solution]
In the new release strategy, two types of whl packages will be involved:

Cubin PIP package:
PIP package maintains a smaller window for GPU arches support, containing
sm_60, sm_70, sm_75, sm_80 cubins, covering Pascal - Ampere arches

JIT release package:
This is a backup for Cubin PIP package, containing compute_35, compute_50, compute_60,
compute_70, compute_75, compute_80, with best performance and GPU arches coverage.

However, it takes around 10 min to install due to the JIT compilation.

[How to use]
The new release strategy is disabled by default.
To compile for Cubin PIP package, add this to cmake: -DCUBIN_RELEASE_PIP
To compile for JIT release package, add this to cmake: -DJIT_RELEASE_WHL

2f3b393d

W

update infer trt ut. (#35261) · 96e7d903
由 Wilber 提交于 8月 31, 2021

96e7d903
X

support fuse layers for ptq (#35015) · ef536250
由 XGZhang 提交于 8月 31, 2021

ef536250
A

NPU add elementwise_mod (#35245) · 561841d2
由 Aganlengzi 提交于 8月 31, 2021

561841d2
A

NPU add fill_zeros_like kernel (#35246) · aaaa9965
由 Aganlengzi 提交于 8月 31, 2021

aaaa9965

30 8月, 2021 3 次提交
- X
  [Paddle Inference-TRT]Adding six op unittest codes of TRT INT8 (#35130) · 39565147
  由 xiaoxiaohehe001 提交于 8月 30, 2021
```
* add_op_unittest
```
  39565147
- Z
  [NPU] Add log_loss op (#35010) · b94d7ff3
  由 zhulei 提交于 8月 30, 2021
```
* [NPU] Add log_loss op

* [NPU] Add log_loss op

* [NPU] Add log_loss op
```
  b94d7ff3
- X
  Set value (#34886) · 37d281c9
  由 xiongkun 提交于 8月 30, 2021
```
* tmp

* Tile - Assign - Crop

* Finish the set value npu kernel and test case in npu

* improve the error message

* Modify according to zhangliujie

* code review
```
  37d281c9
29 8月, 2021 1 次提交
- G
  
  test=document_fix (#35221) · 31cd1065
  由 Guoxia Wang 提交于 8月 29, 2021
  
  31cd1065
27 8月, 2021 31 次提交
- G
  
  test=document_fix (#35222) · 5dcff7c8
  由 Guoxia Wang 提交于 8月 27, 2021
  
  5dcff7c8
- J
  
  add uniform_ op and UT (#33934) · be29b8ee
  由 JYChen 提交于 8月 27, 2021
  
  be29b8ee
- X
  Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0
  由 xiaoting 提交于 8月 27, 2021
```
* add maxunppol2d op, test=develop

* fix typo, test=develop

* fix unpool unitest, test=develop

* fix unpool code-example, test=develop

* fix for unpool_op_unittest,test=develop

* fix example code, test=develop

* add noqa:F401, test=develop

* fix converage, test=develop

* fix unitest for unpool, test=develop

* rename unpool2d to unpool, test=develop

* rename unpool2d to unpool, test=develop
```
  ceee71a0
- G
  sparse_momentum_op is used to save w@GRAD memory for gather_op (#34942) · 234ce932
  由 Guoxia Wang 提交于 8月 27, 2021
```
* sparse_momentum_op is used to save w@GRAD memory for gather_op when gather from a large parameter
```
  234ce932
- W
  
  [hybrid] Fix row parallel linear bias (#35186) · 1533d7e2
  由 WangXi 提交于 8月 27, 2021
  
  1533d7e2
- 王
  
  fix the crash when input variable is bool type, test=develop (#35176) · ad522483
  由王明冬提交于 8月 27, 2021
  
  ad522483
- H
  
  Update test_cross_entropy_loss.py · e838cacf
  由 HydrogenSulfate 提交于 8月 26, 2021
  
  e838cacf
- H
  
  Update loss.py · cf6e543b
  由 HydrogenSulfate 提交于 8月 18, 2021
  
  cf6e543b
- H
  
  Update loss.py · 11e9d4e3
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  11e9d4e3
- H
  
  Update loss.py · 0c2d6bcb
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  0c2d6bcb
- H
  
  Update loss.py · 52804cd8
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  52804cd8
- H
  
  Update loss.py · 00467688
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  00467688
- H
  
  Update test_cross_entropy_loss.py · ee070fbd
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  ee070fbd
- H
  
  Update test_cross_entropy_loss.py · 7afd7f3d
  由 HydrogenSulfate 提交于 8月 16, 2021
  
  7afd7f3d
- H
  
  Update test_cross_entropy_loss.py · f6dc4b6b
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  f6dc4b6b
- H
  
  Update loss.py · f2df33e3
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  f2df33e3
- H
  
  Update test_cross_entropy_loss.py · 0bf32484
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  0bf32484
- H
  
  Update test_cross_entropy_loss.py · 23cc2142
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  23cc2142
- H
  
  Update test_cross_entropy_loss.py · d1a11056
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  d1a11056
- H
  
  Update loss.py · dd0140bd
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  dd0140bd
- H
  
  Update loss.py · 3ca813e6
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  3ca813e6
- H
  
  Update loss.py · b9f665d8
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  b9f665d8
- H
  
  Update loss.py · de972c50
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  de972c50
- H
  
  Update test_cross_entropy_loss.py · c61027e8
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  c61027e8
- H
  
  Update loss.py · b4a3f21c
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  b4a3f21c
- H
  
  Update loss.py · fa4805b4
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  fa4805b4
- H
  
  Update loss.py · 39e81532
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  39e81532
- H
  
  Update loss.py · 4b96ae62
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  4b96ae62
- H
  
  Update loss.py · 61b3a94c
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  61b3a94c
- H
  
  Update loss.py · 05eaa9bc
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  05eaa9bc
- H
  
  Update loss.py · a2327374
  由 HydrogenSulfate 提交于 8月 15, 2021
  
  a2327374

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致