提交 · f556549427a7d5319b29d3d895ee45dc99ae449d · 机器未来 / Paddle

11 10月, 2022 1 次提交
- C
  
  speedup ChannelClipAndQuantDequantKernelQuantAxis1 kernel (#46471) (#46551) · f5565494
  由 ceci3 提交于 10月 11, 2022
  
  f5565494
01 8月, 2022 1 次提交

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

24 6月, 2022 1 次提交
- G
  
  fix quantization clip and round Attribute (#43764) · 491b87b4
  由 Guanghua Yu 提交于 6月 24, 2022
  
  491b87b4
21 6月, 2022 1 次提交
- G
  
  Update quantization round and clip calculation rules (#42695) · 75144f13
  由 Guanghua Yu 提交于 6月 21, 2022
  
  75144f13
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
31 5月, 2022 1 次提交
- L
  Fix the underflow of fp16 fake quantize operators (#43088) · 0ae8a2d6
  由 Leo Chen 提交于 5月 31, 2022
```
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
```
  0ae8a2d6
19 4月, 2022 1 次提交
- L
  
  Add float16 to fake quantize/dequantize OP (#40664) · 5d422287
  由 Leo Chen 提交于 4月 19, 2022
  
  5d422287
08 4月, 2022 1 次提交
- W
  
  Fix fake quant cuda kernel (#41305) · 330582e2
  由 whs 提交于 4月 08, 2022
  
  330582e2
05 4月, 2022 1 次提交
- G
  
  add new format of quantization (#41041) · b72a7ebb
  由 Guanghua Yu 提交于 4月 05, 2022
  
  b72a7ebb
23 3月, 2022 1 次提交
- W
  
  Fix quant and dequant cuda kernels when quant_axis==1 (#40772) · 8991e9ae
  由 whs 提交于 3月 23, 2022
  
  8991e9ae
17 3月, 2022 1 次提交

Improve the performance of fake quantize OP (#40491) · 827b6a0e

由 Leo Chen 提交于 3月 17, 2022

* Move the computation of moving average scale to device

* Use register to save local maximum in a thread

827b6a0e

20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
27 10月, 2021 1 次提交
- W
  
  Fix inverse in fake quant (#36762) · 542ba214
  由 whs 提交于 10月 27, 2021
  
  542ba214
21 6月, 2021 1 次提交
- C
  Combine amp and qat (#33484) · f88af205
  由 cc 提交于 6月 21, 2021
```
* Combine amp and qat
* add unit test
```
  f88af205
26 3月, 2021 1 次提交

[dygraph qat] Use layer to calculate output scale (#31861) · b47478ef

由 cc 提交于 3月 26, 2021

* Use layer to calculate output scale
* add backward for moving_average_abs_max_scale and save output scales to op's attr

b47478ef

03 3月, 2021 1 次提交
- Q
  
  [ROCM] update fluid operators for rocm (part7), test=develop (#31307) · 3b9db171
  由 Qi Li 提交于 3月 03, 2021
  
  3b9db171
17 11月, 2020 1 次提交
- C
  
  Fix fake_quant error when cout > 1024, test=develop (#28603) · 65aac811
  由 cc 提交于 11月 17, 2020
  
  65aac811
21 9月, 2020 1 次提交

Quant op dev (#25932) · 02606d45

由 huangxu96 提交于 9月 21, 2020

* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.

* Finished channel-wise quantize strategy in imperative quantization.

* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp

* Add quant_axis for channel_wise quant.

* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.

* Added some assert infomation and fixed some coding style mistakes.

02606d45

19 8月, 2020 1 次提交

[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639) · 3f816bc8

由 cc 提交于 8月 19, 2020

* Conv2d_transpose and mul support channnelwise quantization, test=develop
* Skip collecting out threshold for output tensor of which the type is not fp32 or fp64, test=develop
* Fix error in test_user_defined_quantization, test=develop
* Add depthwise_conv_bn_fuse, test=develop
* Add conv_transpose_bn_fuse_pass for post_training_quant, test=develop

3f816bc8

09 7月, 2020 1 次提交
- Z
  
  add the c++ part of Imperative QAT. test=develop (#25446) · bb45af02
  由 Zhen Wang 提交于 7月 09, 2020
  
  bb45af02
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

19 3月, 2020 1 次提交

Fix div zero in fake quantize op (#22966) · 915b892a

由 Liufang Sang 提交于 3月 18, 2020

* fix div zero test=develop

* fix div zero test=develop

* add hostdevice function test=develop

* add eps when is zero test=develop

915b892a

27 5月, 2019 1 次提交

Code clean of Allocator (#17602) · 4aa931dd

由 Zeng Jinle 提交于 5月 27, 2019

* Revert "Revert "Fix allocator bug""

This reverts commit 174d0d0b.

* Revert "fix travis ci"

This reverts commit 5656fa9f.

test=develop

* add inlined_vector.h, test=develop

* add inlined_vector_test,test=develop

* clean code of allocator,test=develop

* delete zero_size_allocator.h,test=develop

* fix failed unittest,test=develop

4aa931dd

21 5月, 2019 1 次提交
- Z
  add quant_dequant_moving_avg_max_abs op (#17480) · ff7f911b
  由 Zhaolong Xing 提交于 5月 21, 2019
```
* add quant_dequant_moving_avg_max_abs op
test=develop

* add more note for quantdequant op
test=develop
```
  ff7f911b
07 5月, 2019 1 次提交

Quant output scale (#17215) · a914d9b1

由 Zhen Wang 提交于 5月 07, 2019

* Add MovingAverageAbsMaxScale operator which is only used for calculating the quantization scale.

* test=develop

* change the output into inplace. test=develop

* Revert "test=develop"

This reverts commit 696cf626.

* Revert "change the output into inplace. test=develop"

This reverts commit a19acd20.

* test=develop.

* update the MovingAverageAbsMaxScaleOp test. test=develop

a914d9b1

13 4月, 2019 1 次提交
- Z
  
  fix the hang bugs of memory copying. test=develop · d988a24a
  由 Zhen Wang 提交于 4月 13, 2019
  
  d988a24a
21 3月, 2019 1 次提交
- Z
  
  rewrite the cuda kernels of channel_wise_quant_op and channe_wise_dequant_op. test=develop · 8965819f
  由 Zhen Wang 提交于 3月 21, 2019
  
  8965819f
15 3月, 2019 1 次提交
- add moving average absmax op and fix bug (#15155) · 81b4fad8
  由视言提交于 3月 15, 2019
```
* Add moving average absmax op in quantilize-aware training.
```
  81b4fad8
04 3月, 2019 1 次提交
- Z
  
  add channel wise quantize op. · 545247d7
  由 Zhen Wang 提交于 3月 04, 2019
  
  545247d7
04 9月, 2018 1 次提交
- M
  
  Fix fake_quantize_op · 8059445f
  由 minqiyang 提交于 9月 04, 2018
  
  8059445f
03 9月, 2018 1 次提交
- Q
  Improve and fix fake_quantize_op (#13092) · 9bd933d3
  由 qingqing01 提交于 9月 03, 2018
```
* Improve and fix fake_quantize_op.
```
  9bd933d3
30 8月, 2018 1 次提交
- D
  
  Improve and fix fake_quantize_op. · 251eb372
  由 Dang Qingqing 提交于 8月 30, 2018
  
  251eb372
28 8月, 2018 1 次提交
- D
  
  Refine fake_quantize_op. · bf85cded
  由 Dang Qingqing 提交于 8月 28, 2018
  
  bf85cded
11 7月, 2018 1 次提交

Add fake_quantize_op. (#11359) · 8e4b225f

由视言提交于 7月 11, 2018

* Add a fake_quantize_op, which quantize an input tensor to a tensor with lower bits.

8e4b225f

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致