- 31 5月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* remove attrs from base op * fix typos * remove brelu * undo removing code related to matmul * remove whitespaces * undo changes in matmul * remove empty line
-
- 25 5月, 2022 1 次提交
-
-
由 Ming-Xu Huang 提交于
* Dynamic graph support to Automatic SParsity. 1. Added dynamic support to ASP module (paddle.fluid.contrib.sparsity). 2. Added ASP related unit-tests regards to above changes. 3. Put ASP module under paddle.static for now, waiting for APIs confirmation from Paddle. * Modified documents of functions to have correct examples. * Update in_dygraph_mode to paddle.in_dynamic_mode() * Modified documents of functions and added comments * Minor changes. * Fix example errors in asp API. * Code Change for Review 1. Added more examples in documents. 2. Chaged test_asp_pruning_static. * Minor changes * Update ASP function documents. * Update ASP function documents. * Reduce test case size of asp pruning due CI time limit. * Update time limitation to some asp UTs. * Fix sample code errors. * Fix sample code errors. * Fix sample code errors. * Update time limitation to parts of ASP UTs. * Update UTs to fit with CI. * Reduce problem size in python/paddle/fluid/tests/unittests/asp/test_fleet_with_asp_dynamic.py * Added paddle.asp * Fixed type casting error of OpRole.Optimize in new dygraph mode. * Made set_excluded_layers be compatible with 2.2 * Fix example code of calculate_density. * Update code examples. * Move paddle.asp to paddle.incubate.asp * Fixed an example error of calculate_density
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 11 5月, 2022 1 次提交
-
-
由 Zuza Gawrysiak 提交于
* Add int8 scales gathering pass for convolution * Fix typo * Add unittest * Add corrected unit test * Change test name * Remove enabling mkldnn in test * Speed up test * Change max examples * Add functional test * Change test name * Add new test case * Rename pass
-
- 10 5月, 2022 1 次提交
-
-
由 piotrekobi 提交于
* Readd conv_affine_channel fuse pass as mkldnn pass * Fix formatting * Add new test to parallel_UT_rule.py * Fix Coverage and Windows CI issues * Revert "Fix Coverage and Windows CI issues" This reverts commit f33459846385c9fd51c07f9f44e7ff283a652637. * Fix CI errors * Remove unnecessary conv_eltwise_add_affine_channel fuse pass * Remove test from parallel_UT_rule.py
-
- 04 5月, 2022 3 次提交
-
-
由 Guanghua Yu 提交于
-
由 Guanghua Yu 提交于
-
由 Guanghua Yu 提交于
-
- 28 4月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add gradient merge for DistributedFusedLamb * use master acc gradient * fix CI ut * polish * remove math_function_impl.h change * fix test_update_loss_scaling_op.py * try to fix XPU/NPU CI * add gm ut
-
- 26 4月, 2022 1 次提交
-
-
由 WangXi 提交于
-
- 15 4月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* add mixed-precission support for ipu * restore cast_model_to_fp16 api * update UTs
-
- 07 4月, 2022 1 次提交
-
-
由 joanna.wozna.intel 提交于
-
- 05 4月, 2022 1 次提交
-
-
由 Guanghua Yu 提交于
-
- 01 4月, 2022 1 次提交
-
-
由 danleifeng 提交于
-
- 28 3月, 2022 3 次提交
-
-
由 danleifeng 提交于
* add fused_seqpool_cvm op;test=develop
-
由 Ligoml 提交于
* update docs dtype(core.VarDesc.VarType) * fix code style, test=document_fix fix code style, test=document_fix Co-authored-by: NChen Long <1300851984@qq.com>
-
由 Guanghua Yu 提交于
* add adaround post-quant method
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 24 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* approve amp for intermediate_dygraph * add amp_utils for intermediate_dygraph * add amp needcast check for mlu & npu * test unittest * add SetGradNode for set_stop_gradient && add checktensor for GradientHooks * refine code * refien unittest of imperative_amp for new dygraph * inplace api skip amp * add test_imperative_qat_amp for intermediate amp * refine code * refine test_amp ci strategy * refine unittest code * refine amp_utils code * refine amp getpromotetype for some special op * refine unittest code
-
- 16 3月, 2022 3 次提交
-
-
由 joanna.wozna.intel 提交于
* Modify save_quant_model.py to support differnet input and output filenames * Correct wrong order of arguments
-
由 Ming-Xu Huang 提交于
-
由 qipengh 提交于
-
- 15 3月, 2022 1 次提交
-
-
由 Guanghua Yu 提交于
* add some op for full_quantization
-
- 11 3月, 2022 1 次提交
-
-
由 Guanghua Yu 提交于
-
- 04 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 03 3月, 2022 2 次提交
-
-
由 Baibaifan 提交于
-
由 Jiabin Yang 提交于
* eager, test=develop * fix bug, test=develop * eager, test=develop * merge legacy to fluid * eager, test=develop * eager, test=develop * Refactor TensorAdd func by template and remove gradient_accumulation in eager * Remove needless target name * eager, test=develop * eager, test=develop * Use overload instead of template * Remove legacy code * Remove legacy code * selectedrows, test=develop * Remove DataType test * eager, test=develop * eager, test=develop * support gan, test=develop * Using Tensor directly instead of using EagerTensor * support gradient_accumulation * make test_imperative_lod_tensor_to_selected_rows longer * make test_imperative_lod_tensor_to_selected_rows longer * refine code * ptb, test=develop * Rename all EagerTensor to Tensor * Rename some EagerTensor to Tensor * rename EagerTensor to EagerVariable * eager, test=develop * eager, test=develop * eager, test=develop * eager, test=develop * add more test * eager, test=develop * Support copiable selected rows and merge develop * save load, eager, test=develop * save load, eager, test=develop * refine, test=develop * remove useless _set_value method * refine, test=develop * refine, test=develop * revert static_runner, test=develop * EagerTensor to Tensor, test=develop * refine, test=develop * refine, test=develop * clear grad, test=develop * merge, develop * merge, develop * merge, test=develop * merge, test=develop * Support quant and part of slice * support legacy static save * extend slim tests time * remove imperative on inference * remove imperative on inference * merge develop * fix typo * fix typo * split slice related code into 2 part for imperative and eager * split slice from inference * split slice from inference * fix test_tensor_register_hook Co-authored-by: NWang Huan <wanghuan29@baidu.com> Co-authored-by: NWeilong Wu <veyron_wu@163.com> Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
-
- 01 3月, 2022 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add mobilenetv3_large performance test * Disable the BF16 test if the device does not support BF16 computations * Change test timeout
-
由 wenbin 提交于
* remove * pass * more pass
-
- 19 2月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add DistributedFusedLamb op * polish code * fix compile error * compatible with pten changement * fix rocm compile error * improve converage * update upstream/develop * fix cast_with_ptr.h * add FLAGS_distributed_lamb_divide_nranks_when_allreduce=1 * fix clip before allreduce * add use_master_param_norm * code polish * fix bug * fix ROCM ci
-
- 14 2月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* mish unit tests * code format * remove unused imports * code format * remove hard-coded shape values * remove timeouts * remove timeouts v2 * restore timeouts
-
- 09 2月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* rebuild matmul pass: trt and gpu_cpu * rebuild matmul pass: trt and gpu_cpu * rebuild matmul pass: trt and gpu_cpu * rebuild matmul pass: trt and gpu_cpu
-
- 07 2月, 2022 1 次提交
-
-
由 arlesniak 提交于
* amp list updated * tests updated * gray list updated * amp list updated * test updated
-
- 27 1月, 2022 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Upadate pass in quant2_int8_mkldnn_pass * Back to the previous scale_matmul order * Change place of cpu_quantize_placement_pass
-
- 21 1月, 2022 1 次提交
-
-
由 ceci3 提交于
-
- 13 1月, 2022 1 次提交
-
-
由 jakpiase 提交于
* base changes for mul reimplementation * empty commit * tmp save * full implementation of mul bf16/fp32 fwd bwd * CI fix * CI rerun * changed unity build cmake to avoid gpu issues * removed mul mkldnn from unity build * added skipping tests if not cpu_bf16 * CI fix * CI fix * CI fix
-
- 12 1月, 2022 1 次提交
-
-
由 Sylwester Fraczek 提交于
* fix conv act int8 scale * add unit test for conv+hard_swish
-
- 06 1月, 2022 1 次提交
-
-
由 minghaoBD 提交于
-
- 05 1月, 2022 2 次提交
-
-
由 Jiaqi Liu 提交于
* make post training quant API support dataloader
-
由 joanna.wozna.intel 提交于
* Quantize nearest_interp and nearest_interp_v2 * Check if avx_core supported * Add depthwise_conv2d to supported quantization list
-