- 13 3月, 2023 15 次提交
-
-
由 mengziheng 提交于
* first test * add unsqueeze_op
-
由 wangxiaoning 提交于
* add fp16/bf16 * add grad bf16 * test name
-
由 JYChen 提交于
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 wenbin 提交于
* squeeze2_op * add ut * fix ut * fix static * modity ut
-
由 kangguangli 提交于
* find relevant testcase * remove with_data_parallel * trigger CI * do not apply ParameterServerGraphOptimizer * remove useless optimizer * remove with_data_parallel in test_dist_base * fix test_fleet_base_3 * only reserve changes for GraphExecutionOptimizer * fix bug * fix test_minst_dgc_nccl * fix typo * fix test_dist_mnist_gradient_merge * rm TestDistMnistNCCL2DGCMultiCards * fix optimizer conflicts * fix dist_mnist * fix test_dist_hapi * delete test_fleet_graph_execution_meta_optimizer & test_fleet_graph_executor * temporally not delete unittest * fix unittests * fix ci * recover prune in python/paddle/hapi/model.py
-
由 kangguangli 提交于
-
由 xysheng-baidu 提交于
* Add expand composite rule * reshape x when dim_in less than dim_out * add tile op for expand * remove rensor shape case when comp prim * enable cinn case * dim_out can't be 0 * update test case for prim type
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
由 kangguangli 提交于
* remove with_data_parallel in test_sync_batch_norm_op * fix debug code * polish code * polish code * polish code
-
由 jiangcheng 提交于
-
由 张春乔 提交于
* fix the only one circle import in call_transformer.py * move define of CONVERSION_OPTIONS from convert_call_func.py to program_translator.py * delete the self import of program_translator.py * fix import failed problem * define variable in utils.py * move is_builtin to utils.py * move is_builtin to utils.py * fix import errors * fix import errors * fix something * Update python/paddle/jit/dy2static/call_transformer.py Co-authored-by: NAurelius84 <zhangliujie@baidu.com> * Update python/paddle/jit/dy2static/call_transformer.py --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 houj04 提交于
* [XPU] add increment op. * fix ci
-
由 RuohengMa 提交于
* [XPU] add reduce_min_grad XPU kernel * add unittest for reduce_min xpu op
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
- 12 3月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
- 10 3月, 2023 18 次提交
-
-
由 Nyakku Shigure 提交于
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict * align infer auc with DistPsArch pre-stable
-
由 Infinity_lee 提交于
-
由 tifa 提交于
* fix-debug-mode-exception * fix code style * use try-except solve
-
由 zqw_1997 提交于
-
由 kangguangli 提交于
* remove with_data_parallel * remove multidevice-optimizer-in-controlflow checks and fix ci
-
由 kangguangli 提交于
-
由 Charles-hit 提交于
-
由 sneaxiy 提交于
* add attn_bias.py * add Python interface * add license * add test_attn_bias.py * fix CPU test error * fix ci error
-
由 陈沧夜 提交于
-
由 Charles-hit 提交于
* support variable parameter in optest * add error msg for use tensor attr in static code gen * fix static code gen * fix prim op test * modify comment * fix op test * fix ci * remove code
-
由 wuhuachaocoding 提交于
-
由 Aurelius84 提交于
* [framework]support pass stop_gradient in static mode * fix control_flow op stop_gradient
-
由 kangguangli 提交于
* remove with_data_parallel in custom op test * finish TestCustomOpReluModelStaticMultiDevice * fix typo * add checks for relu output * fix ci * fix ci * fix compile checks * fix coverage ci
-
由 niuliling123 提交于
-
由 liuruyan 提交于
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Modify the logic of comparing grad in bfloat16
-
由 Chitsing KUI 提交于
-
- 09 3月, 2023 6 次提交
-
-
由 yangjianfengo1 提交于
* AMP arange & Test * fix arange bfloat16 dtype * update for review * update for review2 * fix tile * update * fix ci * r * f * fix windows ci * update bfloat data * fix bloat16 input * add print * Update test_where_op.py * update kernel * del repeat * update review
-
由 yangjianfengo1 提交于
* where op test * update bfloat16 * fix * fix windows ci * update bfloat16 data * fix bloat16 x * reset * fix randint * add print * add delta * cancel print * code style * update revirew
-
由 zqw_1997 提交于
* move fluid.utils to paddle.utils.layers_utils * fix error * delete original fluid layers utils * remove import and old utils * remove more old utils import * change import path of fill_constant in the layers_utils.py * fix mistake * fix error * expose in __init__.py * for comment * when change the ref of func is_sequence, it should change to the root of is_sequence instead * for codecheck
-
由 GGBond8488 提交于
* add prim erf grad * add yaml config for prim erf grad * add math.h * add cmath * add math defines * use define math * use define math * define M_2_SQRTPI * M_2_SQRTPI math * try math.h * fix typro * remove pow in erf grad * use new optest * add fp16 fp32 test * remove fp16 test
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict
-
由 will-jl944 提交于
* add softplus double grad * use constant method
-