- 14 3月, 2023 1 次提交
-
-
由 cyber-pioneer 提交于
-
- 13 3月, 2023 39 次提交
-
-
由 kangguangli 提交于
-
由 lubiu 提交于
-
由 Aurelius84 提交于
-
由 TaoTao Li 提交于
* add all_gather and fix conflicts * fix code format * fix ut * fix broadcast ut
-
由 duanyanhui 提交于
-
由 heyanru 提交于
* refresh * compat * register * testop * fix * fix * fox * cast * cast * fix * type * fix * out * cast * fix * fix * fix * broad * broad * broad * fix * fix * fix * fix * fix * broad * broad * numel * fix * fix * fix * fix * cinn * fix * fix * fix * fix
-
由 ronnywang 提交于
* add UpdateWaitChain for process_group_custom * add UpdateWaitChain for process_group_custom
-
由 shentanyue 提交于
[Lite] Change the source code integration of Paddle Lite to the compilation library integration (#51405)
-
由 Mr.Juice 提交于
* fix docs 第七轮第一组 * fix docs 第七轮第一组
-
由 YuanRisheng 提交于
* remove transpose infershape * fix ci bugs * fix ci bugs * delete transpose infershape * fix ci bugs * fix ci bugs
-
由 mengziheng 提交于
* first test * add unsqueeze_op
-
由 iSerendipity 提交于
* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc
-
由 Zhenghai Zhang 提交于
* Add output defs for mode kernel * fix bug
-
由 wangxiaoning 提交于
* add fp16/bf16 * add grad bf16 * test name
-
由 Huang Jiyi 提交于
* platform::CUDAPinnedDeviceContext -> phi::GPUPinnedContext * replace platform::TraceEventCollector
-
由 iSerendipity 提交于
* remove fused_matmul from list * add infermeta for fused matmul
-
由 JYChen 提交于
-
由 Huang Jiyi 提交于
* add from_blob * fix test * fix test * fix codestyle * add gpu test * fix test * update * add comment * fix comment * update comment * fix CI bug * add thread_local * update * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix cmake * fix CI-Py3 make * update * use api_reg * fix include * update * update * update * fix bug * fix bug * fix bug * fix bug
-
由 Sławomir Siwek 提交于
* mkldnn->onednn * fused softplus op + kernel * remove extra attributes * add missing handler * change var name
-
由 engineer1109 提交于
-
由 Sanbu 提交于
* Add output defs for conv3d_coo distribute_fpn_proposals kernel * fix
-
由 wenbin 提交于
* squeeze2_op * add ut * fix ut * fix static * modity ut
-
由 risemeup1 提交于
-
由 Charles-hit 提交于
* fix static prim api code gen * fix static prim api gen
-
由 risemeup1 提交于
-
由 kangguangli 提交于
* find relevant testcase * remove with_data_parallel * trigger CI * do not apply ParameterServerGraphOptimizer * remove useless optimizer * remove with_data_parallel in test_dist_base * fix test_fleet_base_3 * only reserve changes for GraphExecutionOptimizer * fix bug * fix test_minst_dgc_nccl * fix typo * fix test_dist_mnist_gradient_merge * rm TestDistMnistNCCL2DGCMultiCards * fix optimizer conflicts * fix dist_mnist * fix test_dist_hapi * delete test_fleet_graph_execution_meta_optimizer & test_fleet_graph_executor * temporally not delete unittest * fix unittests * fix ci * recover prune in python/paddle/hapi/model.py
-
由 kangguangli 提交于
-
由 xysheng-baidu 提交于
* Add expand composite rule * reshape x when dim_in less than dim_out * add tile op for expand * remove rensor shape case when comp prim * enable cinn case * dim_out can't be 0 * update test case for prim type
-
由 zhupengyang 提交于
-
由 junxiu777 提交于
* add register of KthvalueKernel add register of KthvalueKernel * Update kthvalue_kernel.cc * Update kthvalue_kernel.cu
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
由 Little-chick 提交于
* Update interpreter_util.cc * Update auc_kernel.cc * Update auc_kernel.cu * Update auc_kernel.cc * Update auc_kernel.cu
-
由 kangguangli 提交于
* remove with_data_parallel in test_sync_batch_norm_op * fix debug code * polish code * polish code * polish code
-
由 jiangcheng 提交于
-
由 404988613 提交于
* Update interpreter_util.cc * Update interpreter_util.cc
-
由 张春乔 提交于
* fix the only one circle import in call_transformer.py * move define of CONVERSION_OPTIONS from convert_call_func.py to program_translator.py * delete the self import of program_translator.py * fix import failed problem * define variable in utils.py * move is_builtin to utils.py * move is_builtin to utils.py * fix import errors * fix import errors * fix something * Update python/paddle/jit/dy2static/call_transformer.py Co-authored-by: NAurelius84 <zhangliujie@baidu.com> * Update python/paddle/jit/dy2static/call_transformer.py --------- Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
由 houj04 提交于
* [XPU] add increment op. * fix ci
-
由 RuohengMa 提交于
* [XPU] add reduce_min_grad XPU kernel * add unittest for reduce_min xpu op
-
由 zhupengyang 提交于
-