- 06 3月, 2023 5 次提交
-
-
由 Meteor Liu 提交于
* implement floor_grad by primitive logic * implement floor_grad by primitive logic * Merge branch 'develop' into floor_grad
-
由 niuliling123 提交于
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
由 Vvsmile 提交于
* fix a bug which is triggered by the lack of __class__.op_type * remove two "self.__class__.op_type = self.op_type"
-
由 Sławomir Siwek 提交于
* matmul refactored * fc * SetOutMemDescWithLogicalLayoutFusesSupport * matmul_v2 * alpha support * group repetetive funcs * matmul utils * execute matmul methods * restore registered kernel names * split header and impl files * remove double negatives * increase coverage * add onednn tests to ctest * remove fusion logic from base matmuls
-
- 03 3月, 2023 7 次提交
-
-
由 zxcd 提交于
* add sigmoid composite rule * add python api * fix code style. * add check_prim=True * add sigmoid fp16 unit test. * fix code style. * rm bf16 check_prim * fix code style.
-
由 kangguangli 提交于
* find relevant testcase * remove with_data_parallel in test_dist_base * fix bugs * fix test_minst_dgc_nccl * fix typo * fix test_dist_mnist_gradient_merge * rm TestDistMnistNCCL2DGCMultiCards * fix dist_mnist * polish code * fix different PaddleCloudRoleMaker
-
由 Haohongxiang 提交于
-
由 Weilong Wu 提交于
* support warnings for extra attrs and axis in non default value * support extra_attr warnings and add test case * fix dict name * polish * polish * polish
-
由 wangxiaoning 提交于
* comp gather_nd_grad * fix * test no cinn * fix * fix cinn
-
由 zhouweiwei2014 提交于
-
由 niuliling123 提交于
-
- 02 3月, 2023 16 次提交
-
-
由 Ruibiao Chen 提交于
* Check structed kernel for new executor static build * Update code * Ready for resnet50 * Move transfer_dtype to phi * Ready for transformer * Fix CI errors * Fix layer_norm InferMeta * Remove layer_norm infermeta fix
-
由 chenxiao120660 提交于
-
由 ahahahahahaha 提交于
-
由 xiaoxiaohehe001 提交于
* add_trt_tile * tile_trt
-
由 Charles-hit 提交于
* fix prim_op_test when python api outs is different with kernel sig * add elementwise op prim test * fix unit test * add bfloat16 for full in static prim api * empty-commit * close bf16 test * polish elementwise tests
-
由 qizhaoaoe 提交于
* fluid clean: remove parallel and parallel_helper api * fix: fix the import path. * fix DataParallel imports issue
-
由 Jiabin Yang 提交于
* fix attrs copy error * fix bert by fix slice error * fix op test
-
由 xiongkun 提交于
* [dy2static] bugfix: make stop_gradient a cache key 1. make stop_gradient cache key in dy2static. * fix ci errors * fix ci error * fix ci error * fix ci error
-
由 wangshengxiang 提交于
-
由 Leo Chen 提交于
* register fp16 and bf16 kernel for uniform_random * fix compile * support selected_rows * add ut * revert cpu * fp16 test skip cpu
-
由 wangzhen38 提交于
* [cinn] concat_grad * [cinn] concat_grad * [cinn] concat_grad build success * [Add PGLBOX] fix unnitest * [Add PGLBOX] fix unnitest * [Add PGLBOX] fix codestyle * [cinn] update by comments * [cinn] update by comment * [cinn] add axis check
-
由 LoneRanger 提交于
-
由 gaoziyuan 提交于
-
由 Roc 提交于
* add composite op hard swish * add test grad * update apis calling * update date range * add ut * tune off cinn for 0-d shape * skip cinn
-
由 jiangcheng 提交于
-
由 Vvsmile 提交于
-
- 01 3月, 2023 12 次提交
-
-
由 Chitsing KUI 提交于
* flash attn * seed * almost * softmax * fix workspace * add unitest; linux only * fix setup * fix datatype include * fix setup typo * fix def scope * new error api * use paddle fork * fix attr bug; complete ut * update flash hash * fix rng reset * fix offset * fix comments
-
由 HongyuJia 提交于
* Add comments for #50886 * [Tensor Operants & Prim-Relevant] Tensor supports logical operants * add prim dynamic unit test * add prim static unit test
-
由 zqw_1997 提交于
* tmp gather vjp * support gather * remove useless code * fix compiling error * fix ut * add eager test * add eager test * add seed * small change * fix cpu error * fix transpose op compat * remove tensor index case * fix prim_cinn * small commit * add cumsum prim backward * small commit * skip aixs=None test case * fix op generante eror * fix static test error * remove unused code * fix static test error * small commit * skip cpu float16 test case * skip eager cpu cumsum float16 test case * add eager and static UT * fix ut * add composite backward rule * fix error * fix type error and format error * add try cpu+float16 test * fix test bugs * remove test for cpu+float16 and make y[0] be the grad arg * add cinn test * fix UT * fix the wrong dim of v in test cases * change y[0] to y[1] for grad in UT * reshape flatten out * Disable cinn single test * use scatter_nd_add * modify the reshape part of topk_grad * delete useless build file * to make the syntax right * modify bug * try use of put_along_axis * remove cinn test * reformat todo * add silu composite rule * fix code style. * add cinn test * fix composite grad maker code gen * add prim in cumsum op test * remove old test * fix typro * pass the static test * fix typro * modify optest and delete old test files * remove normal test_top_k_op test * fix typro * pass axis=None test case * buffer comment * for debug * add silu fp16 unit test. * add static guard * remove forward prim test * remove same name axis * modify the test_top_v2_op.py to pass all local tests * delete the useless testcase * fix mistake * add more testcases to test dtype16 and dtype32 --------- Co-authored-by: NJiabinYang <360788950@qq.com> Co-authored-by: NGGBond8488 <857631483@qq.com> Co-authored-by: Nzxcd <228587199@qq.com> Co-authored-by: NCharles-hit <wanghao107@baidu.com>
-
由 wangxiaoning 提交于
* remove transpiler * Revert "remove transpiler" This reverts commit 46044ccd52011d45d7026786d331f264a6a8f645. * Revert "Revert "remove transpiler"" This reverts commit 80ad0945401b5b5efebac4baee0ec50a793d4405. * codestyle * fix setup * fix * fix
-
由 Zhang Ting 提交于
-
由 yunyaoXYY 提交于
* Add unitest from shilong * Add kernel code from shilong * fix codestyle * add broadcast_shape test * fix unitest * fix unitests * fix unitest * add 0D grad support * add 0D grad support * add 0D grad support * fix 0D tensor * fix 0D * fix xpu 0D * fix expand kernel * fix xpu expand * Fix 0D kernel * fix 0D * fix 0D * fix 0D * fix 0D * fix XPU top_k * cancel the modify of xpu * add XPU 0D tensor * fix 0D
-
由 wawltor 提交于
-
由 chenxiao120660 提交于
* fix bug of logsumexp * fix bug for logsumexp * fix bug for logsumexp
-
由 Yichen Zhang 提交于
* implement composite full_like and simple unit test * implement op tests for composite full_like op * some modification as reviewers suggested add cinn op test to CMakeLists.txt fix code style * fix code style * modify input args of prim fill_any_like op * resolve conflicts * resolve conflicts * modify python api and unit tests as suggested * resolve conflicts * resolve conflicts * use framework.dtype to convert dtype in Op test
-
由 LoneRanger 提交于
-
由 niuliling123 提交于
-
由 duanyanhui 提交于
* add support of int64 add for xpu * add transpose support for int64 * add randperm kernel * fix randperm * add distribute_fpn_proposal kernel * fix comment * add reduce_sum_int32
-