- 03 1月, 2023 11 次提交
-
-
由 kangguangli 提交于
-
由 Guanghua Yu 提交于
-
由 zhoutianzi666 提交于
* Implement conv2d_fusion NHWC format using CUTLASS * Add unit testing for CUTLASS Conv in inference * Add experimental API for CUTLASS.
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
由 Yiqun Liu 提交于
* Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad. * Correct the axis when there is only 1 input in BroadcastKernel. * Add the calculate of output's shape.
-
由 zhaoyingli 提交于
* [Zero-Dim] reshape/reshape_/reverse 0D support * rm comment * change paddle.to_tensor to paddle.full * fix docs * update paddle.full
-
由 zhoutianzi666 提交于
-
由 Leo Chen 提交于
-
由 骑马小猫 提交于
-
由 Sanbu 提交于
-
由 Jianghai 提交于
* relu flops all * add annotations and tests * revision for codestyle
-
- 02 1月, 2023 1 次提交
-
-
由 Hulek 提交于
-
- 01 1月, 2023 1 次提交
-
-
由 gem5 提交于
-
- 31 12月, 2022 1 次提交
-
-
由 caozhou 提交于
-
- 30 12月, 2022 18 次提交
-
-
由 zhangbo9674 提交于
-
由 xiongkun 提交于
* bugfix: fix bugs in Indexable and support LayerDict * fix bugs.
-
由 wangxinxin08 提交于
* check weight shape of conv1d_transpose * add unittest case
-
由 zhangbo9674 提交于
* speedup getFNDAFile * add fnda_base for c++ ut cc file * fix bug * fix bug * fix bug * fix bug
-
由 HongyuJia 提交于
* add custom_cpu testcase * update test_custom_device_setup * update path to custom_runtime * fix cmd wait * test Linux only * setup once * integrate to one run_cmd * add pip install * change timeout * add debug string * add debug string * add debug string * use os.system and change module name * add runtime * add more debug message * continue debug * timestamp * fix testcase import bug * remove error message * set TIMEOUT property
-
由 zyfncg 提交于
* fix test_conv_bn_fuse_pass_cc * remove comment
-
由 Zhang Jun 提交于
* update conv to convNd * trigger ci
-
由 Leo Chen 提交于
-
由 risemeup1 提交于
* delete batch_norm * test * test * test * test * test * recover cmake_gen * debug
-
由 Roc 提交于
-
由 zyfncg 提交于
* support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * add extra output in op_compat * remove debug log
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* clean custom_xpu testcase test_static_pe * use assert_allclose to solve precision error * adjust precision * flatten tensor * fix flatten
-
由 zhouzj 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 risemeup1 提交于
* fix_mac_build_problem * fix_mac_build_problem * fix_mac_build_problem
-
由 WangZhen 提交于
* Fix default GetExpectedKernelType for ops supported tensor attrs
-
由 姜永久 提交于
* rm legacy * clear in_legacy * fix tracer
-
- 29 12月, 2022 8 次提交
-
-
由 risemeup1 提交于
-
由 risemeup1 提交于
* fix_static_problem * test * fix_static_problem,test=document_fix
-
由 wangzhen38 提交于
* [fluid remove] rawconv
-
由 Aurelius84 提交于
* [D2SCinn]Support deliver skip_gc_vars into Graph * fix unittest * fix copy
-
由 Lin Manhui 提交于
-
由 ykkk2333 提交于
-
由 xu98bin 提交于
* auto parallel bf16
-
由 zmxdream 提交于
* fix load into memory * fix load into memory * fix code style
-