- 28 12月, 2022 4 次提交
-
-
由 Haohongxiang 提交于
-
由 Yuanle Liu 提交于
-
由 WangZhen 提交于
-
由 wanghuancoder 提交于
* delete old dygraph pylayer
-
- 27 12月, 2022 7 次提交
-
-
由 zhangyikun02 提交于
-
由 risemeup1 提交于
* fix run_setup problem * test
-
由 xiaoting 提交于
* fix fold for large bs * fix fold for large bs
-
由 zhaoyingli 提交于
* [AutoParallel] quantization pass support export * support subgraph * move_presist_var_to_global_block * update unittest * fix ci-coverage * fix codestyle * fix fake_dequantize_op * remove unused var * fix ci error and aprroval error * add unittest for fp16 in test_dequant_linear * replace mutable data * fix unittest in non-cuda-core * fix unittest Co-authored-by: Ncarryyu <569782149@qq.com> Co-authored-by: Nwufeisheng <wfs1997@163.com>
-
由 zhangbo9674 提交于
* cinn use interpretercore * fix bug * fix compile bug * fix scope bug * refine code * refine code by comment * refine code by comment
-
由 Ruibiao Chen 提交于
* Support priority scheduling for standalone executor * Add CPU test
-
- 26 12月, 2022 6 次提交
-
-
由 ykkk2333 提交于
* migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun * fix dlrm throughput problem, test=kunlun
-
由 Roc 提交于
* revert concat and change concat to stack * let stack kernel support int8, uint8 and bool type
-
由 Yulong Ao 提交于
* [Auto Parallel] Rename methods of ProcessMesh * [Auto Parallel] Impl the python process_mesh by the c++ one * [Auto Parallel] Add some minor modifications * [Auto Parallel] Rename some methods * [Auto Parallel] Remove unnecessary codes * [Auto Parallel] Add back some removed files * [Auto Parallel] Fix bugs * [Auto Parallel] Fix a bug * Update process_mesh.cc * [Auto Parallel] Fix a bug
-
由 Ruibiao Chen 提交于
-
由 Ruibiao Chen 提交于
* Memory search for stream analyzer * Shrink redundant waiters
-
由 gem5 提交于
* Enable identity_scale_op_clean_pass by default * IdentityScaleOpCleanPass assert_var_not_persistable
-
- 25 12月, 2022 1 次提交
-
-
由 risemeup1 提交于
* support cinn ci * modify .whl dir * support CINN * support cinn * support kunlun200 and kunlun200test * support cinn * fix tests in kunlun200 * fix bug that can't find libxpu.so.1 * test
-
- 23 12月, 2022 14 次提交
-
-
由 QingshuChen 提交于
-
由 lzy 提交于
-
由 Yuanle Liu 提交于
-
由 JZ-LIANG 提交于
* memcpy overlap * memcpy newexe
-
由 Zhang Jun 提交于
-
由 Charles-hit 提交于
* fix matmul double and triple grad * remove some comment * add matmul_double_grad unit test * fix matmul triple grad * fix dot triple grad and add unit test * modify codestyle * fix dot_grad * refactor dot triple grad * disable some unit test * fix unit test * fix unit test in double grad
-
由 HongyuJia 提交于
* fix custom double_grad backward=None * fix custom_relu.cu bug && polish testcase of double_grad * remove old dynamic graph test
-
由 haosicheng 提交于
-
由 Hui Zhang 提交于
* add warp transducer code
-
由 Wangzheee 提交于
* add ouutput(CLSInds) for fused_token_prune
-
由 MarDino 提交于
-
由 MarDino 提交于
* register half datatype * register roll grad fp16 kernel
-
由 Yuanle Liu 提交于
-
由 houj04 提交于
-
- 22 12月, 2022 8 次提交
-
-
由 Wangzheee 提交于
-
由 Wangzheee 提交于
* fix reshape, gather_nd
-
由 Weilong Wu 提交于
* [eager] use CPUAllocator directly * modify pstring sizeof 48 default * rm CPU test for NaiveBestFitAllocator * fix Mac ci compile errors * use UNUSED to state unused_obj * mv UNUSED statement to allocator_facade.cc * fix roi_align * fix yolov3 test case * recover original code * recover original code * fix trt roi_align test Co-authored-by: Njerrywgz <jerrywgz@126.com>
-
由 xiaoxiaohehe001 提交于
-
由 xiaoxiaohehe001 提交于
-
由 Ruibiao Chen 提交于
-
由 Zhang Zheng 提交于
* Optimize performance of batch_norm_bwd with NHWC layout and infer mode * fix
-
由 Zhang Zheng 提交于
-