- 01 2月, 2023 11 次提交
-
-
由 gouzil 提交于
* [Divide by 0 Error] add norm check * [Divide by 0 Error] fix x AttributeError * [Divide by 0 Error] norm check migrate to c++
-
由 limingshu 提交于
* A leap of try for cudaLaunchCooperativeKernel * fix bugs * Totally replace the lar cuda kernel * Fix bugs * fix code according to comments * fix codes according to review comments * adding some function overload * relocate the power operation. * add bf16 support for index select relevant ops * revert bf16 type change. * add changes for more op * fix code writting bugs
-
由 zhoutianzi666 提交于
-
由 wangxiaoning 提交于
-
由 zqw_1997 提交于
-
由 zxcd 提交于
* add clip_grad_norm_ api. * fix docs and some details according to the comments. * fix code style. * fix no_grad problem, and fix doc. * fix code style. * fix doc and remove type information
-
由 RedContritio 提交于
* add axis check for unbind * add axis range check for unbind * update unittest and axis validation for unbind * add unittest invalid axis for unbind * restore axis extract for unbind
-
由 RedContritio 提交于
-
由 LiYuRio 提交于
-
由 Aurelius84 提交于
* [PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore * fix baseline unittest config * fix code style
-
由 limingshu 提交于
* profile reduce kernel for fp16 and reduceHigherdim * use reinterpret_cast * fix for CI on ROCm * add Macro for ROCm * ROCm CI config * ROCm CI config * unit test repair * pull * add common_funcs.h * reduceType * Update reduce_function.h * not higher * rename * implement of matmul using cublasLt instead of cublas * cublasLt bugfix * Update matmul_kernel_impl.h * Update matmul_kernel_impl_via_blasLt.h * for-loop-algo * PR comments changes * add macro * ci unused variable isCublasLt * ci unused variable isCublasLt macro * split matmul to autotune * rewrite the split kernel with segmented_array * rewrite the split kernel with segmented_array * rewrite the split kernel with segmented_array * add some method for cuda_graph * fix bugs for rocm * change for ci-error * i dont know why ci-model-benchmark gives a shit error, so i recover codes with original one to see if original codes work. * add some changes for passing mode_benchmark and coverage ci * fix ci error * fix ci-rocm error * add some changes for header --------- Co-authored-by: Nzhangbopd <1299246947@qq.com> Co-authored-by: NBo Zhang <105368690+zhangbopd@users.noreply.github.com>
-
- 31 1月, 2023 29 次提交
-
-
由 RedContritio 提交于
-
由 wenbin 提交于
* gn_silu * add ut * set TIMEOUT * correct comments * comments * disable windows ut * rename parameter
-
由 姜永久 提交于
* check dygraph on for op tests * reset eigh and modify prelu&sign * update eager_op_test * lint * add more ops * fix reduce * modify reduce test * reset reduce_op * modify matmul test * revert prelu
-
由 姜永久 提交于
* update elementwise ops tests * add more ops * modify sum&split * lint * rm check_dygraph * revert pow * modify add for cpu test * revert reshape * modify min
-
由 姜永久 提交于
* more ops * revert some ops * reset some ops
-
由 wangshengxiang 提交于
-
由 wenbin 提交于
* disable integer * disable integer * add cast layer
-
由 Zhang Jun 提交于
-
由 niuliling123 提交于
-
由 Charles-hit 提交于
* polish static grad op maker gen * fix some bugs * fix static code gen * solve conflict * modify composite grad maker name * integrate phi and fluid info in static code gen * rename some composite maker * modify static code gen format
-
由 Zhang Jun 提交于
-
由 PuQing 提交于
* add FP16 dtype for CastNumpy2Scalar * fix throw message * add test * fix SyntaxWarning * test skip for float16 * fix dtype mistakes
-
由 张春乔 提交于
* fix div 0 error of NoamDecay * add unittest * Update lr.py
-
由 ronnywang 提交于
* [CustomDevice] add custom device api * update * update * test=document_fix * update * update * add examples
-
由 jameszhang 提交于
* [KUNLUN] rename test_pool_max_op.py * update xpu toolchain
-
由 HongyuJia 提交于
-
由 RedContritio 提交于
* fix incorrect output shape of broadcast * add unittest
-
由 Roc 提交于
-
由 zhangkaihuo 提交于
-
由 zhangbo9674 提交于
-
由 MarDino 提交于
-
由 张春乔 提交于
* fix mod 0 error * fix div 0 error in floormod
-
由 Yuanle Liu 提交于
-
由 zqw_1997 提交于
* remove fluid.ir.RegisterPassHelper PassDesc and RegisterPass * proto import problems * change import way of pass_desc_pb2 * change sys.path * change the way of import framwork_pb2 * add fluid_path directory from path.dirname * fluid_path changed
-
由 wangxiaoning 提交于
-
由 201716010711 提交于
-
由 xiaoting 提交于
* support 0d tensor for interpolate * support 0d tensor for interpolate * add xpu unittest for interp * update unittest for interpolate * fix coverage * fix code style * fix for coverage * fix coverage
-
由 Leo Guo 提交于
-
由 TeFeng Chen 提交于
* support inplaced variable in cinn_launch * fix error hint when compiling * fix inplaced output variable of the subgraph * skip CinnCompiler check * using existed definition * fix namespace reference error * modify error message * update cinn tage * fix namespace * skip enforce check * fix unittest attribute throw
-