- 17 7月, 2023 6 次提交
-
-
由 winter-wang 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 kangguangli 提交于
-
由 Chen Weihang 提交于
-
- 15 7月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 14 7月, 2023 18 次提交
-
-
由 zhangbo9674 提交于
* add code * fix bug * refine code * refine code * fix bug
-
由 caozhou 提交于
* distribute best cfg * adapt to multi args transmission * update metric extracting * fix bugs of prune and reading log * fix time default value * remove time record * adjust the order of searching dim * fix prune bugs * fix adding cfg bug * fix multi nodes bug * reset status * remove alarm and set logdir * deepcopy ctx * change alarm * fix restart bug * add exit * best no need alarm * add warmup time
-
由 Guo Sheng 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 Wang Xin 提交于
-
由 ronnywang 提交于
-
由 zhupengyang 提交于
-
由 ronnywang 提交于
-
由 Siming Dai 提交于
-
由 kangguangli 提交于
* add feed in op_compat.yaml * remove input mapping
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 zhangbo9674 提交于
* add inplace interface * support inplace * refine code * fix bug * fix bug * refien code * add file * add interface * refine code * refine code * add phi kernel instruction * refine code * add test * delete unuse code * add test * add test * add deps * delete unused code * fix bug * fix bug
-
由 hong19860320 提交于
-
由 Tian Zheng 提交于
* Update CUDNN Frontend API to v0.9.1 - Remove old patches - Remove workarounds that are no longer needed * Fix test_switch_autotune
-
由 hong 提交于
-
- 13 7月, 2023 15 次提交
-
-
由 Yuanle Liu 提交于
* copy dense_tensor.h to inference lib * update * update
-
由 Yuanle Liu 提交于
-
由 niuliling123 提交于
-
由 xiaoguoguo626807 提交于
-
由 freeliuzc 提交于
* add init value for CudaSwishFunctor * add new phi kernel fusedBiasActKernel
-
由 Yichen Zhang 提交于
-
由 Ruibiao Chen 提交于
* Support nvprof for auto parallel * Fix CI errors * Fix CI errors
-
由 Charles-hit 提交于
* [prim]support fp16 for instance_norm and instance_norm_grad * support fp16 and bfp16 dtype for instance_norm prim rules * fix new ir test --------- Co-authored-by: Ncxxly <chenxx_id@163.com>
-
由 lil-Xing 提交于
* add phi operator c_concat and ut * update create_var use * update copyright
-
由 hong 提交于
* new ir support builtin slice op * fix phi kernel adaptor bug
-
由 gouzil 提交于
* [tools] Add CI for assert allclose. * fix * fix \s * update * rm demo1 * add demo1 * fix * rm demo;test=document_fix
-
由 zhangyuqin1998 提交于
* Move compare_raw_kernel to legacy * fix * Update compare_kernel.cc * Move compare_raw_kernel to legacy
-
由 Zhang Zheng 提交于
* [CINN] Schedule error message optimization * format code style * add test * fix format * using CINN_THROW and using flags * optimize error msg * do not use abtract class of error hanlder * fix header
-
由 ronnywang 提交于
-
由 Leo Chen 提交于
* Support AMP program for onnx QAT API * Integrate QAT into distributed optimizer * Reduce the size of test data and increase time limit * Use logger and reduce time limit of unittests * Rename and move unittest into fleet test * Test qat_init API
-