- 30 9月, 2022 5 次提交
-
-
由 光明和真理 提交于
-
由 HongyuJia 提交于
* opt GetExpectedKernelType code of conv_transpose_op * fix if error
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 sneaxiy 提交于
* support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * add bfloat16 to selu_grad to pass CI * fix selu grad compilation error
-
- 29 9月, 2022 2 次提交
-
-
由 carryyu 提交于
* fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40.
-
由 光明和真理 提交于
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
-
- 28 9月, 2022 5 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Jacek Czaja 提交于
* added fc int8 tests * CI fix * added skipping UTs for GPUs * fixes for CI * added support for residual connections inside fc * fix for quant int8 bias * - lint Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
由 limingshu 提交于
-
由 Sławomir Siwek 提交于
* Relu6 * remove fluid handler * add individual kernel signature * coding style * replace bounded_relu with clip * whitespace * code style
-
由 zyfncg 提交于
* fix clip_extra code in remove_training_info * revert rnn opmaker clear
-
- 27 9月, 2022 4 次提交
-
-
由 Chenxiao Niu 提交于
-
由 levi131 提交于
* init dropout * small format fix * fix pr comments * add value test
-
由 ceci3 提交于
-
由 zhangkaihuo 提交于
-
- 26 9月, 2022 5 次提交
-
-
由 Jacek Czaja 提交于
* - Conv grad changed for MD * - lint * - compilation fix * yet another lint
-
由 cifar10 提交于
-
由 zyfncg 提交于
-
由 Jiabin Yang 提交于
* support rsqrt_p * refine code and ut * add_prim_rsqrt * fix ut
-
由 zyfncg 提交于
-
- 23 9月, 2022 5 次提交
-
-
由 dongfangshenzhu 提交于
* add phi reduce_sum test=kunlun * add fhi reduce_sum test=kunlun * add fhi reduce_sum test=kunlun
-
由 Chenxiao Niu 提交于
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
-
由 Aurelius84 提交于
* [BugFix]Fix reduce_mean/min/sum/prod, cumsum grad_op infershape bug * fix typo * fix typo
-
- 22 9月, 2022 11 次提交
-
-
由 Paulina Gacek 提交于
* Sum kernel migrated to phi * Static cast added, file name changed * OneDNNGetDataType to uppercase * refactoring * AddOneDNNHandler changed to SumOneDNNHandler
-
由 zyfncg 提交于
* clear extra attrs of look_up_table_v2 in opmaker * fix bug
-
由 Piotr Paturej 提交于
* Convert slice+grad oneDNN fluid kernels to PHI * Change mutable_data to Alloc * Refactor licences
-
由 王明冬 提交于
-
由 Sławomir Siwek 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * gelu fwd * sort activations * gelu gradient * remove unused macros * merge conflicts * fix merge conflicts * remove extra contraint from gelu op
-
由 Leo Chen 提交于
-
由 Yuanle Liu 提交于
-
由 Hui Zhang 提交于
* fix sub sign reverse for mkldnn * refactor code as comment * remove useless * format code
-
由 Haohongxiang 提交于
* fix bugs of mp * fix bugs of mp * update * update * fix bug
-
由 carryyu 提交于
* Optimize topk's performance when k is small and input_width is large * 修改blockdim设置逻辑 * Update top_k_function_cuda.h
-
由 Chenxiao Niu 提交于
-
- 21 9月, 2022 3 次提交
-
-
由 ccrrong 提交于
* add fp16 support * update * update half * code format * fix unittest * fix rocm compile error * code format * code format * fix rocm compile error * fix rocm compile error
-
由 Piotr Paturej 提交于
-
由 RichardWooSJTU 提交于
-