- 30 9月, 2022 3 次提交
- 29 9月, 2022 2 次提交
-
-
由 carryyu 提交于
* fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40.
-
由 光明和真理 提交于
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
-
- 28 9月, 2022 5 次提交
-
-
由 Chen Weihang 提交于
* remove needless using tensor * remove needless using tensor * resolve conflict * replace tensor using * fix format error * revert needless changing * fix rocm and npu compile error * fix cinn compile error * fix format error * fix mkldnn format error * fix mkldnn format error * fix cinn compile error * fix cinn compile error * fix cinn compile error * resolve conflict
-
由 Jacek Czaja 提交于
* added fc int8 tests * CI fix * added skipping UTs for GPUs * fixes for CI * added support for residual connections inside fc * fix for quant int8 bias * - lint Co-authored-by: Njakpiase <jakpia21@gmail.com>
-
由 limingshu 提交于
-
由 Sławomir Siwek 提交于
* Relu6 * remove fluid handler * add individual kernel signature * coding style * replace bounded_relu with clip * whitespace * code style
-
由 zyfncg 提交于
* fix clip_extra code in remove_training_info * revert rnn opmaker clear
-
- 27 9月, 2022 4 次提交
-
-
由 Chenxiao Niu 提交于
-
由 levi131 提交于
* init dropout * small format fix * fix pr comments * add value test
-
由 ceci3 提交于
-
由 zhangkaihuo 提交于
-
- 26 9月, 2022 5 次提交
-
-
由 Jacek Czaja 提交于
* - Conv grad changed for MD * - lint * - compilation fix * yet another lint
-
由 cifar10 提交于
-
由 zyfncg 提交于
-
由 Jiabin Yang 提交于
* support rsqrt_p * refine code and ut * add_prim_rsqrt * fix ut
-
由 zyfncg 提交于
-
- 23 9月, 2022 5 次提交
-
-
由 dongfangshenzhu 提交于
* add phi reduce_sum test=kunlun * add fhi reduce_sum test=kunlun * add fhi reduce_sum test=kunlun
-
由 Chenxiao Niu 提交于
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
-
由 Aurelius84 提交于
* [BugFix]Fix reduce_mean/min/sum/prod, cumsum grad_op infershape bug * fix typo * fix typo
-
- 22 9月, 2022 11 次提交
-
-
由 Paulina Gacek 提交于
* Sum kernel migrated to phi * Static cast added, file name changed * OneDNNGetDataType to uppercase * refactoring * AddOneDNNHandler changed to SumOneDNNHandler
-
由 zyfncg 提交于
* clear extra attrs of look_up_table_v2 in opmaker * fix bug
-
由 Piotr Paturej 提交于
* Convert slice+grad oneDNN fluid kernels to PHI * Change mutable_data to Alloc * Refactor licences
-
由 王明冬 提交于
-
由 Sławomir Siwek 提交于
* gaussian random * mkldnn to onednn renaming * fix merge conflicts * remove fluid code * onednn renaming * gelu fwd * sort activations * gelu gradient * remove unused macros * merge conflicts * fix merge conflicts * remove extra contraint from gelu op
-
由 Leo Chen 提交于
-
由 Yuanle Liu 提交于
-
由 Hui Zhang 提交于
* fix sub sign reverse for mkldnn * refactor code as comment * remove useless * format code
-
由 Haohongxiang 提交于
* fix bugs of mp * fix bugs of mp * update * update * fix bug
-
由 carryyu 提交于
* Optimize topk's performance when k is small and input_width is large * 修改blockdim设置逻辑 * Update top_k_function_cuda.h
-
由 Chenxiao Niu 提交于
-
- 21 9月, 2022 5 次提交
-
-
由 ccrrong 提交于
* add fp16 support * update * update half * code format * fix unittest * fix rocm compile error * code format * code format * fix rocm compile error * fix rocm compile error
-
由 Piotr Paturej 提交于
-
由 RichardWooSJTU 提交于
-
由 ykkk2333 提交于
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun * migrate add_n kernep to phi, test=kunlun
-
由 Piotr Paturej 提交于
[PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and bilinear_interp oneDNN kernels (#45863) * Migrate concat+grad, expand+grad, fill_constant, nearest_interp_v2 and bilinear_interp_v2 oneDNN kernels to PHI * Remove old namespace variable * Fix invalid out dims error * Add mutable_data method to concat output * Add check for -1 dim before computing out_dims * Capitalize oneDNNGetDataType function name * Change fill_constant kernel to correct PHI kernel * Attempt to fix dims error * Fix fill_constant (full) kernel
-