- 17 10月, 2022 1 次提交
-
-
由 Leo Guo 提交于
test=kunlun
-
- 14 10月, 2022 2 次提交
-
-
由 Rayman 提交于
-
由 Wang Bojun 提交于
* draft with debug print
-
- 13 10月, 2022 5 次提交
-
-
由 xiaohemaikoo 提交于
-
由 zhouweiwei2014 提交于
-
由 carryyu 提交于
-
由 Zhang Ting 提交于
* Revert "【Hackathon No.56&38】deformable_conv_v1 算子实现 float16 数据类型支持&前向运行加速 (#46111)"
-
由 Zhang Zheng 提交于
* Correct the logic and remove unnecessary template param * fix error throw * fix print format * fix ci
-
- 12 10月, 2022 5 次提交
-
-
由 Zhang Ting 提交于
This reverts commit 8a5f17e8.
-
由 zhangkaihuo 提交于
-
由 sneaxiy 提交于
* fix some ops for int64 range * update error message
-
由 zhouweiwei2014 提交于
* [Zero-Dim] support input 0D Tensor for unary api * fix CI
-
由 zhangkaihuo 提交于
-
- 11 10月, 2022 2 次提交
-
-
由 Feiyu Chan 提交于
-
由 Chen Weihang 提交于
* remove using lodtensor part1 * polish history code format
-
- 10 10月, 2022 4 次提交
-
-
由 Rayman 提交于
-
由 Paulina Gacek 提交于
* op migrated, Copy(OneDNNContext, ...) added * mutable_data & op registration in fluid removed * refactoring * OneDNNGetDataType to uppercase * missing cpu check added, handler moved to .h file * name changed to transpose_grad * Copy changed back to TensorCopy * Resizing corrected, Copy(OneDNNContext) removed
-
由 Rayman 提交于
-
由 Rayman 提交于
support fp16 for deformable conv
-
- 09 10月, 2022 4 次提交
-
-
由 zhangkaihuo 提交于
-
由 zhangkaihuo 提交于
-
由 Sławomir Siwek 提交于
-
由 Sławomir Siwek 提交于
* enable hard_swish_grad unit test * remove unused argument
-
- 03 10月, 2022 1 次提交
-
-
由 Jacek Czaja 提交于
* - some more MD changes * - lint * - compilation fixes * - compilation fixes * - lint * - fix
-
- 30 9月, 2022 9 次提交
-
-
由 Zhang Zheng 提交于
* Optimize performance of depthwise_conv_bwd of filter * op-benchmark * fix * op benchmark * merge bwd
-
由 Zhang Zheng 提交于
* Optimize performance of depthwise_conv_bwd * fix
-
由 ykkk2333 提交于
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun * migrate add_n kernep to phi, test=kunlun * fix bugs of tipc, test=kunlun
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 六个骨头 提交于
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
-
由 sneaxiy 提交于
* support pure bfloat16 * support bf16 linear * update PR to pass CI * tiny fix where_grad_kernel.cu * add bfloat16 to selu_grad to pass CI * fix selu grad compilation error
-
- 29 9月, 2022 7 次提交
-
-
由 carryyu 提交于
-
由 Zhang Zheng 提交于
* Move valid check from python to kernel * fix error throw * fix * invalid label check * fix * Revert "fix" This reverts commit 79fad6799cfa4b30423dbc84e67d7d843d22b84a. * Revert "invalid label check" This reverts commit 402a9707390ad5386b3222e85844b92d2e9b9fa4. * Revert "fix" This reverts commit 09ba3080ee0587447f875c19cdf060485f15ae3b. * Revert "fix error throw" This reverts commit a901bfcc2179d5c120ec29af766f392b122dab52. * Revert "Move valid check from python to kernel" This reverts commit baa03cc4ef82d8d45516c30dfb52bf5aead30748. * final fix * fix * fix
-
由 Leo Guo 提交于
Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)
-
由 carryyu 提交于
* fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40. * fix P40 topk: Make the optimized topk compatible with P40.
-
由 ming1753 提交于
-
由 傅剑寒 提交于
-
由 houj04 提交于
* [XPU] update xpu cmake to 0923. test=kunlun * [XPU] update xpu cmake to 0928. test=kunlun
-