- 10 3月, 2023 1 次提交
-
-
由 mayang002 提交于
* [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels * [xpu-ernie3] support fp16 for full_like/stack/where xpu kernels
-
- 07 3月, 2023 1 次提交
-
-
由 zhangyikun02 提交于
-
- 06 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
- 02 3月, 2023 1 次提交
-
-
由 haosicheng 提交于
-
- 01 3月, 2023 1 次提交
-
-
由 duanyanhui 提交于
* add support of int64 add for xpu * add transpose support for int64 * add randperm kernel * fix randperm * add distribute_fpn_proposal kernel * fix comment * add reduce_sum_int32
-
- 28 2月, 2023 2 次提交
-
-
由 zhupengyang 提交于
-
由 shentanyue 提交于
-
- 27 2月, 2023 2 次提交
-
-
由 houj04 提交于
* [XPU] add fp16 support for shape op. * [XPU] add fp16 support for lookup_table_v2 op. * update approval list: add qingshu's id.
-
由 wangshengxiang 提交于
* [XPU] bind op scatter_nd_add * [XPU] add more data type for op: clip, transpose2 & assign_value
-
- 24 2月, 2023 2 次提交
-
-
由 niuliling123 提交于
-
由 ronnywang 提交于
* [XPU] add expand_grad, isnan, meshgrid kernels * update
-
- 23 2月, 2023 2 次提交
-
-
由 csy0225 提交于
-
由 jameszhang 提交于
* kunlun support c_softmax_with_cross_entropy * fix grad calc error * replace mutable_data() and ShareDataWith() * update xdnn * update xpu toolchain to 20230215 * remove fluid from test file
-
- 22 2月, 2023 1 次提交
-
-
由 houj04 提交于
-
- 21 2月, 2023 2 次提交
-
-
由 QingshuChen 提交于
-
由 zhangyikun02 提交于
-
- 20 2月, 2023 2 次提交
- 17 2月, 2023 2 次提交
-
-
由 houj04 提交于
* [XPU] add fp16 support for cumsum and log. * [XPU] add fp16 support for cumsum and log.
-
由 zhupengyang 提交于
[XPU] add multi_encoder_xpu_slice_fuse_pass, generate_sequence_xpu_fuse_pass, generate_sequence_xpu kernel (#50570)
-
- 16 2月, 2023 4 次提交
-
-
由 shentanyue 提交于
* support xpu multi-card infer * add ut * clean code * clean code * fix * fix * fix * fix
-
由 houj04 提交于
* [XPU] update xccl to 1.0.8 * update xdnn. add uint8 for concat and split. * update xdnn to 20230215.
-
由 ronnywang 提交于
* [XPU] add group_norm kernel * update * add xpu sin, cos, randint, linspace kernels * update * update
-
由 zhupengyang 提交于
-
- 15 2月, 2023 2 次提交
-
-
由 zhangyikun02 提交于
-
由 QingshuChen 提交于
-
- 13 2月, 2023 1 次提交
-
-
由 ykkk2333 提交于
* add xpu adagrad and where_grad kernels, test=kunlun * add xpu pool3d kernels, test=kunlun
-
- 10 2月, 2023 3 次提交
-
-
由 Leo Guo 提交于
d_bias are nullptr. Modify the code style of full_kernel.cc. Add new data type for concat, elementwise_add, gather, scale, scatter ops. test=kunlun
-
由 zhupengyang 提交于
-
由 wangshengxiang 提交于
-
- 09 2月, 2023 2 次提交
-
-
由 Leo Guo 提交于
-
由 zhangyikun02 提交于
-
- 01 2月, 2023 1 次提交
-
-
由 zhangyikun02 提交于
-
- 31 1月, 2023 1 次提交
-
-
由 wangshengxiang 提交于
-
- 19 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* [KUNLUN] add op: maxpool_with_index * use DeviceContext::Alloc() instead of DenseTensor::mutable_data() * fix file format * solve clip unittest failure * minor fix * Revert "solve clip unittest failure" since the issue is fixed in #49535 This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b. * align with xdnn on the definition of mask in max_pool_with_index * minor
-
- 18 1月, 2023 3 次提交
-
-
由 RuohengMa 提交于
* add reduce_sum_int64 and reduce_sum_int8 xpu kernels * [PHI] add clip grad kernel with support type float32 and int32 * [PHI unittest] add clip_grad unit test * adapt code to clang-format * update xpu api output with clip_grad api * remove int8 support of reduce_sum xpu kernel since it can not pass unit tests * adapt license date, add code for XPUDataType convertion * add int8 support of reduce_sum * add reduce_sum unit tests for dtype int64, int8, and add more test cases * update license date * remove buggy bitwise and, or and xor xpu kernels, refine bitwise not xpu kernel * change license date
-
由 houj04 提交于
-
由 jameszhang 提交于
* revert to use default XPU stream for computing XPUContext now has a null stream by default. If you want to use a separate stream (e.g. in async collective communication), you should create a dedicated XPUContext and invoke its XPUContext::CreateStream() * minor
-
- 16 1月, 2023 1 次提交
-
-
由 QingshuChen 提交于
-
- 13 1月, 2023 1 次提交
-
-
由 jameszhang 提交于
* kunlun add support for c_concat and c_split * replace mutable_data() and ShareDataWith()
-