- 25 5月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 09 5月, 2023 1 次提交
-
-
由 RuohengMa 提交于
* bind sparse_coo_tensor, reduce_max/max_int32, range/arange_int32, equal_bool, scatter_grad_float32, nearest_interp_int64 kernels * add more unit tests; modify compilation logic of xpu sparse kernels
-
- 03 4月, 2023 1 次提交
-
-
由 thunder95 提交于
-
- 14 3月, 2023 1 次提交
-
-
由 engineer1109 提交于
-
- 17 10月, 2022 1 次提交
-
-
由 OccupyMars2025 提交于
* add sparse reshape * change the dtype in all test cases to int64 * just one test case * modify comments * Update test_sparse_reshape_op.py * chang the type of "shape" from vector<int64_t> to IntArray * check whether sp_out.to_dense() is the cause of error * print sp_out * Update reshape_kernel.cc * use numpy to generate the equal paddle tensor * just check dense_tensor.numpy() * check cpu and cuda versions * Update test_sparse_reshape_op.py * supply all test cases for cpu forward coo kernel * test forward coo cuda kernel * change configuration of cuda kernel * keep only one test case * test coo cpu kernel (forward and backward) * row major or column major ??? * test cuda coo forward kernel * complete declaration and registration * Update __init__.py * rebuild * retrigger CI * add cudaMalloc and cudaMemcpy in ReshapeCooKernel and change back to row major order in a cuda dense tensor * midify minor error * test only cpu coo forward kernel * add all test cases for coo forward kernel (both cpu and gpu) * test all forward kernels (coo, csr; cpu, gpu) * add all test cases for all kinds of kernels * just retrigger CI * Update sparse_ops.yaml * Update sparse_ops.yaml * Update sparse_ops.yaml * resolve conflicts * Update sparse_ops.yaml * don't specify tensor place * new shape has -1 or 0 in it * Update unary_grad_kernel.h * correct lvalue error * code style * Update sparse_backward.yaml * Update sparse_ops.yaml * Update unary_kernel.h * Update unary.py * Update sparse_backward.yaml * Update unary.py * code style * code style * code style * Update unary.py * specify tensor place explicitly * do not use numpy array * use numpy array in unit test again * modify example code in docstring
-
- 19 9月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* sparse infer_meta
-
- 07 9月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 02 9月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 16 8月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 22 6月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 17 6月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* fix to_sparse_csr
-
- 13 6月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* use GpuMemcpy and GpuMemset * sparse convert kernel support double dispatch by indices dtype * cudaMemcpyKind->gpuMemcpyKind
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 19 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove shared_storage * fix bug * fix rnn bug
-
- 05 5月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 26 4月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 13 4月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 07 4月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 01 4月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 14 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* move the common function to kernel/funcs/sparse/ * add namespace * rm unused file * move func * reuse code
-
- 22 2月, 2022 2 次提交
-
-
由 xiongkun 提交于
* change Vector to std::vector and provide MixVector class as a helper wrapper class * solve the multi-gpu hang problem * remove the duplicate template instantialize * Copy vector to cpu * add CopyToCPU * xxx * final version: fix the problem of all reduce * remove mixvector dependence * fix * merge * fix code * fix by CI
-
由 Chen Weihang 提交于
* unify register macro * rename declare macro * fix infrt error
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 09 2月, 2022 3 次提交
-
-
由 zyfncg 提交于
* adjust the Empyt dev_api * fix merge conflict * fix sparse_utils_kernel
-
由 zhangkaihuo 提交于
* implement AllocateFrom * dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse * sparse_csr_to_dense * test to_sparse_coo: csr_to_coo * fix writing error * to_sparse_csr: dense_to_sparse_csr and sparse_coo_to_csr * fix check shape * fix unit test * to_dense: sparse_coo_to_dense, sparse_csr_to_dense * replace CUDADeviceContext by GPUContext
-
由 zhangkaihuo 提交于
* implement AllocateFrom * dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse * sparse_csr_to_dense * test to_sparse_coo: csr_to_coo * fix writing error * to_sparse_csr: dense_to_sparse_csr and sparse_coo_to_csr * fix check shape * fix unit test * replace CUDADeviceContext by GPUContext
-
- 30 1月, 2022 2 次提交
-
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse * sparse_csr_to_dense * test to_sparse_coo: csr_to_coo * fix writing error
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse
-