- 17 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* clean useless header in pten core * fix compiled failed * fix cmake target * fix typo * resolve conflict
-
由 Chen Weihang 提交于
* remove fluid device context deps * fix compile failde
-
- 15 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add data_transform in pten api * support GetKernelTypeForVar * fix complie problem of bfloat16 * add scale_sr in api * suppport select_row in C++ api * merge code
-
- 14 2月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add split kernel * add split kernel signature * fix split bug * modify MakePtenScalarArrayFromVarList * modify MakePtenScalarArrayFromVarList * fix split windows register error * add test case for split kernel * replace raw split kernel with pten kernel * fix makeScalar/ScalarArray bug * remove debug log * remove int64_t type in buildPtcontext * update by code review * fix split dev test failed * change DenseTensorMeta to MetaTensor * change split api code from auto gen to manual * split cuda kernel support bfloat16 type * fix conflict * rm raw split kernel * merge develop branch * change to pten::errors
-
- 11 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* ermove xxx_info include * fix namespace error * resolve conflict * skip xpu context in registry * fix macro error * resolve conflict * resolve conflict * revert xpu convert * remove trans to fluid place * remove useless headers
-
- 09 2月, 2022 2 次提交
-
-
由 zhangkaihuo 提交于
* implement AllocateFrom * dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse * sparse_csr_to_dense * test to_sparse_coo: csr_to_coo * fix writing error * to_sparse_csr: dense_to_sparse_csr and sparse_coo_to_csr * fix check shape * fix unit test * to_dense: sparse_coo_to_dense, sparse_csr_to_dense * replace CUDADeviceContext by GPUContext
-
由 zhangkaihuo 提交于
* implement AllocateFrom * dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse * sparse_csr_to_dense * test to_sparse_coo: csr_to_coo * fix writing error * to_sparse_csr: dense_to_sparse_csr and sparse_coo_to_csr * fix check shape * fix unit test * replace CUDADeviceContext by GPUContext
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 04 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add data_transform in pten api * support GetKernelTypeForVar * fix complie problem of bfloat16 * change error namespace * add complex type transform unittest * fix merge conflict
-
- 30 1月, 2022 3 次提交
-
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse * sparse_csr_to_dense * test to_sparse_coo: csr_to_coo * fix writing error
-
由 Chen Weihang 提交于
* change unary infermeta * change other infermeta * change all infermeta format * resolve conflit * fix test failed * resolve reshape conflit * fix compile failed * adapt auto api gen * fix reshape failed * fix concat failed * resolve conflict
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse
-
- 29 1月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* open header for custom kernel * add core utils * tidy core code * tify header * tidy include * tidy namespace * resolve conflit * fix unittest and coverage * remove platform using * resolve conflict * resolve conflict * fix digamma namespace error * fix xpu full kernel error * fix xpu full kernel error * polish details * add place for lib storage
-
- 28 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* refactor scale kernel that its input is selected_rows * complement upload file
-
- 24 1月, 2022 1 次提交
-
-
由 石晓伟 提交于
* updates callers, test=develop * updates tensor, test=develop * fixes errors, test=develop * remove some dtypes, test=develop * fix errors in the base storage modification, test=develop * fixes a bug, test=develop * fixes the bugs in push the whole, test=develop * updates, test=develop * update * update, test=develop * fixes the mac-py3 CI, test=develop * remove the storage impl, test=develop * updates some codes, test=develop * update, test=develop * updates pten allocation, test=develop
-
- 21 1月, 2022 3 次提交
-
-
由 chentianyu03 提交于
* fix test concat dev api build failed * fix conflict * fix conflict
-
由 chentianyu03 提交于
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
- 20 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove MakePtenTensor in BuildKernelContext * fix a bug caused by storage * remove WriteBackOutput in dynamic and static mode * fix complie error of std::max * fix complie error of std::max * fix date_type bug * fix memory alloc bug * add some debug info * fix compile problem * fix problem of data_type check * comment out some unreached code
-
- 15 1月, 2022 1 次提交
-
-
由 石晓伟 提交于
-
- 13 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
- 11 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update
-
- 31 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* unify data layout * fix test_transfer_layout error
-
- 26 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* add register general kernel marco * move copy kernel impl * revert needless change * polish details * fix xpu compil faild * fix xpu compile failed * polish format
-
由 Zhanlue Yang 提交于
* Replaced pten::LoD with paddle::framework::LoD * Overrided CPUVector with CUDAVector * Refactored paddle::framework::Vector
-
- 24 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 23 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add empty and empty_like kernel in pten * add empty dev_api
-
- 21 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* rename cuda to gpu * revert CMake change * resolve conflit * rename other cuda to gpu * poish details
-
由 Chen Weihang 提交于
* remove eigen and blas dir * fix declare error
-
- 20 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add pten conj kernel * modify conj_kernel file path * add defined cuda macro to cuda/conj_kernel.h
-
- 17 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* modify sum mean args * add GetExpectedPtenKernelArgs for redcue_op * modify kernel args number * modify kernel args number
-
- 16 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add register_ctx_kernel and move scale kernel * polish details by reviewer comment * fix xpu compile failed * fix cmake error
-
- 15 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-
- 13 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add variadic_args kernel in pten * merge develop code * add variadic_args kernel and benchmark * change dynamic_cast to static_cast for DeviceContext * merge the code * modify code format * refactor variadic kernel function
-
- 09 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
- 03 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add code-gen file * add api-gen in cmake * adjust the code format * temp save the code * add genen-api module into pten * merge the develop code * fix merge conflict * fix code conflict with develop * support reduce_mean/sum * fix the CI requirement * fix requirement problem of CI * remove original api code * fix bug caused by removing original api
-
- 30 11月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
-