- 18 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
新增sparse.is_same_shape、sparse.reshape、sparse.transpose 三个API
-
- 26 8月, 2022 1 次提交
-
-
由 kangguangli 提交于
* remove fluid kernel and activate phi kernel * fix parameter error * transfer mkldnn part * modify header file path * fix compile error * transfer special case * fix lod setting and special case for layout setting * add testcase and refine code
-
- 25 8月, 2022 1 次提交
-
-
由 kangguangli 提交于
* transfer memcpy_d2h from fluid to phi * refine arg check and add comment * fix cannot fallback to phi kernel * fix gpu_context host alloc when tensor size = 0 * add kernel for std::vector<DenseTensor> args * fix bugs in MemcpyD2HMultiIOKernel * remove useless header file * polish format * fix typo * add testcase for cudapinned place * refine check condition in test * polish error message * polish error message * remove header in fluid directory * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation * fix code style check
-
- 23 6月, 2022 1 次提交
-
-
由 Matsumoto Ruko 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 29 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 18 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
sparse maxpool; kernel_registry support sparse tensor
-
- 04 3月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
move cpu_vec.h to phi/kernels/funcs.
-
- 28 2月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* fix incorrect dims settings * sparse conv3d * fix out dims * test performance * test large shape success * opt scatter, double performance * test float16 * remove profiling code * remove pten * opt code lines * correct boundary judgment * only cpu * test ci * test ci * remove the including paddle/fluid header; extract the conmmon function * opt code lines * use DenseTensor::data() instead of mutable_data * return rulebook for backward * specify layout * rename:conv -> sparse_conv3d
-
- 24 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 17 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* clean useless header in pten core * fix compiled failed * fix cmake target * fix typo * resolve conflict
-
- 14 2月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add split kernel * add split kernel signature * fix split bug * modify MakePtenScalarArrayFromVarList * modify MakePtenScalarArrayFromVarList * fix split windows register error * add test case for split kernel * replace raw split kernel with pten kernel * fix makeScalar/ScalarArray bug * remove debug log * remove int64_t type in buildPtcontext * update by code review * fix split dev test failed * change DenseTensorMeta to MetaTensor * change split api code from auto gen to manual * split cuda kernel support bfloat16 type * fix conflict * rm raw split kernel * merge develop branch * change to pten::errors
-
- 30 1月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse
-
- 21 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
-
- 28 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* refactor matmul directory in pten * fix merge conflict
-
- 23 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add empty and empty_like kernel in pten * add empty dev_api
-
- 20 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add pten conj kernel * modify conj_kernel file path * add defined cuda macro to cuda/conj_kernel.h
-
- 29 11月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
- 22 11月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add cast kernel * add cast cuda kernel * add cast kernel * make cast kernel output dtype undefined * get cast dtype from vardesc * move cast to manipulation and add test case * add castinfershape * avoid reinitilaze variable * InitializeVariable support datatype * merge develop branch * fix merge bug * revert modify initializeVariable * revert modify on InitializeVariable * revert modify on InitializeVariable * mutable support reset dtype * enable make pten tensor from variable when def_arg.type is undefined * fix build pten ctx start_idx error * copy pten out tensor to variable * merge develop branch * fix non pten kernel cast failed * add reset allocation place for remake tensor * fix inplace realloc error * add mutable on pten kernles and remove unused cast files * rename function names * fix output type error * fix conflict with develop branch * set data type to variable with pten's dtype * fix test_cast_api type mismatch * densorTensro mutable_data support 0 bytes value * fix the inplace bug of reshape kernel * fix pten.backend != variable.place when moving storage, palce mismatch bug * fix conflict with develop branch * Fix bug of paddle::experimental::MovesStorage * fix ReMakePtenDenseTensor place mismatch bug * Revert "fix ReMakePtenDenseTensor place mismatch bug" This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a. * fix ReMakePtenDenseTensor place mismatch bug * reverts the set_lod interface, test=develop * modify by the review options * modify error message * add & for const input arguments * add reference in params * elementwise_sub add mutable_data * fix ResetHolderWithType check size bug * add dependence pten_tensor to test_cast_api object * remove unused code to pass ci coverage Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 16 11月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* reshape kernel refactor * fix compile bugs when run ci * support xpu for reshape * fix bugs when run unittest in kunlun ci * fix compile bugs when run kunlun * perfect code according to suggestion * add api and unit test for reshape
-
- 12 11月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* elementwise_add kernel refactor * fix compile bugs in elementwise_add refactor * fix compile bugs when run in npu/xpu * fix bugs when run unit test * fix bugs when run ci-windows * modify code as recommended * code format adjust * fix bugs when run ci * fix compile bug when run in ci-windwos
-
- 05 11月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* organize pten unitests * fix detail errors
-