- 28 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* Optimize the performanece of sum api (#42231) * optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log * Add move construct for KernelSignature (#42253) * add move construct for KernelSignature * add noexcept * fix cherry-pick problem
-
- 27 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* Remove std::type_index in AttributeArdDef (#42122) * polish some impl * add lost attr type * polish details * fix error type * polish in name lists * add double attr * adapt infrt attr parse * add attr type test (#42263) * opt attr eaque perf (#42272)
-
- 26 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* Add paddle::variant and replace paddle::any (#42139) * add variant and replace any * split attribute * Optimize dygraph GetExpectedKernelType perf (#42154) * opt dygraph scheduling * revert part impl * fix variant compile error (#42203) * replace any by variant in infermeta (#42181)
-
- 25 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimiaze performance of PreparePhiData (#42093) * Dygraph performance optimization (v2) (#42103) * optimiaze performance of PreparePhiData * dygraph performance optimization * optimize performance of dygraph (#42137)
-
- 21 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* [Phi] Support setting size of vector<Tensor> for out in yaml (#41576) * support setting vector out size in yaml * support setting size of vector<tensor> for out in yaml * resolve conflict Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
-
- 20 4月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency * deal with conflict * fix bugs when run unit test * fix unit test bugs
-
- 18 4月, 2022 2 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
由 Chen Weihang 提交于
* [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647) * remove old custom op placetype * replace dist placetype using * add with gpu macro * fix mutable_data error * fix set value error * add comment * remove all is initialized using (#41766) * remove inner_place using (#41768) * polish tensor depreacted method warning (#41807) * [CustomOp] Fix PlaceType related compat error (#41826) * fix place type related compat error * fix test failed * remove dll decl * revert place type change * add dll decl * resolve conflict
-
- 04 4月, 2022 1 次提交
-
-
由 From00 提交于
* Add yaml for flatten_contiguous_range OP * update * Fix typos Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 02 4月, 2022 2 次提交
-
-
由 zhangkaihuo 提交于
-
由 zhangkaihuo 提交于
-
- 01 4月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 zhangkaihuo 提交于
-
- 31 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
由 zhangkaihuo 提交于
-
- 29 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 28 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* perfect reshape kernel * fix bugs of sig * add unittest for reshape_sig * fix bugs when run converage
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 24 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support intermediate for saprse api * close intermediate in yaml * fix dygraph_api dep for eager
-
- 22 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* replace Backend by Place in C++ API * fix left code * fix test_to_api bug
-
- 19 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* call sparse api from python
-
- 18 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
sparse maxpool; kernel_registry support sparse tensor
-
- 17 3月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* move assign kernel init commit * change vec<tensor> to vec<tensor*> * support tensor array * support api declare * fix test_list failed * fix npu and xpu failed * fix infrt failed * remove assign array size in operator * move assign sr header into sr dir * add infermeta for assign * test op success * fix test_list failed * fix kunlun failed * add set host allocator in tests * support tensor array in arg ctx * open set layout in share_meta * fix meta tensor layout error * fix test failed
-
由 YuanRisheng 提交于
-
- 16 3月, 2022 2 次提交
-
-
由 zhangkaihuo 提交于
restructure conv
-
由 chentianyu03 提交于
* move reduce kernels into one file * rename reduce_prod to prod * move reduce sum/mean from math_kernel into reduce_kernel * rm comment
-
- 14 3月, 2022 2 次提交
- 12 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* move allclose kernel * remove allclose op kernel * fix coverage failed
-
- 11 3月, 2022 2 次提交
-
-
由 zhangkaihuo 提交于
submanifold convolution
-
由 Chen Weihang 提交于
* remove needless deps in unittests * add gpu marco * fix other unittests * fix kernel name error * fix test_prepare_op * fix failed dygraph unittests * fix gpu failed tests * fix cinn test failed * fix cinn test failed * fix dropout tests
-
- 09 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* save code * fix bug of set_value * add coverage test
-
由 zhangkaihuo 提交于
Sparse conv3d backward(gpu)
-
- 08 3月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [custom kernel]Upgade support for multi libs * upgrade phi_custom_kernel deps
-
- 07 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove storage deps of empty * remove invalid empty method * remove error empty using * fix test_sparse_utils_dev_api * revert some sparse change * add memset for conv grad * resolve conflict * resolve conflict * resolve conflict
-
- 04 3月, 2022 2 次提交
-
-
由 Feiyu Chan 提交于
move cpu_vec.h to phi/kernels/funcs.
-
由 Chen Weihang 提交于
* change input vec tensor to pointer * update input between * fix format error * resolve conflict * resolve conflict
-
- 03 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* suppport sparse api in yaml * support auto-gen code of sparse api * do some refactor * add unittest test_sparse_conv_api * add unitest file Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
-
由 zhangkaihuo 提交于
* sparse conv3d: gpu code
-
- 02 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
Sparse Conv3d backward code
-