- 16 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * int_array supports constructed by gpu tensor * add some test * polish code * adjust full api * add unittest * add unittest Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
- 11 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* change the output format of C++ backward api * fix merge conflict * fix sparse api code auto-gen * fix eager_gen bug * fix bug of output is null * fix bug of conv2d_grad_impl * fix optional grad * fix bug of eager-gen double_grad * fix bug * fix multiply_double_grad bug * remove node pruning
-
- 07 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* the copy type of data transform for gpu_pinned to gpu change from syna to async * refactor code
-
- 14 4月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency * deal with conflict * fix bugs when run unit test * fix unit test bugs
-
- 13 4月, 2022 3 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit fe214af2.
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency
-
由 Chen Weihang 提交于
* remove old custom op placetype * replace dist placetype using * add with gpu macro * fix mutable_data error * fix set value error * add comment
-
- 06 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add real and imag yaml * add roi_align and roi_pool yaml * add qr yaml * add psroi_pool yaml * fix bug * fix param bug of psroi_pool * fix infrt problem * fix merge bug
-
- 04 4月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add concat_grad kernel * fix error * remove comment code * fix outs nullptr error * change to phi header * add concat_grad declare for standalone_executor_test * add concat_grad yaml * add concat api * fix test concat op error * fix test concat op error
-
- 30 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor code auto-gene for no_need_buffer * fix some bug * delete test code
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 24 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* support intermediate for saprse api * close intermediate in yaml * fix dygraph_api dep for eager
-
由 zhangkaihuo 提交于
-
- 21 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add phi device context pool * change year * fix compile error * fix operator = error * refine init impl * polish details * refine init impl
-
- 17 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* fix copy_ problem by doing it with phi copy * improve test coverage * refactor copy with sr kernel
-
- 09 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support code auto-gene for sparse backward api * fix bug of intermediate api and name of return var
-
- 08 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 03 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* suppport sparse api in yaml * support auto-gen code of sparse api * do some refactor * add unittest test_sparse_conv_api * add unitest file Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
-
- 26 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* Support custom implement for C++ API * rename api_invoke_impl to api_custom_impl * remove manual_api * delete mutable_data in copy_to api * fix problem of copy_to * add unittest for infer_meta_fn_factory * fix split cofig in yaml * fix split cofig in yaml * modify sum api yaml * add copy_to wrapped infermeta * rollback copy impl
-
- 24 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
- 23 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove fill_any_like kernel in fluid and fix data transform bug * support scalar in infershpe * recover infershape in fill_and_like
-
- 21 2月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [PluggableDevice]custom kernel to pten core structs * mod extension.h for custom op * compatible python for CI * support custom context * refactor to pten * fix windows and ut
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 18 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support inplace and intermediate in yaml * add cmake for dygraph_api
-
- 17 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* pten matmul cuda kernel support bf16 * avoid custom kernel deps on pten_function_api * Revert "pten matmul cuda kernel support bf16" This reverts commit 5d520845b9a189375677276efb673235ed8e5ee0. * refine code * fix compile * fix test_split_api
-
- 13 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* generate wrapped_infer_meta * add test for wrapped_infer_meta * Update test_meta_fn_utils.cc * change the dir of generated file Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NChen Weihang <chenwhpro@163.com>
-
- 10 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor C++ API code-gen * fix windows problem of C++ API
-
- 04 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add data_transform in pten api * support GetKernelTypeForVar * fix complie problem of bfloat16 * change error namespace * add complex type transform unittest * fix merge conflict
-
- 30 1月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse
-
- 29 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 27 1月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [Demo] custom kernel based on pten kernel * merge and npu custom work well * del comments * delete other code * fix CUDAContext * fix not found small_vector.h * support NPU * fix NPUContext * fix DeviceContext support * add UT * fix call * add UT * fix * fix for comments and ut * add MACRO control * fix multi input output * support env CUSTOM_DEVICE_ROOT * deal with special cases * fix for Windows * try coverage with test_custom_kernel_dot.py * fix test_custom_kernel_dot * fix test_custom_kernel_dot * fix merge * fix merge * fix CI * update * merge and fix * remove WITH_CUSTOM_KERNEL * fix merge * merge and fix * fix ut * fix ut for mac * add more UT * add more UT * fix
-
- 25 1月, 2022 2 次提交
- 24 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add config of backward-api auto-gene * fix compile bug * remove wrong header * rename grad_api to backward_api * modify .gitignore
-
- 16 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* unify device context entrance * move all_context include to header * polish cmake relay for device_context * fix npu compile failed * fix npu compile failed * revert part of change
-
- 10 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
-
- 07 12月, 2021 2 次提交
- 03 12月, 2021 2 次提交
-
-
由 zyfncg 提交于
-
由 zyfncg 提交于
* add code-gen file * add api-gen in cmake * adjust the code format * temp save the code * add genen-api module into pten * merge the develop code * fix merge conflict * fix code conflict with develop * support reduce_mean/sum * fix the CI requirement * fix requirement problem of CI * remove original api code * fix bug caused by removing original api
-