- 23 6月, 2022 2 次提交
-
-
由 Ruibiao Chen 提交于
* Remove unnecessary including for pstring.h * Fix typos
-
由 Matsumoto Ruko 提交于
-
- 13 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 10 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all phi kernels to 2(host/device) static libraries directly * fix calling kernel_declare * fix compile * fix cpu compile * fix rocm compile * fix xpu compile * fix xpu kp compile * fix inference compile
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 31 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add embedding yaml * fix infermeta bug * fix bug of selected_rows infer_meta * fix selected_rows * add unittest
-
- 30 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* make data transform inplace when tensor is on gpupinned in new dygraph * fix unittest
-
- 23 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove storage * add glog include * add glog include * add glog include
-
- 19 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* remove shared_storage * fix bug * fix rnn bug
-
- 16 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * int_array supports constructed by gpu tensor * add some test * polish code * adjust full api * add unittest * add unittest Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
- 12 5月, 2022 1 次提交
-
-
由 tiancaishaonvjituizi 提交于
-
- 11 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* change the output format of C++ backward api * fix merge conflict * fix sparse api code auto-gen * fix eager_gen bug * fix bug of output is null * fix bug of conv2d_grad_impl * fix optional grad * fix bug of eager-gen double_grad * fix bug * fix multiply_double_grad bug * remove node pruning
-
- 27 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* unify utils naming style * polish details
-
- 26 4月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
-
由 zyfncg 提交于
* optimize the performanece of sum api * optimize IsDenseTensorInput * remove debug log
-
- 25 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 24 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add variant and replace any * split attribute
-
- 23 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 22 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimiaze performance of PreparePhiData * dygraph performance optimization
-
- 19 4月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
[Phi]Separate AddKernel/DivideKernel/SubtractKernel/MultiplyKernel from ElementwiseKernel(Part1) (#41806) * seperate add/div/sub/mul from elementwise * delete code * fix compile bugs * deal with conflict * fix bugs when compile * fix windows unit test bug * fix ci converage bugs
-
- 18 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* create tensor by empty in custom op * fix some bug
-
- 17 4月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* split phi and fluid infermeta context * resolve conflict * fix type error * optimize scheduling perf * spec small vector size * replace all grad var name * fix test failed * move init defalut signature * polish details * polish details * fix no init bug * init sig for tests * add init sig for infer * fix infrt error * fix infrt failed * fix kunlun error * fix infrt failed
-
由 Chen Weihang 提交于
* fix place type related compat error * fix test failed * remove dll decl * revert place type change * add dll decl
-
- 15 4月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
- 14 4月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency * deal with conflict * fix bugs when run unit test * fix unit test bugs
-
- 13 4月, 2022 3 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit fe214af2.
-
由 YuanRisheng 提交于
* support construct scalar using non-cpu tensor * fix bugs when run unittest * fix compile bugs * fix bugs when run ci * fix compile bugs * fix bugs when move copy * perfect unit test * perfect unittest * update according to comment * add target dependency
-
由 Chen Weihang 提交于
* remove old custom op placetype * replace dist placetype using * add with gpu macro * fix mutable_data error * fix set value error * add comment
-
- 12 4月, 2022 1 次提交
-
-
由 Lijunhui 提交于
* init commit no push * collect comile errors * bitwise UT * fix compile problem * cancel comments * restore miss deletion * fix compilation * fix UT * NO stash in multiple branch at the same times * fix error * combine .cu from gpu and kps * replace gpu by kps * fix by Chen-weihang * Revert "Fix kps compile error in Junhui logic compare bitwise" * fix backend test * rm comments Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 04 4月, 2022 1 次提交
-
-
由 From00 提交于
* Add yaml for flatten_contiguous_range OP * update * Fix typos Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 02 4月, 2022 2 次提交
-
-
由 zhangkaihuo 提交于
-
由 zhangkaihuo 提交于
-
- 01 4月, 2022 2 次提交
-
-
由 wanghuancoder 提交于
* support pinned, test=develop * support async_write, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine,test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop
-
由 zhangkaihuo 提交于
-
- 31 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* rename scalar_array to int_array * update cmake * fix conflict * remove useless log
-
由 zhangkaihuo 提交于
-
- 29 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 28 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* perfect reshape kernel * fix bugs of sig * add unittest for reshape_sig * fix bugs when run converage
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-