- 12 12月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add new tensor * fix windows compile bugs * fix ci bugs * fix ci bugs * fix ci bugs * perfect according comment * fix ci compile bugs * add raw tensor * fix ci bugs * modify code by comment * delete String
-
- 01 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 19 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move sum * fix ci bugs * fix ci bugs * fix set_lod bugs * fix infershape bugs * fix ci bugs * fix ci unittest bug * fix ci bugs * perfect code * update code according comment * add unittest * fix ci bugs
-
- 06 9月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add tensor array * fix ci bugs * fix ci bugs * fix ci bugs * fix ci bugs * update by comment * update code
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 02 8月, 2022 1 次提交
-
-
由 Weilong Wu 提交于
* polish and rename, pt* -> phi* * fix code format
-
- 14 7月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* adapt mkldnn kernel in PHI * fix ci compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * fix compile bugs * delete comment * fix compile bugs in windows-inference * delete code for converage * modify code by review * modify code by review * add todo * fix compile bugs * fix compile bugs * fix compile bugs * fix unittest bugsx
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 27 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
- 26 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* polish some impl * add lost attr type * polish details * fix error type * polish in name lists * add double attr * adapt infrt attr parse
-
- 24 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add variant and replace any * split attribute
-
- 01 4月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add interploate cpu kernel * fix nullptr bug * add interpolate gpu kernel * fix unit test error * remove raw kernels * add cuda kernel impl * add infermeta * recover accidentally deleted kernels in interpolate op * fix grad x_grad name error * remove interpolate_v2_op.h * rm unused codes * fix xpu build error * fix build error * fix namespace error * add register header for nup * fix infermeta error * modify by review * add the missing args in test_trt_convert_nearest_interp_v2
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 18 3月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
sparse maxpool; kernel_registry support sparse tensor
-
- 08 3月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [custom kernel]Upgade support for multi libs * upgrade phi_custom_kernel deps
-
- 06 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* replace prefix pt by pd * replace added kernel * revert util change * pd kernel to phi * resolve conflict * resolve conflict
-
- 04 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* change input vec tensor to pointer * update input between * fix format error * resolve conflict * resolve conflict
-
- 02 3月, 2022 1 次提交
-
-
由 hong 提交于
* move sgd to phi; test=develop * update * add sgd kernel; test=develop
-
- 28 2月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
-
- 22 2月, 2022 2 次提交
-
-
由 Aganlengzi 提交于
* [custom kernel]Delete useless * change RegType enum names * mod notes * merge * update
-
由 Chen Weihang 提交于
* unify register macro * rename declare macro * fix infrt error
-
- 21 2月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [PluggableDevice]custom kernel to pten core structs * mod extension.h for custom op * compatible python for CI * support custom context * refactor to pten * fix windows and ut
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 17 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 15 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix single dtype reg errror * fix windows failed
-
- 11 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* ermove xxx_info include * fix namespace error * resolve conflict * skip xpu context in registry * fix macro error * resolve conflict * resolve conflict * revert xpu convert * remove trans to fluid place * remove useless headers
-
- 09 2月, 2022 1 次提交
-
-
由 hong 提交于
* update basic infrastructure; support string, suport vecotr<int>, add tensor args type index; test=develop * remove useless code; test=develop * fix bug; test=develop * polish code; test=develop
-
- 30 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* upgrade _get_all_register_op_kernels * add ut * support xpu/npu * fix device id * enhance TransToFluidPlace * fix compile
-
- 29 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* open header for custom kernel * add core utils * tidy core code * tify header * tidy include * tidy namespace * resolve conflit * fix unittest and coverage * remove platform using * resolve conflict * resolve conflict * fix digamma namespace error * fix xpu full kernel error * fix xpu full kernel error * polish details * add place for lib storage
-
- 28 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* refactor scale kernel that its input is selected_rows * complement upload file
-
- 25 1月, 2022 1 次提交
-
-
由 xiongkun 提交于
* transfer: string tinyformat errors and part of enforce into pten * remove comment * fix by code review * assert is not compile in -DNDEBUG * add string as dependences of paddle_inference
-
- 22 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 13 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename register marco * fix error changing * fix format error
-
- 11 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor matmul directory in pten * fix merge conflict * add dot_grad kernel * add dot_grad kernel in pten * add matmul_grad kernel * update the code * delete useless code in fluid * fix some bug of running matmul grad kernel * fix merge conflict * refactor some code * refactor code
-
- 24 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 21 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* rename cuda to gpu * revert CMake change * resolve conflit * rename other cuda to gpu * poish details
-
- 16 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add register_ctx_kernel and move scale kernel * polish details by reviewer comment * fix xpu compile failed * fix cmake error
-
- 14 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* polish register marco * resolve compile failed * revert needless change * revert eager related change * revert eager related change * change register marco name * polish deetails
-
由 YuanRisheng 提交于
-
- 13 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add variadic_args kernel in pten * merge develop code * add variadic_args kernel and benchmark * change dynamic_cast to static_cast for DeviceContext * merge the code * modify code format * refactor variadic kernel function
-