- 31 3月, 2022 5 次提交
-
-
由 wuyefeilin 提交于
* mv yolov3_loss op to phi * fix as review * update operator.h
-
由 Zhang Ting 提交于
-
由 zhangkaihuo 提交于
-
由 limingshu 提交于
* for 1st time interface combine. * modification with kernel factory * first auto_tune version. * first version. * basic version * add warm up step. * a debug version. * optimize the functionality of class auto_tuner. * add some quotes for optimized auto_tuner class. * add some quotes for optimized auto_tuner class. * add namespace. * modification according to the advices * replace fluid header with phi header. * replace fluid header with phi header.
-
由 Aganlengzi 提交于
-
- 30 3月, 2022 14 次提交
-
-
由 zyfncg 提交于
* move rnn kernel to phi * move infershape of rnn to phi * fix HIP bug * rename function * fix HIP bug * fix hip bug
-
由 Haohongxiang 提交于
* fix uncontrolled randomness of op * fix bugs
-
由 Chen Weihang 提交于
Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi (#40993)" (#41065)" (#41110) This reverts commit 3a6f1135.
-
由 xiongkun 提交于
-
由 Chen Weihang 提交于
This reverts commit 054fc997.
-
由 pangyoki 提交于
-
由 YuanRisheng 提交于
-
由 Zhang Zheng 提交于
* Optimize the perf of top_k when k is too large * fix rcom compile * fix * only compile in cuda * fix log info
-
由 pangyoki 提交于
* suppor inplace in tensor_method_setitem * delete bump_inplace_version * optimize inplace unittest * fix * fix setitem bug * update eager_generator * optimize inplace unittest * little change
-
由 zyfncg 提交于
* refactor code auto-gene for no_need_buffer * fix some bug * delete test code
-
由 chentianyu03 提交于
* fix pad3d infermeta bug * add check for construct ScalarArray
-
由 pangyoki 提交于
* support view strategy in eager_final state * perfect reshape kernel * fix bugs of sig * add unittest for reshape_sig * fix bugs when run converage * fix inplace bug in final_state eager_gen * fix python_c_gen * support view strategy for final state * fix order of out and xshape in reshape * fix Coverage_CI unittest timeout error * support reshape view * fix reshape_sig * fix yml and api_base Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 wawltor 提交于
* fix the argosrt cpu * add the test case for the paddle.argsort
-
- 29 3月, 2022 7 次提交
-
-
由 zhangkaihuo 提交于
-
由 tianshuo78520a 提交于
This reverts commit 9c0eaada.
-
由 tianshuo78520a 提交于
This reverts commit b532315d.
-
由 tianshuo78520a 提交于
This reverts commit e77a947e.
-
由 津 提交于
* [Phi] trans logsumexp op * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * add sig * fix sig bugs * fix sig bugs * fix xpu bugs * fix review bugs * test=develop
-
由 wuyefeilin 提交于
* mv floordiv to phi * mv elementwise_pow to phi * fix as review
-
由 Chen Weihang 提交于
-
- 28 3月, 2022 10 次提交
-
-
由 hong 提交于
* move momentum, rmsprop to phi; test=develop * update * update * update * update * udpate; test=develop * fix xpu npu bugs; test=develop * fix npu bug; test=develop * fix windows compile error; test=develop * fix windows compile error; test=develop * polish code; test=develop * fix conflict; test=develop * add meshgrid; * update * polish code * polish code; * fix bug * format; remove useless code * fix npu bug * fix bug
-
由 hong 提交于
* update * add forward case * update * update; test=develop * add some grad kernel; test=develop * move gpu kernel; test=develop * update * update; * update test; * fix selected rows bug; * add mix vector include ; * add mixed vector depen; test=develop * add logit grad signature; * polish code * fix bug; * add namespace for abs * revert code * not move softsign * revmove duplate register; * fix softsign bug * polish code * format * format * fix bug * remove cmake dep * add square sqrt selected rows support * update * remove clip norm * add standalone executor sqrt dep * standalone exec denp sqrt * remove sqrt op in cmkaelist * open some case
-
由 Leo Chen 提交于
-
由 Yang 提交于
-
由 zyfncg 提交于
-
由 chentianyu03 提交于
* remove in_dtype, out_dtype in redcue grad * set the dtype and layout in noneedbufferInputs func
-
由 Chen Weihang 提交于
* fix assign kernel bug * fix xpu kernel select error * add cudn pinned place * fix copy error * fix infrt error
-
由 0x45f 提交于
* moving OP * move forward * move grad and infershape * code format * format code * fix code * fix code * fix CMakerLists.txt * fix comments * Refine CMakeLists for rocm ci
-
由 Chen Weihang 提交于
* move assign value op kernel * remove original op * fix failed unittests * remove infershape
-
由 YuanRisheng 提交于
* perfect reshape kernel * fix bugs of sig * add unittest for reshape_sig * fix bugs when run converage
-
- 27 3月, 2022 3 次提交
-
-
由 Li Min 提交于
-
由 hong 提交于
* move slice to pten * merge develop; test=develop * fix slice bug; * update * update * fix error * update * fix bug * polish code * polish code * polish code * try to fix windows bug * add gpu compile flag; * try to fix * remov template; * polish code; * fix npu bug; * fix npu bug * fix npu bug; test=develop * fix slice bug; * remove no need dep
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 26 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-