- 10 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all phi kernels to 2(host/device) static libraries directly * fix calling kernel_declare * fix compile * fix cpu compile * fix rocm compile * fix xpu compile * fix xpu kp compile * fix inference compile
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 25 4月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* merge all phi lib to several big static lib * merge all phi lib to several big static lib
-
- 15 4月, 2022 2 次提交
-
-
由 chentianyu03 提交于
* split reduce_kernel * rm reduce_kernel in cmake * split reduce_grad kernels * fix cmake build error * format code * fix standalone_executor_test error
-
由 limingshu 提交于
* change cudnn helper for auto-tune * Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm. * Fix the bug in calculating and printing current step cache hit rate. * Improve the autotune cache and fix unittest. * Change the key from AlgorithmType to int64_t. * Fix unittest for cpu-only env. * change ChooseAlgoByWorkspace for heuristic mode Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
- 10 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 01 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add cross_entropy_with_softmax phi kernel * remove softmax_with_cross_entropy kernel * add softmax_with_cross_entropy grad kernel * remove original op kernel * refine cross entropy impl * fix pointer error * revert kernel cu change * fix xpu failed * fix cinn failed * fix npu failed * add forward sig * add check_nan_inf for pt kernel * remove repeat cmake item * fix unittest error
-
- 30 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* move rnn kernel to phi * move infershape of rnn to phi * fix HIP bug * rename function * fix HIP bug * fix hip bug
-
- 28 3月, 2022 2 次提交
-
-
由 hong 提交于
* move momentum, rmsprop to phi; test=develop * update * update * update * update * udpate; test=develop * fix xpu npu bugs; test=develop * fix npu bug; test=develop * fix windows compile error; test=develop * fix windows compile error; test=develop * polish code; test=develop * fix conflict; test=develop * add meshgrid; * update * polish code * polish code; * fix bug * format; remove useless code * fix npu bug * fix bug
-
由 0x45f 提交于
* moving OP * move forward * move grad and infershape * code format * format code * fix code * fix code * fix CMakerLists.txt * fix comments * Refine CMakeLists for rocm ci
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 25 3月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
- 24 3月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [phi] Split selected_rows CMake compilation * move file back * move file back
-
- 23 3月, 2022 2 次提交
-
-
由 zyfncg 提交于
* move deformable_conv_grad to phi * move infershape of deformable_conv to phi * adjust some code format * move deformable_conv_v1 to phi
-
由 Zhang Ting 提交于
* add kernel profiler * add gpu timer tool * remove warmup * fix rocm complilation error
-
- 18 3月, 2022 2 次提交
-
-
由 Zhang Zheng 提交于
* first commit * fix compile error * support std::vector<std::srting> * fix * fix op support on GPU by chenweihang * pass test * infershape * add set_dtype * fix order * fix * unify the impl of dt and sr * fix
-
由 chentianyu03 提交于
* move reduce_mean_grad kernel into phi * move reduce_max/min_grad into phi * remove raw max/min grad kernel * fix bug * fix max/min grad error * move all reduce_grad kernel into one file * add prod grad kernel * add infermeta for prod kernel
-
- 17 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 15 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add determinant phi kernel * remove original determinant op kernel * add determinant grad [hi kernel * fix determinant test failed * remove original determinant grad op kernel
-
- 14 3月, 2022 1 次提交
-
-
由 From00 提交于
* Move Pool OPs to phi * Fix CI error * Fix conflicts
-
- 10 3月, 2022 2 次提交
-
-
由 Liu-xiandong 提交于
* [phi] move matrix_power op * MatrixInverse fluid -> phi * modify the CMake to fix compile bug * delete useless comment * mutable memory -> phi Alloc * modify the include file * modify the include file * fix bug in CI compiler
-
由 Zhong Hui 提交于
* move segment_pool to phi. * mark summed ids as optional tensor. * fix as reviews.
-
- 09 3月, 2022 1 次提交
-
-
由 crystal 提交于
* migrate eigh to phi * optimize code * modify code according to comment * conflict resolution
-
- 08 3月, 2022 2 次提交
-
-
由 Yang 提交于
* [Phi] move put_along_axis/take_along_axis/maxout * use phi::Copy
-
由 Aganlengzi 提交于
* [custom kernel]Upgade support for multi libs * upgrade phi_custom_kernel deps
-
- 07 3月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove storage deps of empty * remove invalid empty method * remove error empty using * fix test_sparse_utils_dev_api * revert some sparse change * add memset for conv grad * resolve conflict * resolve conflict * resolve conflict
-
- 06 3月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* Migrate triangular_solve op into phi * fix CI * move MatrixReduceSum to phi funcs * move MatrixReduceSum to phi funcs * fix comment * fic CI
-
- 04 3月, 2022 1 次提交
-
-
由 hong 提交于
* move conv to pten * move conv to pten; test=develop * fix bug; * add conv cudnn impl; test=develop * update * update operator; test=develop * fix bug; test=develop * move operator and prepared_operator to develop; test=develop * resolve conflict; test=develop * remove useless code;test=develop * add depency ; test=develop * fix bug; * add sig.cc ; test=develop * fix use_op error; test=develop * fix bug; test=develop * fix bug; test=develop * add conv3d register; test=develop * fix star gan and conv_nn_grad test failed; test=develop * add header; test=develop * manul to recover to develop; * resolve confilct; test=develop * remove useless code * fix bug; * remove conv2d_cudnn; test=develop * fix bugs; test=develop * fix cpu rocm compile bugs; test=develop * fix blas error; test=develop * fix compile bug; test=develop * fix windows compile error; test=develop * fix windows error; test=develop * resolve confilct; test=develop
-
- 26 2月, 2022 2 次提交
-
-
由 From00 提交于
* Move GumbelSoftmax OP to phi * platform::errors -> phi::errors; GumbelSoftmaxGradInferMeta -> backend.h/cc * Use axis util in kernel impl * Remove namespace platform::errors * Use GetCPUEngine in Device Context
-
由 zyfncg 提交于
* Support custom implement for C++ API * rename api_invoke_impl to api_custom_impl * remove manual_api * delete mutable_data in copy_to api * fix problem of copy_to * add unittest for infer_meta_fn_factory * fix split cofig in yaml * fix split cofig in yaml * modify sum api yaml * add copy_to wrapped infermeta * rollback copy impl
-
- 25 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* support cudnn kernel moving * polish cmake rules * add unittest for coverage * remove orig kernel * remove softmax cudnn kernel * fix softmax test failed * fix npu func error * resolve conflict * rename gpu dnn kernels * fix name rule error * fix compile error * update fp16 namespace
-
- 24 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
- 23 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* move unbind to phi * revert infer shape * add header file * move concat_and_split to phi
-
- 22 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [Phi] Migrate unfold_op into phi * fix im2col CPUContext template instantial * fix unfold_op.h header include problem * fix unittest * fix PT->PD
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 17 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* clean useless header in pten core * fix compiled failed * fix cmake target * fix typo * resolve conflict
-
- 30 1月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
* dense_to_sparse_coo * optimize unit testing; support rocm * 1. delete fluid related header file 2. update the copyright * fix hipMemcpy * update dense_to_sparsecoo * add namespace sparse
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 24 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048) * refactor elementwise add grad * fix compile bugs * fix unit test bugs * fix file conflicts * fix bugs when buildPtenContext
-
- 22 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-