- 13 3月, 2023 1 次提交
-
-
由 zhoutianzi666 提交于
* use python to generate cutlass code * refine CommonConvKernelPart1, CommonConvKernelPart2 * remove useless code in generate_cutlass_code.sh * add more config in conv2d_residual * CommonCutlassConvKernelPart1 and CommonCutlassConvKernelPart2 * add group conv support in util.cu * remove .sh * refine name * make name goodgit status! * add fuse_alpha * make code easy to understand * mot fopen generate in py * use python script to generate conv2d,group=1 cutlass code * use const & * use const & && use python script to generate conv2d/group=1 code
-
- 06 3月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move DeviceContextPool to phi * add EmplaceExternalContextFunc * update namespace * update cmake * fix bugs and create context_pool_impl.h * replace platform::is_xxx_place * fix bugs * update generator * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * fix enforce usage * Revert "fix enforce usage" This reverts commit 5f521f08a69713cee506e64a00ec6d9fba709e27. * fix bugs * rm XPUDeviceContext and CustomDeviceContext * fix bugs * fix fix context init bug * fix bugs after merge * fix bugs * fix name * fix mutable_data * update and fix bugs * fix bugs * update * fix bugs * fix name * fix bugs * merge * fix bugs * create context_pool in phi/backends * create context_pool in phi/backends * fix bugs * fix xpu bugs * fix rocm bugs * fix bugs * fix bugs * fix bugs * fix xpu bugs * update * update * fix bugs * fix bugs
-
- 02 3月, 2023 1 次提交
-
-
由 zyfncg 提交于
* split generated_op.cc into 4 src files * fix bug * fix compile on windows
-
- 23 2月, 2023 1 次提交
-
-
由 Huang Jiyi 提交于
* move fluid generator to phi * move fluid generator to phi * update .gitignore * fix bugs * fix cannot find "glog/logging.h" in "generator.h" * fix bugs
-
- 22 2月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* tmp gather vjp * support gather * remove useless code * fix compiling error * fix ut * add eager test * add eager test * add seed * fix cpu error * fix transpose op compat * remove tensor index case * fix prim_cinn * fix ut
-
- 20 2月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* polish tensor operants implementation * change year, 2021->2023 * autogen tensor.h and tensor_api.cc * polish CMakeLists logic * cancel tensor.h auto-gen * clean useless parameter * delete tensor_api.cc
-
- 18 2月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* autogen phi tensor operants * autogen operants base * autogen operants_manager * polish code style * add cmake output * fix cmake depency error * fix cmake file path error
-
- 14 2月, 2023 1 次提交
-
-
由 Aurelius84 提交于
* [Dy2St]Enhance @not_to_static API * del breakpoint()
-
- 16 12月, 2022 1 次提交
-
-
由 hong 提交于
* change staticRNN to while * update code * fix rnn bug * update * fix _find_op_path_ bugs in append_backward. * polish code * revert op proto * update * udpate while * format * revert test while loop op * fix create array * fix windows error * fix bug * update * fix array write bug Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 12 12月, 2022 2 次提交
-
-
由 Aurelius84 提交于
* [Setup] Ignore @PADDLE_BINARY_DIR@ files * test=document_fix
-
由 zyfncg 提交于
-
- 28 11月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* [Fluid Clean] Migrate program_translate.py/jit.py into paddle.jit dir
-
- 09 11月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* move fluid op generator into fluid * remove parsed op * resolve sig undef error * append python interp find logic * remove dup code
-
- 18 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support generating code of opmaker for backward op invoke forward op * gsupport code-gen of opmaker for sparse op * refind logic of choose phi kernrel * fix complie budg * fix code_gen bug * fix bug * fix kernel signature code-gen * fix complie bug of VarType * fix complie bug of VarType * fix test_sparse_conv_op * fix test_sparse_norm_op
-
- 27 9月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 26 9月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 30 8月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add runtime config in phi * add runtime attr for op desc and op * fix no proto error * adjust opdesc set_attr impl * try to remove conv_op extra attrs * add init runtime attr map * change extra header path * fix runtime_attr * fix trace_op * fix bug of pass * fix merge conflict * fix dygraph attrs * fix bug of pass * fix dygraph bug * fix unittest module * delete extra attr default * fix dropout kernel * polish code * fix extra output of instance_norm * fix merge confilct * fix op_desc bug * add extra attr in yaml for conv3d_transpose * don't remove extra input and output * fix save_inference_model * fix bug of batch_norm * revert some change * polish log * polish code * add code comment Co-authored-by: NChen Weihang <chenweihang@baidu.com>
-
- 26 8月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
-
- 15 8月, 2022 1 次提交
-
-
由 Wilber 提交于
* convert_fp16 support multi block * update * update
-
- 12 8月, 2022 1 次提交
-
-
由 hong 提交于
-
- 12 7月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* split coed gen for eager fluid_generated
-
- 06 7月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 01 7月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* move yaml to phi * fix undef var error
-
- 20 5月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 17 3月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 09 3月, 2022 1 次提交
-
-
由 huzhiqiang 提交于
-
- 08 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 03 3月, 2022 1 次提交
-
-
由 zyfncg 提交于
* suppport sparse api in yaml * support auto-gen code of sparse api * do some refactor * add unittest test_sparse_conv_api * add unitest file Co-authored-by: Nzkh2016 <zhangkaihuo@baidu.com>
-
- 02 3月, 2022 1 次提交
-
-
由 huzhiqiang 提交于
-
- 28 2月, 2022 1 次提交
-
-
由 tianshuo78520a 提交于
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 18 2月, 2022 2 次提交
-
-
由 Wilber 提交于
* the mlir representation of pten, test=develop * fixes an error, test=develop * infrt registers pten kernels Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
由 zyfncg 提交于
* support inplace and intermediate in yaml * add cmake for dygraph_api
-
- 15 2月, 2022 1 次提交
-
-
由 hong 提交于
* move histogram to pten; test=develop * fix format error; test=develop * fix histogram kernel format; test=develop
-
- 13 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* generate wrapped_infer_meta * add test for wrapped_infer_meta * Update test_meta_fn_utils.cc * change the dir of generated file Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NChen Weihang <chenwhpro@163.com>
-
- 09 2月, 2022 1 次提交
-
-
由 huzhiqiang 提交于
-
- 04 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add data_transform in pten api * support GetKernelTypeForVar * fix complie problem of bfloat16 * change error namespace * add complex type transform unittest * fix merge conflict
-
- 30 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-
- 27 1月, 2022 1 次提交
-
-
由 Yan Chunwei 提交于
-