- 28 3月, 2022 5 次提交
-
-
由 Leo Chen 提交于
-
由 chentianyu03 提交于
* remove in_dtype, out_dtype in redcue grad * set the dtype and layout in noneedbufferInputs func
-
由 Chen Weihang 提交于
* fix assign kernel bug * fix xpu kernel select error * add cudn pinned place * fix copy error * fix infrt error
-
由 0x45f 提交于
* moving OP * move forward * move grad and infershape * code format * format code * fix code * fix code * fix CMakerLists.txt * fix comments * Refine CMakeLists for rocm ci
-
由 Chen Weihang 提交于
* move assign value op kernel * remove original op * fix failed unittests * remove infershape
-
- 27 3月, 2022 3 次提交
-
-
由 Li Min 提交于
-
由 hong 提交于
* move slice to pten * merge develop; test=develop * fix slice bug; * update * update * fix error * update * fix bug * polish code * polish code * polish code * try to fix windows bug * add gpu compile flag; * try to fix * remov template; * polish code; * fix npu bug; * fix npu bug * fix npu bug; test=develop * fix slice bug; * remove no need dep
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 26 3月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 25 3月, 2022 8 次提交
-
-
由 Zhang Ting 提交于
-
由 YuanRisheng 提交于
-
由 Aurelius84 提交于
* [Phi] Migrate strided_slice into Phi * [Phi] Migrate strided_slice into Phi * fix compilation problem
-
由 zhangbo9674 提交于
* add cast_grad phi kernel * refie unittest * refien unittest * refine unittest * refine include header path * refien xpu cast unittest * refine code
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
由 FlyingQianMM 提交于
* add maximum limit for grid of reduce, elementwise and gather * add {} after if
-
由 Chen Weihang 提交于
* move part sum op kernel * remove deprecated names
-
由 FlyingQianMM 提交于
-
- 24 3月, 2022 6 次提交
-
-
由 Chen Weihang 提交于
* add mean phi kernel * remove original mean kernel * add alias name
-
由 zhiboniu 提交于
-
由 caozhou 提交于
* migrate infershape * fix tril_triu infershape error * fix qr_op infershape * add parse qr mode func * move order
-
由 Aurelius84 提交于
* [phi] Split selected_rows CMake compilation * move file back * move file back
-
由 Chen Weihang 提交于
* add mul phi kernel * remove mul op kernel * remove original mul grad op * fix cinn test * fix dygraph test failed
-
由 niuliling123 提交于
-
- 23 3月, 2022 10 次提交
-
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
* move deformable_conv_grad to phi * move infershape of deformable_conv to phi * adjust some code format * move deformable_conv_v1 to phi
-
由 YuanRisheng 提交于
* remove flatten infermeta * fix bugs when run inference ci * fix bugs when run inference ci * fix bugs when run ci * support infrt * inplace infershape code'
-
由 KP 提交于
* Add stft_op. * Add stft_grad_op. * Add stft_op unittest. * [DLTP-45176] Add complex compatibility in static mode for stft api. * [DLTP-45176] Add complex compatibility in static mode for stft api. * Add doc. * Update unitests of stft op. * Update spectral helper. * fix coding style.
-
由 zhouweiwei2014 提交于
-
由 niuliling123 提交于
-
由 xiongkun 提交于
* transfer unsqueeze to phi * fix conflict * add squeeze * add infershape * fix xpu and npu error
-
由 YuanRisheng 提交于
* move activation * fix bugs when run ce
-
由 Chen Weihang 提交于
* add full_batch_size_like phi kernel * remove fill constant bs like * update year
-
由 Zhang Ting 提交于
* add kernel profiler * add gpu timer tool * remove warmup * fix rocm complilation error
-
- 22 3月, 2022 6 次提交
-
-
由 hong 提交于
* move mutable_data to context alloc * move mutable_data to context alloc * remvoe duplicate code
-
由 Chen Weihang 提交于
-
由 hong 提交于
* move embeding to phi; * update sig; test=develop * move reset impl to phi; test=develop * remove old register; test=develop * fix cpu bf16 bug; test=develop * fix lookup speed error * polish code * fix paddle throw type
-
由 Chen Weihang 提交于
* add reverse phi kernel * add reverse infermeta * remove original reverse op kernl & infershape
-
由 Siming Dai 提交于
* add out_size shape for graph_send_recv * fix bug in register kernel: no const int& support * add out_size in infermeta * change unittest * fix unittest * fix out_size default value * fix doc * delete arg mapping * add sig * move -1 to 0 * move -1 to 0
-
由 chentianyu03 提交于
* modify out and out_grad order in reduce_grad_kernel * delete unsed boolReduceKernel * fix conflict
-
- 21 3月, 2022 1 次提交
-
-
由 0x45f 提交于
* Refine to_tensor for eager mode * support gpu_pinned * refine code * support gpu_pinned copy_to * fix layer.__setattr__ * support to_tensor for gpu_pinned * fix unit test * refine gpu_pinned * restore the original code * add is_gup_pinned() and refine eager.Tensor._copy_to()
-