- 08 8月, 2022 1 次提交
-
-
由 xiaoxiaohehe001 提交于
-
- 22 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
Only cherry pick format tool(clang-format, yapf, cmake-format) upgrade to release/2.3, lint tool such as cpplint will not move, because we are not going to fix cpplint error in release/2.3 pre_commit.sh also is moved to release/2.3 so that both PR-CI-pre-commit and PR-CI-pre-commit-23 can works. pre install clang-format to avoid repeat installation due to pre-commit's multi-thread running.
-
- 21 6月, 2022 1 次提交
-
-
由 chalsliu 提交于
* Update CUDA and TensorRT version for CI * disable ut * Update TensorRT for CUDA 10.2
-
- 14 6月, 2022 1 次提交
-
-
由 Shang Zhizhou 提交于
-
- 10 5月, 2022 1 次提交
-
-
由 fwenguang 提交于
* [MLU] add mlu new profiler (#41138) * [MLU] add mlu new profiler * fix format * [MLU] support add callback to stream (#41831) * [MLU] add gather mlu kernel (#41969) * [MLU] add mlu activation kernels (#41751)
-
- 09 5月, 2022 1 次提交
-
-
由 Allen Guo 提交于
add class NameScopeHelper for adding namescope info 添加更多 种类优化器状态的映射 为 IpuStrategy 添加 compilation_progress_logger option 用于输出 编译进度 部分代码清理和杂项优化
-
- 27 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* Remove std::type_index in AttributeArdDef (#42122) * polish some impl * add lost attr type * polish details * fix error type * polish in name lists * add double attr * adapt infrt attr parse * add attr type test (#42263) * opt attr eaque perf (#42272)
-
- 14 4月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Add yaml for matrix rank op (#41466) * modify matrix_rank * add matrix_rank shape * add matrix_rank shape * Add yaml for matrix_rank OP * Add UT Co-authored-by: Nzhoujianqian <15205085056@163.com> * Add yaml for eye OP (#41476) * [cherry-pick] Add yaml config for matrix_rank, eye, deformable_conv and deformable_conv_v1 OPs * Add yaml for deformable_conv and deformable_conv_v1 OPs * Add UT * Add to skipped_phi_api list for infrt Co-authored-by: Nzhoujianqian <15205085056@163.com>
-
- 13 4月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Revert "[Phi] Migrate Adam and AdamW into Phi (#40351)" This reverts commit 56cd3407. * add infermeta
-
- 11 4月, 2022 1 次提交
-
-
由 hong 提交于
-
- 06 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add real and imag yaml * add roi_align and roi_pool yaml * add qr yaml * add psroi_pool yaml * fix bug * fix param bug of psroi_pool * fix infrt problem * fix merge bug
-
- 05 4月, 2022 2 次提交
-
-
由 zyfncg 提交于
* add roi_align yaml * fix bug
-
由 RichardWooSJTU 提交于
* add nms op and batched_nms api
-
- 04 4月, 2022 3 次提交
-
-
由 hong 提交于
* update epxand and sigmoid with cross entropy * skip expand as infrt check * fix sigmoid cross entropy bug * remove no grad set white list * remove no grad set * fix bug * fix sigmoid error * fix bug
-
由 hong 提交于
* add dropout slice yaml * remove useless code * fix infer shape error * skip infrt compile for dropout
-
由 From00 提交于
* Add yaml for flatten_contiguous_range OP * update * Fix typos Co-authored-by: NShixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 02 4月, 2022 2 次提交
-
-
由 huzhiqiang 提交于
* code * code
-
由 Xiaoxu Chen 提交于
Enhance vjp/jvp/Jacobian/Hessian API for supporting dynamic, static graph and batched, unbatched mode (#40692) * modify vjp/jvp for both dynamic and static graph * enforce jacobian class for supporting first/last batch * add unittest for jvp, jacobian withlast batch, jacobian with first batch * fix the incorrect shape when multi-index Jacobian * enforce Hessian class for supporting dynamic graph * add Hessian class unittest * bugfix, jvp double_backward_trick zeros_like return stop_gradient=True in static graph * add API beta warnnings * add white_list for cuda11.x ci windows. * optimize some code snippets and documments * set unittest timeout to 100 seconds * move vjp,jvp,Jacobian,Hessian to incubate * fix vjp,vjp import path of sample code * fix code style error of augtograd/__init__ file
-
- 01 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add nll_loss yaml * fix nll loss * fix nll loss bug * fix bug * fix bug * fix infrt problem Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 31 3月, 2022 1 次提交
-
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * .
-
- 30 3月, 2022 2 次提交
-
-
由 王明冬 提交于
-
由 huzhiqiang 提交于
-
- 29 3月, 2022 1 次提交
-
-
由 王明冬 提交于
-
- 28 3月, 2022 2 次提交
-
-
由 王明冬 提交于
-
由 Chen Weihang 提交于
* fix assign kernel bug * fix xpu kernel select error * add cudn pinned place * fix copy error * fix infrt error
-
- 27 3月, 2022 1 次提交
-
-
由 Jack Zhou 提交于
* add string tensor and case convert kernels * Add strings empty kernel; Reorganize the structure of case convert kernel * Add string infermeta * Update mutable_data of string tensor * rename kernel name * add string copy tmp * Fix strings copy device bug * add utf8 gpu converter * add string tensor c++ api * Remove mutable_data of string tensor * update string tensor interface * remove charcases_flag.h * remove some fluid headers * Add make_ddim * __HIPCC__ -> PADDLE_WITH_HIP * remove fluid headers * fix cpu compile * remove std::hash * Fix cudaMalloc * Remove strings/impl directory * Fix infrt/get_phi_kernel_info.py;Add custom_kernels deps * Add empty kernel test * Remove some comments * Modify lower/upper api encoding type: string->bool * STRING->PSTRING; Add CreateInferLikeMeta * Add code gen for C++ String API * remove strings_api_utils.h * Add ignore file (strings_api.h, strings_api.cc) * update strings gen script * change args order of case convert kernels * Add comments for pstring, StringTensor * cpstring_internal.h -> cpstring_impl.h * Update accordding to comments: 1. Remove fluid headers 2. paddle::platform::errors -> phi::errors 3. Use 'place.GetType() == phi::AllocationType::GPU' instead of 'paddle::platform::is_cpu_space()' 4. Use camel code style * Remove all singletons in strings kernels * fix rocm compile * Fix py3 compile * Fix c++ coverage * 1. Add pstring proto type 2. Add StringTensor debug info 3. Rename case_convert_kernel to strings_lower_upper 4. Remove serialize derialize strings kernel * DataLayout::PSTRING -> DataLayout::PSTRING_UNION * Register pstring data type * Fix strings api gen * Fix dense tensor register pstring dtype * Fix error messages * remove line * add pstring unittest * remove test string api unitest * remove empty line * Remove some headers to decrease the size of executable file
-
- 25 3月, 2022 3 次提交
-
-
由 Wilber 提交于
* update register every make. * fix * update
-
由 Aurelius84 提交于
* [Phi] Migrate Adam and Adamw into Phi * fix compile error and unittest ok * fix compile error and unittest ok * fix undefined reference to fLI::FLAGS * test depend on operator * fix cmake * fix xpu compile * fix infrt * fix amp_type_traits * fix amp_type_traits * modify according reviewer * modify according reviewer * fix dtype float16 * fix typo * fix Cmake * fix code style
-
由 王明冬 提交于
-
- 24 3月, 2022 4 次提交
-
-
由 zhangbo9674 提交于
* approve amp for intermediate_dygraph * add amp_utils for intermediate_dygraph * add amp needcast check for mlu & npu * test unittest * add SetGradNode for set_stop_gradient && add checktensor for GradientHooks * refine code * refien unittest of imperative_amp for new dygraph * inplace api skip amp * add test_imperative_qat_amp for intermediate amp * refine code * refine test_amp ci strategy * refine unittest code * refine amp_utils code * refine amp getpromotetype for some special op * refine unittest code
-
由 Ren Wei (任卫) 提交于
-
由 huzhiqiang 提交于
-
由 huzhiqiang 提交于
-
- 23 3月, 2022 2 次提交
-
-
由 王明冬 提交于
-
由 Zhanlue Yang 提交于
* Removed redundant use of declarations.h * Fixed minor bug
-
- 21 3月, 2022 1 次提交
-
-
由 石晓伟 提交于
-
- 18 3月, 2022 4 次提交
-
-
由 huzhiqiang 提交于
-
由 Sing_chan 提交于
-
由 Wilber 提交于
* support register with attr * add infrt_with_gpu macor
-
由 王明冬 提交于
* [infrt] rename pd dialect from mlir to infrt. test=develop * [infrt] fix the kernel signature generator bug.
-