- 02 11月, 2022 3 次提交
-
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
由 zhouweiwei2014 提交于
-
由 zyfncg 提交于
* support generating static code of high order grad op by yaml * polish code
-
- 01 11月, 2022 9 次提交
-
-
由 Leo Chen 提交于
* refine comments,test=kunlun * link xpu lib, test=kunlun * add sleep for test, test=kunlun * merge develop, fix compile, test=kunlun * remove debug code, test=kunlun * add dependency to avoid potential concurrency error, test=kunlun
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-
由 HongyuJia 提交于
-
由 limingshu 提交于
* first commit * transpose_kernel_optimization * first complishment of transpose op * second commit * refine code logics of tranpose_kernel * refine transpose kernel * first commit * fix DtoD copy bugs for hip * refine code according to the PR advice * change dim to int64_t type. * fix some type error
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
由 feng_shuai 提交于
-
由 Wang Xin 提交于
-
由 zyfncg 提交于
* gene static graph code for ceil, expm1 op * gene static graph code for some activation op * fix bug * revert doc of silu and logsigmoid
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
- 31 10月, 2022 6 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix ci bugs * fix ci bugs * fix ce bugs
-
由 feng_shuai 提交于
* optimize: vit 384 * fix:bug * fix:bug * fix:supoort rocm complie * refactor:name * fix:support rocm * fix:__HIP_NO_HALF_CONVERSIONS__ * optimize: delete scalar * fix:rocm can't support * fix:ernie error
-
由 kangguangli 提交于
* replace executor in conditional_block_op.run with standalone_executor * add block_id as the argument of standalone executor's method run; add print for program * fix scope bug about conditional block op * fix bug: unnecessary return of fetch value * fix typo * fix: quantization will set variable persistable, and these variables must exist in global scope * add interpretercore cache for conditional block op but not activate in default * fix bug: local scope reuse for conditional block op * reset scope when conditional block op runs * fix typo * fix typo and code style * add build scope for conditional block op * add skip for transfer_layout kernel * refind code * fix reset_scope * fix reset_scope * refine code * refine code * refine code 1. remove flag use in conditional_block_op 2. pass execution_config to BuildOpFuncList instead of individual parameter * refine code * remove the use of FLAGS_control_flow_use_new_executor_cache * change FLAGS_control_flow_use_new_executor to false
-
由 Chenxiao Niu 提交于
-
由 Nyakku Shigure 提交于
* fix typo `Fasle`/`Flase` -> `Flase` * fix typo `Ture` -> `True`
-
由 Wang Xin 提交于
-
- 28 10月, 2022 3 次提交
- 27 10月, 2022 4 次提交
-
-
由 Aurelius84 提交于
* add predictor_engine * add predictor_engine * fix zero shape * fix lodTensor * fix unittest * fix code style * update CmakeList
-
由 Leo Chen 提交于
* make all cpp tests dynamic linked to libpaddle.so * add comments * keep old cc_test for some tests * fix some ut * make some ut use cc_test_old * fix typos and fit for win32 * fix lib path * fix some tests * skip lite test * fit for rocm * fit for cinn * fit for mac * fit for win32 * skip inference ut * skip windows * fix coverage
-
由 Shijie 提交于
-
由 HongyuJia 提交于
-
- 26 10月, 2022 7 次提交
-
-
由 Sławomir Siwek 提交于
* fc/matmuls + scale fuse pass * remove double-extension * add unit tests * comments from review * codestyle * add pass to int8 list * new codestyle * attr name typo
-
由 HongyuJia 提交于
* remove prior_box mkldnn hard code * add header file * simplify PD_VISIT_TYPE * decouple dependency between prior_box and density_prior_box * fix pragma omp parallel error * bypass #pragma omp_parallel_for error * polish code * remove visit_type headerfile * polish codestyle * polish codestyle * try fix CI error * add testcase, datatype=float64 * reset test_prior_box testcase * add datacheck to DenseTensor * update template name * call prior_box with macro expand
-
由 HongyuJia 提交于
-
由 Wang Xin 提交于
fix uninitialized, tautological-constant-out-of-range-compare and literal-conversion warning on macos (#47341)
-
由 zyfncg 提交于
* fix inference perfermence problem caused by selecting cudnn kernel for softmax * recover use_cudnn in opmaker of softmax
-
由 Chen Weihang 提交于
* remove using lodtensor part2 * resolve code format error * resolve conflict * resolve conflict * replace added frameworrk tensor
-
由 sneaxiy 提交于
* fix fused_attention fused_feedforward * fix ci * fix ci * fix ci PADDLE_GET_CONST * fix ci ut
-
- 25 10月, 2022 6 次提交
-
-
由 jakpiase 提交于
* return proper state * fix for dims * fix
-
由 HongyuJia 提交于
-
由 HongyuJia 提交于
* opt cudnn hardcode of sequence_softmax * fix grad datatype
-
由 HongyuJia 提交于
* use dnn_fallback flag to delete mkldnn hardcode * polish code style * fix protected error * fix const error * fix reduce_op fallback * fix pool_op fallback * add Set function of dnn_fallback_
-
由 zhouweiwei2014 提交于
-
由 HongyuJia 提交于
-
- 24 10月, 2022 2 次提交
-
-
由 zyfncg 提交于
* support selected_rows kernel for multiply in dygraph * delete useless code of slice in fluid * fix complie bug * move slice_array from fluid to phi * fix strided_slice_op_npu
-
由 zyfncg 提交于
* gene static graph code for exp * refactor the doc of exp * fix bug * fix bug * update doc of exp * fix sparse op
-