- 01 11月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 Yuang Liu 提交于
-
由 umiswing 提交于
-
- 31 10月, 2022 18 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix ci bugs * fix ci bugs * fix ce bugs
-
由 xiongkun 提交于
* add unittest for einsum-v2-trace and diagonal * repeat labels. * einsum support repeated labels. * forward is ok for diagonal and undiagonalized. TODO: check backward is ok by our theorem. * backward is ok! * fix by PR suggestions. * fix ci error * fix ci error * fix ci warning
-
由 Guanghua Yu 提交于
-
由 wanghuancoder 提交于
* fix predictor memory write overflow
-
由 feng_shuai 提交于
* feat: add int8 support for vit * test:add test
-
由 ronnywang 提交于
* [CustomDevice] GetCCLComm add custom device support * update * update * update
-
由 feng_shuai 提交于
* optimize: vit 384 * fix:bug * fix:bug * fix:supoort rocm complie * refactor:name * fix:support rocm * fix:__HIP_NO_HALF_CONVERSIONS__ * optimize: delete scalar * fix:rocm can't support * fix:ernie error
-
由 Yulong Ao 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * [Auto Parallel] Add the missiong import
-
由 kangguangli 提交于
* replace executor in conditional_block_op.run with standalone_executor * add block_id as the argument of standalone executor's method run; add print for program * fix scope bug about conditional block op * fix bug: unnecessary return of fetch value * fix typo * fix: quantization will set variable persistable, and these variables must exist in global scope * add interpretercore cache for conditional block op but not activate in default * fix bug: local scope reuse for conditional block op * reset scope when conditional block op runs * fix typo * fix typo and code style * add build scope for conditional block op * add skip for transfer_layout kernel * refind code * fix reset_scope * fix reset_scope * refine code * refine code * refine code 1. remove flag use in conditional_block_op 2. pass execution_config to BuildOpFuncList instead of individual parameter * refine code * remove the use of FLAGS_control_flow_use_new_executor_cache * change FLAGS_control_flow_use_new_executor to false
-
由 Chenxiao Niu 提交于
-
由 Nyakku Shigure 提交于
* fix typo `Fasle`/`Flase` -> `Flase` * fix typo `Ture` -> `True`
-
由 zhouweiwei2014 提交于
-
由 zhangbo9674 提交于
* fix python module not found bug * delete unused cast,test=allcases
-
由 YangZhou 提交于
* rm kaiser window in audio window function * rm paddle audio utils which is redundant * rm kaiser in test_audio_functions.py
-
由 Wang Xin 提交于
-
由 risemeup1 提交于
-
由 risemeup1 提交于
-
由 risemeup1 提交于
-
- 30 10月, 2022 1 次提交
-
-
由 Roc 提交于
* maping from dist name scope to single name scope * update * fix gen cmake * support runtype is '' when using test_runner.py * Revert "fix gen cmake" This reverts commit d7a653d33aeacb8bb4a13957c9961ed9f626a18f. * update gen-ut-cmakelist; test=document_fix * revert code; test=document_fix
-
- 29 10月, 2022 1 次提交
-
-
由 Roc 提交于
-
- 28 10月, 2022 12 次提交
-
-
由 sneaxiy 提交于
* add fused_allreduce_gradients_with_group * add scale * fix ci
-
由 zyfncg 提交于
-
由 Haohongxiang 提交于
-
由 zhaoyingli 提交于
* fix engine build method * fix import * update engine cost * update raise error * update cmakelist * revert optimizer * revert optimizer * fix unittest * fix unittest Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
由 YangZhou 提交于
* fix window security error * format
-
由 LiYuRio 提交于
-
由 Aurelius84 提交于
* [JITLayer]Enable OneDNN on CPU and Fix zero shape * remove VLOG
-
由 Haohongxiang 提交于
* fix no sync bugs * update * update task chain fix: update wait chain feat: add `GetDeviceContext` for gloo * fix oom * fix dev * update * update Co-authored-by: NLiYuRio <liyuruijx@163.com> Co-authored-by: NForFishes <2282912238@qq.com>
-
由 zyfncg 提交于
* generate static graph code for some activation op * fix example code of cosh
-
由 Guanghua Yu 提交于
-
由 Wang Xin 提交于
-
- 27 10月, 2022 5 次提交
-
-
由 Aurelius84 提交于
* add predictor_engine * add predictor_engine * fix zero shape * fix lodTensor * fix unittest * fix code style * update CmakeList
-
由 Yuanle Liu 提交于
-
由 Guanghua Yu 提交于
-
由 WangZhen 提交于
Fix abnormal growth of memory in train mode and no_grad for Dy2St
-
由 zyfncg 提交于
-