- 03 11月, 2022 6 次提交
-
-
由 zhouweiwei2014 提交于
-
由 sneaxiy 提交于
-
由 Sławomir Siwek 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * remove redundant imports * migrate softmax * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * merge dev * fix map at error * adjust attribute * adapt funcs to PHI Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
由 zhangkaihuo 提交于
-
由 zhouweiwei2014 提交于
-
由 YuanRisheng 提交于
-
- 02 11月, 2022 5 次提交
-
-
由 zhangbo9674 提交于
* fix ci bug * test
-
由 Tao Luo 提交于
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
由 zhouweiwei2014 提交于
-
由 houj04 提交于
* [XPU] add int64 support for slice and subtract. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * remove unnecessary modification. test=kunlun
-
- 01 11月, 2022 7 次提交
-
-
由 Siming Dai 提交于
-
由 limingshu 提交于
* first commit * transpose_kernel_optimization * first complishment of transpose op * second commit * refine code logics of tranpose_kernel * refine transpose kernel * first commit * fix DtoD copy bugs for hip * refine code according to the PR advice * change dim to int64_t type. * fix some type error
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
由 xiongkun 提交于
* Einsum Support Complex * code fix * add unittest for complex grad with einsum * set rtol=1e-4 * fix
-
由 Wang Xin 提交于
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 umiswing 提交于
-
- 31 10月, 2022 6 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix ci bugs * fix ci bugs * fix ce bugs
-
由 xiongkun 提交于
* add unittest for einsum-v2-trace and diagonal * repeat labels. * einsum support repeated labels. * forward is ok for diagonal and undiagonalized. TODO: check backward is ok by our theorem. * backward is ok! * fix by PR suggestions. * fix ci error * fix ci error * fix ci warning
-
由 ronnywang 提交于
* [CustomDevice] GetCCLComm add custom device support * update * update * update
-
由 kangguangli 提交于
* replace executor in conditional_block_op.run with standalone_executor * add block_id as the argument of standalone executor's method run; add print for program * fix scope bug about conditional block op * fix bug: unnecessary return of fetch value * fix typo * fix: quantization will set variable persistable, and these variables must exist in global scope * add interpretercore cache for conditional block op but not activate in default * fix bug: local scope reuse for conditional block op * reset scope when conditional block op runs * fix typo * fix typo and code style * add build scope for conditional block op * add skip for transfer_layout kernel * refind code * fix reset_scope * fix reset_scope * refine code * refine code * refine code 1. remove flag use in conditional_block_op 2. pass execution_config to BuildOpFuncList instead of individual parameter * refine code * remove the use of FLAGS_control_flow_use_new_executor_cache * change FLAGS_control_flow_use_new_executor to false
-
由 zhouweiwei2014 提交于
-
由 Wang Xin 提交于
-
- 28 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 27 10月, 2022 2 次提交
-
-
由 Jacek Czaja 提交于
* - halfway transforming transpose grad - Fixes - buildable * - lint * rerunning the process
-
由 Bo Zhang 提交于
* fix reduce_any kernel data race on sharedMem * use bit operation instead of div & mod * unbranch * modified according to PR comments
-
- 26 10月, 2022 3 次提交
-
-
由 Lin Manhui 提交于
* Fix paddle.pow() bugs * Add unittest cases * Fix ut cases * Add ut cases on multiple devices
-
由 zhangkaihuo 提交于
-
由 Wang Xin 提交于
fix uninitialized, tautological-constant-out-of-range-compare and literal-conversion warning on macos (#47341)
-
- 25 10月, 2022 2 次提交
-
-
由 jakpiase 提交于
-
由 zhouweiwei2014 提交于
-
- 24 10月, 2022 4 次提交
-
-
由 zyfncg 提交于
* support selected_rows kernel for multiply in dygraph * delete useless code of slice in fluid * fix complie bug * move slice_array from fluid to phi * fix strided_slice_op_npu
-
由 Yiqun Liu 提交于
-
由 Zhang Ting 提交于
-
由 Yiqun Liu 提交于
-
- 21 10月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 20 10月, 2022 2 次提交
-
-
由 JingZhuangzhuang 提交于
* Add infer prune function * Update phi.cmake * Update operators.cmake * add fusion op
-
由 thunder95 提交于
-
- 19 10月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
-