- 02 11月, 2022 6 次提交
-
-
由 Yiqun Liu 提交于
Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. (#47095) * Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. * Add a FLAGS to control whether abort when meets inf/nan and polish codes. * Fix unittest. * Change the computing of mean.
-
由 ShenLiang 提交于
* add unbalanced data * fix utest
-
由 zyfncg 提交于
* support generating static code of high order grad op by yaml * polish code
-
由 houj04 提交于
* [XPU] add int64 support for slice and subtract. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * remove unnecessary modification. test=kunlun
-
由 zhangkaihuo 提交于
-
由 Tian Zheng 提交于
* Add build option for CUDNN Frontend API * Fix review comments * Change namespace for cudnn_frontend.h
-
- 01 11月, 2022 28 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][E711][E712] update flake8 config * empty commit, test=document_fix
-
由 Nyakku Shigure 提交于
* [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)`
-
由 Leo Chen 提交于
* refine comments,test=kunlun * link xpu lib, test=kunlun * add sleep for test, test=kunlun * merge develop, fix compile, test=kunlun * remove debug code, test=kunlun * add dependency to avoid potential concurrency error, test=kunlun
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-
由 Yuanle Liu 提交于
-
由 HongyuJia 提交于
-
由 Siming Dai 提交于
-
由 Haohongxiang 提交于
-
由 limingshu 提交于
* first commit * transpose_kernel_optimization * first complishment of transpose op * second commit * refine code logics of tranpose_kernel * refine transpose kernel * first commit * fix DtoD copy bugs for hip * refine code according to the PR advice * change dim to int64_t type. * fix some type error
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
由 shentanyue 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` * revert changes in fluid * revert unrelated file * revert changes in norm * revert changes in auto_parallel_amp * fix norm and auto_parallel_amp * revert a typo fix due to fixed at #47477
-
由 Ruibiao Chen 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * Support custom stream for standalone executor Co-authored-by: NYulong Ao <aoyulong@baidu.com>
-
由 xiongkun 提交于
* Einsum Support Complex * code fix * add unittest for complex grad with einsum * set rtol=1e-4 * fix
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `six` package (part2) * six.ensure_str * remove unused `import six` * remove six from BUILTIN_LIKELY_MODULES * remove six in example code * remove some decode * try to fix example code * fix MockEtcdClient get/get_prefix returns data type * fix MockEtcdClient get_prefix returns data * fix MockEtcdClient get returns data * remove `six` in pypi and conda requirements * fix MockEtcdClient add_watch_callback/add_watch_prefix_callback returns data type * refine MockEtcdClient
-
由 risemeup1 提交于
* 修改.gitigore文件,把ljd_sh文件忽略掉 * 取消ljd_sh文件忽略 * filter npu xpu mlu ipu file * filter xpu npu mlu ipu files * filter npu xpu mlu ipu file
-
由 Kaipeng Deng 提交于
* fix memory copy in prepare_data. test=develop
-
由 shentanyue 提交于
-
由 sneaxiy 提交于
-
由 Aurelius84 提交于
* [Tools]Add autoflake pre-commit hook to remove unused-imports/var * add more args test=document_fix
-
由 Aurelius84 提交于
-
由 feng_shuai 提交于
-
由 Vigi Zhang 提交于
-
由 Wang Xin 提交于
-
由 zyfncg 提交于
* gene static graph code for ceil, expm1 op * gene static graph code for some activation op * fix bug * revert doc of silu and logsigmoid
-
由 Chen Weihang 提交于
* add extra attr property set * add type_info for all context * add onednn context to all context * fix context compile error * simplify conv kernel args * pass runtime attr into dev_ctx * fix marco error * clear conv_grad_kernel extra args * merge conv_grad_grad into conv_grad * clear conv2d_grad_grad extra attrs * clear yaml and eager extra attr * fix conv1d error * change to thread local * fix npu compile failed * try to fix windows compile failed * add conv2d onednn phi kernel * fix ci bugs (#36) * fix compile bugs (#38) * fix extra input transform bug (#39) * support dynamic created attr (#40) * reset extra info gen code * rm conv_grad_grad kernel * reimpl pass attr adapting * add int attr support * remove vector inputnames creating * fix map at error * Update paddle/phi/kernels/onednn/conv_grad_kernel.cc Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> * remove useless extra attrs * replace mkldnn_engine by onednn_engine Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
-
由 Yuang Liu 提交于
-
由 umiswing 提交于
-
- 31 10月, 2022 6 次提交
-
-
由 YuanRisheng 提交于
* standard api * fix ci bugs * fix ci bugs * fix ce bugs
-
由 xiongkun 提交于
* add unittest for einsum-v2-trace and diagonal * repeat labels. * einsum support repeated labels. * forward is ok for diagonal and undiagonalized. TODO: check backward is ok by our theorem. * backward is ok! * fix by PR suggestions. * fix ci error * fix ci error * fix ci warning
-
由 Guanghua Yu 提交于
-
由 wanghuancoder 提交于
* fix predictor memory write overflow
-
由 feng_shuai 提交于
* feat: add int8 support for vit * test:add test
-
由 ronnywang 提交于
* [CustomDevice] GetCCLComm add custom device support * update * update * update
-