- 02 11月, 2022 24 次提交
-
-
由 JYChen 提交于
-
由 Zhang Jun 提交于
* add bilinear OutSize
-
由 Leo Chen 提交于
-
由 zhangbo9674 提交于
* fix ci bug * test
-
由 Zhang Zheng 提交于
* Support NHWC layout in GroupNorm * fix cteset
-
由 tianshuo78520a 提交于
* add cuda117 dockerfile; test=cuda117 * notest;test=cuda117 * test=cuda117 * test=document_fix
-
由 丁一 提交于
-
由 Haohongxiang 提交于
-
由 Ruibiao Chen 提交于
* Dispath computation OPs before communication in standalone executor * Update code * Fix CI errors
-
由 Tao Luo 提交于
-
由 Chen Weihang 提交于
* add phi core file into ci list, test=document_fix * remove repated file, test=document_fix
-
由 Chen Weihang 提交于
* add storage properties for npu * fix compile failed * fix api name mismatch * polish design
-
由 YuanRisheng 提交于
* Standardise batch norm * standardize conv3d and depwise_conv2d * fix ci bugs
-
由 zhouweiwei2014 提交于
-
由 Leo Chen 提交于
-
由 Kevin吴嘉文 提交于
-
由 risemeup1 提交于
* 修改.gitigore文件,把ljd_sh文件忽略掉 * 修复改动单测文件没有触发精准测试的问题 * 取消改动.gitignore * 修复改动单测没有出发精准测试的问题 * 修改变量名含义更加容易理解,test=coverage
-
由 Yiqun Liu 提交于
Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. (#47095) * Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. * Add a FLAGS to control whether abort when meets inf/nan and polish codes. * Fix unittest. * Change the computing of mean.
-
由 ShenLiang 提交于
* add unbalanced data * fix utest
-
由 zyfncg 提交于
* support generating static code of high order grad op by yaml * polish code
-
由 houj04 提交于
* [XPU] add int64 support for slice and subtract. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * try to fix xpu compile. test=kunlun * remove unnecessary modification. test=kunlun
-
由 zhangkaihuo 提交于
-
由 Tian Zheng 提交于
* Add build option for CUDNN Frontend API * Fix review comments * Change namespace for cudnn_frontend.h
-
- 01 11月, 2022 16 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][E711][E712] update flake8 config * empty commit, test=document_fix
-
由 Nyakku Shigure 提交于
* [CodeStyle][E711] use `is`/`is not` for comparison with `None` * `self.assertTrue($A is None)` -> `self.assertIsNone($A)` * `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)` * `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)` * `self.assertEqual($A, None)` -> `self.assertIsNone($A)` * `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)`
-
由 Leo Chen 提交于
* refine comments,test=kunlun * link xpu lib, test=kunlun * add sleep for test, test=kunlun * merge develop, fix compile, test=kunlun * remove debug code, test=kunlun * add dependency to avoid potential concurrency error, test=kunlun
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-
由 Yuanle Liu 提交于
-
由 HongyuJia 提交于
-
由 Siming Dai 提交于
-
由 Haohongxiang 提交于
-
由 limingshu 提交于
* first commit * transpose_kernel_optimization * first complishment of transpose op * second commit * refine code logics of tranpose_kernel * refine transpose kernel * first commit * fix DtoD copy bugs for hip * refine code according to the PR advice * change dim to int64_t type. * fix some type error
-
由 YuanRisheng 提交于
* standard_api * add hardtanh
-
由 shentanyue 提交于
-
由 Nyakku Shigure 提交于
* [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` * revert changes in fluid * revert unrelated file * revert changes in norm * revert changes in auto_parallel_amp * fix norm and auto_parallel_amp * revert a typo fix due to fixed at #47477
-
由 Ruibiao Chen 提交于
* [Auto Parallel] Improve the c++ dist attr * [Auto Parallel] Modify test_program.py * Support custom stream for standalone executor Co-authored-by: NYulong Ao <aoyulong@baidu.com>
-
由 xiongkun 提交于
* Einsum Support Complex * code fix * add unittest for complex grad with einsum * set rtol=1e-4 * fix
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `six` package (part2) * six.ensure_str * remove unused `import six` * remove six from BUILTIN_LIKELY_MODULES * remove six in example code * remove some decode * try to fix example code * fix MockEtcdClient get/get_prefix returns data type * fix MockEtcdClient get_prefix returns data * fix MockEtcdClient get returns data * remove `six` in pypi and conda requirements * fix MockEtcdClient add_watch_callback/add_watch_prefix_callback returns data type * refine MockEtcdClient
-
由 risemeup1 提交于
* 修改.gitigore文件,把ljd_sh文件忽略掉 * 取消ljd_sh文件忽略 * filter npu xpu mlu ipu file * filter xpu npu mlu ipu files * filter npu xpu mlu ipu file
-