- 03 12月, 2021 6 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
由 Leo Chen 提交于
* use sync h2d copy * use stream safe allocator in memcpy_h2d * remove wait * add guard
-
由 WangXi 提交于
-
由 Zhanlue Yang 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen
-
由 Sing_chan 提交于
* add trt_mobilenet demo test * only deliver trt_root in run.sh
-
由 wanghuancoder 提交于
* refine a test case, test=develop * publish python c api for eager, test=develop * revert modify about test_allclose_layer.py, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * delete numpy includes, use pybind11 numpy.h, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * suport eager error msg, and add grad test case, test=develop * refine, test=develop * refine, test=develop
-
- 02 12月, 2021 11 次提交
-
-
由 Thunderbrook 提交于
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * refactor logic that transform variable to densetensor * update func name
-
由 Yuang Liu 提交于
-
由 seemingwang 提交于
-
由 LiYuRio 提交于
-
由 Leo Chen 提交于
-
由 wuhuanzhou 提交于
-
由 furnace 提交于
* [NPU] add int64 support for scatter op * [NPU] delete debug codes * [NPU] optimize codes
-
由 WangXi 提交于
-
由 wenbin 提交于
* first commit * more uts * file name duplicated * timeout * Update CMakeLists.txt change TIMEOUT from 120 to 240 * Update CMakeLists.txt more time * Update CMakeLists.txt timeout * Update CMakeLists.txt 60s
-
由 Zhanlue Yang 提交于
* Handled dispensable tensors in AutoCodeGen for Eager Dygraph * Enabled Eager Dygraph AutoCodeGen for 500+ existing ops
-
- 01 12月, 2021 11 次提交
-
-
由 sneaxiy 提交于
* fix inplace addto pass * update * fix ut * improve ci coverage * fix musl ci compile error
-
由 Leo Chen 提交于
-
由 Jiabin Yang 提交于
* optimizer __call__ to make dygraph faster * fix return type * remove cpp Layer
-
由 Yuang Liu 提交于
-
由 taixiurong 提交于
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostEventTracer * update HostEventTracer * fix c++17 * update * update * update * update * fix bug Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 TTerror 提交于
* add prior_box for kunlun * update * update CMakeLists
-
由 Jason 提交于
* fix fc_fuse * modify cmake notest,test=windows_ci * retrigger all the ci
-
由 Zhanlue Yang 提交于
-
由 Feiyu Chan 提交于
* add angle_op
-
由 Huihuang Zheng 提交于
Modify ShareTensorWithCinnBuffer by callback to save memory
-
- 30 11月, 2021 12 次提交
-
-
由 WangXi 提交于
-
由 Sylwester Fraczek 提交于
* refactoring matmul hierarchy * review fix * review fix * review_FIX-part2
-
由 Sławomir Siwek 提交于
* Add new unittests * Replace I with O channel for filter groups * Undo changes affecting other operators * Fix oneDNN namespace typo * Fix code format error
-
由 zhangbo9674 提交于
* add regularation and Nesterov for mergerd_momentum * refine unittest for use_nesterov attr * refine op check * refine code * fix bug * refine code of regularization_flag * delete useless code
-
由 Zhanlue Yang 提交于
* Refactored eager legacy namespace * Fixed namespace issues
-
由 Zhanlue Yang 提交于
* Enabled performance benchmark tests for Eager Dygraph * Protected CUDA tests with macro * Fixed dependency issues for windows-ci
-
由 zhaocaibei123 提交于
-
由 Chen Weihang 提交于
-
由 Yuang Liu 提交于
-
由 Guoxia Wang 提交于
* support data_format='NHWC' for prelu channel mode
-
由 Yang 提交于
-
由 LiYuRio 提交于
-