- 01 12月, 2021 14 次提交
-
-
由 Jiabin Yang 提交于
* optimizer __call__ to make dygraph faster * fix return type * remove cpp Layer
-
由 Yuang Liu 提交于
-
由 taixiurong 提交于
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * update HostEventTracer * update HostEventTracer * fix c++17 * update * update * update * update * fix bug Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 TTerror 提交于
* add prior_box for kunlun * update * update CMakeLists
-
由 Jason 提交于
* fix fc_fuse * modify cmake notest,test=windows_ci * retrigger all the ci
-
由 Zhanlue Yang 提交于
-
由 Sing_chan 提交于
-
由 Jiabin Yang 提交于
* optimizer __call__ to make dygraph faster * fix return type
-
由 Feiyu Chan 提交于
* add angle_op
-
由 tianshuo78520a 提交于
Delete pylint for ci
-
由 Guanghua Yu 提交于
-
由 Tao Luo 提交于
-
由 Huihuang Zheng 提交于
Modify ShareTensorWithCinnBuffer by callback to save memory
-
- 30 11月, 2021 20 次提交
-
-
由 WangXi 提交于
-
由 Sylwester Fraczek 提交于
-
由 Sylwester Fraczek 提交于
* refactoring matmul hierarchy * review fix * review fix * review_FIX-part2
-
由 chentianyu03 提交于
-
由 Sławomir Siwek 提交于
* Add new unittests * Replace I with O channel for filter groups * Undo changes affecting other operators * Fix oneDNN namespace typo * Fix code format error
-
由 zhangbo9674 提交于
* add regularation and Nesterov for mergerd_momentum * refine unittest for use_nesterov attr * refine op check * refine code * fix bug * refine code of regularization_flag * delete useless code
-
由 xiayanming 提交于
* [Auto Parallel] elastic support auto parallel re-launch * [Auto Parallel] elastic support auto parallel re-launch * fix ci issue * fix ci issue * fix rank mapping unittest * fix rank mapping unittest * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue * fix ci issue
-
由 Zhanlue Yang 提交于
* Refactored eager legacy namespace * Fixed namespace issues
-
由 Zhanlue Yang 提交于
* Enabled performance benchmark tests for Eager Dygraph * Protected CUDA tests with macro * Fixed dependency issues for windows-ci
-
由 zhaocaibei123 提交于
-
由 andyjpaddle 提交于
* add diff op, test=develop * rm some notes, test=develop * update diff doc * update sample code * fix diff api params and example code, test=develop
-
由 Chen Weihang 提交于
-
由 Sing_chan 提交于
* modify for wincheck-inference case * modify according to zhouwei's comment * open with_trt and block failed unittests in windows * test
-
由 Yuang Liu 提交于
-
由 Guoxia Wang 提交于
* support data_format='NHWC' for prelu channel mode
-
由 Yang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the unified cluster representation * [Auto Parallel] Add the graph class for physical mapping * [Auto Parallel] Add the simple physical mapper * Set the timeout of the mapper * Merge the upstream develop unittests cmake files * Fix a bug of the process group * Remove mapper unittest from platforms which is not GPU * Move the instantiation of process group after resharding * Add the local id for devices * Update the rank mapping format * Add some comments * Remove the related files about mapping * Update the unittest for auto mapping * Remove unused rank_mapping unittest * Improve the unittest coverage * Improve the unittest coverage
-
由 LiYuRio 提交于
-
由 Chen Weihang 提交于
-
由 xiongkun 提交于
* add scope_guard * 1. fix control flow cases 2. fix calc_gradient
-
- 29 11月, 2021 6 次提交
-
-
由 Zhanlue Yang 提交于
-
由 taixiurong 提交于
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
由 TTerror 提交于
* add expand_v2/expand_as_v2 for kunlun * update expand_as_v2 * update expand_as_v2 * support float16/bool * update xpu.cmake
-
由 wanghuancoder 提交于
-
由 piotrekobiIntel 提交于
-