- 25 2月, 2021 1 次提交
-
-
由 Leo Chen 提交于
refactor npu device manager (#31154)
-
- 23 2月, 2021 1 次提交
-
-
由 liym27 提交于
* [NPU] Support executor with NPU * Fix code according to reviews * Fix code * Add unittest for sub op npu
-
- 20 2月, 2021 1 次提交
-
-
由 gongweibao 提交于
Fix reshape on GE graph
-
- 18 2月, 2021 1 次提交
-
-
由 xiayanming 提交于
support parsing ascend rank table file
-
- 08 2月, 2021 1 次提交
-
-
由 gongweibao 提交于
Destroy session first.
-
- 01 2月, 2021 2 次提交
-
-
由 gongweibao 提交于
Add paddle ascend distribution training supported
-
由 OleNet 提交于
Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug (#30797) Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug
-
- 29 1月, 2021 2 次提交
-
-
由 dingsiyu 提交于
Merge ascend_optimizer and ascend_parser.
-
由 gongweibao 提交于
code style
-
- 25 1月, 2021 1 次提交
-
-
由 Void Main 提交于
[Feature] Build parser to support distributed training
-
- 22 1月, 2021 2 次提交
-
-
由 gongweibao 提交于
cleanup test_ascend_group.py
-
由 gongweibao 提交于
Add startup bash files of test_ascend_group
-
- 21 1月, 2021 4 次提交
-
-
由 gongweibao 提交于
Add Hccl program group
-
由 gongweibao 提交于
Pass device_ids info from launch to trainer
-
由 Void Main 提交于
Build praser for Hcom* operators
-
由 gongweibao 提交于
Add distribution supported
-
- 15 1月, 2021 3 次提交
- 14 1月, 2021 6 次提交
-
-
由 taixiurong 提交于
-
由 Zhou Wei 提交于
-
由 Jiaqi Liu 提交于
* add auc into 'all' list * alias acc, expose to users * update sample code
-
由 Chengmo 提交于
Co-authored-by: NseiriosPlus <tangwei12@baidu.com>
-
由 123malin 提交于
* test=develop, add distributed_infer
-
由 Chen Weihang 提交于
-
- 13 1月, 2021 11 次提交
-
-
由 WeiXin 提交于
-
由 Huihuang Zheng 提交于
As the title
-
由 cc 提交于
* skip quantizing ops in cpu inference, test=develop
-
由 Bai Yifan 提交于
-
由 huangxu96 提交于
-
由 Huihuang Zheng 提交于
As the title
-
由 Leo Chen 提交于
Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338) * set expected place in child thread for dataloader * set device id when set tensor from numpy * revert tensor_py change * add compile guard * fix ci * fix bug
-
由 QingshuChen 提交于
* optimize memcpy perf for kunlun * remove useless unitest for kunlun mean * minor
-
由 cnn 提交于
-
由 huangxu96 提交于
* Implemented AddQuantDequantPass in imperative quantization. * Supported LeakyReLU Quantization * For meeting coverage rate. * Changed the file name of test of AddQuantDequant * Implemented more Quantized NoWeightLayers. * Fix the loss cannot align problem between static and dynamic model quantization, add swish as supported quantized layer in imperative quantization. * remove noweight_list * support 2.0 API such as Pool2D and ReLu
-
由 ShenLiang 提交于
-
- 12 1月, 2021 4 次提交
-
-
由 JZ-LIANG 提交于
-
由 lidanqing 提交于
-
由 Wojciech Uss 提交于
* upgrade oneDNN version to 2.0 master branch * - Added workarounds for new lib onednn change * fix regex Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
-
由 tangwei12 提交于
* add sparse embedding & load vars for 2.0 Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b * fix hdfs gloo Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6 * fix gloo hdfs Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e * move loadvar/sparse embedding from incubute to static Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0
-