- 13 7月, 2020 5 次提交
-
-
由 tangwei12 提交于
* test_dist_fleet_ctr disable, test=develop
-
由 liym27 提交于
[while grad]Support pruning op in find_op_path about while sub-block when appending backward (#25330) Prune OPs which are not related with loss in while sub-block when constructing backward OP path.
-
由 yaoxuefeng 提交于
-
由 Huihuang Zheng 提交于
Add Similarity Net as unit test. During the unit test, we found three problems: 1. The run_program_op has memory optimization error when running dy2stat net multiple times. 2. The support for SelectedRows can cause problem in dy2stat. 3. The return grammar has problem. This PR fixes the 1. problem but modify codes for the 2. 3. problems to make PR smaller. I will fix those two problems in the next PR(s)
-
由 yaoxuefeng 提交于
-
- 12 7月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 11 7月, 2020 2 次提交
-
-
由 Zhen Wang 提交于
* Add the imperative quantization aware training. * This is the python part of Imperative QAT. test=develop
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 10 7月, 2020 6 次提交
-
-
由 wangchaochaohu 提交于
-
由 zlsh80826 提交于
* add explicit specialization * add skiplayernorm vector load if available * test=develop
-
由 Chen Weihang 提交于
* polish pe exception process logic, test=develop * fix unittest, test=develop * add unittests, test=develop
-
由 Zhou Wei 提交于
* fix optimizer.state_dict and LRScheduler.state_dict to save/load dygraph,test=develop * fix optimizer.state_dict and LRScheduler.state_dict to save/load dygraph,test=develop * Add a judgment that state_dict/set_dict is used incorrectly,test=develop * fix some doc error,test=develop * fix current_step_lr for _LearningRateEpochDecay,test=develop * remove some unsed code to improve coverage,test=develop * remove some unsed code to improve coverage,test=develop
-
由 Jeng Bai-Cheng 提交于
Use vector instruction (LDG.128) to improve qkv transpose. It provides 1.4X speedup at same GPU base frequency. test=develop
-
由 zhupengyang 提交于
-
- 09 7月, 2020 9 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
remove useless property
-
由 Zhou Wei 提交于
add new API:LambdaDecay
-
由 tianshuo78520a 提交于
-
由 Leo Chen 提交于
* attempt to resolve tls problem, test=develop * add glibc version check, test=develop * fix regex, test=develop * refine get_libc_ver, test=develop * refine get_libc_ver, test=develop
-
由 Jacek Czaja 提交于
test=develop
-
由 Chen Weihang 提交于
-
由 Zhen Wang 提交于
-
由 wangchaochaohu 提交于
-
- 08 7月, 2020 12 次提交
-
-
由 hong 提交于
* fix optimizer parameter is a iterator; test=develop * fix parameter list None bug; test=develop * use is not None; test=develop * change list to iterable; test=develop
-
由 Leo Chen 提交于
* refine as_lodtensor, test=develop * fix test, test=develop * add unittest, test=develop * handle nested_list, test=develop * handle nested_list, test=develop
-
由 Pei Yang 提交于
-
由 Jacek Czaja 提交于
-
由 GaoWei8 提交于
* fix concat shape error test=develop
-
由 jzhang533 提交于
test=develop test=document_fix
-
由 Dong Daxiang 提交于
test=develop
-
由 Kaipeng Deng 提交于
* fix test_multiprocess_dataloader_exception failed on CPU only version. test=develop
-
由 Leo Chen 提交于
* clean __str__ of VarBase and ParamBase, test=develop * clean to_string, test=develop * update unittest, test=develop
-
由 WangXi 提交于
-
由 ceci3 提交于
* fix bn & in in dy, test=develop * update instance_norm,test=develop * fix bugs,test=develop * add more case in unittest,test=develop * fix,test=develop * fix,test=develop
-
- 07 7月, 2020 5 次提交
-
-
由 Aurelius84 提交于
-
由 lilong12 提交于
* add device_worker for pipeline, test=develop
-
由 zhangchunle 提交于
-
由 hong 提交于
* cat bad alloc exception; test=develop * add unitest; test=develop * move bad alloc catch to the first place; test=develop * polish error message; test=develop * polish error message; test=develop * add mutex header; test=develop
-
由 Huihuang Zheng 提交于
As the title
-