- 14 7月, 2020 1 次提交
-
-
由 Pei Yang 提交于
* add api to clear intemediate tensors in analysis predictor. test=develop * add python api. test=develop
-
- 13 7月, 2020 11 次提交
-
-
由 zhangchunle 提交于
-
由 YUNSHEN XIE 提交于
-
由 leesusu 提交于
-
由 zhangchunle 提交于
-
由 Kaipeng Deng 提交于
* make default_collate_fn visible. test=develop
-
由 Zhen Wang 提交于
-
由 tangwei12 提交于
* test_dist_fleet_ctr disable, test=develop
-
由 liym27 提交于
[while grad]Support pruning op in find_op_path about while sub-block when appending backward (#25330) Prune OPs which are not related with loss in while sub-block when constructing backward OP path.
-
由 yaoxuefeng 提交于
-
由 Huihuang Zheng 提交于
Add Similarity Net as unit test. During the unit test, we found three problems: 1. The run_program_op has memory optimization error when running dy2stat net multiple times. 2. The support for SelectedRows can cause problem in dy2stat. 3. The return grammar has problem. This PR fixes the 1. problem but modify codes for the 2. 3. problems to make PR smaller. I will fix those two problems in the next PR(s)
-
由 yaoxuefeng 提交于
-
- 12 7月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 11 7月, 2020 2 次提交
-
-
由 Zhen Wang 提交于
* Add the imperative quantization aware training. * This is the python part of Imperative QAT. test=develop
-
由 Chen Weihang 提交于
* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop * replace old macro & for condition, test=develop * polish details, test=develop
-
- 10 7月, 2020 6 次提交
-
-
由 wangchaochaohu 提交于
-
由 zlsh80826 提交于
* add explicit specialization * add skiplayernorm vector load if available * test=develop
-
由 Chen Weihang 提交于
* polish pe exception process logic, test=develop * fix unittest, test=develop * add unittests, test=develop
-
由 Zhou Wei 提交于
* fix optimizer.state_dict and LRScheduler.state_dict to save/load dygraph,test=develop * fix optimizer.state_dict and LRScheduler.state_dict to save/load dygraph,test=develop * Add a judgment that state_dict/set_dict is used incorrectly,test=develop * fix some doc error,test=develop * fix current_step_lr for _LearningRateEpochDecay,test=develop * remove some unsed code to improve coverage,test=develop * remove some unsed code to improve coverage,test=develop
-
由 Jeng Bai-Cheng 提交于
Use vector instruction (LDG.128) to improve qkv transpose. It provides 1.4X speedup at same GPU base frequency. test=develop
-
由 zhupengyang 提交于
-
- 09 7月, 2020 9 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
remove useless property
-
由 Zhou Wei 提交于
add new API:LambdaDecay
-
由 tianshuo78520a 提交于
-
由 Leo Chen 提交于
* attempt to resolve tls problem, test=develop * add glibc version check, test=develop * fix regex, test=develop * refine get_libc_ver, test=develop * refine get_libc_ver, test=develop
-
由 Jacek Czaja 提交于
test=develop
-
由 Chen Weihang 提交于
-
由 Zhen Wang 提交于
-
由 wangchaochaohu 提交于
-
- 08 7月, 2020 10 次提交
-
-
由 hong 提交于
* fix optimizer parameter is a iterator; test=develop * fix parameter list None bug; test=develop * use is not None; test=develop * change list to iterable; test=develop
-
由 Leo Chen 提交于
* refine as_lodtensor, test=develop * fix test, test=develop * add unittest, test=develop * handle nested_list, test=develop * handle nested_list, test=develop
-
由 Pei Yang 提交于
-
由 Jacek Czaja 提交于
-
由 GaoWei8 提交于
* fix concat shape error test=develop
-
由 jzhang533 提交于
test=develop test=document_fix
-
由 Dong Daxiang 提交于
test=develop
-
由 Kaipeng Deng 提交于
* fix test_multiprocess_dataloader_exception failed on CPU only version. test=develop
-
由 Leo Chen 提交于
* clean __str__ of VarBase and ParamBase, test=develop * clean to_string, test=develop * update unittest, test=develop
-