- 21 9月, 2020 6 次提交
-
-
由 ceci3 提交于
* add double grad compute for batch norm,test=develop * fix unittest, test=develop * remove unuse tensor,test=develop * add format,test=develop * update, test=develop
-
由 Shang Zhizhou 提交于
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
由 LutaoChu 提交于
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
-
由 Wilber 提交于
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
- 20 9月, 2020 1 次提交
-
-
由 tangwei12 提交于
* fix fleet util and gloo * fix worker endpoints * fix * fix UT * fix gloo * fix gloo * update gloo * update gloo * update gloo * update gloo * update gloo * fix gloo wrapper for hdfs * add file gloo and UT * fix UT * fix UT * fix UT * hide public method of RoleMaker * fix UT * GPU fleetrun support gloo * parameterserver fleetrun support gloo * add UT * add UT * fix UT * fix get server endpoint * fix get server endpoint * fix UT * hide public method of rolemaker * hide public method of rolemaker * hide public method of rolemaker * Update test_fleet_rolemaker_new.py * hide public method of rolemaker * hide public method of rolemaker
-
- 18 9月, 2020 6 次提交
-
-
由 tangwei12 提交于
* fix worker endpoints * fix gloo wrapper for hdfs * GPU fleetrun support gloo * parameterserver fleetrun support gloo * fix get server endpoint
-
由 liu zhengxi 提交于
-
由 Pei Yang 提交于
* pass compatibility * add compatibility registry * add unittests for different padding * add assert * drop errmsg
-
由 haozech 提交于
-
由 chajchaj 提交于
-
由 Zhen Wang 提交于
-
- 17 9月, 2020 12 次提交
-
-
由 liym27 提交于
* Add env value to log to stdout; 2.Add logger name * Optimize log messages in dygraph-to-static * Replace logging.warn and warnings.warn with logging_utils.warn
-
由 Shang Zhizhou 提交于
-
由 furnace 提交于
and optimize the c++ implmentation of empty op as PR#26659 reviews, and add bool for shape op.
-
由 123malin 提交于
* test=develop, util documents
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
由 YUNSHEN XIE 提交于
-
由 YUNSHEN XIE 提交于
-
由 ShenLiang 提交于
* fix floordiv
-
由 guofei 提交于
* Refine the unittest to support py38 test=develop
-
由 Zhou Wei 提交于
-
由 Huihuang Zheng 提交于
Change ProgramTranslator.enable_declarative to ProgramTranslator.enable_to_static to meet API 2.0
-
由 wanghuancoder 提交于
* add op_function_generator.exe retry in windows, test=develop * modify test_imperative_using_non_zero_gpu from use two gpus to one gpu, test=develop
-
- 16 9月, 2020 13 次提交
-
-
由 Leo Chen 提交于
* remove objgraph * remove graphviz * fix ut
-
由 gongweibao 提交于
-
由 ShenLiang 提交于
* add adaptivelsgd * Todo fix the code to avoid the conflict.
-
由 Yibing Liu 提交于
-
由 Chen Weihang 提交于
* support load state_dict from save_params/persistables * remove failed unittest * add load eof check & unittest * remove eof check
-
由 yaoxuefeng 提交于
-
由 ShenLiang 提交于
* fix error message
-
由 Chen Weihang 提交于
* add input_spec & output_spec for translated_layer * update error message
-
由 littletomatodonkey 提交于
-
由 YUNSHEN XIE 提交于
-
由 Zhen Wang 提交于
-
由 danleifeng 提交于
* fix ports conflict when launching multi-nodes in paddlecloud;test=develop * add DISTRIBUTED_TRAINER_ENDPOINTS env for cloud;test=develop
-
由 chalsliu 提交于
-
- 15 9月, 2020 2 次提交
-
-
由 Shang Zhizhou 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
由 Wilber 提交于
-