- 22 9月, 2020 1 次提交
-
-
由 Pei Yang 提交于
-
- 21 9月, 2020 10 次提交
-
-
由 furnace 提交于
-
由 LutaoChu 提交于
* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension
-
由 ceci3 提交于
* add double grad compute for batch norm,test=develop * fix unittest, test=develop * remove unuse tensor,test=develop * add format,test=develop * update, test=develop
-
由 Shang Zhizhou 提交于
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
由 LutaoChu 提交于
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
-
由 Wilber 提交于
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
由 Leo Chen 提交于
* refine error msg in var_handle.h, test=develop * refine all_reduce_op_handle * fix some error msg * refine variable_visitor * refine threaded_ssa_graph_executor * refine inplace related files * refine executor related files * refine fetch_op_handle.cc * fix bug * follow comments
-
由 MRXLT 提交于
* fix adam * rmsprop support double
-
- 18 9月, 2020 7 次提交
-
-
由 tangwei12 提交于
* fix worker endpoints * fix gloo wrapper for hdfs * GPU fleetrun support gloo * parameterserver fleetrun support gloo * fix get server endpoint
-
由 Pei Yang 提交于
-
由 yaoxuefeng 提交于
-
由 Pei Yang 提交于
* pass compatibility * add compatibility registry * add unittests for different padding * add assert * drop errmsg
-
由 haozech 提交于
-
由 GaoWei8 提交于
* fix cudnn dyload error
-
-
- 17 9月, 2020 8 次提交
-
-
由 HappyAngel 提交于
* fix op print error info problem. test=develop * fix build error * fix format * fix error msg info * fix format
-
由 Shang Zhizhou 提交于
-
由 furnace 提交于
and optimize the c++ implmentation of empty op as PR#26659 reviews, and add bool for shape op.
-
由 Yi Liu 提交于
paddle/fluid/operators/distributed_ops OP报错信息优化
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
由 lilong12 提交于
-
由 ShenLiang 提交于
* fix floordiv
-
由 Zhou Wei 提交于
-
- 16 9月, 2020 10 次提交
-
-
由 ShenLiang 提交于
* add adaptivelsgd * Todo fix the code to avoid the conflict.
-
由 Jack Zhou 提交于
Error description optimize for the math dir
-
由 Zhou Wei 提交于
fix judge cache file of inference api more accurate
-
由 Jacek Czaja 提交于
test=develop
-
由 ShenLiang 提交于
* fix error message
-
由 Chen Weihang 提交于
* polish framework error message part 7 * fix typo * polish by reviewes comment
-
由 wawltor 提交于
update the error message check for the some ops
-
由 wawltor 提交于
add the error message check for the some operator
-
由 Wilber 提交于
-
由 Leo Chen 提交于
-
- 15 9月, 2020 4 次提交
-
-
由 Shang Zhizhou 提交于
* optimize slice TRT plugin This patch removes unnecessary barrier for data transfer of needed offset, so data transfer can be overlap with GPU kernel execution. This patch also fixes incorrect name of slice plugin. That is, replaces "layernorm" with "slice" test=develop * add serialize/deserialize to slice plugin * add static shape slice trt plugin * fix slice trt op convertor dynamic shape bug * fix format by clang-format * fix pylint format error * fix problems commented by peiyang Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
由 Wilber 提交于
-
由 石晓伟 提交于
-
由 Chen Weihang 提交于
* polish framework error msg part 6 * polish lossed item * fix failed unittest * polish by reviewer comments
-