- 24 9月, 2020 1 次提交
-
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 23 9月, 2020 6 次提交
-
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 LutaoChu 提交于
The calculation result of Diag_v2 Op under large size input is wrong
-
由 tangwei12 提交于
* rename communicator meet->BatchesCounter * fix parame recv for sparse * geo sparse init from pserver * optimize init from pserver * add large scale optimizer fuse(SGD/ADAM) * rectification init_worker and exe.run startup program
-
由 Chen Weihang 提交于
* polish no onwer op error message * fix unittest failed * polish details based reviewer comment
-
由 Zhang Ting 提交于
* add fused_bn_add_relu op
-
由 wangchaochaohu 提交于
-
- 22 9月, 2020 4 次提交
-
-
由 123malin 提交于
* test=develop, update error message
-
由 wangchaochaohu 提交于
-
由 石晓伟 提交于
-
由 Zhong Hui 提交于
Add the cpu version of segment sum mean max min op
-
- 21 9月, 2020 8 次提交
-
-
由 furnace 提交于
-
由 LutaoChu 提交于
* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension
-
由 ceci3 提交于
* add double grad compute for batch norm,test=develop * fix unittest, test=develop * remove unuse tensor,test=develop * add format,test=develop * update, test=develop
-
由 Shang Zhizhou 提交于
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
由 LutaoChu 提交于
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
由 MRXLT 提交于
* fix adam * rmsprop support double
-
- 18 9月, 2020 2 次提交
-
-
由 GaoWei8 提交于
* fix cudnn dyload error
-
-
- 17 9月, 2020 5 次提交
-
-
由 HappyAngel 提交于
* fix op print error info problem. test=develop * fix build error * fix format * fix error msg info * fix format
-
由 furnace 提交于
and optimize the c++ implmentation of empty op as PR#26659 reviews, and add bool for shape op.
-
由 Yi Liu 提交于
paddle/fluid/operators/distributed_ops OP报错信息优化
-
由 Jack Zhou 提交于
enhance reduce op which can reduce tensor with arbitrary rank
-
由 ShenLiang 提交于
* fix floordiv
-
- 16 9月, 2020 5 次提交
-
-
由 Jack Zhou 提交于
Error description optimize for the math dir
-
由 Jacek Czaja 提交于
test=develop
-
由 ShenLiang 提交于
* fix error message
-
由 wawltor 提交于
update the error message check for the some ops
-
由 wawltor 提交于
add the error message check for the some operator
-
- 15 9月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* replace sequence length attr to input
-
- 14 9月, 2020 7 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 lilong12 提交于
* improve err report, test=develop
-
由 Zhong Hui 提交于
Enhance the error messages for files in operators/math
-
由 Pei Yang 提交于
-
由 Zhen Wang 提交于
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240) * update amp_check_finite_and_scale_op for static_amp. * use amp_check_finite_and_scale in static graph amp. * update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op). * add update_loss_scaling op in cpp. * add update_loss_scaling_op unit test. * update the doc of the check_finite_and_unscale op * Update the process of gradients updating skipping if the gradients have infinite values. * update the way to zero grads. * update test_update_loss_scaling_op.py * add log info when find infinite grads. * add the unit test for UpdateLossScaling Layer.
-
由 Adam 提交于
* Add int8 GRU kernel with UTs * Lint fixes * More lint fixes
-
由 Jack Zhou 提交于
Error description optimize for math dir
-
- 13 9月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-