- 27 9月, 2020 4 次提交
-
-
由 QingshuChen 提交于
* support elementwise add, activation, matmul on Baidu Kunlun * test=kunlun * minor * test=kunlun * reconstuct the xpu directory * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun * minor * test=kunlun
-
由 Jack Zhou 提交于
register log double grad kernel for cpu and cuda
-
由 Chengmo 提交于
* fix sgd/momentum/dpsgd/rmsprop error message
-
由 Chengmo 提交于
* fix test_dist_fleet_heter_ctr & peformance update
-
- 26 9月, 2020 2 次提交
-
-
由 Zhong Hui 提交于
fix cpplint error for the autmic max/min
-
由 joanna.wozna.intel 提交于
-
- 25 9月, 2020 6 次提交
- 24 9月, 2020 7 次提交
-
-
由 ruri 提交于
-
由 Kaipeng Deng 提交于
* fix error message in conv/conv_transpose. test=develop
-
由 ruri 提交于
* refine err msg in shuffle channel op
-
由 Aurelius84 提交于
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 wangchaochaohu 提交于
* refine the error message for bath size like
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 23 9月, 2020 6 次提交
-
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 LutaoChu 提交于
The calculation result of Diag_v2 Op under large size input is wrong
-
由 tangwei12 提交于
* rename communicator meet->BatchesCounter * fix parame recv for sparse * geo sparse init from pserver * optimize init from pserver * add large scale optimizer fuse(SGD/ADAM) * rectification init_worker and exe.run startup program
-
由 Chen Weihang 提交于
* polish no onwer op error message * fix unittest failed * polish details based reviewer comment
-
由 Zhang Ting 提交于
* add fused_bn_add_relu op
-
由 wangchaochaohu 提交于
-
- 22 9月, 2020 4 次提交
-
-
由 123malin 提交于
* test=develop, update error message
-
由 wangchaochaohu 提交于
-
由 石晓伟 提交于
-
由 Zhong Hui 提交于
Add the cpu version of segment sum mean max min op
-
- 21 9月, 2020 8 次提交
-
-
由 furnace 提交于
-
由 LutaoChu 提交于
* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension
-
由 ceci3 提交于
* add double grad compute for batch norm,test=develop * fix unittest, test=develop * remove unuse tensor,test=develop * add format,test=develop * update, test=develop
-
由 Shang Zhizhou 提交于
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
由 LutaoChu 提交于
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
由 MRXLT 提交于
* fix adam * rmsprop support double
-
- 18 9月, 2020 2 次提交
-
-
由 GaoWei8 提交于
* fix cudnn dyload error
-
-
- 17 9月, 2020 1 次提交
-
-
由 HappyAngel 提交于
* fix op print error info problem. test=develop * fix build error * fix format * fix error msg info * fix format
-