- 24 9月, 2020 8 次提交
-
-
由 Kaipeng Deng 提交于
* fix error message in conv/conv_transpose. test=develop
-
由 Wilber 提交于
-
由 ruri 提交于
* refine err msg in shuffle channel op
-
由 Aurelius84 提交于
-
由 Shibo Tao 提交于
* fix tensorrt 6 build error. test=develop * fix. test=develop * bug fix * test=develop
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
由 wangchaochaohu 提交于
* refine the error message for bath size like
-
由 Zhong Hui 提交于
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
-
- 23 9月, 2020 12 次提交
-
-
由 YUNSHEN XIE 提交于
-
由 Shang Zhizhou 提交于
* [bug fix]:Memory increases after adapting the cudnn version to 8 * [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
-
由 Zhou Wei 提交于
* Makes the Bind Method more intelligent * Makes the Bind Method more intelligent * fix unittest * fix unittest * fix conflict
-
由 LutaoChu 提交于
The calculation result of Diag_v2 Op under large size input is wrong
-
由 tangwei12 提交于
* rename communicator meet->BatchesCounter * fix parame recv for sparse * geo sparse init from pserver * optimize init from pserver * add large scale optimizer fuse(SGD/ADAM) * rectification init_worker and exe.run startup program
-
由 Qi Li 提交于
-
由 Zhou Wei 提交于
-
由 Chen Weihang 提交于
* polish no onwer op error message * fix unittest failed * polish details based reviewer comment
-
由 Zhang Ting 提交于
* add fused_bn_add_relu op
-
由 Wilber 提交于
-
由 Chen Weihang 提交于
* polish some lost error msg * add some math file to white list * polish detail based reviewer commnet
-
由 wangchaochaohu 提交于
-
- 22 9月, 2020 7 次提交
- 21 9月, 2020 10 次提交
-
-
由 furnace 提交于
-
由 LutaoChu 提交于
* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension
-
由 ceci3 提交于
* add double grad compute for batch norm,test=develop * fix unittest, test=develop * remove unuse tensor,test=develop * add format,test=develop * update, test=develop
-
由 Shang Zhizhou 提交于
* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0 * fix seqconv_eltadd_relu_fuse_pass unitest error
-
由 Leo Chen 提交于
* support use add instead of sum to do gradient accumulation * add inplace addto pass * add grad_add op and inplace addto pass * remove debug code * code refine * fix bug when sereral sum ops inserts at same op_idx * fix Flags type * add addto attribute for conv3d * fix ut * code clean * fix type
-
由 LutaoChu 提交于
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
-
由 Wilber 提交于
-
由 huangxu96 提交于
* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests. * Finished channel-wise quantize strategy in imperative quantization. * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP Add Cuda code of ChannelWiseQuantDequantMaxAbsOp * Add quant_axis for channel_wise quant. * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement. * Added some assert infomation and fixed some coding style mistakes.
-
由 Leo Chen 提交于
* refine error msg in var_handle.h, test=develop * refine all_reduce_op_handle * fix some error msg * refine variable_visitor * refine threaded_ssa_graph_executor * refine inplace related files * refine executor related files * refine fetch_op_handle.cc * fix bug * follow comments
-
由 MRXLT 提交于
* fix adam * rmsprop support double
-
- 18 9月, 2020 3 次提交
-
-
由 tangwei12 提交于
* fix worker endpoints * fix gloo wrapper for hdfs * GPU fleetrun support gloo * parameterserver fleetrun support gloo * fix get server endpoint
-
由 Pei Yang 提交于
-
由 yaoxuefeng 提交于
-