- 07 1月, 2020 8 次提交
-
-
由 Yibing Liu 提交于
* Fix the global_step & continuous applying error in EMA test=develop * Fix for step 0 & add unit test, test=develop
-
由 zhupengyang 提交于
-
由 liym27 提交于
-
由 Chengmo 提交于
* fix grad clip, clip op belongs to Backward op when running in Parameter Server mode.
-
由 Feiyu Chan 提交于
* add erf op and python interface. * add fp16 support for erf op. * add unitests for erf op and its python interface.
-
由 WangXi 提交于
-
由 Chen Weihang 提交于
-
由 zhupengyang 提交于
enhanced ops: conv2d, conv3d elementwise_pow: change to a reasonable shape
-
- 06 1月, 2020 17 次提交
-
-
由 silingtong123 提交于
-
由 Double_V 提交于
* support elu activation double grad,test=develop * delete the code commit in .cc,test=develop * fix relu test unpass, test=develop * add elu double grad kernel and unit test * add caculate dX in elu double grad functor, test=develop * update the commit code,test=develop
-
由 Pei Yang 提交于
* add gelu plugin * align trt bert with gpu * add support for fused fc with relu, * add unittest for bert trt
-
由 Jacek Czaja 提交于
-
由 Huihuang Zheng 提交于
-
由 Huihuang Zheng 提交于
-
由 Zeng Jinle 提交于
-
由 songyouwei 提交于
* add dygraph LayerList * add unittest test=develop * add newline test=develop * typo fix test=develop
-
由 Adam 提交于
-
由 Qinghe JING 提交于
* fix dim=[] bug in reduce ops
-
由 zhongpu 提交于
-
由 silingtong123 提交于
-
由 Zeng Jinle 提交于
-
由 chengjuntao 提交于
-
由 zhupengyang 提交于
enhanced ops: unsqueeze, squeeze2, strided_slice, unsqueeze, unsqueeze2, var_conv_2d, spectral_norm, slice, match_matrix_tensor, nce, pad, pad_constant_like, filter_by_instag
-
由 zhupengyang 提交于
affine_channel, affine_grid, bilinear_interp, bilinear_tensor_product, clip, crop, crop_tensor, cumsum, data_norm, expand, expand_as
-
由 123malin 提交于
* add distributed_strategy
-
- 05 1月, 2020 2 次提交
-
-
由 Jacek Czaja 提交于
-
由 Adam 提交于
-
- 04 1月, 2020 7 次提交
-
-
由 zhupengyang 提交于
flatten, flatten2, gather, gather_nd, group_norm, gru, gru_unit, reduce_max, reduce_min, reduce_mean, reduce_sum, reshape, scatter_nd_add, sequence_expand, sequence_expand_as, sequence_pad
-
由 liym27 提交于
* append optimize op in the grad block of current block if current block is in control flow. test=develop * add conditional grad op when optimizer used in control flow. test=develop * add comment and modify typo. test=develop * fix append_backward to support control flow. test=develop * add test. test=develop * fix copy_var_to_parent_block and conditional_block_grad. test=develop * fix bug: revert to append conditional_block_grad vars to sub grad block. test=develop * fix bug: revert to assign var to parent block even if var already is in parent block * fix bug: consider outputs is empty. test=develop * move _rename_grad_ out. test=develop * modify code according to reviews from Huihuang. test=develop * modify code according to reviews from Jinle. test=develop
-
由 Qinghe JING 提交于
-
由 ShenLiang 提交于
-
由 Kaipeng Deng 提交于
-
由 Leo Zhao 提交于
-
由 Adam 提交于
-
- 03 1月, 2020 6 次提交
-
-
由 SunAhong1993 提交于
* register int/int64_t/float16 in pow/square kernel,test=develop * add abs/square/exp type,test=develop
-
由 Leo Chen 提交于
* fix test_conv2d_ngraph for grad diff, test=develop * register NoNeedBufferVarsInference for max_pool_grad_op, test=develop * refine error message, test=develop * fix numpy, test=develop * disable test conv2d_ngraph_op, test=develop Co-authored-by: NZhang Ting <709968123@qq.com>
-
由 Liufang Sang 提交于
* add shape description in accuracy api doc test=develop * fix details test=develop * fix details test=develop
-
由 Yiqun Liu 提交于
* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc. test=develop * Call CUDA driver api to launch the kernel compiled by nvrtc. test=develop * Disable for mac and windows. test=develop * Refine the codes to support manually specified num_threads and workload_per_thread. test=develop * Refine the CUDA kernel to support large dims. test=develop * Add DeviceCodePool to manage all device codes. * Add the first implementation fusion_group op. * Add unit-test for fusion_group op. * Add the check of result. * Add the check of nvrtc in unit-test. test=develop * Add comment to explain the inputs, outputs and features of fusion_group op. test=develop * Disable fusion_group op for mac and windows. test=develop * Make the compiling of device code return status instead of hanging up. test=develop * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API. * Unify fusion_group_op's input and output names. test=develop * Add the check of CUDA driver library in unittest. test=develop * Refine the calling of PADDLE_ENFORCE. test=develop
-
由 Michał Gallus 提交于
-
由 Yibing Liu 提交于
* Fix example in log_loss doc, test=develop * Use none for unknown dim, test=develop, test=document_fix
-