- 17 9月, 2019 1 次提交
-
-
由 翟飞跃 提交于
* Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop * tmp add fused_emb_seq layer * Add the support of padding_idx attribute. test=develop * add padding_idx support test=develop * implement grad refer lego test=develop
-
- 16 9月, 2019 4 次提交
-
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop * Enhance fc_fuse_pass to enable fusing relu. * Allow print the shapes of var_desc in graph. test=develop * Enhance fc_fuse_pass_tester. * Remove the use of PADDLE_ENFORCE. test=develop * Correct the number of ops after fusing. test=develop * Fix a typo. test=develop * Set activation_type to null when there is no relu in fc. test=develop * Refine fc_fuse_pass's codes. * Enable the set of shape for tensor. * Refine repeated_fc_relu_pass and add unittest. test=develop
-
由 zhongpu 提交于
* add kernel for squeeze_op, test=develop * delete comment, test=develop
-
由 zhongpu 提交于
* add kernel for unstack_op, test=develop * add kernel for unstack_op, test=develop * add kernel for unstack_op, test=develop * adjust the code format, test=develop * modify some comment, test=develop
-
由 Kaipeng Deng 提交于
-
- 14 9月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 12 9月, 2019 2 次提交
-
-
由 Aurelius84 提交于
* add one_hot_v2_op to remove last_dims==1 test=develop * add api unittest code for CI_Coverage test=develop * improve CI_Coverage rate by adding test_with_depth test=develop
-
由 Jacek Czaja 提交于
test=develop - fix to BWD test=develop
-
- 11 9月, 2019 7 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
由 Zeng Jinle 提交于
* make leaky relu inplacable, test=develop * force add unittests to pass coverage, test=develop
-
由 Zeng Jinle 提交于
-
由 Jacek Czaja 提交于
test=develop - Cosmetic fixes test=develop
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop
-
由 Aurelius84 提交于
* Remove constraint that last dimension is forced to be 1 in huber_loss test=develop * add y[rank-1] == 1 when x_rank=y_rank test=develop * modify into contain_unknown_dim test=develop
-
由 Tao Luo 提交于
remove unused accuracy-diff warpctc-cudnn implementation test=develop
-
- 10 9月, 2019 4 次提交
-
-
由 Adam 提交于
* MKLDNN handler cleanup * MKLDNN handler cleanup test=develop
-
由 Zeng Jinle 提交于
-
由 wangguanzhong 提交于
* merge_empty_lod_tensor, test=develop * fix multiclass_nms, test=develop * refine API.spec, test=develop * add unittest case for fetch, test=develop * add lod tensor test, test=develop * return index for multiclass_nms, test=develop * add api for multiclass_nms2 * update API.spc, test=develop * refine api doc, test=develop * fix test_detection.py, test=develop * polish code, test=develop * add more unittest case, test=develop
-
由 yaoxuefeng 提交于
* fix instag op * fix instag bug: Some tiny logical error, occurring when ins_tag (2nd input) is multiple. test=develop
-
- 09 9月, 2019 2 次提交
-
-
由 zhongpu 提交于
* add kernel for unsqueeze_op, test=develop * add kernel for unsqueeze_op, test=develop * add kernel for unsqueeze_op, test=develop
-
由 Tao Luo 提交于
* paddle::framework::vectorize() templatization test=develop * update pybind/imperative.cc test=develop * revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc test=develop
-
- 08 9月, 2019 1 次提交
-
-
由 hutuxian 提交于
fix cmakelist deps: remove unnecessary deps and add proper op deps
-
- 07 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove -Wmaybe-uninitialized warning test=develop * remove uninitialized op_handle_ in scale_loss_grad_op_handle.cc test=develop
-
- 06 9月, 2019 4 次提交
-
-
由 wangchaochaohu 提交于
-
由 chengduo 提交于
test=develop
-
由 wangchaochaohu 提交于
* test=develop codegen for fused elementwise operation * fix test=develop
-
由 Chen Weihang 提交于
* delete useless raw variables in Conv2D, test=develop * adjust the vars number in test_graph_wrapper to pass unittest, test=develop
-
- 05 9月, 2019 6 次提交
-
-
由 123malin 提交于
* test=develop, communicator merge add => merge average
-
由 tangwei12 提交于
-
由 Tao Luo 提交于
* remove assert.h * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE test=develop * fix tensorrt paddle_enforce test=develop
-
由 Leo Chen 提交于
-
由 Zeng Jinle 提交于
* enable inplace for affine_channel op, dropout op, test=develop * remove dropout inplace for ngraph fails, test=develop
-
由 Tao Luo 提交于
test=develop
-
- 04 9月, 2019 7 次提交
-
-
由 danleifeng 提交于
elementwise broadcast function enhancement
-
由 Chen Weihang 提交于
Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605) * add extra error message hint in optimizer ops * polish format & delete useless change, test=develop * extract init judue from shape compare, test=develop
-
由 zhongpu 提交于
* add kernel for flatten_op, test=develop * add kernel for flatten_op, test=develop * fix the license and remove redundant code, test=develop
-
由 Tao Luo 提交于
test=develop
-
由 ShenLiang 提交于
* add scatter_nd op, test=document_preview test=develop * fixed the document, test=document_preview test=develop * modify the notes, test=document_preview test=develop * remove the ShareDataWith, test=develop
-
由 wawltor 提交于
* test=develop Fix the scatter op bug when use the add mode, and support the int64 data type of scatter_op Index(#18804). * test=develop Remove the PADDLE_ENFORCE and use PADDLE_ENFORCE_EQ * test=develop Remove the fix bug of scatter_add, and just add the support of int64 in scatter_add * test=develop Add the test case for scatter op, the test case just for index int64
-
由 Adam 提交于
test=develop
-