- 30 8月, 2019 6 次提交
-
-
由 Jacek Czaja 提交于
- Refactor step 1 - Compilation fix - Yet another compilation fix - Even more compilation fix - Lint fixes test=develop - Removed deprectaed PADDLE_ENFORCE occurance test=develop - Candidate fix to BN forward - Lint fixes test=develop - Refactoring in data_layout_transform - compilation fix - Another comppilation fix - Step further into darkness - Yet another compilation fix - Yet another compilation fix - missing header - compilation fix - Added MKLDNN -> Paddle conversion in fetch op test=develop - Compilation fix test=develop - Lint test=develop - Mul fix - Fix to MKLDNN MUL op and Elementwise MUL UT test=develop - Workaround for diffrent weights with groups representation Paddle vs MKL-DNN. test=develop - Candidate fix for 5D convolution with groups - Refactor of fix for conv3d and conv2d in fetch op test=develop - Compilation fix - Still same compilation fix - Compilation fix - Compilation fix - Reverted refactoring of fixes - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format not NHWC test=develop - minor fix in UT test=develop - Lint fixes test=develop
-
由 GaoWei8 提交于
* Modify the dropout op to multi-thread test=develop * define parallel test=develop
-
由 Huihuang Zheng 提交于
test=develop
-
由 Liufang Sang 提交于
-
由 Aurelius84 提交于
* add topk_avg_pooling * refine api doc and modify api.spec test=develop
-
由 Tao Luo 提交于
test=develop
-
- 29 8月, 2019 4 次提交
-
-
由 lidanqing 提交于
test=develop
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
由 chengduo 提交于
* increase num_iteration_per_drop_scope test=develop * Fix bug of while_op test=develop * fix bug of whileOp test=develop
-
- 28 8月, 2019 3 次提交
-
-
由 Double_V 提交于
fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412) Support Tensor input for row_conv_op
-
由 tangwei12 提交于
* fix correctness of the communicator * fix a bug in send thread when sending var context is empty, test=develop * add lookup_table_prefetch_op and prefetch optimize, test=develop * remove remote prefetch GPU supported * word2vec force with CPU, test=develop * test dist remote lookup table force with CPU, test=develop
-
由 baojun 提交于
* update for multiple threading test=develop * remove PADDLE_ENFORCE test=develop
-
- 27 8月, 2019 3 次提交
-
-
由 Yi Liu 提交于
* supports multiple NCCL communicators preserved in NCCLCommContext test=develop * add ut for c_comm_init_all operator and fix cuda resource release problem test=develop
-
由 Huihuang Zheng 提交于
-
由 vincentXiyu 提交于
* support tensor input with padding for warpctc op * merge with develop * test=develop * modified python API examples test=develop * nn.py is modified for code coverage test=develop * update documents info about warpctc op in API.spec test=develop * add test_warpctc_with_padding in test_layers test=develop * add warning log for cuda_version back to warpctc_op.cc * modify API.spec for warpctc op test=develop * modify API.spec * update warpctc test to new CompiledProgram API test=develop * modify code examples for warpctc op test=develop * modify API.spec for warpctc op test=develop * modify API.spec for warpctc op test=develop
-
- 26 8月, 2019 2 次提交
-
-
由 Huihuang Zheng 提交于
-
由 tangwei12 提交于
* fix sync mode hang in transpiler * remove sync mode in send/recv * replace PADDLE_ENFORCE with PADDLE_ENFORCE_NE
-
- 22 8月, 2019 3 次提交
-
-
由 翟飞跃 提交于
* Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop
-
由 Leo Chen 提交于
* add pybind interface to get all inplace ops, test=develop * enhance OpTest to check whether the consistency of operator when using and not using inplace, test=develop * handle corner cases in op_test, test=develop * support outputs without tensor holder_, like XShape in reshape_op, test=develop * fix bug, some op has GradOpMaker, but actually no grad_op in OpInfoMap, test=develop * use reshape_grad instead of reshape in FlattenGradOp, test=develop * fix error debug dims info for variables like XShape, test=develop * change computational order in sum_op to relieve computation difference using inplace, test=develop * add inplace_atol to check group_norm, and skip inplace_grad for mkldnn, test=develop * follow sneaxiy's comments, test=develop * remove unused DefaultGradOpDescMaker in mkldnn op, test=develop
-
由 Aurelius84 提交于
* add diag init in Uniform_random op test=develop * modify api.spec test=develop * fix unform_batch_size_like maker test=develop * add diag_num and diag_step assert check test=develop
-
- 21 8月, 2019 2 次提交
-
-
由 Adam 提交于
* Add generalized Conv+Activation MKLDNN fuse pass creation Part2 test=develop * Undefined behaviour of GetAttrIfExists<> FIX test=develop
-
由 wangguanzhong 提交于
-
- 20 8月, 2019 3 次提交
-
-
由 zhaoyuchen2018 提交于
For small case use 1D block is better than 2D block. Refer to this issue: #19275
-
由 Yihua Xu 提交于
* Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * Ignore the deprecated status for windows test=develop
-
由 wangchaochaohu 提交于
* cuda optimie for dropout * remove tmp swp file * fix compile error test=develop * test=develop optimize the cuda realization of dropout op * remove unsed code test=develop * remove tmp file test=develop
-
- 19 8月, 2019 6 次提交
-
-
由 Zhaolong Xing 提交于
* fix mask rcnn bug: 1. affine channel fuse (diff) 2. condition block op (memory leak) 3. merge lod tensor op (diff) 4. memroy optim (diff) test=develop * fix ci aboud PADDLE_ENFOCE fix merge lod infer op ut test=develop
-
由 qingqing01 提交于
-
由 Aurelius84 提交于
* add matrch_matrix_tensor op test=develop * fix ignore unittest if with_mkl=off test=develop * clean code and rm is_test param test=develop * modify API.spec test=develop * rm useless code in search_compute.h test=develop * modify api.spec test=develop * modify default_grad.spec test=develop * Add API test code test=develop * clean code in search_computer.h * modify PADDLE_ENFORCE and clean search_compute.h test=develop * fix code style test=develop
-
由 Zeng Jinle 提交于
-
由 zhang wenhui 提交于
add fl_listen_and_serv op for Federated_learning and fl_distribute_transpiler add this op to pserver program . This op just listen the endpoint and sum&scale.
-
由 silingtong123 提交于
* print error code if cuda related API fails
-
- 18 8月, 2019 1 次提交
-
-
由 gongweibao 提交于
Unset unittests http_proxy env to avoid timeout.
-
- 16 8月, 2019 2 次提交
-
-
由 Kaipeng Deng 提交于
* fix temporal_shift OP PADDLE_ENFORCE. test=develop * fix HasInput/HasOutpu ENFORECE. test=develop
-
由 Zeng Jinle 提交于
-
- 15 8月, 2019 2 次提交
-
-
由 Adam 提交于
test=develop
-
由 Yibing Liu 提交于
* Add padding support for crf_decoding * Fixes in comupte kernel test=develop * Update API Spec test=develop * Update API.spec test=develop * Avoid using paddle_enforce test=develop * Fix enforce test=develop
-
- 14 8月, 2019 3 次提交