- 17 9月, 2019 6 次提交
-
-
由 liym27 提交于
1. add tensor support for argument expand_times in expand op; 2. add support parameter inference when argument expand_times is a list containing integer and tensor variable; improve expand op according to reviews: 1. add doc of ExpandTimes in expand_op.cc; 2. improve the test of test_api. add stop_gradient=True when attr(expand_times) is tensor Variable, change code examples. test=develop,test=document_preview
-
由 xujiaqi01 提交于
* support preload thread * sleep before fleet wrapper exit for pslib core dump * optimize hdfs log * fix master+patch bug
-
由 Jiabin Yang 提交于
* refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * add transform_data to dygraph * test=develop, refoctor name to make it easier to understand * test=develop, refoctor name to make it easier to understand * add test and change input to const ref for safety * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ * add ut for data transform * refine ut for data_transform * test=develop, fix ut failed on parallel se-resnext * test=develop, change one more PADDLE_ENFORCE * add test_tracer on multiple devices * test=develop, change place to mutable for data transform * test=develop, add transform data on same place test and remove useless log * test=develop, Add to do for data layout and and ut for conv2d with no bias
-
由 lvmengsi 提交于
* cpu conv_grad_grad
-
由 翟飞跃 提交于
* Implement the operator with sprase matrix multiply * Update the URL of mklml library. test=develop * Disable MKLML implematation when using no-linux. test=develop * optimize bp with mkl sparse matrix test=develop * tmp add fused_emb_seq layer * Add the support of padding_idx attribute. test=develop * add padding_idx support test=develop * implement grad refer lego test=develop
-
由 chengduo 提交于
* fix example error test=develop * Remove set_desc test=develop
-
- 16 9月, 2019 4 次提交
-
-
由 ruri 提交于
* add unit test for square error cost op
-
由 zhongpu 提交于
* add kernel for squeeze_op, test=develop * delete comment, test=develop
-
由 Chen Weihang 提交于
-
由 tangwei12 提交于
fix wrong place with distributed_lookup_table
-
- 12 9月, 2019 2 次提交
-
-
由 Aurelius84 提交于
* add one_hot_v2_op to remove last_dims==1 test=develop * add api unittest code for CI_Coverage test=develop * improve CI_Coverage rate by adding test_with_depth test=develop
-
由 JesseyXujin 提交于
-
- 11 9月, 2019 10 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
由 chengduo 提交于
Fix test_parallel_executor_test_while_train
-
由 Zeng Jinle 提交于
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop
-
由 Aurelius84 提交于
* Remove constraint that last dimension is forced to be 1 in huber_loss test=develop * add y[rank-1] == 1 when x_rank=y_rank test=develop * modify into contain_unknown_dim test=develop
-
由 Youwei Song 提交于
* update dygraph api-doc and backward api-doc, test=develop * update dygraph api-doc and backward api-doc, update api.spec, test=develop * update dygraph api-doc and backward api-doc, update api.spec, test=develop * update API.spec, test=develop
-
由 Thunderbrook 提交于
-
由 Youwei Song 提交于
* fix dygraph partitial backward problem, test=develop * add unittest, fix ClearGradient. test=develop * add filter and error in python side, test=develop * rebase develop, test=develop * bug fix for list equals in py3.5, test=develop * bug fix for list equals, test=develop
-
由 Tao Luo 提交于
remove unused accuracy-diff warpctc-cudnn implementation test=develop
-
由 Bai Yifan 提交于
* split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop
-
- 10 9月, 2019 4 次提交
-
-
由 Zeng Jinle 提交于
-
由 chengduo 提交于
test=develop
-
由 wangguanzhong 提交于
* merge_empty_lod_tensor, test=develop * fix multiclass_nms, test=develop * refine API.spec, test=develop * add unittest case for fetch, test=develop * add lod tensor test, test=develop * return index for multiclass_nms, test=develop * add api for multiclass_nms2 * update API.spc, test=develop * refine api doc, test=develop * fix test_detection.py, test=develop * polish code, test=develop * add more unittest case, test=develop
-
由 gongweibao 提交于
Fix float16 optimizer
-
- 09 9月, 2019 4 次提交
-
-
由 Chen Weihang 提交于
* add prune when test_program.clone is executed after optimizer.minimize * add unittest, test=develop * add resnet and transformer test case, test=develop * add regularization for optimizer & program compare function, test=develop * add lstm unittest, test=develop * polish code based on review comment, test=develop * adapt to interface change in framework._prune, test=develop * update API.spec, test=develop
-
由 zhongpu 提交于
* add kernel for unsqueeze_op, test=develop * add kernel for unsqueeze_op, test=develop * add kernel for unsqueeze_op, test=develop
-
由 Zeng Jinle 提交于
-
由 JesseyXujin 提交于
* delete args on linear_chain_crf_op doc * delete args on linear_chain_crf_op doc * delete args on linear_chain_crf_op doc * add code example * fix api doc * fix doc of crf * fix doc of crf * add test=develop * modify API.spec, test=develop
-
- 06 9月, 2019 6 次提交
-
-
由 123malin 提交于
* fleet api add input check, test=develop
-
由 Aurelius84 提交于
-
由 chengduo 提交于
* make test_pe_seresnext serial test=develop * Increase test_pe_seresnext time limit on MAC test=develop
-
由 chengduo 提交于
* Disable this unit test temporally test=develop * Split GPU and CPU test=develop
-
由 Chen Weihang 提交于
* delete useless raw variables in Conv2D, test=develop * adjust the vars number in test_graph_wrapper to pass unittest, test=develop
-
由 Jie Fang 提交于
init new amp, optimize inserting cast op for batchnorm
-
- 05 9月, 2019 4 次提交
-
-
由 123malin 提交于
* test=develop, communicator merge add => merge average
-
由 Jiabin Yang 提交于
* refactor dygraph,test=develop * fix failed unittest,test=develop * polish code,test=develop * check windows ci error,test=develop try to fix windows ci error by np.allclose,test=develop * polish vlog and profiler, test=develop * try to fix preceding ops order,test=develop * test transformer in windows ci, test=develop * use python c-api to speed up tracer.trace,test=develop * test=develop, fix docker with paddle nccl problem * test=develop, add ut for debug string and gradient_accumulator * test=develop, add tests for layer/gradient_accumulator/prepared_op * test=develop, fix complie error for test_prepared_op * test=develop, add more ut for dygraph * test=develop, create API.spec for dygraph api change * test=develop, refoctor name to make it easier to understand * test=develop, refoctor name to make it easier to understand * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ * test=develop, fix ut failed on parallel se-resnext * test=develop, change one more PADDLE_ENFORCE
-
由 mapingshuo 提交于
* Fix bug: add feed_vars to the prune function
-
由 whs 提交于
* Fix data type of variable in edit distance evaluator. test=develop * Add unitest for edit_distance python API. test=develop
-