- 19 12月, 2019 1 次提交
-
-
由 WangXi 提交于
-
- 18 12月, 2019 2 次提交
-
-
由 Huihuang Zheng 提交于
The fixed bugs: 1. The condition sub-graph is not pruned 2. When backward graph is extremely simple, the whole backward ops are pruned.
-
由 xujiaqi01 提交于
* fix compiled error of butil when with_pslib=on and with_testing=on * test=develop
-
- 17 12月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
-
- 16 12月, 2019 6 次提交
-
-
由 zhaoyuchen2018 提交于
* Fix softmax cuda bug * Refine multihead log and softmax logic
-
由 Kaipeng Deng 提交于
* yolo_box OP add Attr(clip_bbox). test=develop
-
由 Michał Gallus 提交于
test=develop
-
由 Leo Chen 提交于
* fix elementwise_pow bug on integer, test=develop * use llrint to support elementwise_pow_grad, test=develop * add some tests, test=develop * revert grad functor, test=develop
-
由 石晓伟 提交于
-
由 lidanqing 提交于
* fc-dequantize squash test=develop * change according to reviews test=develop * change PADDLE_ENFORCE test=develop * add second test when fc-dequant do not fuse test=develop * change all related PADDLE_ENFORCE test=develop
-
- 15 12月, 2019 2 次提交
-
-
由 Chen Weihang 提交于
* rename paddle throw error macro, test=develop * fix new error use case, test=develop
-
由 WangXi 提交于
-
- 12 12月, 2019 4 次提交
-
-
由 Leo Chen 提交于
* polish cmake, test=develop * add current directory to LD_LIBRARY_PATH, test=develop
-
由 joanna.wozna.intel 提交于
* Add reshape int8 op test=develop * Change test to CPUPlace test=develop * Correct tests test=develop
-
由 WangXi 提交于
-
由 tangwei12 提交于
* add fake init for the trainer, fix large memory hold in the trainer * do not merge recv vars from a remote endpoint, test=develop * add recv and save op, merge slice var in one op, save memory * remove hsigmoid with pull sparse, test=develop
-
- 11 12月, 2019 5 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
由 Zeng Jinle 提交于
* make OperatorWithKernel::InferShape virtual, test=develop * fix test_prepare_op by relu, test=develop
-
由 mapingshuo 提交于
* add no_need_buffer_slots interface to pybind
-
由 Zeng Jinle 提交于
-
由 GaoWei8 提交于
test=develop
-
- 10 12月, 2019 10 次提交
-
-
由 Chen Weihang 提交于
* refine dygraph dataloader & polish related code, test=develop * refine code based review comment, test=develop
-
由 wangchaochaohu 提交于
-
由 Zeng Jinle 提交于
-
由 mapingshuo 提交于
* add seed op
-
由 Adam 提交于
* MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop
-
由 rensilin 提交于
* ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop
-
由 xujiaqi01 提交于
* fix master patch when slot is dense * test=develop
-
由 xujiaqi01 提交于
* fix code style of fleet_wrapper * test=develop
-
由 wangchaochaohu 提交于
* accelerate mean op test=develop
-
由 Leo Chen 提交于
* add op function generator, test=develop * add unittest, test=develop * follow comments, test=develop * fix windows compilation problem, test=develop
-
- 09 12月, 2019 3 次提交
-
-
由 lidanqing 提交于
* update benchmark for int8v2, QAT1, QAT2 accuracy and performance test=document_fix * change according to reviews test=develop test=document_fix * improve some descriptions and some models test=develop test=document_fix * update models benchmark data test=develop test=document_fix * update int8v2 and qat2 performance test=develop test=document_fix
-
由 Leo Chen 提交于
* refine init function, test=develop * add tests, test=develop * remove extern, which may cause symbol error in gcc-4.8, test=develop
-
由 Leo Chen 提交于
* dygraph_grad_maker supports varbase without grad_var, test=develop * fix compile, test=develop * fix test_tracer, test=develop * follow comments, test=develop
-
- 07 12月, 2019 1 次提交
-
-
由 xujiaqi01 提交于
* remove optimize_for in framework.proto * test=develop
-
- 06 12月, 2019 5 次提交
-
-
由 Zeng Jinle 提交于
* polish infer shape registry, test=develop * modify some operators registry, test=develop
-
由 Aurelius84 提交于
-
由 Zeng Jinle 提交于
-
由 Huihuang Zheng 提交于
Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests. Fix bugs: 1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op. 2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var. This PR also did some code clean up: 1. Print the var name when sgd_op catches shape error so that it is easier to debug 2. Fix a typo: dicta -> dict
-
由 hutuxian 提交于
* Add a single_process_multi_thread transpiler. * Add some UTs. * Fix some API description.
-