- 10 12月, 2019 7 次提交
-
-
由 mapingshuo 提交于
* add seed op
-
由 Adam 提交于
* MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop
-
由 rensilin 提交于
* ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop
-
由 xujiaqi01 提交于
* fix master patch when slot is dense * test=develop
-
由 xujiaqi01 提交于
* fix code style of fleet_wrapper * test=develop
-
由 wangchaochaohu 提交于
* accelerate mean op test=develop
-
由 Leo Chen 提交于
* add op function generator, test=develop * add unittest, test=develop * follow comments, test=develop * fix windows compilation problem, test=develop
-
- 09 12月, 2019 3 次提交
-
-
由 lidanqing 提交于
* update benchmark for int8v2, QAT1, QAT2 accuracy and performance test=document_fix * change according to reviews test=develop test=document_fix * improve some descriptions and some models test=develop test=document_fix * update models benchmark data test=develop test=document_fix * update int8v2 and qat2 performance test=develop test=document_fix
-
由 Leo Chen 提交于
* refine init function, test=develop * add tests, test=develop * remove extern, which may cause symbol error in gcc-4.8, test=develop
-
由 Leo Chen 提交于
* dygraph_grad_maker supports varbase without grad_var, test=develop * fix compile, test=develop * fix test_tracer, test=develop * follow comments, test=develop
-
- 07 12月, 2019 1 次提交
-
-
由 xujiaqi01 提交于
* remove optimize_for in framework.proto * test=develop
-
- 06 12月, 2019 8 次提交
-
-
由 Zeng Jinle 提交于
* polish infer shape registry, test=develop * modify some operators registry, test=develop
-
由 Aurelius84 提交于
-
由 Zeng Jinle 提交于
-
由 Huihuang Zheng 提交于
Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests. Fix bugs: 1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op. 2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var. This PR also did some code clean up: 1. Print the var name when sgd_op catches shape error so that it is easier to debug 2. Fix a typo: dicta -> dict
-
由 hutuxian 提交于
* Add a single_process_multi_thread transpiler. * Add some UTs. * Fix some API description.
-
由 liym27 提交于
* add file check_op_desc.py and add interface to get default value. test=develop * add test for c++ coverage rate. test=develop * Correct typo. test=develop
-
由 Jacek Czaja 提交于
test=develop
-
由 Jacek Czaja 提交于
* - BAtch norm mkl-dnn NHWC test=develop - compilation fix test=develop - UT fix - cosmetics test=develop - Fix to Batch Norm MKL-DNN NHWC UT test=develop Conflicts: paddle/fluid/operators/batch_norm_op.h * - Lint fixes test=develop
-
- 05 12月, 2019 5 次提交
-
-
由 Zeng Jinle 提交于
-
由 Huihuang Zheng 提交于
As the title
-
由 Pei Yang 提交于
-
由 wangchaochaohu 提交于
* fix profiler warning message in cpu profile mode test=develop
-
由 Leo Chen 提交于
* test=develop, fix docker with paddle nccl problem * don't expose numerous Tensor.set(), test=develop * fix condition, test=develop * fix float16 bug, test=develop * feed should be Tensor or np.array, not Variable or number, test=develop * use forcecast to copy numpy slice to new array, test=develop * remove float16-uint16 hacking, test=develop * add variable method to varbase and refactor to_variable to support return varbase * support kwargs in varbase constructor * add VarBase constructor to support default python args * refine varbase initial method * reset branch * fix ut for change VarBase error info to PaddleEnforce * cherry is parameter change before * overload isinstance to replace too many change of is_variable * rm useless files * rm useless code merged by git * test=develop, fix some ut failed error * test=develop, fix test_graph_wrapper * add some tests, test=develop * refine __getitem__, test=develop * add tests, test=develop * fix err_msg, test=develop
-
- 04 12月, 2019 7 次提交
-
-
由 Youwei Song 提交于
* dygraph Embedding layer use lookuptable v2 test=develop * fix test_nce test=develop
-
由 Pei Yang 提交于
* make DisableGlogInfo able to mute all logs in inference.
-
由 wangchaochaohu 提交于
* fix fill_constant_batch_size_like_op precious problem test=develop
-
由 Zhaolong Xing 提交于
test=develop
-
由 WangXi 提交于
-
由 hong 提交于
* add overrider for virtual function; test=develop * fix layer.h OutputName bug; test=develop
-
由 Aurelius84 提交于
* add _get_all_register_op_kernels api test=develop * refine usage of check_op_register_type test=develop * add import in core test=develop
-
- 03 12月, 2019 9 次提交
-
-
由 Zeng Jinle 提交于
-
由 Jacek Czaja 提交于
-
由 GaoWei8 提交于
* add ernie-large test test=develop * add ernie large c++ inference test test=develop
-
由 zhongpu 提交于
* support SelectedRows in dygraph, test=develop * fix bug of _grad_ivar interface, test=develop * add optest for support seletedrows, test=develop * fix bug for gradient_accumulator in GPU mode, test=develop * fix error when Selectedrows addto LodTensor in sorted_gradient mdoe in dygraph, test=develop * refine and simplify gradient accumulator code, test=develop * add optest, test=develop * add optest and simplify code, test=develop * fix bug for test_imperative_selected_rows, test=develop * add optest for Coverage, test=develop * fix gradient interface and simplify code, test=develop * update api for gradient, test=develop * fix ShareDim's bug in DygraphExecutionContext class, test=develop * add optest, test=develop
-
由 Tao Luo 提交于
test=develop
-
由 lilong12 提交于
* set dim[0] to -1 if dim[0] < 0 and remove assertion to runtime, test=develop * modify ENFORCE message, test=develop * add validation for x.shape[0] > 0, test=develop * add ut, test=develop
-
由 Zhaolong Xing 提交于
* add jeston compile support test=develop * refine the cmake test=develop
-
由 Zhaolong Xing 提交于
test=develop
-
由 tangwei12 提交于
-