- 10 12月, 2019 1 次提交
-
-
由 Adam 提交于
* MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop
-
- 06 12月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 05 12月, 2019 2 次提交
-
-
由 Huihuang Zheng 提交于
As the title
-
由 wangchaochaohu 提交于
* fix profiler warning message in cpu profile mode test=develop
-
- 04 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
* make DisableGlogInfo able to mute all logs in inference.
-
- 03 12月, 2019 2 次提交
-
-
由 Zhaolong Xing 提交于
* add jeston compile support test=develop * refine the cmake test=develop
-
由 Huihuang Zheng 提交于
Add warning message when initialize GLOG failed
-
- 02 12月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* fix -Wno-error=sign-compare warning in gcc8 test=develop * fix warning in distributed codes test=develop
-
- 01 12月, 2019 1 次提交
-
-
由 Jie Fang 提交于
-
- 29 11月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 28 11月, 2019 2 次提交
-
-
由 wangchaochaohu 提交于
* fix profile api high version test=develop
-
由 wangchaochaohu 提交于
-
- 25 11月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 24 11月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 18 11月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
* fix warnings oof gcc 8 compilation, test=develop * fix boost::bad_get, test=develop * refine PADDLE_ENFORCE, test=develop
-
由 liuwei1031 提交于
cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows
-
- 14 11月, 2019 2 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
由 Chen Weihang 提交于
-
- 13 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 12 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* simplify C++ error stack by rewrite Place, test=develop * polish assignment overload func, test=develop
-
- 08 11月, 2019 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add transpose2 INT8 for mkl-dnn test=develop * Fix test_transpose_int8_mkldnn test=develop * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2" This reverts commit 34011bdb, reversing changes made to 2ce6473f. * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"" This reverts commit 23754dd7. * Add template to TransposeMKLDNNHandler test=develop * Resolve conflict test=develop * Restore get_size and refactor test=develop
-
由 Chen Weihang 提交于
* Enrich the type of error and declare the error type interfaces, test=develop * adjust tests to adapt new form, test=develop * add inference deps with error_codes.pb.h, test=develop * restore stack iter start pos, test=develop * polish code based review comments, test=develop
-
- 07 11月, 2019 1 次提交
-
-
由 Adam 提交于
* Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop
-
- 06 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 01 11月, 2019 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 31 10月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 28 10月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 25 10月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* Make formatted ENFORCE stack adapt to more situations and polish details, test=develop * restore template message position, test=develop
-
- 22 10月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 20 10月, 2019 1 次提交
-
-
由 123malin 提交于
* test=develop, communicator_is_sgd_optimizer flags
-
- 18 10月, 2019 3 次提交
- 17 10月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
* - Flushing mkl-dnn cache test=develop - Disabled clearing cache for LoadModel - Added clearing of mkl-dnn cache when Executor is created test=develop - Do not clear for GPU places test=develop - compilation fix test=develop * - Moved clearing of mkl-dnn cache in destructor of executor test=develop * - Compilation fix test=develop - Reverted conditional clearing of mkl-dnn cache in Executors's destructor test=develop - compilation fix
-
- 16 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 14 10月, 2019 1 次提交
-
-
由 633WHU 提交于
* support dlpack to tensor and implement python interface test=develop * add unittest for _to_dlpack and from_dlpack test=develop
-
- 12 10月, 2019 1 次提交
-
-
由 Wilber 提交于
enable cpu machine to run paddle model in gpu lib
-
- 11 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 30 9月, 2019 1 次提交
-
-
由 danleifeng 提交于
Improve elementwise operators performance in same dimensions
-
- 28 9月, 2019 1 次提交
-
-
由 qingqing01 提交于
* How to write custom op needs to follow framework OP spec. * Package fluid_framework.so and headers into whl. * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir. * Export some C-APIs to merge OpInfo between core.so and custom_op.so. * Add unit testing. * Update API.spec.
-