- 03 12月, 2019 1 次提交
-
-
由 bingyanghuang 提交于
-
- 02 12月, 2019 1 次提交
-
-
由 zhaoyuchen2018 提交于
* Improve topk performance. give 200000 data to compute topk, before opt: cost 1s after opt: cost 0.0028s. * Refine return value. * Add cuda util funtions. * Fix ComputeBlockSize bug & refine comments. Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 25 11月, 2019 1 次提交
-
-
由 liuwei1031 提交于
cudaStreamSynchronize randomly hang when used in multi-thread environment, replace it with cudaStreamQuery API on windows
-
- 24 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* simplify C++ error stack by rewrite Place, test=develop * polish assignment overload func, test=develop
-
- 21 11月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* delete paddle infershape enforce marco (#20832) * Polish and arrange code in enforce.h (#20901) * Enrich the type of error and declare the error type interfaces (#21024) * Enrich the type of error and declare the error type interfaces, test=develop * adjust tests to adapt new form, test=develop * add inference deps with error_codes.pb.h, test=develop * restore stack iter start pos, test=develop * polish code based review comments, test=develop * Add dependency for error_codes.proto (#21084) * fix activation_functions deps, test=develop, test=document_fix * add error_codes_proto deps, test=develop, test=document_fix * try delete enforce.h, test=develop, test=document_fix * change cuda enforce & add example (#21142) test=release/1.6
-
- 07 11月, 2019 1 次提交
-
-
由 Adam 提交于
* Add asymetric padding support for mkldnn pooling test=develop * Add asymetric padding support for mkldnn conv test=develop * Add asymetric padding support for mkldnn conv_transpose test=develop
-
- 30 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* add support to gcc8, add docker env * remove the warning issue
-
- 25 10月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
-
- 22 10月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 21 10月, 2019 1 次提交
-
-
由 WangXi 提交于
-
- 20 10月, 2019 1 次提交
-
-
由 123malin 提交于
* test=develop, communicator_is_sgd_optimizer flags
-
- 18 10月, 2019 2 次提交
-
-
由 bingyanghuang 提交于
-
由 Michał Gallus 提交于
test=release/1.6 * - Flushing mkl-dnn cache test=develop - Disabled clearing cache for LoadModel - Added clearing of mkl-dnn cache when Executor is created test=develop - Do not clear for GPU places test=develop - compilation fix test=develop * - Moved clearing of mkl-dnn cache in destructor of executor test=develop * - Compilation fix test=develop - Reverted conditional clearing of mkl-dnn cache in Executors's destructor test=develop - compilation fix
-
- 16 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 14 10月, 2019 1 次提交
-
-
由 633WHU 提交于
-
- 13 10月, 2019 1 次提交
-
-
由 Wilber 提交于
cpu机器在gpu库上运行paddle出core,原因是由于缺失显卡driver,显卡driver与cuda driver不匹配 加上driver check解决该问题
-
- 11 10月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 01 10月, 2019 1 次提交
-
-
由 danleifeng 提交于
-
- 28 9月, 2019 2 次提交
-
-
由 qingqing01 提交于
* How to write custom op needs to follow framework OP spec. * Package fluid_framework.so and headers into whl. * Add paddle.sysconfig.get_include() and paddle.sysconfig.get_lib() to get include dir and lib dir. * Export some C-APIs to merge OpInfo between core.so and custom_op.so. * Add unit testing. * Update API.spec.
-
由 liym27 提交于
* fix pool2d pool3d: 1. support asymmetric padding; 2. support padding algorithm:"SAME" and "VALID"; 3. support channel_last: data_format NHWC and NDHWC; 4. support inferring shape when input with negative dims in compile time; 5. change doc of python API and c++; 6. fix bug in cuda kernel when Attr(adaptive) is true. test=develop,test=document_preview * fix 'tensors' to 'Tensors'. test=develop,test=document_preview * add test for converage ValueError.test=develop,test=document_preview * resolve conflict in test_pool2d. test=develop
-
- 27 9月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* shape and optimize paddle error message stack, test=develop * limit exception type & add unittest, test=develop * fix multi-platform problem, test=develop * fix related unnitest failed, test=develop * add doc & fix unittest errors, test=develop * fix function name error, test=develop * update tensor test exception msg compare, test=develop * remove unittest on win32, the dir format is different, test=develop * remove useless package, test=develop * add paddle enforce handler unittest, test=develop * add exception checkout, test=develop * fix coverage failed, test=develop * fix op registry test failed, test=develop * refactor whole pr, test=develop * remove test in CMakelist, test=develop * fix coverage, test=develop
-
- 26 9月, 2019 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Fix conv2d+dequantize squash for residual fusion test=develop * Correct int8 input test=develop * Add if exclude or include padding in pool2d mkldnn test=develop
-
- 24 9月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Jacek Czaja 提交于
- First implementation of BWD and FWD of pooling mkl-dnn - Compilation fix - Fix - Fix - Fix - Fix to crash - Compilation fix - Combined AcquireBacward with Fwd test=develop
-
- 23 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* Add RecordHistoryLocalExecScopes test=develop
-
- 22 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* refine reallocate of workspace size, test=develop * add lock to cudnn handle calls, test=develop
-
- 20 9月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Jacek Czaja 提交于
- LRN mkl-dnn kernel refactor test=develop - compilation fix - Another compilation fix - Compilation fix - another compilation fix - compilation fix - Crash fix - optional LRN mkldnn workspace - Added mid allocation - Workaround for tests - Removed gradient from is_test ut - Removed mid for inference - Reverted LRN mid removal for is_test - PADDLE_ENFORCE adjusted - Rebase to templatization commit - Compilation fix - compilation fix test=develop - lint test=develop - Fix to crash - Rebase to recent codebase - lin - lint - compilation fix
-
- 19 9月, 2019 2 次提交
-
-
由 lidanqing 提交于
* fix conflicts test=develop * change mask_bias_reorder test=develop * add ComputeMask function to make code clear test=develop * change according to reviews test=develop * change according to reviews test=develop
-
由 Adam 提交于
* Add template functions for Acquire primitive/primitive_desc test=develop * Move acquire primitive descriptor to protected section test=develop
-
- 18 9月, 2019 2 次提交
-
-
由 Zeng Jinle 提交于
-
由 Zeng Jinle 提交于
-
- 17 9月, 2019 1 次提交
-
-
由 Adam 提交于
test=develop
-
- 16 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 14 9月, 2019 2 次提交
- 12 9月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
test=develop - fix to BWD test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 10 9月, 2019 2 次提交
-
-
由 Adam 提交于
* MKLDNN handler cleanup * MKLDNN handler cleanup test=develop
-
由 XiaoguangHu 提交于
Add document annotations for FLAGS that need to be open to external developers test=develop (#19692) Add document annotations for FLAGS that need to be open to external developers
-