- 12 5月, 2020 1 次提交
-
-
由 Shibo Tao 提交于
* support CUDA using cmake built-in way. test=develop * test=develop
-
- 14 1月, 2020 1 次提交
-
-
由 zhouwei25 提交于
faster build by reduce by-product, reduce linking library and fix compile warning of std=c++11 (#22164)
-
- 16 12月, 2019 1 次提交
-
-
由 xujiaqi01 提交于
* fix compile error when WITH_PSLIB=ON * test=develop
-
- 25 11月, 2019 1 次提交
-
-
由 zhouwei25 提交于
-
- 11 9月, 2019 1 次提交
-
-
由 Huihuang Zheng 提交于
TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory. We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton. Also added data_feed_proto to operator to fix CI in CPU compilation
-
- 12 7月, 2019 1 次提交
-
-
由 kh2se2013 提交于
2)add pip install coverage in Dockerfile.tmp test=develop
-
- 27 6月, 2019 1 次提交
-
-
由 kh2se2013 提交于
* add WITH_COVERAGE option, default OFF test=develop * add coverage for python sdk test=develop * fix code style * fix COVERAGE_FILE path test=develop * remove coverage package test=develop * test = develop, run coverage as module
-
- 26 6月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 16 6月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
* reuse C-API INT8 unit test application test=develop * updates after review test=develop
-
- 13 6月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
-
- 24 5月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
* fuse mul and elementwise add to fc * Reimplement the FC forward operator * Fix FC MKLDNN integration by transposing weights * Add FC MKLDNN Pass test=develop * FC MKLDNN Pass: change memcpy to std::copy * Fix MKLDNN FC handling of mismatch input and weights dims * Lower tolerance for MKL-DNN in resnet50 test test=develop * Adjust FC to support MKLDNN Op placement test=develop * Adjust Placement Op to set use_mkldnn attribute for graph test=develop * MKLDNN FC: fix weights format so that gemm version is called test=develop * FC MKLDNN: Remove tolerance decrease from tester_helper * FC MKL-DNN: Refactor the code, change input reorder to weight reorder * MKL-DNN FC: Introduce operator caching test=develop * FC MKL-DNN: Fix the tensor type in ExpectedKernelType test=develop * FC MKL-DNN: fix style changes test=develop * FC MKL-DNN: fallback to native on non-supported dim sizes test=develop * FC MKLDNN: fix CMake paths test=develop * FC MKLDNN: Refine placement pass graph mkldnn attribute test=develop * Fix Transpiler error for fuse_conv_eltwise test=develop * Fix missing STL includes in files test=develop * FC MKL-DNN: Enable new output size computation Also, refine pass to comply with newest interface. test=develop * FC MKL-DNN: enable only when fc_mkldnn_pass is enabled * FC MKL-DNN: Allow Weights to use oi or io format * FC MKL-DNN: Adjust UT to work with correct dims test=develop * Enable MKL DEBUG for resnet50 analyzer test=develop * FC MKL-DNN: Improve Hashing function test=develop * FC MKL-DNN: Fix shape for fc weights in transpiler * FC MKL-DNN: Update input pointer in re-used fc primitive * Add log for not handling fc fuse for unsupported dims test=develop * FC MKL-DNN: Move transpose from pass to Op Kernel test=develop * FC MKL-DNN: Disable transpose in unit test test=develop * FC MKL-DNN: Remove fc_mkldnn_pass from default list * Correct Flag for fake data analyzer tests test=develop * FC MKL-DNN: Add comment about fc mkldnn pass disablement test=develop * FC MKL-DNN: Disable fc in int8 tests test=develop
-
- 21 5月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 20 5月, 2019 1 次提交
-
-
由 wopeizl 提交于
* fix the random compilation failure on windows
-
- 03 4月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
test=develop This reverts commit c38c7c56.
-
- 02 4月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* link the libwbaes.so into paddle * polish detail, test=develop * try fix mac_pr_ci error, test=develop * add compile option, test=develop * fix ci error, test=develop * ignore failed to find mac lib, test=develop * change cdn to bj, cdn can't get the latest version * trigger ci, test=develop * temporary delete win32 lib linking, test=develop * change https to http, test=develop * turn compile option on to off * turn compile option off to on, test=develop * try lib compiled by gcc4.8, test=develop * update lib version, test=develop * link other lib, test=develop * add setup config * delete false, test=develop * delete no_soname, test=develop * recover so name set * fix, test=develop * adjust make config, test=develop * remove link to wbaes, test=develop * remove useless define, test=develop
-
- 25 1月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 21 1月, 2019 2 次提交
- 18 1月, 2019 1 次提交
-
-
由 Wu Yi 提交于
* fix mac cmake version 3.13 test=develop * fix again test=develop
-
- 16 1月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Simplify the compare op for CPU. * Use asynchronous tensor copy in reshape_op's kernel. * Optimize while_op for test, avoiding creating variables every time. test=develop * Enable the cache of kernel type and kernel function. test=develop * Enable profiling with gperftools. * Remove flags for testing, and fix the linking error. test=develop * Delete the codes of ChooseKernel. test=develop * Fix bug when preparing ExecutorPrepareContext for while_op. * Fix missing depending on grpc libraries. * Remove the redundant print. test=develop * Follow comments. * Remove the codes related to prepare the ExecutorPrepareContext for while_op. test=develop
-
- 10 1月, 2019 1 次提交
-
-
由 peizhilin 提交于
test=develop
-
- 07 1月, 2019 2 次提交
- 03 1月, 2019 2 次提交
- 18 12月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 12 12月, 2018 1 次提交
-
-
由 minqiyang 提交于
-
- 10 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 23 11月, 2018 1 次提交
-
-
由 sabreshao 提交于
* HIP cmake. Enable whole archieve build for pybind library. Disable two warning. Rollback to C++11. Link RCCL to WA gpu kernel loading issue. Update eigen to fix build failure. Add more include directories. Fix O3 build failure. Update eigen. fix tensor_util_test segment fault issue add more macro check in hip.cmake. we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future. Fix rocRAND load. Update eigen to fix gru_unit_op and reduce_op. Add HIP support to testing. Update eigen to support int16 and int8 in arg min and arg max. * add rocprim as cub library used by nv implementation * Reduce build time in rocprim. * Add rocprim introduction, remove useless cmake code. * Remove useless flags and format cmake file.
-
- 22 11月, 2018 2 次提交
-
-
由 peizhilin 提交于
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 08 11月, 2018 2 次提交
-
-
由 peizhilin 提交于
-
由 Zhaolong Xing 提交于
-
- 05 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 01 11月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 24 10月, 2018 1 次提交
-
-
由 wanghaoshuang 提交于
-
- 19 10月, 2018 1 次提交
-
-
由 typhoonzero 提交于
-
- 18 10月, 2018 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 15 10月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-