- 13 6月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
-
- 24 5月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
* fuse mul and elementwise add to fc * Reimplement the FC forward operator * Fix FC MKLDNN integration by transposing weights * Add FC MKLDNN Pass test=develop * FC MKLDNN Pass: change memcpy to std::copy * Fix MKLDNN FC handling of mismatch input and weights dims * Lower tolerance for MKL-DNN in resnet50 test test=develop * Adjust FC to support MKLDNN Op placement test=develop * Adjust Placement Op to set use_mkldnn attribute for graph test=develop * MKLDNN FC: fix weights format so that gemm version is called test=develop * FC MKLDNN: Remove tolerance decrease from tester_helper * FC MKL-DNN: Refactor the code, change input reorder to weight reorder * MKL-DNN FC: Introduce operator caching test=develop * FC MKL-DNN: Fix the tensor type in ExpectedKernelType test=develop * FC MKL-DNN: fix style changes test=develop * FC MKL-DNN: fallback to native on non-supported dim sizes test=develop * FC MKLDNN: fix CMake paths test=develop * FC MKLDNN: Refine placement pass graph mkldnn attribute test=develop * Fix Transpiler error for fuse_conv_eltwise test=develop * Fix missing STL includes in files test=develop * FC MKL-DNN: Enable new output size computation Also, refine pass to comply with newest interface. test=develop * FC MKL-DNN: enable only when fc_mkldnn_pass is enabled * FC MKL-DNN: Allow Weights to use oi or io format * FC MKL-DNN: Adjust UT to work with correct dims test=develop * Enable MKL DEBUG for resnet50 analyzer test=develop * FC MKL-DNN: Improve Hashing function test=develop * FC MKL-DNN: Fix shape for fc weights in transpiler * FC MKL-DNN: Update input pointer in re-used fc primitive * Add log for not handling fc fuse for unsupported dims test=develop * FC MKL-DNN: Move transpose from pass to Op Kernel test=develop * FC MKL-DNN: Disable transpose in unit test test=develop * FC MKL-DNN: Remove fc_mkldnn_pass from default list * Correct Flag for fake data analyzer tests test=develop * FC MKL-DNN: Add comment about fc mkldnn pass disablement test=develop * FC MKL-DNN: Disable fc in int8 tests test=develop
-
- 21 5月, 2019 1 次提交
-
-
由 Tao Luo 提交于
test=develop
-
- 20 5月, 2019 1 次提交
-
-
由 wopeizl 提交于
* fix the random compilation failure on windows
-
- 03 4月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
test=develop This reverts commit c38c7c56.
-
- 02 4月, 2019 1 次提交
-
-
由 Chen Weihang 提交于
* link the libwbaes.so into paddle * polish detail, test=develop * try fix mac_pr_ci error, test=develop * add compile option, test=develop * fix ci error, test=develop * ignore failed to find mac lib, test=develop * change cdn to bj, cdn can't get the latest version * trigger ci, test=develop * temporary delete win32 lib linking, test=develop * change https to http, test=develop * turn compile option on to off * turn compile option off to on, test=develop * try lib compiled by gcc4.8, test=develop * update lib version, test=develop * link other lib, test=develop * add setup config * delete false, test=develop * delete no_soname, test=develop * recover so name set * fix, test=develop * adjust make config, test=develop * remove link to wbaes, test=develop * remove useless define, test=develop
-
- 25 1月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 21 1月, 2019 2 次提交
- 18 1月, 2019 1 次提交
-
-
由 Wu Yi 提交于
* fix mac cmake version 3.13 test=develop * fix again test=develop
-
- 16 1月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Simplify the compare op for CPU. * Use asynchronous tensor copy in reshape_op's kernel. * Optimize while_op for test, avoiding creating variables every time. test=develop * Enable the cache of kernel type and kernel function. test=develop * Enable profiling with gperftools. * Remove flags for testing, and fix the linking error. test=develop * Delete the codes of ChooseKernel. test=develop * Fix bug when preparing ExecutorPrepareContext for while_op. * Fix missing depending on grpc libraries. * Remove the redundant print. test=develop * Follow comments. * Remove the codes related to prepare the ExecutorPrepareContext for while_op. test=develop
-
- 10 1月, 2019 1 次提交
-
-
由 peizhilin 提交于
test=develop
-
- 07 1月, 2019 2 次提交
- 03 1月, 2019 2 次提交
- 18 12月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 12 12月, 2018 1 次提交
-
-
由 minqiyang 提交于
-
- 10 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 23 11月, 2018 1 次提交
-
-
由 sabreshao 提交于
* HIP cmake. Enable whole archieve build for pybind library. Disable two warning. Rollback to C++11. Link RCCL to WA gpu kernel loading issue. Update eigen to fix build failure. Add more include directories. Fix O3 build failure. Update eigen. fix tensor_util_test segment fault issue add more macro check in hip.cmake. we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future. Fix rocRAND load. Update eigen to fix gru_unit_op and reduce_op. Add HIP support to testing. Update eigen to support int16 and int8 in arg min and arg max. * add rocprim as cub library used by nv implementation * Reduce build time in rocprim. * Add rocprim introduction, remove useless cmake code. * Remove useless flags and format cmake file.
-
- 22 11月, 2018 2 次提交
-
-
由 peizhilin 提交于
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 16 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 08 11月, 2018 2 次提交
-
-
由 peizhilin 提交于
-
由 Zhaolong Xing 提交于
-
- 05 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 01 11月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 24 10月, 2018 1 次提交
-
-
由 wanghaoshuang 提交于
-
- 19 10月, 2018 1 次提交
-
-
由 typhoonzero 提交于
-
- 18 10月, 2018 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 15 10月, 2018 2 次提交
-
-
由 dzhwinter 提交于
-
由 wanghaoshuang 提交于
-
- 14 10月, 2018 1 次提交
-
-
由 wanghaoshuang 提交于
-
- 25 9月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* generic cmake * Update generic.cmake
-
- 19 9月, 2018 1 次提交
-
-
由 chengduo 提交于
-
- 15 9月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 30 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 29 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 28 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 25 8月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-