- 22 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
2. refine trt code test=develop
-
- 20 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
-
- 19 3月, 2019 1 次提交
-
-
由 zhhsplendid 提交于
test=develop
-
- 16 3月, 2019 1 次提交
-
-
由 qingqing01 提交于
test=develop
-
- 15 3月, 2019 1 次提交
-
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
- 22 2月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Initialize the benchmark tester for operator. test=develop * Rearrange the codes. test=develop
-
- 30 1月, 2019 1 次提交
-
-
由 xuezhong 提交于
-
- 25 1月, 2019 1 次提交
-
-
由 baojun 提交于
* enable ngraph_engine_op test=develop * merge develop test=develop * avoid const_cast test=develop * rm ngraph_operator test=develop * Added TODO to move EnableNgraph test=develop * Add TODO to remove const_cast test=develop
-
- 24 1月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine the beam_search op and test. * A basic CUDA implementation of beam_search for small batch_size. * Implement CUDA kernel for beam_search_op. * Use multiple CUDA threads in the same block to select the top beam. * Update the python api of beam_search op. * Enable extend function in CPU kernel of beam_search op. * Unify the CUDA codes. test=develop * Unify the CPU kernel of beam_search op. * Ensure the seletced items of beam_search_op's CPU kernel sorted by scores. * Update the description of beam_search in API.spec. * Enable the use of CUDA kernel in beam_search op. * Exclude the beam_search's CUDA unittest when there is no CUDA gpu, and delete some debuging statements. test=develop * Follow comments. test=develop * Call the CPU kernel for beam_search op when batch_size > 4. test=develop * Remove the except of is_empty op in PrepareData. test=develop
-
- 18 1月, 2019 1 次提交
-
-
由 zhaozhehao 提交于
* refactor tree2col operator with new memory mechanism test=develop * test=develop * test=develop * Modified API according to panyx0718 test=develop * fix API change according to heavengate test=develop * Modify API comment test=develop
-
- 29 12月, 2018 1 次提交
-
-
由 peizhilin 提交于
test=develop
-
- 26 12月, 2018 1 次提交
-
-
由 peizhilin 提交于
test=develop
-
- 18 12月, 2018 3 次提交
- 13 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
fix cmake again test=develop
-
- 10 12月, 2018 2 次提交
-
-
由 sneaxiy 提交于
-
由 tensor-tang 提交于
-
- 05 12月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 03 12月, 2018 1 次提交
-
-
由 nhzlx 提交于
-
- 28 11月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 25 11月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 22 11月, 2018 1 次提交
-
-
由 wopeizl 提交于
* add recordio support * disable the openblas multi-thread on windows since no support adjust the python script * code style * code style test=develop * add create_recordio_file_reader back * fix code style test=develop * fix the gtest.cmake on windows * fix cc_test on windows * fix the win build test=develop * remove fused compile support on windows test=develop * add the jit support test=develop * add the jit support, test=develop * add the jit support, test=develop * add the jit back fix compile error on windows * rollback test=develop * test case fix * disable DSO by default on windows * exclude warpctc_op on windows * exclude the dynload_warpctc out on windows test=develop * fix the scripts error test=develop * disable avx on windows by default test=develop * re-organize the cmake file * disable mkl on windows by default * add warp_ctc back * fix the dependency * fix the dependency * fix the build issue on windows * remove unsupported flag on windows * code style * code style test=develop * fix issue * add profiler, parallel_executor back * clean up the pre-definitions on windows * fix build issue * test=develop
-
- 21 11月, 2018 2 次提交
-
-
由 Yu Yang 提交于
some operators depend on cub and xxhash by header. The dependency should be declared explicitly rather than declared to pybind. test=develop
-
由 Dang Qingqing 提交于
test=develop
-
- 19 11月, 2018 7 次提交
- 18 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
fix compile error on windows
-
- 17 11月, 2018 3 次提交
- 16 11月, 2018 3 次提交
-
-
由 Wu Yi 提交于
* wip simplify operator framework * wip * wip * done test=develop * clean test=develop * fix test=develop * fix deps test=develop * fix cpu build test=develop * fix tensorrt build test=develop * fix tests test=develop * fix test=develop * fix cpu build test=develop
-
由 whs 提交于
* Fix truncated normal. * Fix. * Make nce support more distribution. * Fix API.spec. * Fix python API. * Fix. test=develop * Fix API.spec test=develop * Fix sampler. * Fix order of arguments in python API. test=develop
-
由 Wu Yi 提交于
* add cudnn ctc loss * wip add test test=develop * wip * wip * done test=develop * move include cudnn test=develop * test test=develop * fix build test=develop * fix build test=develop * fix build on cudnn5 test=develop * fix cudnn5 build test=develop * fix cudnn5 build test=develop * merge develop softmax functor change test=develop
-
- 15 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-