- 29 5月, 2019 3 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
由 mozga-intel 提交于
-
由 Yiqun Liu 提交于
Optimize the concat and split kernel for specical cases when the number of inputs/outputs is 2 (#17415) * Optimize the concat and split kernel for special cases that the number of inputs/outputs is 2. test=develop * Refine codes. test=develop * Correct the condition. test=develop * Move the define of tmp_data outside the if statement. * Print the cudnn minor version. test=develop * Fix the case when in_num/o_num is 1 in concat/split op. test=develop * Remove const_cast. test=develop
-
- 28 5月, 2019 9 次提交
-
-
由 石晓伟 提交于
* update python APIs test=document_preview test=develop * update API.spec test=document_preview test=develop * update merge_selected_rows * update API.spec test=document_preview test=develop * update API.spec test=document_preview test=develop * fix the comment of less_than test=develop test=document_preview * update API.spec test=develop test=document_preview * update API.spec test=develop test=document_preview * update API.spec test=develop test=document_preview * update API.spec test=develop * update API test=develop
-
由 Jiabin Yang 提交于
-
由 lidanqing 提交于
* add INT8 conv+relu6 fuse and enbale mobilentv2 INT8 test test=develop * change fasle and 0.0 to fuse_brelu and brelu_threshold test=develop change the "fuse_relu||fuse_brelu" to "unsigned_output" test=develop * Use relu instead of brelu as INT8 post-op because INT8 brelu is not enabled in mkldnn v0.18 test=develop * continuous-integration fix test=develop
-
由 Jacek Czaja 提交于
* - changes to graph detector - Changes to pass - Added ut for new pass - use_pass - Added pass to mkldnn passes - fix to registration - improved verbose messaging for conv bias passes - Lint fixes test=develop * - Lint fixes test=develop
-
由 Shuai Yuan 提交于
* [DOC] Fix api docs, test=develop, test=document_preview * [DOC] Fix api annotation: fluid.layers.tensor_array_to_tensor. test=develop, test=document_preview * test=develop, test=document_preview update MD5 of tensor_array_to_tensor
-
由 wopeizl 提交于
* fix the api example for create_global_var, create_parameter, SGDOptimizer, RMSPropOptimizer, MomentumOptimizer, LarsMomentumOptimizer, FtrlOptimizer test=develop * add example for adamoptimizer fix API.spec test=develop * test=develop * test=develop
-
由 Yibing Liu 提交于
* Bias correction for exponential moving average test=develop, test=document_preview * Fix docs test=develop, test=document_preview
-
由 Tao Luo 提交于
test=develop
-
- 27 5月, 2019 10 次提交
-
-
由 Krzysztof Binias 提交于
* Enable sqrt operator for the nGraph Bridge. test=develop * Update activation_op.h
-
由 Sylwester Fraczek 提交于
* add Concat quantization add unit test for quantizing concat fix for wrong value when the input is not in map of calculated scales add use_quantizer to concat_op.cc add scale_algo rules for concat test=develop * missing fix for multiple inputs quantize-squash * wojtuss review fix: adding comment test=develop
-
由 Zeng Jinle 提交于
-
由 gongweibao 提交于
-
由 Krzysztof Binias 提交于
test=develop
-
由 Zhen Wang 提交于
* fix the bug that sub_scope_ may be null in AnalysisPredictor::Run. * add more directions about io APIs' docs. * update the API.spec. test=develop test=document_preview
-
由 chengduo 提交于
* enhance print
-
由 Zeng Jinle 提交于
* Revert "Revert "Fix allocator bug"" This reverts commit 174d0d0b. * Revert "fix travis ci" This reverts commit 5656fa9f. test=develop * add inlined_vector.h, test=develop * add inlined_vector_test,test=develop * clean code of allocator,test=develop * delete zero_size_allocator.h,test=develop * fix failed unittest,test=develop
-
由 Guo Sheng 提交于
test=develop
-
由 Huihuang Zheng 提交于
* Add Dockerfile for cuda9 and cuda10
-
- 26 5月, 2019 1 次提交
-
-
由 Bai Yifan 提交于
-
- 25 5月, 2019 6 次提交
-
-
由 bdzhuxiaoning 提交于
-
由 chengduo 提交于
* add data parallel batch
-
由 hutuxian 提交于
* gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop
-
由 Huihuang Zheng 提交于
* Add Dockerfile for cuda9 and cuda10 Add Dockerfile for building cuda9 cuda10 images.
-
由 mozga-intel 提交于
-
由 Zhaolong Xing 提交于
* fluid int8 train and trt int8 predict align. trt int8 predict init op converter * 2. align fluid int8 train and trt int8 inference. enhance quant dequant fuse pass enhance op converter, trt engine, trt engine op, trt subgraph pass. * 3. add delete_quant_dequant_pass for trt test=develop * 4. add the missing file test=develop * 5. i modify the c++ interface, but forget to modify the pybind code fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter test=develop
-
- 24 5月, 2019 11 次提交
-
-
由 Michał Gallus 提交于
* fuse mul and elementwise add to fc * Reimplement the FC forward operator * Fix FC MKLDNN integration by transposing weights * Add FC MKLDNN Pass test=develop * FC MKLDNN Pass: change memcpy to std::copy * Fix MKLDNN FC handling of mismatch input and weights dims * Lower tolerance for MKL-DNN in resnet50 test test=develop * Adjust FC to support MKLDNN Op placement test=develop * Adjust Placement Op to set use_mkldnn attribute for graph test=develop * MKLDNN FC: fix weights format so that gemm version is called test=develop * FC MKLDNN: Remove tolerance decrease from tester_helper * FC MKL-DNN: Refactor the code, change input reorder to weight reorder * MKL-DNN FC: Introduce operator caching test=develop * FC MKL-DNN: Fix the tensor type in ExpectedKernelType test=develop * FC MKL-DNN: fix style changes test=develop * FC MKL-DNN: fallback to native on non-supported dim sizes test=develop * FC MKLDNN: fix CMake paths test=develop * FC MKLDNN: Refine placement pass graph mkldnn attribute test=develop * Fix Transpiler error for fuse_conv_eltwise test=develop * Fix missing STL includes in files test=develop * FC MKL-DNN: Enable new output size computation Also, refine pass to comply with newest interface. test=develop * FC MKL-DNN: enable only when fc_mkldnn_pass is enabled * FC MKL-DNN: Allow Weights to use oi or io format * FC MKL-DNN: Adjust UT to work with correct dims test=develop * Enable MKL DEBUG for resnet50 analyzer test=develop * FC MKL-DNN: Improve Hashing function test=develop * FC MKL-DNN: Fix shape for fc weights in transpiler * FC MKL-DNN: Update input pointer in re-used fc primitive * Add log for not handling fc fuse for unsupported dims test=develop * FC MKL-DNN: Move transpose from pass to Op Kernel test=develop * FC MKL-DNN: Disable transpose in unit test test=develop * FC MKL-DNN: Remove fc_mkldnn_pass from default list * Correct Flag for fake data analyzer tests test=develop * FC MKL-DNN: Add comment about fc mkldnn pass disablement test=develop * FC MKL-DNN: Disable fc in int8 tests test=develop
-
由 wopeizl 提交于
* add __str__ method for tensor and lodtensor to support print test=develop
-
由 Krzysztof Binias 提交于
test=develop
-
由 Hongyu Liu 提交于
* fix example; test=develop * fix api spec; test=develop * fix api spec; test=develop * add doc check test=develop test=document_preview * test=develop,test=document_preview add blank line to fix format, add one more "import" * fix bug; test=develop * fix bug; test=develop
-
由 guru4elephant 提交于
add Run Prepared Ctx, fix pybind problem
-
由 Yibing Liu 提交于
test=develop
-
由 Guo Sheng 提交于
* Fix the example code in some Python API. test=develop * Fix the example code in some Python API by adding import. test=develop
-
由 chengduo 提交于
* This PR adds broadcast for multi-process. And it could be used in dynamic graph to broadcast parameters.
-
由 flame 提交于
Python examples of fluid.layers.io.double_buffer and some BuildStrategy's methods.
-
由 Sylwester Fraczek 提交于
* add conv_concat_relu fuse test=develop * add test code test=develop * added missing include with unordered_map test=develop * review fixes for wojtuss test=develop * remove 'should (not) be fused' comment statements one of them was invalid anyway test=develop
-
由 Sylwester Fraczek 提交于
* fix quantize_squash_pass segfault when there is no tensor linked do Bias input test=develop * add googlenet test test=develop * fix concat CreateKey not using input format test=develop
-