- 20 3月, 2019 22 次提交
-
-
由 nhzlx 提交于
1. refine anakin engine 2. add data type for zero copy align dev branch and PaddlePaddle:feature/anakin-engine brach the cudnn workspace modify was not included for now, because we use a hard code way in feature/anakin-engine branch. There should be a better way to implement it, and subsequent submissions will be made. test=develop
-
由 nhzlx 提交于
-
由 nhzlx 提交于
-
由 nhzlx 提交于
-
由 nhzlx 提交于
-
由 nhzlx 提交于
support change input size
-
由 nhzlx 提交于
-
由 flame 提交于
* use anakin batch norm and scale implement fluid batch norm
-
由 flame 提交于
cherry-pick feature/anakin-engine: add anakin softmax/transpose/batch_norm/flatten/reshape op (#16020) * add anakin softmax/ flatten/reshape/transpose/batch_norm op converter
-
由 nhzlx 提交于
-
由 nhzlx 提交于
-
由 flame 提交于
* add activation op * test conv2d relu sigmoid tanh
-
由 Yan Chunwei 提交于
-
由 Tao Luo 提交于
add runtime_context_cache_pass
-
由 baojun 提交于
* Add softmax_with_cross_entropy_op test=develop * simplify implementation test=develop
-
由 ruri 提交于
update sqrt explaination
-
由 chengduo 提交于
* fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop
-
由 Zeng Jinle 提交于
Remove unused variables in op grad maker
-
由 baojun 提交于
* take care edge cases test=develop * use pragma test=develop
-
由 Tao Luo 提交于
refine cos_sim infershape
-
由 Wu Yi 提交于
* wip allreduce in op * wip * wip * wip * wip adding test * wip for conflict with mp mode * fix tests test=develop * fix cpu build test=develop * fix travis clang format test=develop * fix cpu build test=develop * update api.spec test=develop * delete comment test=develop * fix cpplint test=develop * fix test=develop * follow comment test=develop * add file test=develop * fix build test=develop * update test=develop * to be compatible with sync_bn, and fix mp mode in develop test=develop
-
由 sneaxiy 提交于
test=develop
-
- 19 3月, 2019 15 次提交
-
-
由 luotao1 提交于
test=develop
-
由 Tao Luo 提交于
Revert "cache runtime_context"
-
由 whs 提交于
* Make step_input support custom lod level. test=develop * Fix API.spec test=develop * Fix API.spec. test=develop * Fix API.spec test=develop * Add default value in document of step_input. test=develop * Fix document. test=develop * Fix API.spec test=develop
-
由 luotao1 提交于
test=develop
-
由 Hongyu Liu 提交于
Fix concat
-
由 Tao Luo 提交于
-
由 tensor-tang 提交于
refine sequence enumerate op
-
由 Qiyang Min 提交于
Implement imperative infer var type
-
由 Zeng Jinle 提交于
test=develop
-
由 Jacek Czaja 提交于
* - Fix to crash of Transformer when mkldnn is to be used Desc: TensorCopy was not setting MKLDNN primitive descriptor when layout was to be kMKLDNN test=develop * - Enable transformer for mkl-dnn test=develo * - Compilation fix test=develop * - Removed manual selection of MKL-DNN ops to be used in Transformer test test=develop
-
由 Yibing Liu 提交于
test=develop
-
由 shippingwang 提交于
-
由 Wojciech Uss 提交于
* Add cpu_quantize_placement_pass for C-API quantization test=develop * added a comment on required pass attributes test=develop
-
由 Tao Luo 提交于
cache runtime_context
-
-
- 18 3月, 2019 3 次提交
-
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
-
由 Zeng Jinle 提交于
Remove const_cast in optimizers
-