1. 08 11月, 2019 2 次提交
    • J
      Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835
      joanna.wozna.intel 提交于
      * Add transpose2 INT8 for mkl-dnn
      
      test=develop
      
      * Fix test_transpose_int8_mkldnn
      
      test=develop
      
      * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"
      
      This reverts commit 34011bdb, reversing
      changes made to 2ce6473f.
      
      * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""
      
      This reverts commit 23754dd7.
      
      * Add template to TransposeMKLDNNHandler
      
      test=develop
      
      * Resolve conflict
      
      test=develop
      
      * Restore get_size and refactor
      
      test=develop
      77c20835
    • C
      Enrich the type of error and declare the error type interfaces (#21024) · 7ee25189
      Chen Weihang 提交于
      * Enrich the type of error and declare the error type interfaces, test=develop
      
      * adjust tests to adapt new form, test=develop
      
      * add inference deps with error_codes.pb.h, test=develop
      
      * restore stack iter start pos, test=develop
      
      * polish code based review comments, test=develop
      7ee25189
  2. 05 11月, 2019 3 次提交
  3. 02 11月, 2019 1 次提交
  4. 01 11月, 2019 3 次提交
  5. 31 10月, 2019 3 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
    • T
      support dump param of model into afs (#20302) · 59bcdc8a
      Thunderbrook 提交于
      * support dump param to afs
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * dump param
      test=develop
      
      * dump param
      test=develop
      
      * dump param
      test=develop
      
      * dump param
      test=develop
      59bcdc8a
    • Y
      Refine the cache of program, context and scope in executor. (#18483) · 16e4d026
      Yiqun Liu 提交于
      * Refine the cache of program, context and scope in executor.
      test=develop
      
      * Refine the unittest test_executor_and_use_program_cache.
      
      * Add the test the PaddingRNN with use_program_cache=True.
      test=develop
      
      * Remove a check.
      test=develop
      
      * Refine the unittest to check whether it is correct when setting use_program_cache=True.
      test=develop
      16e4d026
  6. 29 10月, 2019 4 次提交
    • H
      save load problem fix and new feature add (#20823) · ff0886a9
      hong 提交于
      * fix persistable;
      
      * fix save load bugs; test=develop
      
      * fix bug; test=develop
      
      * add example for new io api; test=develop
      
      * addd example; test=develop
      ff0886a9
    • Y
      Check and correct the output's lod_level in DynamicRNN related operators (#19144) · 6fcfd32e
      Yiqun Liu 提交于
      * Refine the InferShape of ReadFrom and WriteTo op, and add comment to explain why not call ShareLoD for runtime.
      test=develop
      
      * Add comment for ReorderLoDTensorByRank op.
      
      * Add comment for lod_tensor_to_tensor_array op to explain why only call DecreaseLoDLevel for compile time.
      test=develop
      
      * ShrinkRNNMemory op should call ShareLoD for compile time.
      test=develop
      
      * Add the implementation of IncreaseLoDLevel and add the compile-time check of lod_level in InferShape of sequence_pool.
      test=develop
      
      * Refine the unittest of DynamicRNN.
      test=develop
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_NE.
      test=develop
      6fcfd32e
    • Y
      Implement a pass detect fusion group of elementwise op (#19884) · b5f3be83
      Yiqun Liu 提交于
      * Add fusion_group_pass and elementwise pattern.
      
      * Rewrite the detector of elementwise group.
      test=develop
      
      * Add a comment in codegen.
      
      * Add more unittest cases.
      test=develop
      
      * Move code_generator related code to fusion_group directory.
      
      * Correct the including path.
      
      * Add the definition of SubGraph and finish the insert of fusion_group op in pass.
      
      * Insert graph_vis_pass in tester to visualize the graph for debug.
      b5f3be83
    • H
      Add shape and type check at read_op (#20754) · 95ba4bd2
      Huihuang Zheng 提交于
      95ba4bd2
  7. 28 10月, 2019 2 次提交
  8. 25 10月, 2019 2 次提交
    • X
      fix several sparse table issuses (#20686) · 48669aa8
      xujiaqi01 提交于
      * no longer need to define all embedding layers (no one less) of all slots in each program. make trainer_param repeated in ps.proto.
      * add find_distributed_lookup_table_grads instead of hard code GRAD
      * support embedding stop gradient. push sparse has error before fix this.* 
      * fix fill sparse, skip slots which do not have embedding. each slot's embedding in a sparse table should be used in all training programs before fix this.
      * fix pull sparse, skip slots which do not have embedding.
      * fix collect feasign label info, skip slots which do not have embedding.
      * support when there are multi sparse tables in one or multi training programs, each program can pull/push its own related sparse tables instead of all sparse tables.
      * test=develop
      48669aa8
    • C
      Make formatted ENFORCE stack adapt to more situations (#20826) · 1d1552d1
      Chen Weihang 提交于
      * Make formatted ENFORCE stack adapt to more situations and polish details, test=develop
      
      * restore template message position, test=develop
      1d1552d1
  9. 24 10月, 2019 2 次提交
  10. 21 10月, 2019 1 次提交
  11. 19 10月, 2019 1 次提交
  12. 18 10月, 2019 5 次提交
  13. 17 10月, 2019 2 次提交
  14. 16 10月, 2019 1 次提交
    • C
      Add IndicateVarDataType interface to block tensor is not initialized problem... · 003f369b
      Chen Weihang 提交于
      Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
      
      * add indicate_var_data_type inferface, test=develop
      
      * add unittests & polish error message, test=develop
      
      * remove needless include, test=develop
      
      * extract public function & polish message, test=develop
      
      * delete empty var check, test=develop
      
      * change data_type to pointer parameter, test=develop
      
      * polish details, test=develop
      003f369b
  15. 15 10月, 2019 2 次提交
  16. 14 10月, 2019 5 次提交
  17. 13 10月, 2019 1 次提交
    • Z
      Add Multihead matmul fuse pass (#20167) · b8333ede
      zhaoyuchen2018 提交于
      * Add multihead fuse pass for ernie opt
      
      * Refine softmax
      
      test=develop
      
      * Refine cuda kernel
      
      * Refine cuda version
      
      * Refine cmake
      
      test=develop
      
      * refine header file
      
      * refine test case and pass
      * refine comments
      b8333ede