1. 17 9月, 2019 5 次提交
    • H
    • J
      Feature/add transform data dygraph (#19707) · cc311bdf
      Jiabin Yang 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * add transform_data to dygraph
      
      * test=develop, refoctor name to make it easier to understand
      
      * test=develop, refoctor name to make it easier to understand
      
      * add test and change input to const ref for safety
      
      * test=develop, fix multi-gpu failed problem , add Tracer tests, change PADDLEENFORCE to PADDLEENFORCE_EQ
      
      * add ut for data transform
      
      * refine ut for data_transform
      
      * test=develop, fix ut failed on parallel se-resnext
      
      * test=develop, change one more PADDLE_ENFORCE
      
      * add test_tracer on multiple devices
      
      * test=develop, change place to mutable for data transform
      
      * test=develop, add transform data on same place test and remove useless log
      
      * test=develop, Add to do for data layout and and ut for conv2d with no bias
      cc311bdf
    • L
      cpu Conv double grad (#19672) · b76343c3
      lvmengsi 提交于
      * cpu conv_grad_grad
      b76343c3
    • Z
    • Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770) · 93c85c93
      翟飞跃 提交于
      * Implement the operator with sprase matrix multiply
      
      * Update the URL of mklml library.
      
      test=develop
      
      * Disable MKLML implematation when using no-linux.
      
      test=develop
      
      * optimize bp with mkl sparse matrix
      test=develop
      
      * tmp add fused_emb_seq layer
      
      * Add the support of padding_idx attribute.
      
      test=develop
      
      * add padding_idx support
      test=develop
      
      * implement grad refer lego
      test=develop
      93c85c93
  2. 16 9月, 2019 8 次提交
  3. 14 9月, 2019 3 次提交
  4. 13 9月, 2019 1 次提交
    • C
      Open fuse all reduce option (#19765) · 056fdedd
      chengduo 提交于
      * Open fuse all reduce op
      test=develop
      
      * Add Fuse optimization op log
      
      * Add log in fuse_optimizer op pass and fuse all_reduce op pass
      
      * replace with boost::optional<bool>
      test=develop
      
      * Polish code
      test=develop
      
      * fix code coverage
      test=develop
      056fdedd
  5. 12 9月, 2019 3 次提交
  6. 11 9月, 2019 10 次提交
  7. 10 9月, 2019 7 次提交
  8. 09 9月, 2019 3 次提交