1. 30 8月, 2019 1 次提交
    • J
      [MKL-DNN] Fix to face model on AVX512 platforms (#19282) · ecd9f330
      Jacek Czaja 提交于
      - Refactor step 1
      
      - Compilation fix
      
      - Yet another compilation fix
      
      - Even more compilation fix
      
      - Lint fixes
      
      test=develop
      
      - Removed deprectaed PADDLE_ENFORCE occurance
      
      test=develop
      
      - Candidate fix to BN forward
      
      - Lint fixes
      
      test=develop
      
      - Refactoring in data_layout_transform
      
      - compilation fix
      
      - Another comppilation fix
      
      - Step further into darkness
      
      - Yet another compilation fix
      
      - Yet another compilation fix
      
      - missing header
      
      - compilation fix
      
      - Added MKLDNN -> Paddle conversion in fetch op
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - Lint
      
      test=develop
      
      - Mul fix
      
      - Fix to MKLDNN MUL op and Elementwise MUL UT
      
      test=develop
      
      - Workaround for diffrent weights with groups representation Paddle vs
        MKL-DNN.
      
      test=develop
      
      - Candidate fix for 5D convolution with groups
      
      - Refactor of fix for conv3d and conv2d in fetch op
      
      test=develop
      
      - Compilation fix
      
      - Still same compilation fix
      
      - Compilation fix
      
      - Compilation fix
      
      - Reverted refactoring of fixes
      
      - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format
        not NHWC
      
      test=develop
      
      - minor fix in UT
      
      test=develop
      
      - Lint fixes
      
      test=develop
      ecd9f330
  2. 10 5月, 2019 1 次提交
    • Q
      Double backward of conv2d. (#17211) · e32c9888
      qingqing01 提交于
      * Add conv2d_grad_grad_op
      * Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
          - Now use it in conv2d_grad_grad.
          - Will simply the searching code in conv2d and conv2d_grad in next PR.
      * Enhance and fix bug in unit testing of gradient_checker.
      * Support to fetch empty variables,return None in Python.
      e32c9888
  3. 26 11月, 2018 1 次提交
  4. 16 11月, 2018 1 次提交
    • W
      Refine operator cmake (#14413) · a2d9b344
      Wu Yi 提交于
      * wip simplify operator framework
      
      * wip
      
      * wip
      
      * done test=develop
      
      * clean test=develop
      
      * fix test=develop
      
      * fix deps test=develop
      
      * fix cpu build test=develop
      
      * fix tensorrt build test=develop
      
      * fix tests test=develop
      
      * fix test=develop
      
      * fix cpu build test=develop
      a2d9b344
  5. 08 11月, 2018 1 次提交
  6. 05 11月, 2018 1 次提交
  7. 24 10月, 2018 1 次提交
  8. 05 8月, 2018 1 次提交
  9. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  10. 27 4月, 2018 1 次提交
  11. 26 4月, 2018 1 次提交
  12. 16 3月, 2018 1 次提交
  13. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  14. 12 2月, 2018 1 次提交
  15. 10 2月, 2018 2 次提交
  16. 09 2月, 2018 1 次提交
  17. 09 1月, 2018 1 次提交
  18. 08 1月, 2018 1 次提交
  19. 27 12月, 2017 4 次提交
  20. 26 12月, 2017 1 次提交
  21. 24 12月, 2017 1 次提交
    • D
      Feature/operator run place (#6783) · 735eba29
      dzhwinter 提交于
      * "change operator interface"
      
      * "move devicepool to device_context"
      
      * "fix operator test"
      
      * "fix op_registry Run interface"
      
      * "net op passed. Need to fix nccl multi-Context"
      
      * "add nccl group function"
      
      * "add nccl group function"
      
      * "fix gpu count exceed 32 error"
      
      * "fix recurrent op, nccl op"
      
      * "change the other operators interface with Place"
      
      * "fix typo"
      
      * "fix pybind"
      
      * "fix device in python side"
      
      * "fix pybind failed"
      
      * "add init for test"
      
      * "fix CI"
      735eba29
  22. 20 12月, 2017 1 次提交
  23. 26 11月, 2017 1 次提交
    • D
      Feature/copytensor (#5455) · 45062fe5
      dzhwinter 提交于
      * "make global tensor function independently"
      
      * "replace functor"
      
      * "fix inline template error"
      
      * "fix tensor array with CopyFrom"
      
      * "fix other case use CopyFrom"
      
      * "move the op interface hardly"
      
      * "fix operators"
      
      * "fix typo"
      
      * "delete dynamic recurrent rnn and fix gru_unit in debugmode"
      
      * "fix unique_ptr copy"
      
      * "fix cuda copy"
      
      * "fix namespace error"
      
      * "removed nccl python test"
      
      * "fix include error"
      
      * "fix typo"
      
      * fix copy util test
      45062fe5
  24. 04 11月, 2017 1 次提交
  25. 27 10月, 2017 1 次提交
  26. 24 10月, 2017 1 次提交
  27. 20 10月, 2017 2 次提交
    • Y
      Remove template parameter for Tensor methods (#4937) · c532b967
      Yu Yang 提交于
      * Remove template parameter for Tensor methods
      
      * Also check the type is correct when data()
      * Simplize holder_
      
      * Fix accuracy_op
      
      * Register Code
      c532b967
    • Y
      Feature/py executor test (#4922) · 3db52783
      Yu Yang 提交于
      * Implement FC layer with helper
      
      * Update LayerHelper
      
      * Add debug string for Python ProtoBuf
      
      and Rename `Sync` to `Flush`
      
      * Add check of ProtoBuf initialization
      
      * Layer wrapper for FC
      
      * Fix unittest
      
      * Fix CI
      
      * Add code generator
      
      * AttributeChecker Better error log and speicalize bool
      
      Since lots of types can be cast to bool
      
      * Complete mlp, fit_a_line
      
      * Expose get global scope
      
      * Make global scope not thread-safe
      
      1. It is no need to make global scope thread-safe, since it will be
      invoked in Python main thread.
      2. Do not free the global scope when C++ exit. Let the OS free memories,
      otherwise, we need to handle the destroy dependencies.
      
      See
      https://google.github.io/styleguide/cppguide.html#Static_and_Global_Variables
      
      * Fix
      
      * Implementation of simple conv_2d layer
      
      * Stash
      
      * Remove private data members in OpRegister
      
      * Fix bugs
      
      * Stash
      
      * Expose FeedFetchList as VarType
      
      * Change ProgramDesc not a global variable
      
      * Polish code style
      
      * Stash
      
      * Correct implement BlockDesc destructor
      
      * Correct implement BlockDesc destructor
      
      * Unify program as parameter name
      
      * Fix bugs
      
      * Add unittest
      
      * Fix unit test error
      
      * Remove unused functions
      
      * Add clone for Python Program
      
      * Working on executor
      
      * Stash
      
      * Add glog as dependencies of ops
      
      * Use VLOG to logging some information is helpful when we debug Paddle
      
      * Expose VarDesc::persistable to Python
      
      * Test executor
      
      * Complete unittest
      
      * Polish code
      
      * Fix merge error
      
      * Follow comment
      
      * Polish Python Code
      3db52783
  28. 17 10月, 2017 1 次提交
    • Y
      Rewrite feed/fetch op (#4815) · 4df6cf4d
      Yu Yang 提交于
      * Feed/Fetch op just plain operator, not a OpWithKernel
      * Do not register OpInfoMaker since Feed/Fetch will never be
        configured by users
      * Feed/Fetch op has empty gradient
      * Feed/Fetch op do not hard code `feed_variable`, `fetch_variable` as
        its input and output, make it as a plain Operator input/output
      4df6cf4d
  29. 11 10月, 2017 1 次提交
  30. 10 10月, 2017 3 次提交
  31. 07 10月, 2017 2 次提交
  32. 06 10月, 2017 1 次提交