1. 02 8月, 2019 2 次提交
    • Z
      Open gc by default (#18836) · 7ac748ad
      Zeng Jinle 提交于
      * open gc by default, test=develop
      
      * fix test_train_recognize_digits and disable gc when ngraph is enabled, test=develop
      
      * fix conditional_block op eager deletion bug, test=develop
      
      * add some comments to reviewers, test=develop
      7ac748ad
    • Fusion: seqpool_cvm_concat (#18471) · ee2f296e
      石晓伟 提交于
      * add fusion_seqpool_cvm_concat test=develop
      
      * simplify pass, test=develop
      
      * fix code style, test=develop
      ee2f296e
  2. 01 8月, 2019 3 次提交
    • W
      Add the op of unique_with_counts, expand count function of the op unique (#18720) · 3ab1866c
      wawltor 提交于
      * test=develop
      Add the op of unique_with_counts, the op is calc the unqiue input of data, and output the corresponding indices and count of data.
      
      * test=develop
      Check the input and dtype in the op of unique_with_counts
      
      * test=develop
      test=document_preview
      update the API.spec for `unique_with_counts`, at the same time, optimize the python api in the op of `unique_with_count`
      
      * test=develop
      test=document_preview
      Fix some python api problem in the op of `unique_with_counts`, and change the error messsage in this op.
      
      * Fix some API problem in the op of `unique_with_counts`
      test=develop
      test=document_preview
      
      * test=develop
      test=document_preview
      Fix the api sample of op `unique_with_counts`, and update api.spec
      3ab1866c
    • J
      - Removed passing X from FWD to GRAD via device context (#18911) · 5cf2d385
      Jacek Czaja 提交于
      test=develop
      
      - Extracted key generation from FWD and GRAD into separate function
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - another compilation
      
      test=develop
      5cf2d385
    • L
      Fix depthwise conv gpu kernel bug (#18582) · 22fa4c2d
      LielinJiang 提交于
      * fix depthwise conv gpu kernel bug, test=develop
      * add more depthwise conv test, test=develop
      22fa4c2d
  3. 31 7月, 2019 7 次提交
  4. 30 7月, 2019 2 次提交
  5. 28 7月, 2019 1 次提交
  6. 26 7月, 2019 4 次提交
  7. 25 7月, 2019 4 次提交
  8. 24 7月, 2019 2 次提交
    • B
      Extend Matmul to support matrix multiplication with multiple heads (#18570) · 220eef60
      Bob Zhu 提交于
      * extend matmul op to support multiple head multiplication
      
      With the support of multiple head, the multiplication of two big matrixes is
      split into multiplication of several (head_number) small matrixes. e.g. if
      Mat A is [3, 24] and Mat B is [24, 4], when multiple A and B with head_number
      as 4, Mat A will be split as 4 matrix of [3, 6] and Mat B will be 4 matrix of
      [6, 4]. The result of final matrix will be 4 matrix of [3, 4], i.e. [3, 16].
      220eef60
    • W
      Add python API for appending LoD level (#18702) · 075e1cf7
      whs 提交于
      * Make lod reset op support for append lod level.
      
      * Fix API.spec
      test=develop
      
      * Fix unitest.
      test=develop
      
      * Add python api for lod append.
      test=develop
      
      * Fix API.spec
      test=develop
      
      * Fix format of doc.
      test=develop
      
      * Fix unitest.
      test=develop
      
      * Fix doc.
      test=develop
      075e1cf7
  9. 23 7月, 2019 4 次提交
  10. 22 7月, 2019 4 次提交
  11. 20 7月, 2019 2 次提交
  12. 19 7月, 2019 2 次提交
  13. 18 7月, 2019 2 次提交
  14. 17 7月, 2019 1 次提交
    • Y
      Add cuda implementation for `prelu` backward pass (#18633) · ce1ec332
      Yang Zhang 提交于
      * Add GPU implementation for `prelu` backward pass
      
      test=develop
      
      * Fix logic error in `prelu` GPU backward and simplify a bit
      
      test=develop
      
      * Fix `prelu` backward CUDA implementation
      
      test=develop
      
      CPU version was not used actually, so test passed
      ce1ec332