1. 26 9月, 2021 3 次提交
  2. 25 9月, 2021 1 次提交
  3. 24 9月, 2021 9 次提交
    • J
      add gradient kernel of det op and slogdet op (#36013) · b91e8eec
      jiangcheng 提交于
      * add gradient kernel of det op and slogdet op
      
      * fix CI APPROVAL problem
      b91e8eec
    • P
      Added elementwise_sub_mkldnn operator (#35662) · 787273ed
      piotrekobiIntel 提交于
      * Add elementwise_sub_mkldnn_op without grad
      
      * Add test to static_mode_white_list
      
      * Refactor code, change license years
      
      * Remove invalid grad implementation
      
      * Fix element_wise_sub_op test
      
      * Fix CI Approval error
      
      * Remove unnecessary EltwiseSubMKLDNNGradKernel class
      
      * Fix CI Approval 2
      
      * Fix CI Approval 3
      
      * Fix CI Approval Attempt #4
      
      * Fix CI Approve Attempt #5
      
      * Fix CI Approval Attempt #6
      
      * Fix CI Approval Attemt #7
      
      * Change test names containing add to sub
      
      * Fix old tests testing add instead of sub
      
      * Copy grad implementation from elementwise_add_mkldnn
      
      * CI test fix attempt
      
      * Revert "CI test fix attempt"
      
      This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900.
      
      * Fix CI attempt 2
      
      * Fix elementwise_sub tests, temporary mkldnn broadcast test disable
      
      * Add working implementation of elementwise_sub grad
      
      * Fix build errors caused by pull
      
      * Fix format error
      
      * Fix format error 2
      
      * Disable elementwise_sub_mkldnn test on GPU
      
      * Apply fix for paddle.fluid import
      
      * Revert changes of test_elementwise_sub and Fix mkldnn test
      
      * Revert "Apply fix for paddle.fluid import"
      
      This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742.
      
      * fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862)
      
      * Add changes suggested by reviewers
      
      * Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI
      
      * Remove check_dygraph=False to satisify CI Approval
      Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>
      787273ed
    • J
      add pool2d convert test (#35923) · 82f255d0
      JingZhuangzhuang 提交于
      * add pool2d convert test
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      82f255d0
    • J
      [oneDNN] candidate fix to #34554 (#35884) · 485b387d
      Jacek Czaja 提交于
      * - candidate fix
      
      * - More fixes to #34554
      
      * - another incosnstent fix to key
      
      * - Remvoed unneeded line
      
      * - matching the cache behaviour to other ops
      485b387d
    • W
      update lite branch (#36010) · 17adcf68
      Wilber 提交于
      17adcf68
    • W
      Add paddle.linalg.solve OP (#35715) · 8caf951c
      Weilong Wu 提交于
      * Add linalg.solve op, test=develop
      
      * Fix a bug caused by accidental deletion
      
      * updated description and fix a bug: missing a comma
      
      * Add linalg.solve op, test=develop
      
      * updated solve op backward logic
      
      * updated solve op backward logic again
      
      * Add linalg.solve Op, test=develop
      
      * Updated and modified to fit CI requirements
      
      * Fix a bug
      
      * 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code
      
      * Remove redundant comments
      
      * 1)Removed redundant code; 2)Updated to enhance code robustness
      
      * Removed redundant code
      
      * Updated API documents
      8caf951c
    • L
    • B
      add multihead_matmul trt converter test case (#36023) · fcaa64b3
      baoachun 提交于
      * add multihead_matmul trt converter test case
      
      * move attribute check to op_teller
      fcaa64b3
    • W
      add the shape check for the matmul (#35791) · 8e19d1ba
      wawltor 提交于
      * add the shape check for the matmul
      
      * remove the test case for the linear
      8e19d1ba
  4. 23 9月, 2021 6 次提交
  5. 22 9月, 2021 16 次提交
  6. 21 9月, 2021 2 次提交
    • G
      support fp16 (#35888) · 087c23a9
      Guoxia Wang 提交于
      087c23a9
    • A
      Reuse OneDNN handler for SGD and SUM for SelectedRows input tensors. (#35510) · 799f3861
      Adam Osewski 提交于
      * Create stateful OneDNNAXPYHandler object.
      
      This makes it possible to call it multiple times without recreating the
      oneDNN primitives every time.
      
      * Prepare SGDOpKernel to reuse its implementation from OneDNN kernel.
      
      * OneDNN SGD kernel.
      
      * Update call to use new OneDNNAXPYHandler object api.
      
      * Setup seed in proper place.
      
      * Enable OneDNN kernel only for single case.
      
      * For dense param and sparse grad.
      
      * Small refactor.
      
      * Enable oneDNN by op attr or by cmd line flag.
      
      * Use int64_t type for number of elements.
      
      * Support dense param and grad from OneDNN kernel.
      
      * Enable SGD OneDNN kernel when use MP BF16 optimizer.
      
      * Force non-copyable/movable OneDNNAXPYHandler.
      
      * Reuse OneDNNAXPYHandler for spare tensors in SUM op.
      
      * Fix SFINAE rules.
      
      * Remove recording event inside AXPY.
      
      * Get rid of internal primitive caching.
      
      * Stop use PP cache mechanims to store mem and primitive obj.
      * Handler obj store and reuse needed desc & prim
      
      * Do not derive from MKLDNNHandlerT
      799f3861
  7. 19 9月, 2021 2 次提交
    • L
      Optimization of pool2d grad (#35389) · 86685190
      limingshu 提交于
      * Optimization of pool2d grad, first commit.
      
      * remove useless print codes
      
      * refine codes
      
      * refine codes
      
      * seal more operation into template specialization
      
      * fix template struct error in MaxPool2dGrad.
      
      * Fix header including error
      
      * refine code with comment
      
      * Seal the param-preparation codes into function for common use.
      
      * Seal the param-preparation codes into function for common use.
      
      * Seal the param-preparation into funciton and make it common for other kernels
      
      * polish code and erase useless template speicalization
      
      * Rerun triger
      
      * rerun trigger
      86685190
    • B
      add hard_sigmoid trt converter test cases (#35876) · 9f88d327
      baoachun 提交于
      9f88d327
  8. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e