1. 15 11月, 2022 1 次提交
    • S
      mkldnn directory cleanup (#47779) · 8a339d24
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      8a339d24
  2. 08 11月, 2022 1 次提交
    • P
      Split quant (#47449) · 130db92a
      Paulina Gacek 提交于
      * Split kernel registered, tests for uint/int added
      
      * Split quantized
      
      * Split output scales calculated only once
      
      * NearestInterp test fix reversed
      
      * DequantizeOutputs corrected
      130db92a
  3. 17 10月, 2022 1 次提交
  4. 11 10月, 2022 1 次提交
  5. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  6. 31 8月, 2022 1 次提交
    • C
      Fix split api bug (#45396) · 4a25b60d
      Charles-hit 提交于
      * fix split bug
      
      * solve function redefine
      
      * fix fluid.layers.split and add unit test
      
      * delete splitInferMeta register in unary.cc
      
      * modify test_split_op GPU unit test
      
      * modify test_split_op GPU unit test place param
      
      * refactor split op and fix infershape bugs
      
      * add () in && and ||
      
      * fix split C++ unit test
      
      * fix split infershape
      4a25b60d
  7. 26 6月, 2022 1 次提交
  8. 05 6月, 2022 1 次提交
  9. 19 4月, 2022 1 次提交
  10. 04 3月, 2022 1 次提交
  11. 01 3月, 2022 1 次提交
  12. 20 2月, 2022 1 次提交
  13. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  14. 14 2月, 2022 1 次提交
    • C
      [pten] add split kernel (#39060) · d0df5632
      chentianyu03 提交于
      * add split kernel
      
      * add split kernel signature
      
      * fix split bug
      
      * modify MakePtenScalarArrayFromVarList
      
      * modify MakePtenScalarArrayFromVarList
      
      * fix split windows register error
      
      * add test case for split kernel
      
      * replace raw split kernel with pten kernel
      
      * fix makeScalar/ScalarArray bug
      
      * remove debug log
      
      * remove int64_t type in buildPtcontext
      
      * update by code review
      
      * fix split dev test failed
      
      * change DenseTensorMeta to MetaTensor
      
      * change split api code from auto gen to manual
      
      * split cuda kernel support bfloat16 type
      
      * fix conflict
      
      * rm raw split kernel
      
      * merge develop branch
      
      * change to pten::errors
      d0df5632
  15. 01 9月, 2021 1 次提交
    • J
      Added slice BF16/FP32 FWD/BWD kernels (#34332) · 070cab11
      jakpiase 提交于
      * aded slice FWD FP32
      
      * added tests for slice FWD FP32
      
      * added slice bwd
      
      * added bf16 tests
      
      * CI fix
      
      * CI fix
      
      * added reason to skip_if
      
      * minor change
      
      * temporary fix for failing test
      
      * temporary fix
      
      * changes after review
      
      * CI rerun
      070cab11
  16. 07 7月, 2021 1 次提交
  17. 23 6月, 2021 1 次提交
    • J
      Added split op bf16/fp32 oneDNN kernel (#33584) · 68106509
      jakpiase 提交于
      * base changes for split op
      
      * 90% of split functionality added
      
      * full fp32 functionality
      
      * added bf16 test
      
      * added submemory caching
      
      * added bf test to static mode whitelist
      
      * minor change
      
      * enabled split op for inference
      
      * minor fix
      
      * minor fix
      68106509
  18. 13 10月, 2020 1 次提交
  19. 30 7月, 2020 1 次提交
  20. 28 2月, 2020 1 次提交
  21. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  22. 29 10月, 2019 1 次提交
    • L
      support Tensor for split and concat, support -1 in num_or_sections, add check... · 6802539a
      liym27 提交于
      support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)
      
      * improve split and concat op:
      1. support Tensor for argument 'dim' in split op.
      2. support Tensor for argument 'axis' in concat op.
      test=develop
      
      * redefine function GetDataFromTensor and set unknown output shape to - 1.
      test=develop
      
      * add check: Attr(sections) match Input(X). test=develop
      
      * support Tensor for attr(sections) and attr(sections) can contain -1.
      add check for attr(sections).
      test=develop
      
      * modify error message for concat and call Resize only when necessary. test=develop
      6802539a
  23. 27 9月, 2019 1 次提交
  24. 11 4月, 2019 1 次提交
  25. 23 10月, 2018 1 次提交
    • C
      Refine Split op (#13967) · a7497653
      chengduo 提交于
      * speedup split_op
      test=develop
      
      * speedup split_op
      test=develop
      
      * rename ConcatGrad to Split
      
      * refine concat and split
      test=develop
      
      * fix compile error
      a7497653
  26. 14 6月, 2018 1 次提交
  27. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  28. 16 4月, 2018 1 次提交
  29. 12 4月, 2018 1 次提交
    • Y
      remove net op and cond_op (#9663) · b26f5050
      Yang Yang(Tony) 提交于
      * remove net op and cond_op
      
      * fix cpplint
      
      * fix dependency
      
      * delete backward_test; fix compile
      
      * disable batch_norm backward
      
      * rm test_net.py
      
      * make batchnorm test independent of backward.cc
      
      * make test_layer_norm_op independent of backward.cc
      
      * make test_layer_norm_op independent of backward.cc
      
      * delete unused code
      
      * clean up
      b26f5050
  30. 12 2月, 2018 1 次提交
  31. 10 2月, 2018 2 次提交
  32. 15 1月, 2018 1 次提交
  33. 21 12月, 2017 1 次提交
  34. 20 12月, 2017 1 次提交
  35. 04 11月, 2017 1 次提交
  36. 27 10月, 2017 1 次提交
    • Y
      Gradient check use graph (#5027) · be00b0c4
      Yu Yang 提交于
      * Simplize Gradient Check
      
      * Stash
      
      * Extract apply_backward_pass to backward.py
      
      Rename apply_backward_pass to append_backward_ops
      
      * Use graph API to check gradient
      
      * Fix ci
      
      * Fix CI
      
      * Fix backward for double precision
      
      * Stash
      
      * Fix CI
      
      * Fix ci
      
      * Ignore GRU test
      
      * Ignore xe op
      
      * Fix CI
      
      * Fix softmax with xe gradient
      
      The correct equation should be IG = OG * (d_softmax_with_xe())
      
      * Fix typo
      
      * Fix merge error
      
      * Disable LRN
      be00b0c4
  37. 17 10月, 2017 1 次提交
  38. 07 10月, 2017 1 次提交
  39. 28 9月, 2017 1 次提交