1. 04 1月, 2023 1 次提交
    • H
      [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
      HongyuJia 提交于
      * execute use kernel_key first
      
      * change OpKernelType->KernelKey
      
      * fix py3 compile error, remove redundant header files
      
      * fix build_strategy_test
      
      * fix DataType::RAW
      
      * fix custom_type test: operator_test.cc
      
      * fix transform place
      
      * fix backends_are_same_class
      
      * try fix place TransDataDevice
      
      * support all KernelKey
      
      * fix TransformData
      
      * fix place_are_same_class
      
      * fix merge
      
      * fix test_params_no_grad
      
      * fix specific place of GetExpectedKernelType
      
      * fix specific place of GetExpectedKernelType
      
      * fix GetKernelTypeForVar
      
      * fix dtype error
      
      * fix fetch_v2
      
      * change GetKernelTypeForVar
      
      * fix interpreter
      
      * fix typo error
      
      * polish codes
      
      * polish codes
      
      * polish codes
      
      * fix conflict
      4383494f
  2. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  3. 19 9月, 2022 1 次提交
  4. 26 6月, 2022 1 次提交
  5. 05 6月, 2022 1 次提交
  6. 16 3月, 2022 1 次提交
  7. 15 3月, 2022 1 次提交
    • C
      [Phi] Move gather op kernel into phi (#40500) · 0c703fe7
      Chen Weihang 提交于
      * add phi gather kernel
      
      * update year
      
      * remove original gather opkernel
      
      * add gather grad phi kernels
      
      * remove origin gather grad kernel
      
      * fix failed npu and xpu
      
      * fix xpu compile failed
      0c703fe7
  8. 01 3月, 2022 1 次提交
    • Z
      [bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332
      zhangbo9674 提交于
      * add scale gather sum
      
      * refine CUDA_ATOMIC_WRAPPER ADD for bf16
      
      * add gather unittest
      
      * solve conflict
      
      * add scale uinttest
      
      * add sum unittest
      
      * solve conflict
      
      * refine gather unittest
      
      * refine unittest
      6d26b332
  9. 20 2月, 2022 1 次提交
  10. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  11. 03 9月, 2021 1 次提交
    • 沉潜的鱼儿's avatar
      Gather op (#35353) · ef7bc367
      沉潜的鱼儿 提交于
      * add lookup_table_v2_op AsExtra
      
      * add grap attr asextra
      
      * add gather_op asExtra()
      ef7bc367
  12. 11 6月, 2021 1 次提交
  13. 16 11月, 2020 1 次提交
  14. 11 11月, 2020 2 次提交
  15. 09 11月, 2020 1 次提交
  16. 16 9月, 2020 1 次提交
  17. 28 8月, 2020 1 次提交
  18. 23 8月, 2020 1 次提交
  19. 27 5月, 2020 1 次提交
  20. 18 5月, 2020 1 次提交
  21. 14 5月, 2020 1 次提交
  22. 25 3月, 2020 1 次提交
  23. 09 3月, 2020 1 次提交
  24. 31 10月, 2019 1 次提交
    • H
      GradMaker for dygraph (#19706) · 8c4573a3
      hong 提交于
      * refactor dygraph,test=develop
      
      * fix failed unittest,test=develop
      
      * polish code,test=develop
      
      * check windows ci error,test=develop
      try to fix windows ci error by np.allclose,test=develop
      
      * polish vlog and profiler, test=develop
      
      * try to fix preceding ops order,test=develop
      
      * test transformer in windows ci, test=develop
      
      * use python c-api to speed up tracer.trace,test=develop
      
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, add ut for debug string and gradient_accumulator
      
      * test=develop, add tests for layer/gradient_accumulator/prepared_op
      
      * test=develop, fix complie error for test_prepared_op
      
      * test=develop, add more ut for dygraph
      
      * test=develop, create API.spec for dygraph api change
      
      * optimize grad maker; test=develop
      
      * optimize grad maker
      
      * test
      
      * grad make optim; test=develop
      
      * fix unittest bugs; test=develop
      
      * add dygraph grad op maker and split_op
      
      * grad op maker refactor; test=develop
      
      * add dygraph grad maker; test=develop
      
      * fix op deformable_conv_v1_op bug; test=develop
      
      * fix deformable_conv prroi pool bugs;
      
      * fix new op grad op maker bug; test=develop
      
      * fix split by ref bug; test=develop
      
      * fix dygraph auto prune bug; test=develop
      
      * fix test_trace bug; test=develop
      
      * fix fused emb seq pool bug; test=develop
      
      * remove useless code in op_desc file; test=develop
      
      * remove useless code, StrVarBaseNode; test=develop
      
      * fix review issues; test=develop
      
      * fix rank_loss grad maker; test=develop
      
      * remove flag in VarBase; test=develop
      
      * fix distributed_notify_op compile bug ; test=develop
      
      * fix reshape op double grad; test=develop
      
      * fix expand as op; test=develop
      
      * add impertive type_defs.h for demo_train; test=develop
      
      * fix inference lib cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix infernce_lib; test=develop
      
      * fix inference cmake; test=develop
      
      * fix inference lib; test=develop
      
      * fix inference lib; test=develop
      
      * remove condition dygraph grad maker, modify local name; test=develop
      
      * fix split grad maker bug; test=develop
      
      * fix pyramid_op bug; test=develop
      
      * change travis time out limit; test=develop
      
      * restore travis; test=develop
      
      * change timeout limit; test=develop
      8c4573a3
  25. 28 10月, 2019 1 次提交
  26. 12 6月, 2019 1 次提交
    • W
      Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3
      wawltor 提交于
      * test=develop
      The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.
      
      * test=develop
      Fix some code format problem, and the same time add the test case in gather and scatter op
      8eb134c3
  27. 25 3月, 2019 1 次提交
  28. 23 1月, 2019 1 次提交
  29. 12 12月, 2018 1 次提交
  30. 12 11月, 2018 1 次提交
    • Y
      Fix gather & stack op (#14355) · bd294378
      Yibing Liu 提交于
      * Add int type support for stack_op
      
      * Improve gather op to support index with shape N x 1
      
      test=develop
      
      * Fix stack_op kernel's registry
      
      test=develop
      bd294378
  31. 31 10月, 2018 1 次提交
  32. 30 8月, 2018 1 次提交
  33. 01 6月, 2018 1 次提交
  34. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  35. 19 4月, 2018 1 次提交
  36. 17 4月, 2018 1 次提交
  37. 12 2月, 2018 1 次提交
  38. 10 2月, 2018 2 次提交