1. 19 9月, 2022 1 次提交
  2. 30 8月, 2022 1 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
  3. 11 7月, 2022 1 次提交
  4. 07 7月, 2022 1 次提交
    • A
      [IPU] support dy2static for IPU merge code (#43770) · 6984fbca
      Allen Guo 提交于
      * feat(): dynamic_to_static support for ipu.
      
      * fix(): format fix.
      
      * fix format
      
      * fix cpplint error
      
      * use phi::errors
      
      * fix format
      
      * fix format
      
      * fix(): add api to restore patched function.
      
      * fix(): identity_loss uses cpu place as expected kernel type.
      
      * doc(): add IPU dy2static related docs.
      
      * fix(): combine test cases.
      
      * fix format
      
      * fix comment
      
      * fix format
      
      * apply comment
      
      * fix compiling
      
      * fix(): align docs.
      
      * fix(): fix identity_loss function docs.
      
      * fix(): adjust mean and sum in identity_loss.
      
      * fix(): minor docs.
      
      * move API to paddle.incubate.identity_loss
      
      * fix UT
      Co-authored-by: Nzhaorui chen <zhaoruic@graphcore.ai>
      6984fbca
  5. 26 6月, 2022 1 次提交
  6. 23 6月, 2022 1 次提交
  7. 05 6月, 2022 1 次提交
  8. 16 5月, 2022 1 次提交
  9. 28 4月, 2022 1 次提交
  10. 27 4月, 2022 1 次提交
    • Z
      Optimize performance of dygraph (v4) (#42196) · 37e2f027
      zyfncg 提交于
      * optimize performance of dygraph
      
      * optimize performance of dygraph and elementwise_add
      
      * optimize the trace op
      
      * fix bug
      
      * fix bug
      
      * fix unittest bug
      
      * fix code format
      37e2f027
  11. 26 4月, 2022 1 次提交
  12. 19 4月, 2022 1 次提交
  13. 07 4月, 2022 1 次提交
  14. 23 3月, 2022 1 次提交
    • W
      [Eager] Slice (#40587) · b07d239c
      wanghuancoder 提交于
      * fix some slice bug, test=develop
      
      * eager slice, test=develop
      
      * eager slice, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug, test=develop
      
      * refine, test=develop
      
      * rename function name, test=develop
      b07d239c
  15. 15 3月, 2022 2 次提交
  16. 07 3月, 2022 1 次提交
  17. 03 3月, 2022 1 次提交
  18. 28 2月, 2022 2 次提交
  19. 22 2月, 2022 1 次提交
  20. 21 2月, 2022 1 次提交
    • C
      Update record interface using part2 (#39694) · c984cd85
      chenjian 提交于
      * fix RecordEvent interface
      
      * modify default level to 4
      
      * update interface use
      
      * add const default trace level
      
      * update record event interface using
      
      * update record event interface using
      
      * update operator.cc
      
      * update part2
      
      * update part1
      
      * fix include profiler.h header in ps server
      
      * fix include profiler.h header in ps server
      
      * fix profiler.h header
      c984cd85
  21. 20 2月, 2022 1 次提交
  22. 18 2月, 2022 1 次提交
    • Z
      [AMP] support GPU BF16 amp for dygraph (#39029) · 7d6d3848
      zhangbo9674 提交于
      * support dtype param for auto_cast
      
      * add amp_dtype for tracer
      
      * add unsupported bf16 list
      
      * support bf16 amp for O2
      
      * refine python interface for bfloat16
      
      * refine code
      
      * refine code
      
      * refine unittest
      
      * refine code
      
      * refine code
      
      * add bf16 o1
      
      * refine code by comment
      
      * add gradient accumulator
      
      * add recompute
      7d6d3848
  23. 16 2月, 2022 1 次提交
    • J
      EagerTensor to EagerVariable (#39447) · 831fd86e
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * refine code
      
      * Rename all EagerTensor to Tensor
      
      * Rename some EagerTensor to Tensor
      
      * rename EagerTensor to EagerVariable
      
      * add more test
      
      * merge develop and refine code
      831fd86e
  24. 15 2月, 2022 1 次提交
    • R
      [PluggableDevice] Add custom runtime support (#38740) · 3e7825f3
      ronnywang 提交于
      * [CustomRuntime] Add DeviceManager
      
      * [CustomRuntime] Add DeviceInterface
      
      * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager
      
      * [CustomRuntime] Add plug-in device
      
      * [CustomRuntime] Memory module support PluggableDevice
      
      * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option
      
      * update
      
      * [API] update API doc based on comments, test=develop
      Co-authored-by: Nqili93 <qili93@qq.com>
      3e7825f3
  25. 09 2月, 2022 1 次提交
    • J
      Replace EagerTensor with Tensor (#39376) · 945a3ce9
      Jiabin Yang 提交于
      * merge legacy to fluid
      
      * Remove legacy code
      
      * Remove legacy code
      
      * Remove DataType test
      
      * Using Tensor directly instead of using EagerTensor
      
      * support gradient_accumulation
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      
      * make test_imperative_lod_tensor_to_selected_rows longer
      945a3ce9
  26. 02 2月, 2022 1 次提交
  27. 26 1月, 2022 1 次提交
  28. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  29. 15 1月, 2022 1 次提交
  30. 20 12月, 2021 1 次提交
  31. 10 11月, 2021 1 次提交
    • C
      [PTen] Compatible runtime performance optimization (#36946) · 76d2fd1d
      Chen Weihang 提交于
      * resolve conflit with develop
      
      * cache kernel context in tracer for perf up
      
      * replace densetensor when build kernel context
      
      * fix detail compile error
      
      * append impl to static mode
      
      * fix conflit error
      
      * clear attrs after run kernel
      
      * fix coverage failed
      
      * fix cycle compile error
      
      * remove multi-in&out adapt code
      
      * remove tensor meta utils
      
      * clear data when throw exception
      76d2fd1d
  32. 13 10月, 2021 1 次提交
  33. 17 9月, 2021 1 次提交
    • Z
      [AMP] Support pure fp16 training mode for dygraph (#35521) · adaeee4d
      zhangbo9674 提交于
      * add pure fp16 major function in auto_cast & tracer
      
      * support master weight in dygraph for pure fp16
      
      * check mix dtype of fp16&fp32 for check_finite_and_unscale op
      
      * change pure fp16 funtion name
      
      * refine some bug in auto_cast
      
      * refine auto_cast interface logic
      
      * add param _casted_by_pure_fp16 for class Layer
      
      * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator
      
      * refine pure_fp16_decorator as decorator
      
      * add unittest
      
      * add comment
      
      * add comment
      
      * support recompute
      
      * add comment for auto_cast and decorator
      
      * support to_static_state_dict for paddle.jit.save
      
      * unlimite models num and optimizers num
      
      * add lookup_table in black_list
      
      * fix momentum and layer state_dict
      
      * fix bug in layer state_dict
      
      * fix bug in layer state_dict_helper
      
      * refine unittest
      
      * refine test_momentun_op
      
      * refine interface and some code
      
      * refine amp_decorator interface
      
      * refine pure fp16 interface
      
      * refine master weight interface
      adaeee4d
  34. 05 8月, 2021 1 次提交
  35. 30 6月, 2021 1 次提交
  36. 24 6月, 2021 1 次提交
    • H
      [NPU] support dygraph execution on npu place(#33579) · 6aea6be2
      houj04 提交于
      * in NPU environment, use CPUPlace for missing operators.
      
      * in NPU environment, use CPUPlace for missing operators.
      
      * fix TensorCopy bug and add unit test.
      
      * fix code style.
      
      * add more unit tests.
      6aea6be2
  37. 23 6月, 2021 1 次提交
    • W
      optimize attr default value (#33357) · 5d2eb678
      wanghuancoder 提交于
      * optimize attr default value, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix bug in AttrReader, test=develop
      
      * fix bug, test=develop
      
      * fix double_grad, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix checker null, test=develop
      
      * for test, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      5d2eb678
  38. 28 4月, 2021 1 次提交