1. 03 2月, 2023 1 次提交
    • J
      【Prim】Blacklist bwd comp (#50148) · cc8a7858
      Jiabin Yang 提交于
      * refactor dir for prim
      
      * support blacklist for bwd comp
      
      * fix type error
      
      * remove additional file
      
      * fix git ignore
      
      * add more test
      
      * merge develop
      cc8a7858
  2. 02 2月, 2023 1 次提交
    • X
      【PRIM】Support use operator's output metadata info in constructing static... · d8643cb6
      Xiaoxu Chen 提交于
      【PRIM】Support use operator's output metadata info  in constructing static backward composite (#50043)
      
      * [prim] support custom target_gradients
      
      * support infershape after append one gradop
      
      * [prim] add simple net test
      
      * fix test_loop segment fault bug
      
      * [prim] fix infer shape segment fault bug when output of grad_op_desc is empty
      d8643cb6
  3. 31 1月, 2023 1 次提交
  4. 20 1月, 2023 1 次提交
  5. 17 1月, 2023 2 次提交
  6. 09 1月, 2023 1 次提交
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
  7. 10 11月, 2022 1 次提交
    • J
      XPU multi-card support eager mode (#47445) · 3b91f8f3
      james 提交于
      * XPU support eager mode
      
      * add unittest for XPU eager mode
      
      * minor bugfix
      
      * minor bugfix, test=kunlun
      
      * correct copyright info
      
      * 1. remove unsed vars/funcs
      2. ProcessGroupBKCL inherit from ProcessGroupStream
      
      * bugfix for fp16 in eager mode multi-card, test=kunlun
      
      * rebase & fix a few issues
      
      * use new processgroup interface, test=kunlun
      
      * fix compile issue, test=kunlun
      3b91f8f3
  8. 03 11月, 2022 1 次提交
  9. 25 10月, 2022 1 次提交
    • N
      [CodeStyle][py2] remove `paddle.compat` (#47269) · ea8e87fa
      Nyakku Shigure 提交于
      * [CodeStyle][py2] remove `paddle.compat`
      
      * remove compat from `paddle.__init__`
      
      * enable_static in sample code
      
      * Revert "enable_static in sample code"
      
      This reverts commit ffccaa633900154ea5f3d056e746aae9a1927399.
      
      * enable_static in sample code
      ea8e87fa
  10. 23 10月, 2022 1 次提交
  11. 20 10月, 2022 1 次提交
  12. 11 10月, 2022 1 次提交
  13. 27 9月, 2022 1 次提交
  14. 26 9月, 2022 1 次提交
    • C
      Fix libpaddle soname mismatch error (#46344) · cdcc0013
      Chen Weihang 提交于
      * fix libpaddle soname mismatch error
      
      * fix windows failed
      
      * polish linux and windows make impl
      
      * unify winddows lib name
      
      * fix windows error
      
      * revert copy dst change
      
      * revert naming change
      
      * revert windows change
      
      * fix gpups compile failed
      cdcc0013
  15. 16 9月, 2022 1 次提交
    • C
      Unify core avx and core_noavx to libpaddle (#46095) · 267d71a4
      Chen Weihang 提交于
      * unify  core_avx and core_noavx
      
      * fix except error
      
      * revert mac compile logic
      
      * revert dylib to so
      
      * add core_noavx branch
      
      * remove core_noavx
      
      * replace paddle_core by lib paddle
      
      * polish var name
      
      * replace paddle_core by libpaddle
      
      * update custom device commit
      
      * polish code by comments
      267d71a4
  16. 05 6月, 2022 1 次提交
    • S
      【code format check upgrade】 step2:yapf (#42944) · a072fca8
      Sing_chan 提交于
      * use yapf to format all python file
      
      * yapf exclude two unittests file for they rely on writing and reading file, and format will break them
      
      * disable diff_py_file because too many diff files cause command following failed
      a072fca8
  17. 25 3月, 2022 1 次提交
    • J
      Refactor Dygraph Flags (#40786) · 3085d5e4
      Jiabin Yang 提交于
      * refactor eager flags
      
      * fix flags error when we switch from eager to dygraph
      
      * fix ci problem
      
      * fix ci
      
      * fix ci
      
      * merge develop and fix code style
      
      * merge develop and fix code style
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * fix op test error
      
      * merge develop
      3085d5e4
  18. 08 3月, 2022 1 次提交
    • C
      add python profiler package (#40065) · 10325a82
      chenjian 提交于
      * add python profiler package
      
      * update according to review
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * add unit test
      
      * Revert "add unit test"
      
      This reverts commit 4e69ff71b0645e069afe5dd8fea0d07717852c48.
      
      * reduce for pr
      
      * add unit test
      
      * modify for pr
      
      * fix unittest
      
      * update for ci coverage
      
      * modify according to review
      
      * fix bug
      
      * improve coverage
      10325a82
  19. 27 1月, 2022 1 次提交
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
  20. 18 1月, 2022 1 次提交
  21. 28 12月, 2021 1 次提交
    • J
      Support test basic of Var and Layer (#38426) · 1fb80a6a
      Jiabin Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * support create varbase and fix retain grad error
      
      * fix windows error
      
      * support test code coverage
      
      * support test code coverage
      
      * support test code coverage
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      1fb80a6a
  22. 16 12月, 2021 1 次提交
  23. 15 9月, 2021 1 次提交
  24. 27 7月, 2021 1 次提交
  25. 23 7月, 2021 1 次提交
  26. 22 7月, 2021 2 次提交
  27. 19 7月, 2021 1 次提交
    • C
      Add Cuda event and stream API (#32460) · 9c7f6af5
      chentianyu03 提交于
      * add cuda event and stream api
      
      * add cuda event and stream api
      
      * add get_current_stream api
      
      * add get_current_stream api
      
      * init streams
      
      * modify get_current_stream
      
      * modify get_cuttent_stream
      
      * add synchronize func
      
      * add current_stream doc and test file
      
      * move get_current_stream into CUDA macro
      
      * move CudaEvent into CUDA macro
      
      * move _get_current_stream and _device_synchronize into cuda macro
      
      * modify the macro of cuda stream and event
      
      * add test case for synchronize
      
      * add paddle.devices.cuda module
      
      * event and stream support hip
      
      * add doc for stream and event class
      
      * move cuda stream and event into single pybind
      
      * add cuda_streams_py.cc to cmakelist
      
      * add _device_synchronize and _get_current_stream to core module
      
      * add test case for cudastream and cudaevent
      
      * move __all__ in streams.py
      
      * fix test fail
      
      * add cuda to devices __all__
      
      * fix current_stream doc writing error
      
      * move devices to device direction, and merge device.py into __init__.py
      
      * add required:gpu to sample codes
      
      * remove cuda direction from device/__init__.py
      9c7f6af5
  28. 13 7月, 2021 1 次提交
  29. 06 7月, 2021 1 次提交
  30. 29 6月, 2021 1 次提交
  31. 09 6月, 2021 1 次提交
    • W
      paddle.save support object save to memory. (#32999) · cdd6437a
      WeiXin 提交于
      * support state_dict save to memory.
      
      * Perfect unittest
      
      * perfect unittest.
      
      * suport saving binary var to memory
      
      * polish code.
      
      * packag save/load files into pybind/io.py
      
      * polish code .
      
      * add example for save to memory; remove useless save load function(_load_static_dict,_save_dygraph_dict)
      
      * delete _load_static/dygraph_dict;_save_static/dygraph_dict
      
      * edit example of paddle.save/load
      cdd6437a
  32. 07 5月, 2021 1 次提交
  33. 22 4月, 2021 1 次提交
    • W
      support save/load binary format tensor. (#32211) · f4d9adc7
      WeiXin 提交于
      * support save/load binary format tensor
      
      * Fix error when create cudaplace
      
      * Fix error when create cudaplace
      
      * Fix error when create cudaplace
      
      * get devive context from pool.
      
      * move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'.
      
      * improve coverage.
      
      * improve coverage.
      
      * polish API
      
      * deal with conflict
      
      * disable save/load large file in unnittest
      
      * split unnittest.
      f4d9adc7
  34. 31 3月, 2021 1 次提交
  35. 14 1月, 2021 1 次提交
  36. 04 12月, 2020 1 次提交
  37. 24 11月, 2020 1 次提交
    • L
      Upgrade string literals to raw string (#28989) · 3815d7aa
      Leo Chen 提交于
      * upgrade comment string to raw string
      
      * fix string in
      
      * fix string with ' '
      
      * revert update on comments
      
      * upgrade only necessary
      
      * fix sample code checker
      
      * fix comments with '''
      3815d7aa
  38. 12 10月, 2020 1 次提交