1. 13 1月, 2023 1 次提交
  2. 30 12月, 2022 1 次提交
  3. 15 12月, 2022 1 次提交
    • Y
      修改了API文档的相关内容 (#49055) · b89cea33
      yuchen202 提交于
      * 修改了API文档的相关内容
      
      对weight_norm进行修改
      
      * Update python/paddle/profiler/utils.py
      
      * Update python/paddle/utils/cpp_extension/cpp_extension.py
      
      * Update python/paddle/device/__init__.py
      
      * Update python/paddle/device/__init__.py
      
      * test=document_fix
      
      * for Hyperlink; test=document_fix
      
      * Update dlpack.py
      
      * test=document_fix
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      b89cea33
  4. 07 12月, 2022 1 次提交
    • Z
      修改了英文API文档 (#48219) · 4aad4dc5
      Zman 提交于
      * 修改paddle.nn.dynamic_decode,paddle.nn.functional.diag_embed 示例
      
      * mma qk tensor_core (#48087)
      
      * use mma for QK dot computing in fused_multi_transformer.
      * Update fused_multi_transformer_op.cu.h
      
      * remove lrn which is not used in paddle 2.0 (#47945)
      
      * replace scatter_nd and scatter_nd_add with paddle.scatter_nd and (#47960)
      
      paddle.scatter_nd_add
      
      * [PHI] Migrate mul_grad kernel (#48061)
      
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      
      * init
      
      * ExecuteMatMulV2
      
      * rm fluid kernel
      
      * matmul_grad
      
      * remove mutable_data
      
      * mul_grad
      
      * delete unnecessary shape and slice op (#48112)
      
      * 修改英文文档。
      
      * 修改segment operator等英文文档。
      
      * 重新修改了paddle.einsum,paddle.unique_consecutive,
      paddle.disable_signal_handler的英文文档格式。
      
      * 重新修改了英文文档格式。;test=docs_preview
      
      * Update extension.py
      
      * 重新修改了英文文档格式。;test=docs_preview
      
      * 重新修改了英文文档格式。
      待验收:
      - paddle.linalg.svd
      - paddle.nn.functional.diag_embed
      - paddle.set_grad_enabled
      - paddle.disable_signal_handler
      - paddle.cumprod
      - paddle.devaice.cuda.stream_guard
      
      待修改:
      - paddle.nn.dynamic_decode
      - paddle.einsum
      - paddle.unique_consecutive
      - paddle.linalg.svd
      - paddle.uncubate.segment_min
      - paddle.uncubate.segment_max
      - paddle.uncubate.segment_sum
      - paddle.uncubate.segment_mean
      
      ;test=docs_preview
      
      * 重新修改了英文文档格式。
      待验收:
      - paddle.linalg.svd
      - paddle.nn.functional.diag_embed
      - paddle.set_grad_enabled
      - paddle.disable_signal_handler
      - paddle.cumprod
      - paddle.devaice.cuda.stream_guard
      - paddle.nn.dynamic_decode
      - paddle.unique_consecutive
      - paddle.linalg.svd
      
      待修改:
      - paddle.einsum
      - paddle.incubate.segment_min
      - paddle.incubate.segment_max
      - paddle.incubate.segment_sum
      - paddle.incubate.segment_mean
      
      ;test=docs_preview
      
      * 重新修改了英文文档格式。
      待验收:
      - paddle.linalg.svd
      - paddle.nn.functional.diag_embed
      - paddle.set_grad_enabled
      - paddle.disable_signal_handler
      - paddle.cumprod
      - paddle.devaice.cuda.stream_guard
      - paddle.nn.dynamic_decode
      - paddle.unique_consecutive
      - paddle.linalg.svd
      
      待修改:
      - paddle.einsum
      - paddle.incubate.segment_min
      - paddle.incubate.segment_max
      - paddle.incubate.segment_sum
      - paddle.incubate.segment_mean
      
      ;test=docs_preview
      
      * update
      
      * test=docs_preview
      
      * update formula; test=docs_preview
      
      * update formula; test=docs_preview
      
      * remove this operator; test=docs_preview
      
      * add hyper link; test=docs_preview
      
      * add default value; test=docs_preview
      
      * update format; test=docs_preview
      
      * empty commit; test=docs_preview
      
      * fix codestyle issues; test=docs_preview
      
      * empty commit; test=docs_preview
      Co-authored-by: Nlzy <569782149@qq.com>
      Co-authored-by: NVvsmile <450864116@qq.com>
      Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>
      Co-authored-by: NRichardWooSJTU <37864677+RichardWooSJTU@users.noreply.github.com>
      Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      4aad4dc5
  5. 29 11月, 2022 1 次提交
  6. 10 11月, 2022 1 次提交
    • J
      XPU multi-card support eager mode (#47445) · 3b91f8f3
      james 提交于
      * XPU support eager mode
      
      * add unittest for XPU eager mode
      
      * minor bugfix
      
      * minor bugfix, test=kunlun
      
      * correct copyright info
      
      * 1. remove unsed vars/funcs
      2. ProcessGroupBKCL inherit from ProcessGroupStream
      
      * bugfix for fp16 in eager mode multi-card, test=kunlun
      
      * rebase & fix a few issues
      
      * use new processgroup interface, test=kunlun
      
      * fix compile issue, test=kunlun
      3b91f8f3
  7. 08 11月, 2022 1 次提交
  8. 04 11月, 2022 1 次提交
  9. 23 10月, 2022 1 次提交
  10. 12 10月, 2022 1 次提交
  11. 22 9月, 2022 1 次提交
    • Fix the En docs (delete some expression like 'This OP') (#46165) · 3a928a8c
      张春乔 提交于
      * 1. Delete some expression like 'This Op'
      2. remove import numpy as np
      
      * test=document_fix
      
      * fix eg; test=document_fix
      
      * fix 'import numpy' cases; test=document_fix
      
      * fix 'import numpy' cases; test=document_fix
      
      * fix some docs; test=document_fix
      
      * delete raise; test=document_fix
      
      * add some introduction; test=document_fix
      
      * add some introduction; test=document_fix
      
      * test=document_fix
      
      * Fix ’note‘ format; test=document_fix
      
      * Fix Returns of cholesky; test=document_fix
      
      * Fix Example format; test=document_fix
      
      * Fix det; test=document_fix
      
      * Fix eig; test=document_fix
      
      * Fix eigh; test=document_fix
      
      * Fix eigh; test=document_fix
      
      * Apply suggestions from code review;test = document_fix
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      
      * Apply suggestions from code review;test = document_fix
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      
      * Apply suggestions from code review;test = document_fix
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      
      * test=document_fix
      
      * test=document_fix
      
      * KLDiv;test=document_fix
      
      * norm example code; test=document_fix
      
      * revert python/paddle/fluid/**/*
      
      * revert python/paddle/distributed/spawn.py
      
      * revert python/paddle/fluid/*
      
      * fix a `Note` format
      
      * Fix inv; test=document_fix
      
      * Fix lu; test=document_fix
      
      * Fix lu_unpack; test=document_fix
      
      * Fix matrix_power; test=document_fix
      
      * Fix multi_dot; test=document_fix
      
      * Fix solve; test=document_fix
      Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
      3a928a8c
  12. 14 9月, 2022 1 次提交
  13. 29 8月, 2022 1 次提交
    • Z
      [new_exe] Dy2Static support new_executor (#44450) · aba1295b
      zhangbo9674 提交于
      * add interpretercore
      
      * refine backward program id
      
      * add code
      
      * refine program
      
      * refine code
      
      * create forward/backward_program by prog2graph2prog method
      
      * test, do not care
      
      * refine code
      
      * refine code
      
      * refine code
      
      * test, do not care
      
      * add interpretorcore
      
      * add scope
      
      * refine scope create method
      
      * add jit for new_exe
      
      * solve conflict
      
      * delete unused code
      
      * polish code
      
      * polish code
      
      * refine scope in inplace
      
      * refine for datatransfer
      
      * refine _rebuild_from_desc
      
      * refine control eager deletion attr
      
      * refine used_for_jit
      
      * refine jit for infer
      
      * op size0 use ori program
      
      * polish code
      
      * refine jit
      
      * refine run_program_op ut
      
      * refine inplace
      
      * refine control
      
      * refine graph helper
      
      * refine control
      
      * refine inplace
      
      * refine buffer_share_inplace_pass
      
      * polish code
      
      * polish code
      
      * refine usage for compilerProgram
      
      * refine control
      
      * test
      
      * test core cache
      
      * refine code
      
      * refine io.py
      
      * increase test_seq2seq timeout
      
      * refine convert program
      
      * refine interpretercore_cache release
      
      * delete buildinplace
      
      * refine partial_program && io
      
      * refine code for io
      
      * test
      
      * test
      
      * test
      aba1295b
  14. 14 7月, 2022 1 次提交
  15. 27 6月, 2022 1 次提交
  16. 16 6月, 2022 1 次提交
  17. 14 6月, 2022 1 次提交
  18. 07 6月, 2022 1 次提交
  19. 05 6月, 2022 1 次提交
    • S
      【code format check upgrade】 step2:yapf (#42944) · a072fca8
      Sing_chan 提交于
      * use yapf to format all python file
      
      * yapf exclude two unittests file for they rely on writing and reading file, and format will break them
      
      * disable diff_py_file because too many diff files cause command following failed
      a072fca8
  20. 02 6月, 2022 1 次提交
  21. 27 5月, 2022 1 次提交
  22. 30 3月, 2022 1 次提交
    • F
      Add new APIs for GPU memory monitoring (max_memory_allocated,... · afe02e9d
      From00 提交于
      Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657)
      
      * Add new API memory_reserved
      
      * Add memory_allocated, max_memory_reserved and max_memory_allocater
      
      * Fix CI error
      
      * Fix CI error
      
      * Enhance UT
      
      * Add FLAGS_memory_stats_opt
      
      * Add STATS macro functions
      
      * Add StatAllocator
      
      * Fix CI errors
      
      * Add UT
      
      * Fix CI errors
      afe02e9d
  23. 15 2月, 2022 1 次提交
    • R
      [PluggableDevice] Add custom runtime support (#38740) · 3e7825f3
      ronnywang 提交于
      * [CustomRuntime] Add DeviceManager
      
      * [CustomRuntime] Add DeviceInterface
      
      * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager
      
      * [CustomRuntime] Add plug-in device
      
      * [CustomRuntime] Memory module support PluggableDevice
      
      * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option
      
      * update
      
      * [API] update API doc based on comments, test=develop
      Co-authored-by: Nqili93 <qili93@qq.com>
      3e7825f3
  24. 31 12月, 2021 1 次提交
  25. 27 12月, 2021 1 次提交
  26. 09 12月, 2021 1 次提交
  27. 07 12月, 2021 1 次提交
  28. 09 11月, 2021 1 次提交
  29. 28 10月, 2021 1 次提交
  30. 29 9月, 2021 3 次提交
    • Z
      Add basic support for CUDA Graph (#36190) · 21b93c3d
      Zeng Jinle 提交于
      * add basic support for CUDA Graph
      
      * fix ci compile error
      
      * fix LOG print, fix windows CI
      
      * follow comments and update
      
      * small fix for default ctor
      
      * fix rocm compile error
      
      * fix CPU compile error
      21b93c3d
    • H
      Add op paddle.device.cuda.get_device_name and paddle.device.cuda.get_device_capability. (#35672) · f703558d
      hlygit66666 提交于
      * add op paddle.device.cuda.get_device_name
      
      * fix some bugs
      
      * fix some bugs
      
      * fix error message bugs
      
      * fix en docs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * add error message test case
      
      * add get_device_name and get_device_capability
      
      * fix review
      
      * fix docs bug
      
      * fix docs
      
      * fix docs
      f703558d
    • Y
      fix paddle.device.cuda.get_device_properties doc (#36178) · 6d4435ac
      Yanxing Shi 提交于
      * Initial Commit
      
      * add unittest and add error information
      
      * modify doc
      
      * fix some error
      
      * fix some word
      
      * fix bug cudaDeviceProp* and modify error explanation
      
      * fix cudaDeviceProp* error and unnitest samples
      
      * fix hip error and PADDLE_WITH_HIP
      
      * update style
      
      * fix error is_compiled_with_cuda
      
      * fix paddle.device.cuda.get_device_properties
      
      * fix error for multi thread safe
      
      * update style
      
      * merge conflict
      
      * modify after mentor review
      
      * update style
      
      * delete word
      
      * fix unittest error for windows
      
      * support string input and modify some code
      
      * modify doc to support string input
      
      * fix error for express information
      
      * fix error for express information
      
      * fix unnitest for windows
      
      * fix device.startswith('gpu:')
      
      * format error and doc
      
      * fix after review
      
      * format code
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix py2 error
      
      * fix wrong words and doc
      
      * fix _gpuDeviceProperties
      
      * test=document_fix
      6d4435ac
  31. 28 9月, 2021 1 次提交
    • Y
      Add paddle.device.cuda.get_device_properties (#35661) · 4cbed9e5
      Yanxing Shi 提交于
      * Initial Commit
      
      * add unittest and add error information
      
      * modify doc
      
      * fix some error
      
      * fix some word
      
      * fix bug cudaDeviceProp* and modify error explanation
      
      * fix cudaDeviceProp* error and unnitest samples
      
      * fix hip error and PADDLE_WITH_HIP
      
      * update style
      
      * fix error is_compiled_with_cuda
      
      * fix paddle.device.cuda.get_device_properties
      
      * fix error for multi thread safe
      
      * update style
      
      * merge conflict
      
      * modify after mentor review
      
      * update style
      
      * delete word
      
      * fix unittest error for windows
      
      * support string input and modify some code
      
      * modify doc to support string input
      
      * fix error for express information
      
      * fix error for express information
      
      * fix unnitest for windows
      
      * fix device.startswith('gpu:')
      
      * format error and doc
      
      * fix after review
      
      * format code
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix py2 error
      
      * fix wrong words and doc
      
      * fix _gpuDeviceProperties
      4cbed9e5
  32. 15 9月, 2021 2 次提交
  33. 14 9月, 2021 1 次提交
    • C
      Add api paddle.device.cuda.empty_cache to release idle gpu memory hold by allocator。 (#35427) · 83932715
      chenenquan 提交于
      * Add empty_cache api to release idle gpu memory hold by allocator,test=develop
      
      * Add empty_cache api to release idle gpu memory hold by allocator,test=develop
      
      * Add empty_cache api to release idle gpu memory hold by allocator,test=develop
      
      * Fix test coverage problem for empty_cache
      
      * delete redundant check for empty_cache
      
      * fix the problem of empty_cache's doc
      
      * delete the nvidia-smi comment in doc of empty_cache, test=document_fix
      83932715
  34. 23 8月, 2021 1 次提交
  35. 19 7月, 2021 1 次提交
    • C
      Add Cuda event and stream API (#32460) · 9c7f6af5
      chentianyu03 提交于
      * add cuda event and stream api
      
      * add cuda event and stream api
      
      * add get_current_stream api
      
      * add get_current_stream api
      
      * init streams
      
      * modify get_current_stream
      
      * modify get_cuttent_stream
      
      * add synchronize func
      
      * add current_stream doc and test file
      
      * move get_current_stream into CUDA macro
      
      * move CudaEvent into CUDA macro
      
      * move _get_current_stream and _device_synchronize into cuda macro
      
      * modify the macro of cuda stream and event
      
      * add test case for synchronize
      
      * add paddle.devices.cuda module
      
      * event and stream support hip
      
      * add doc for stream and event class
      
      * move cuda stream and event into single pybind
      
      * add cuda_streams_py.cc to cmakelist
      
      * add _device_synchronize and _get_current_stream to core module
      
      * add test case for cudastream and cudaevent
      
      * move __all__ in streams.py
      
      * fix test fail
      
      * add cuda to devices __all__
      
      * fix current_stream doc writing error
      
      * move devices to device direction, and merge device.py into __init__.py
      
      * add required:gpu to sample codes
      
      * remove cuda direction from device/__init__.py
      9c7f6af5