- 31 1月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom device api * update * update * test=document_fix * update * update * add examples
-
- 13 1月, 2023 1 次提交
-
-
由 duanyanhui 提交于
* update get_device to custom * add custom_device api * rm is_compiled_with_custom_device from framework * add todo comments
-
- 30 12月, 2022 1 次提交
-
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 15 12月, 2022 1 次提交
-
-
由 yuchen202 提交于
* 修改了API文档的相关内容 对weight_norm进行修改 * Update python/paddle/profiler/utils.py * Update python/paddle/utils/cpp_extension/cpp_extension.py * Update python/paddle/device/__init__.py * Update python/paddle/device/__init__.py * test=document_fix * for Hyperlink; test=document_fix * Update dlpack.py * test=document_fix Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
-
- 07 12月, 2022 1 次提交
-
-
由 Zman 提交于
* 修改paddle.nn.dynamic_decode,paddle.nn.functional.diag_embed 示例 * mma qk tensor_core (#48087) * use mma for QK dot computing in fused_multi_transformer. * Update fused_multi_transformer_op.cu.h * remove lrn which is not used in paddle 2.0 (#47945) * replace scatter_nd and scatter_nd_add with paddle.scatter_nd and (#47960) paddle.scatter_nd_add * [PHI] Migrate mul_grad kernel (#48061) * cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency * init * ExecuteMatMulV2 * rm fluid kernel * matmul_grad * remove mutable_data * mul_grad * delete unnecessary shape and slice op (#48112) * 修改英文文档。 * 修改segment operator等英文文档。 * 重新修改了paddle.einsum,paddle.unique_consecutive, paddle.disable_signal_handler的英文文档格式。 * 重新修改了英文文档格式。;test=docs_preview * Update extension.py * 重新修改了英文文档格式。;test=docs_preview * 重新修改了英文文档格式。 待验收: - paddle.linalg.svd - paddle.nn.functional.diag_embed - paddle.set_grad_enabled - paddle.disable_signal_handler - paddle.cumprod - paddle.devaice.cuda.stream_guard 待修改: - paddle.nn.dynamic_decode - paddle.einsum - paddle.unique_consecutive - paddle.linalg.svd - paddle.uncubate.segment_min - paddle.uncubate.segment_max - paddle.uncubate.segment_sum - paddle.uncubate.segment_mean ;test=docs_preview * 重新修改了英文文档格式。 待验收: - paddle.linalg.svd - paddle.nn.functional.diag_embed - paddle.set_grad_enabled - paddle.disable_signal_handler - paddle.cumprod - paddle.devaice.cuda.stream_guard - paddle.nn.dynamic_decode - paddle.unique_consecutive - paddle.linalg.svd 待修改: - paddle.einsum - paddle.incubate.segment_min - paddle.incubate.segment_max - paddle.incubate.segment_sum - paddle.incubate.segment_mean ;test=docs_preview * 重新修改了英文文档格式。 待验收: - paddle.linalg.svd - paddle.nn.functional.diag_embed - paddle.set_grad_enabled - paddle.disable_signal_handler - paddle.cumprod - paddle.devaice.cuda.stream_guard - paddle.nn.dynamic_decode - paddle.unique_consecutive - paddle.linalg.svd 待修改: - paddle.einsum - paddle.incubate.segment_min - paddle.incubate.segment_max - paddle.incubate.segment_sum - paddle.incubate.segment_mean ;test=docs_preview * update * test=docs_preview * update formula; test=docs_preview * update formula; test=docs_preview * remove this operator; test=docs_preview * add hyper link; test=docs_preview * add default value; test=docs_preview * update format; test=docs_preview * empty commit; test=docs_preview * fix codestyle issues; test=docs_preview * empty commit; test=docs_preview Co-authored-by: Nlzy <569782149@qq.com> Co-authored-by: NVvsmile <450864116@qq.com> Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com> Co-authored-by: NRichardWooSJTU <37864677+RichardWooSJTU@users.noreply.github.com> Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
-
- 29 11月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* isort all files * revert conflicting files * revert conflicting files * revert conflicting files
-
- 10 11月, 2022 1 次提交
-
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
- 08 11月, 2022 1 次提交
-
-
由 shentanyue 提交于
-
- 04 11月, 2022 1 次提交
-
-
由 shentanyue 提交于
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 12 10月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
[CodeStyle][F401] remove unused imports in python_paddle/inference_device_profiler_text_metric_incubate_quantization_libs_audio_amp_jit. (#46762)
-
- 22 9月, 2022 1 次提交
-
-
由 张春乔 提交于
* 1. Delete some expression like 'This Op' 2. remove import numpy as np * test=document_fix * fix eg; test=document_fix * fix 'import numpy' cases; test=document_fix * fix 'import numpy' cases; test=document_fix * fix some docs; test=document_fix * delete raise; test=document_fix * add some introduction; test=document_fix * add some introduction; test=document_fix * test=document_fix * Fix ’note‘ format; test=document_fix * Fix Returns of cholesky; test=document_fix * Fix Example format; test=document_fix * Fix det; test=document_fix * Fix eig; test=document_fix * Fix eigh; test=document_fix * Fix eigh; test=document_fix * Apply suggestions from code review;test = document_fix Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com> * Apply suggestions from code review;test = document_fix Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com> * Apply suggestions from code review;test = document_fix Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com> * test=document_fix * test=document_fix * KLDiv;test=document_fix * norm example code; test=document_fix * revert python/paddle/fluid/**/* * revert python/paddle/distributed/spawn.py * revert python/paddle/fluid/* * fix a `Note` format * Fix inv; test=document_fix * Fix lu; test=document_fix * Fix lu_unpack; test=document_fix * Fix matrix_power; test=document_fix * Fix multi_dot; test=document_fix * Fix solve; test=document_fix Co-authored-by: NNyakku Shigure <sigure.qaq@gmail.com>
-
- 14 9月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* trim trailing whitespace * fix `.cmake-format.py` * revert npu ut changes, avoid npu ci error
-
- 29 8月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add interpretercore * refine backward program id * add code * refine program * refine code * create forward/backward_program by prog2graph2prog method * test, do not care * refine code * refine code * refine code * test, do not care * add interpretorcore * add scope * refine scope create method * add jit for new_exe * solve conflict * delete unused code * polish code * polish code * refine scope in inplace * refine for datatransfer * refine _rebuild_from_desc * refine control eager deletion attr * refine used_for_jit * refine jit for infer * op size0 use ori program * polish code * refine jit * refine run_program_op ut * refine inplace * refine control * refine graph helper * refine control * refine inplace * refine buffer_share_inplace_pass * polish code * polish code * refine usage for compilerProgram * refine control * test * test core cache * refine code * refine io.py * increase test_seq2seq timeout * refine convert program * refine interpretercore_cache release * delete buildinplace * refine partial_program && io * refine code for io * test * test * test
-
- 14 7月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom ccl api * add ut
-
- 27 6月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [CustomDevice]add custom place supports * sync format
-
- 16 6月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 14 6月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 07 6月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 02 6月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support CUDAGraph for partial graph * add ut * fix ci * fix ut again because of eager mode * fix kunlun ci * fix win ci
-
- 27 5月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Support memory stats for CPU * Add UTs * Fix typos * Fix typos
-
- 30 3月, 2022 1 次提交
-
-
由 From00 提交于
Add new APIs for GPU memory monitoring (max_memory_allocated, max_memory_reserved, memory_allocated, memory_reserved) (#38657) * Add new API memory_reserved * Add memory_allocated, max_memory_reserved and max_memory_allocater * Fix CI error * Fix CI error * Enhance UT * Add FLAGS_memory_stats_opt * Add STATS macro functions * Add StatAllocator * Fix CI errors * Add UT * Fix CI errors
-
- 15 2月, 2022 1 次提交
-
-
由 ronnywang 提交于
* [CustomRuntime] Add DeviceManager * [CustomRuntime] Add DeviceInterface * [CustomRuntime] Add Stream, Event, DeviceGuard, CallbackManager * [CustomRuntime] Add plug-in device * [CustomRuntime] Memory module support PluggableDevice * [CustomRuntime] Add WITH_PLUGGABLE_DEVICE cmake option * update * [API] update API doc based on comments, test=develop Co-authored-by: Nqili93 <qili93@qq.com>
-
- 31 12月, 2021 1 次提交
-
-
由 fwenguang 提交于
* [MLU]support calling mlu op from python interface * [MLU]fix * fix * [mlu]fix mlu_places * [mlu]fix required mlu * fix * [MLU]fix tensor copy * [mlu] fix MLUPlace call path
-
- 27 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
-
- 09 12月, 2021 1 次提交
-
-
由 jianghaicheng 提交于
-
- 07 12月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
Paddle don't have to set runtime_include_dir during run CINN.
-
- 09 11月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
* try to fix CUDA Graph H2D copy bug * remove useless code * fix ci * fix ROCM CI * fix CUDA_VERSION * improve CI coverage
-
- 28 10月, 2021 1 次提交
-
-
由 Ligoml 提交于
* fix device docs;test=document_fix * update __init__.py
-
- 29 9月, 2021 3 次提交
-
-
由 Zeng Jinle 提交于
* add basic support for CUDA Graph * fix ci compile error * fix LOG print, fix windows CI * follow comments and update * small fix for default ctor * fix rocm compile error * fix CPU compile error
-
由 hlygit66666 提交于
* add op paddle.device.cuda.get_device_name * fix some bugs * fix some bugs * fix error message bugs * fix en docs * fix bugs * fix bugs * fix bugs * add error message test case * add get_device_name and get_device_capability * fix review * fix docs bug * fix docs * fix docs
-
由 Yanxing Shi 提交于
* Initial Commit * add unittest and add error information * modify doc * fix some error * fix some word * fix bug cudaDeviceProp* and modify error explanation * fix cudaDeviceProp* error and unnitest samples * fix hip error and PADDLE_WITH_HIP * update style * fix error is_compiled_with_cuda * fix paddle.device.cuda.get_device_properties * fix error for multi thread safe * update style * merge conflict * modify after mentor review * update style * delete word * fix unittest error for windows * support string input and modify some code * modify doc to support string input * fix error for express information * fix error for express information * fix unnitest for windows * fix device.startswith('gpu:') * format error and doc * fix after review * format code * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix py2 error * fix wrong words and doc * fix _gpuDeviceProperties * test=document_fix
-
- 28 9月, 2021 1 次提交
-
-
由 Yanxing Shi 提交于
* Initial Commit * add unittest and add error information * modify doc * fix some error * fix some word * fix bug cudaDeviceProp* and modify error explanation * fix cudaDeviceProp* error and unnitest samples * fix hip error and PADDLE_WITH_HIP * update style * fix error is_compiled_with_cuda * fix paddle.device.cuda.get_device_properties * fix error for multi thread safe * update style * merge conflict * modify after mentor review * update style * delete word * fix unittest error for windows * support string input and modify some code * modify doc to support string input * fix error for express information * fix error for express information * fix unnitest for windows * fix device.startswith('gpu:') * format error and doc * fix after review * format code * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix error for doc compile * fix py2 error * fix wrong words and doc * fix _gpuDeviceProperties
-
- 15 9月, 2021 2 次提交
-
-
由 Siming Dai 提交于
Fix docs in stream_guard API
-
由 Siming Dai 提交于
Add paddle.cuda.device.stream_guard API
-
- 14 9月, 2021 1 次提交
-
-
由 chenenquan 提交于
* Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Fix test coverage problem for empty_cache * delete redundant check for empty_cache * fix the problem of empty_cache's doc * delete the nvidia-smi comment in doc of empty_cache, test=document_fix
-
- 23 8月, 2021 1 次提交
-
-
由 Linjie Chen 提交于
* Add cuda device count api * update coda format * fix unittest error * update code format * update comment
-
- 19 7月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add cuda event and stream api * add cuda event and stream api * add get_current_stream api * add get_current_stream api * init streams * modify get_current_stream * modify get_cuttent_stream * add synchronize func * add current_stream doc and test file * move get_current_stream into CUDA macro * move CudaEvent into CUDA macro * move _get_current_stream and _device_synchronize into cuda macro * modify the macro of cuda stream and event * add test case for synchronize * add paddle.devices.cuda module * event and stream support hip * add doc for stream and event class * move cuda stream and event into single pybind * add cuda_streams_py.cc to cmakelist * add _device_synchronize and _get_current_stream to core module * add test case for cudastream and cudaevent * move __all__ in streams.py * fix test fail * add cuda to devices __all__ * fix current_stream doc writing error * move devices to device direction, and merge device.py into __init__.py * add required:gpu to sample codes * remove cuda direction from device/__init__.py
-