- 02 2月, 2023 1 次提交
-
-
由 Xiaoxu Chen 提交于
【PRIM】Support use operator's output metadata info in constructing static backward composite (#50043) * [prim] support custom target_gradients * support infershape after append one gradop * [prim] add simple net test * fix test_loop segment fault bug * [prim] fix infer shape segment fault bug when output of grad_op_desc is empty
-
- 31 1月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add custom device api * update * update * test=document_fix * update * update * add examples
-
- 20 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
-
- 17 1月, 2023 2 次提交
-
-
由 zhangbo9674 提交于
* refine munmap freq for ref_cnt_mmap_allocator * add shm reuse logic * fix compile bug * fix compile bug * fix bug of file refcount * fix compile bug * fix compile bug * refine code for delete shm case * polish code * refine shm cache pool size setting logic * set buffer is 2 * refine shm cache size logic * refine max shm cache * refine shm cache size
-
由 cyber-pioneer 提交于
* support @to_static+to_prime+cinn * fix code logic * debug4 * debug5 * debug6 * debug7 * debug 8 * debug 9 * debug10 * debug11 * debug11 * debug 12 Co-authored-by: NAurelius84 <zhangliujie@baidu.com>
-
- 09 1月, 2023 1 次提交
-
-
由 Jiabin Yang 提交于
* proto type of composite grad in paddle * proto type of composite grad in paddle * refactor composite api with phi * fix compile error * support static graph code-gen for squeeze op * generate static graph code of unsqueeze * refine op name * fix compile error * add extra output in op_compat * remove debug log * fix clang compile error * support prim switch flag * support prim switch flag * fix dygraph error * merge develop * add code_gen * add necessary files without codegen * fix code_gen bug * add deps * modify igmnore * add ignore * delete std cout * add composite logic for backward.py * add tanh first order grad composite * support enable_prim flag for static graph * throw expection when both GrapOpMaker and GradCompOpMaker not been registered * reorganize the directory of prim api tests * fix windows error * add eager_utils * add eager_utils * modify code gen * add composite parse * add unittest for get_grad_op_desc * code optimize * fix static test on windows * support generate static graph code for imag and real op * fix windows compile error in test_static_prim * merge develop * disable test eager in inference * prim code gen * disable eager compile in inference * rm other file * rm gitignore file * code_style * add eager test * code_style * merge develop * remove useless files * modify static test * support bool flag from singlton * merge develop * recover git ignore * fix conflict * recover git ignore for generated op * fix test compile error * remove some tests * add python test * fix some name issue * add composite code gen * modify backward yaml * fix static composite grad maker code gen * remove addtional files * add some static funcs unit test * fix some bugs * fix composite grad maker register code gen * optimize some functions Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com> Co-authored-by: Nwangruting <wangruting@baidu.com> Co-authored-by: Ncxxly <chenxx_id@163.com> Co-authored-by: Ncharles-hit <wanghao107@baidu.com> Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
-
- 10 11月, 2022 1 次提交
-
-
由 james 提交于
* XPU support eager mode * add unittest for XPU eager mode * minor bugfix * minor bugfix, test=kunlun * correct copyright info * 1. remove unsed vars/funcs 2. ProcessGroupBKCL inherit from ProcessGroupStream * bugfix for fp16 in eager mode multi-card, test=kunlun * rebase & fix a few issues * use new processgroup interface, test=kunlun * fix compile issue, test=kunlun
-
- 03 11月, 2022 1 次提交
-
-
由 zlsh80826 提交于
-
- 25 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle][py2] remove `paddle.compat` * remove compat from `paddle.__init__` * enable_static in sample code * Revert "enable_static in sample code" This reverts commit ffccaa633900154ea5f3d056e746aae9a1927399. * enable_static in sample code
-
- 23 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* update config * re-blacken python code * temporarily disable date and diff_py_file * skip a format
-
- 20 10月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* add _get_phi_kernel_name interface * remove inference interface * Revert "remove inference interface" This reverts commit 784a8a6c51fa2dc49a01c8699525298ac21b178f.
-
- 11 10月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
[CodeStyle] remove compat module (long_type, int_type, get_exception_message, floor_division) (#46686)
-
- 27 9月, 2022 1 次提交
-
-
由 Nyakku Shigure 提交于
* [CodeStyle] remove all future import * revert test_error.py * restore future import in example code
-
- 26 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* fix libpaddle soname mismatch error * fix windows failed * polish linux and windows make impl * unify winddows lib name * fix windows error * revert copy dst change * revert naming change * revert windows change * fix gpups compile failed
-
- 16 9月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* unify core_avx and core_noavx * fix except error * revert mac compile logic * revert dylib to so * add core_noavx branch * remove core_noavx * replace paddle_core by lib paddle * polish var name * replace paddle_core by libpaddle * update custom device commit * polish code by comments
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* use yapf to format all python file * yapf exclude two unittests file for they rely on writing and reading file, and format will break them * disable diff_py_file because too many diff files cause command following failed
-
- 25 3月, 2022 1 次提交
-
-
由 Jiabin Yang 提交于
* refactor eager flags * fix flags error when we switch from eager to dygraph * fix ci problem * fix ci * fix ci * merge develop and fix code style * merge develop and fix code style * fix op test error * fix op test error * fix op test error * fix op test error * fix op test error * merge develop
-
- 08 3月, 2022 1 次提交
-
-
由 chenjian 提交于
* add python profiler package * update according to review * fix bug * fix bug * fix bug * add unit test * Revert "add unit test" This reverts commit 4e69ff71b0645e069afe5dd8fea0d07717852c48. * reduce for pr * add unit test * modify for pr * fix unittest * update for ci coverage * modify according to review * fix bug * improve coverage
-
- 27 1月, 2022 1 次提交
-
-
由 Aganlengzi 提交于
* [Demo] custom kernel based on pten kernel * merge and npu custom work well * del comments * delete other code * fix CUDAContext * fix not found small_vector.h * support NPU * fix NPUContext * fix DeviceContext support * add UT * fix call * add UT * fix * fix for comments and ut * add MACRO control * fix multi input output * support env CUSTOM_DEVICE_ROOT * deal with special cases * fix for Windows * try coverage with test_custom_kernel_dot.py * fix test_custom_kernel_dot * fix test_custom_kernel_dot * fix merge * fix merge * fix CI * update * merge and fix * remove WITH_CUSTOM_KERNEL * fix merge * merge and fix * fix ut * fix ut for mac * add more UT * add more UT * fix
-
- 18 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
-
- 28 12月, 2021 1 次提交
-
-
由 Jiabin Yang 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * Enabled generation for Operators without Grad/Inputs/Outputs * Resolved operators without input * Fixed merge conflicts * Enabled Eager AutoCodeGen for 10+ more operators * Refactored Eager AutoCodeGen with more organized helper objects * Enabled Eager AutoCodeGen for operators with multiple OpBases * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen * Adjusted function generation/call between Python-C API & Dygraph API * Synchronized auto-generated Python-C API with Dygraph Forward Functions * support more eager tensor api * fix merge compile error * fix compile error and fit develop code * support pure CPU * fix some logic error in eager_mode * support _varbase_creator in eager mode * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs * for eager mode * refine * support multiple constructor for eager tensor * add place related code * polish code * specific randint with dtype of int64 * Support pure cpu test * eager logic * refine test in pure cpu * eager logic * eager logic * eager logic, test=develop * skip core.eager when in inference, test=develop * refine, test=develop * refine, test=develop * call RetainGrad after run forward kernel, test=develop * refine, test=develop * support dygraph util, meta, guard test * support inference test * refine test and fix initializer failed * support create varbase and fix retain grad error * fix windows error * support test code coverage * support test code coverage * support test code coverage Co-authored-by: Njim19930609 <jim19930609@gmail.com> Co-authored-by: NWang Huan <wanghuan29@baidu.com>
-
- 16 12月, 2021 1 次提交
-
-
由 Jiabin Yang 提交于
* support eager switch system * polish code
-
- 15 9月, 2021 1 次提交
-
-
由 Siming Dai 提交于
Add paddle.cuda.device.stream_guard API
-
- 27 7月, 2021 1 次提交
-
-
由 Aurelius84 提交于
Revert "Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348)" (#34384) This reverts commit 577fdde5.
-
- 23 7月, 2021 1 次提交
-
-
由 Aurelius84 提交于
Revert "[Dy2Stat] Refactor ExecutorCache logic and pre-support BuildStrategy for pass (#34181)" (#34348) This reverts commit 609f8225.
-
- 22 7月, 2021 2 次提交
-
-
由 Aurelius84 提交于
* modify into program_id * fix cache_info declare problem * fix python int to C long problem * modify point to reference * add ENVS
-
由 tianshuo78520a 提交于
-
- 19 7月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add cuda event and stream api * add cuda event and stream api * add get_current_stream api * add get_current_stream api * init streams * modify get_current_stream * modify get_cuttent_stream * add synchronize func * add current_stream doc and test file * move get_current_stream into CUDA macro * move CudaEvent into CUDA macro * move _get_current_stream and _device_synchronize into cuda macro * modify the macro of cuda stream and event * add test case for synchronize * add paddle.devices.cuda module * event and stream support hip * add doc for stream and event class * move cuda stream and event into single pybind * add cuda_streams_py.cc to cmakelist * add _device_synchronize and _get_current_stream to core module * add test case for cudastream and cudaevent * move __all__ in streams.py * fix test fail * add cuda to devices __all__ * fix current_stream doc writing error * move devices to device direction, and merge device.py into __init__.py * add required:gpu to sample codes * remove cuda direction from device/__init__.py
-
- 13 7月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 06 7月, 2021 1 次提交
-
-
由 danleifeng 提交于
* pipeline adaptive for heterps;test=develop * fix finalize hang;test=develop * add is_compiled_with_heterps for dataset;test=develop * fix hashtable core when pass ins_num=0;test=develop
-
- 29 6月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 09 6月, 2021 1 次提交
-
-
由 WeiXin 提交于
* support state_dict save to memory. * Perfect unittest * perfect unittest. * suport saving binary var to memory * polish code. * packag save/load files into pybind/io.py * polish code . * add example for save to memory; remove useless save load function(_load_static_dict,_save_dygraph_dict) * delete _load_static/dygraph_dict;_save_static/dygraph_dict * edit example of paddle.save/load
-
- 07 5月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* Remove paddle_custom_op dynamic libraries, change link to FLUID_CORE on windows, and check copy_to * fix CI
-
- 22 4月, 2021 1 次提交
-
-
由 WeiXin 提交于
* support save/load binary format tensor * Fix error when create cudaplace * Fix error when create cudaplace * Fix error when create cudaplace * get devive context from pool. * move define of 'SerializeToStream' and 'DeserializeFromStream' to 'lod_tensor.cc' and 'selected_rows.cc'. * improve coverage. * improve coverage. * polish API * deal with conflict * disable save/load large file in unnittest * split unnittest.
-
- 31 3月, 2021 1 次提交
-
-
由 Kaipeng Deng 提交于
* polish tensor pipeline. test=develop
-
- 14 1月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 04 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments
-
- 24 11月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* upgrade comment string to raw string * fix string in * fix string with ' ' * revert update on comments * upgrade only necessary * fix sample code checker * fix comments with '''
-
- 12 10月, 2020 1 次提交
-
-
由 chen.zhiyu 提交于
-
- 28 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* add paddle.fluid._cuda_synchronize, test=develop * fix bug about core_avx core_noavx, test=develop * delete CPUPlace and XPUPlace, test=develop
-