- 20 1月, 2022 2 次提交
-
-
由 minghaoBD 提交于
* [Paddle-ASP]Make test_asp_sharding running on non-mac platform * syntax check * syntax check
-
由 wanghuancoder 提交于
* Rearranged Eager AutoCodeGen directory structure * Removed USE_OP in Eager AutoCodeGen * Enabled generation for Operators without Grad/Inputs/Outputs * Resolved operators without input * Fixed merge conflicts * Enabled Eager AutoCodeGen for 10+ more operators * Refactored Eager AutoCodeGen with more organized helper objects * Enabled Eager AutoCodeGen for operators with multiple OpBases * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen * Adjusted function generation/call between Python-C API & Dygraph API * Synchronized auto-generated Python-C API with Dygraph Forward Functions * support more eager tensor api * fix merge compile error * fix compile error and fit develop code * support pure CPU * fix some logic error in eager_mode * support _varbase_creator in eager mode * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs * for eager mode * refine * support multiple constructor for eager tensor * add place related code * polish code * specific randint with dtype of int64 * Support pure cpu test * eager logic * refine test in pure cpu * eager logic * eager logic * eager logic, test=develop * skip core.eager when in inference, test=develop * refine, test=develop * refine, test=develop * call RetainGrad after run forward kernel, test=develop * refine, test=develop * support dygraph util, meta, guard test * eager test case * support inference test * refine test and fix initializer failed * modify eagertensor patch method * add eagertensor.clear_grandint, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * support create varbase and fix retain grad error * call monkey_patch_varbase in _test_eager_guard, test=develop * fix windows error * split clear_gradient to clear_gradient and zero_grads, test=develop * refine, test=develop * refine, test=develop * support test_imperative_basic test in eager mode * remove additional log in variable.h * remove additional log in variable.h * remove additional code create in merge * eager * fix some eager logic, test=develop * refine, test=develop * refine, test=develop * refine, test=develop * patch_tensor_method_func, test=develop * refine, test=develop * eager test case, test=develop * refine, test=develop * eager, test=develop * eager, test=develop * eager optimizer, test=develop * eager optimizer, test=develop * eager test_imperative_optimizer_v2, test=develop * eager, test=develop * refine, test=develop * refine, test=develop * eager, test=develop * add resize in share buffer to, test=develop * eager, test=develop * fix _share_buffer_to, test=develop * refine, test=develop * refine, test=develop * support eager for dataloader,test=develop Co-authored-by: Njim19930609 <jim19930609@gmail.com> Co-authored-by: NJiabinYang <360788950@qq.com>
-
- 19 1月, 2022 3 次提交
-
-
由 jianghaicheng 提交于
* ipu_commit_tests p1 * resolve comments * resolve comments * resolve comments * resolve comments * resolve comments * resolve comments * resolve comments * update lint and ipustrategy introduction * update ipu_config * update __init__ of static * update doc * update doc 2 * update doc 3 * update doc 4 * update doc 5 * update doc 5 * update doc 6 * update lint * update lint 2 * update ipustrategy * add IpuStrategy to all * update ipustrategy * update ipu_shard_guard * update ipu_shard_guard 2 Co-authored-by: Nyaozhixin <522190855@qq.com>
-
由 wuhuachaocoding 提交于
-
由 zhangyikun02 提交于
-
- 18 1月, 2022 7 次提交
-
-
由 Sławomir Siwek 提交于
* Mish * Change exp() library * mish fuse pass * mish attrs * fixes * mishop maker * remove attrs * mish kernal for bf16 * fc+mish fuse * fix code format error * Resolve merge conflicts * Update mish operator version * update mish variable to new naming convention
-
由 zhouweiwei2014 提交于
* change CUDA implementaion of uniform/gaussian OP * fix unittest
-
由 wawltor 提交于
* add the uva api for the tensor * fix the compiler problem for the uva * fix the example for the _uva * fix the compile problem in the pten library * update the enviroment support for the uva * use the make_shared replace the shared_ptr
-
由 JingZhuangzhuang 提交于
* fix trt convert conv2d skip * fix trt convert conv2d skip
-
由 wenbin 提交于
* modify params check * correct compile
-
由 zhaoyingli 提交于
* [AutoParallel] Recompute Pass * update unittest * reshard for amp * add comment
-
由 sneaxiy 提交于
* speedup gelu using fast math * add bwd part
-
- 17 1月, 2022 7 次提交
-
-
由 jakpiase 提交于
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 sneaxiy 提交于
* add no reduce mode for pe * add NoReduce ut
-
由 sneaxiy 提交于
-
由 ronnywang 提交于
* fix paddle.where torch diff * update
-
由 0x45f 提交于
* close enable_inplace PASS for PE, and test dy2st pure fp16 training stability * add some comment * enlarge atol
-
由 Jiabin Yang 提交于
* support test_auto_prune_partial * support rest of autoprune strategy in eager mode
-
- 14 1月, 2022 4 次提交
-
-
由 heliqi 提交于
* add trt_convert_flatten_contiguous_rang op * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * test cast add trt version >=7 skip
-
由 Zhangjingyu06 提交于
* [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun,*test=kunlun * [XPU]add stack_grad op for kunlun2,*test=kunlun Co-authored-by: NQingshuChen <chenqingshu@baidu.com>
-
由 Baibaifan 提交于
-
由 qipengh 提交于
* [MLU]: add mean and reduce mean op * [MLU]add mlu pytest dir in CMakeLists.txt * [MLU]fix tensor data * [MLU]fix TensorToPyArray and license
-
- 13 1月, 2022 6 次提交
-
-
由 furnace 提交于
[NPU] fix tril_triu
-
由 furnace 提交于
* [NPU] fix expand op * [NPU] optimize codes * [NPU] optimize codes
-
由 wenbin 提交于
roi_align aligned supported
-
由 jakpiase 提交于
* base changes for mul reimplementation * empty commit * tmp save * full implementation of mul bf16/fp32 fwd bwd * CI fix * CI rerun * changed unity build cmake to avoid gpu issues * removed mul mkldnn from unity build * added skipping tests if not cpu_bf16 * CI fix * CI fix * CI fix
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
由 Weilong Wu 提交于
* Support test_imperative using_non_zero_gpu and Add a TODO comment * Change GPU number to 0 * Modify the cuda device selection method
-
- 12 1月, 2022 4 次提交
-
-
由 Sylwester Fraczek 提交于
* fix conv act int8 scale * add unit test for conv+hard_swish
-
由 xiaoting 提交于
* support 5d for nearest * update nearest3d unittest, test=develop * fix approve ci, test=develop * fix approve ci, test=develop
-
由 JZ-LIANG 提交于
* auto parallel sharding base * chmod * add unitest * set unitest cmake dist label * revise code according to rewiew * chmod * bugfix for grad_clip and param broadcast * chmod * update unitest * chmod * add clip * chmod * add amp pass * chmod * add unitest * remove grad update * fixed bug * fixed bug * fixed typose * fixed typoes
-
由 Jiabin Yang 提交于
-
- 11 1月, 2022 4 次提交
-
-
由 Weilong Wu 提交于
-
由 caozhou 提交于
* update dist tensor * add unitest * update unitest * refactor dist tensor * update dist tensor and unitest
-
由 zhangbo9674 提交于
* check amp.decorate and DataParallel * refine coverage * fix layer dtype * refine code
-
由 Ming-Xu Huang 提交于
* Pre-save hooks of jit.save 1. Added pre_save_hooks features to jit.save. 2. Added related unittests * Added jit pre_save_hooks functions's alias to paddle.jit and copyright. * Make jit.save_pre_hook style be consisent with Paddle's rule. * Fixed arguments passing bug in run_save_pre_hooks * Added API Documents * Move clear and run_pre_save_hooks as internal methonds only. * Made register_save_pre_hook as an internal function.
-
- 10 1月, 2022 3 次提交
-
-
由 baoachun 提交于
-
由 Haohongxiang 提交于
* add lstsq gpu kernel * update * add docs_en * modify ut * fix bugs * modify example in docs_en * remove lstsq_op.cu from ROCM cmake * modify docs_en * modify docs_en * modify docs_en * remove unneccessary TensorCopy
-
由 Yulong Ao 提交于
* Add the backward support for QR * Remove unnecessary comments
-