- 18 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 15 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* remove cached kernel context * revert dataloader format change
-
- 13 1月, 2022 2 次提交
-
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
- 11 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor matmul directory in pten * fix merge conflict * add dot_grad kernel * add dot_grad kernel in pten * add matmul_grad kernel * update the code * delete useless code in fluid * fix some bug of running matmul grad kernel * fix merge conflict * refactor some code * refactor code
-
- 10 1月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
* Added shared_ptr<Allocation> member & corresponding interfaces to Storage * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly * Fixed issues with storage offset * Used place to malloc allocation for TensorStorage * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor * Fixed issues with place * Added comments * Moved mutable_data with stream argument to DenseTensor * Added set_offset interface * Fixed CI issues,test=allcases * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor * Modified framework::Tensor to inherit from DenseTensor * Reverted changes too pten_layout() interface * Removed friend classes * Rearranged cfunction calls from tensor.data<void>() to tensor.data() * Fixed CI issues * Fixed lite issues * Fixed data() interface issues,test=allcases * Resolved IsInitialized() issues * Fixed ResetHolder() issues * Fixed MKLDNN & Storage issues * Resolved ShareBufferWith() issues * Fixed LoD issues
-
由 Chen Weihang 提交于
* unify infer_shape func calling * support set grad infer shape fn for custom op * unify infershape in new executor and eager * remove todo comment * revert infershape in operator
-
- 31 12月, 2021 1 次提交
-
-
由 fwenguang 提交于
* [MLU]support calling mlu op from python interface * [MLU]fix * fix * [mlu]fix mlu_places * [mlu]fix required mlu * fix * [MLU]fix tensor copy * [mlu] fix MLUPlace call path
-
- 30 12月, 2021 1 次提交
-
-
由 Feng Xing 提交于
This PR adds runtime flags run_kp_kernel, which choose which op to run for xpu2. There are two: dynamic linked and built from kp.
-
- 27 12月, 2021 2 次提交
- 23 12月, 2021 1 次提交
-
-
由 zhouweiwei2014 提交于
* add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector * fix comment
-
- 20 12月, 2021 1 次提交
-
-
由 fwenguang 提交于
-
- 16 12月, 2021 2 次提交
-
-
由 chentianyu03 提交于
-
由 chentianyu03 提交于
* Revert "Revert "pylayer support tuple/list type args (#37727)" (#37956)" This reverts commit d848ff04. * move check args,kwargs before forward execute
-
- 14 12月, 2021 3 次提交
-
-
由 Aurelius84 提交于
-
由 YuanRisheng 提交于
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile
-
- 10 12月, 2021 3 次提交
-
-
由 pangyoki 提交于
-
由 kuizhiqing 提交于
-
由 Leo Chen 提交于
-
- 09 12月, 2021 1 次提交
-
-
由 jianghaicheng 提交于
-
- 08 12月, 2021 2 次提交
-
-
由 chentianyu03 提交于
This reverts commit a73064f2.
-
由 chentianyu03 提交于
-
- 07 12月, 2021 2 次提交
-
-
由 Zhanlue Yang 提交于
* Debug * Fixed issue with reset_grad_inplace_version when used with clear_gradient & cross-batch accumulation * Rearranged interfaces * Fixed ci issues
-
由 Leo Chen 提交于
-
- 06 12月, 2021 3 次提交
-
-
由 chentianyu03 提交于
-
由 ronnywang 提交于
-
由 kuizhiqing 提交于
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 02 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add inplace op adaptation * optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor> * refactor logic that transform variable to densetensor * update func name
-
- 01 12月, 2021 1 次提交
-
-
由 Jiabin Yang 提交于
* optimizer __call__ to make dygraph faster * fix return type * remove cpp Layer
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 26 11月, 2021 1 次提交
-
-
由 Zhanlue Yang 提交于
reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care. This is essentially a hack interface to satisfy one specific request
-
- 25 11月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add scalar and scalar_array * remove DenseTensor include from Scalar and ScalarArray * remove inner header from scalar_array * refactor the method of fill_constant and add some comment * add fill_constant kernel using ScalarArray * modify some prompt * remove fill_constant kernel with no shape
-
- 24 11月, 2021 1 次提交
-
-
由 0x45f 提交于
* run dy2stat pure fp16 in Linear model * no use self._pure_fp16_inputs * add test and fix Adam error in dy2stat pure fp16 training * use paddle.optimizer.Adam * run test in gpu * change test time for CI * enlarge atol for test_resnet_pure_fp16 * refine code and enlarge atol * make custom_white_list and custom_black_list take effect for AMP and pure fp16 * check tracer is not None * use default atol * change filter_size * change atol and add some NOTE
-
- 23 11月, 2021 2 次提交