- 19 2月, 2022 1 次提交
-
-
由 chenjian 提交于
* fix RecordEvent interface * modify default level to 4 * update interface use * add const default trace level * update record event interface using * update operator.cc * update part1 * fix include profiler.h header in ps server * fix include profiler.h header in ps server
-
- 15 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 10 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Added shared_ptr<Allocation> member & corresponding interfaces to Storage * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly * Fixed issues with storage offset * Used place to malloc allocation for TensorStorage * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor * Fixed issues with place * Added comments * Moved mutable_data with stream argument to DenseTensor * Added set_offset interface * Fixed CI issues,test=allcases * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor * Modified framework::Tensor to inherit from DenseTensor * Reverted changes too pten_layout() interface * Removed friend classes * Rearranged cfunction calls from tensor.data<void>() to tensor.data() * Fixed CI issues * Fixed lite issues * Fixed data() interface issues,test=allcases * Resolved IsInitialized() issues * Fixed ResetHolder() issues * Fixed MKLDNN & Storage issues * Resolved ShareBufferWith() issues * Fixed LoD issues
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 25 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 26 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 27 9月, 2020 1 次提交
-
-
由 Leo Chen 提交于
* refine broadcast_op_handle * refine some error messages * refine some files * fix bug * fix bug * fix bug * follow comments * follow comments
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 05 2月, 2020 1 次提交
-
-
由 Wilber 提交于
cmake选项中添加了WITH_NCCL,显示指定是否编译NCCL的部分代码,WITH_NCCL默认打开,但如果WITH_GPU为OFF,则关闭WITH_NCCL 添加了PADDLE_WITH_NCCL定义 单机单卡能够关闭NCCL编译,多卡的话需要默认打开NCCL,如果关闭NCCL,则只能使用单卡 Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 20 9月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 11 9月, 2019 1 次提交
-
-
由 chengduo 提交于
* fix vlog level and fuse option type test=develop
-
- 11 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop
-
- 28 3月, 2019 1 次提交
-
-
由 chengduo 提交于
* fuse optimizer
-
- 21 2月, 2019 1 次提交
-
-
由 Dun 提交于
* refine profiler && add runtime tracer * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * fix bug && test=develop * add thread id map && test=develop * test=develop * testing * bug fix * remove cuda event && refine code && test=develop * test=develop * test=develop * test=develop * fix windows temp file && test=develop * test=develop * fix windows bug && test=develop * fix start up issue && test=develop * code polish && test=develop * remove unused code && test=develop * add some cupti cbid && test=develop * add FLAGS_multiple_of_cupti_buffer_size && test=develop * fix compile error && test=develop * add keyword && test=develop * fix && test=develop * code polish && test=develop
-
- 19 2月, 2019 1 次提交
-
-
由 tensor-tang 提交于
* fix warnings test=develop * fix enforce test test=develop
-
- 17 1月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 26 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
test=develop
-
- 22 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 08 11月, 2018 1 次提交
-
-
由 minqiyang 提交于
Fix code to support cpplint syntax check test=develop
-
- 29 10月, 2018 2 次提交
-
-
由 Qiao Longfei 提交于
-
由 Wu Yi 提交于
* fix nccl2 lars dist support * put lars in momentum op * add tests lars * fix ci * fix cpu kernel * soft warning * remove lars in test_recognize_digits.py * move to another op * add file * update api.spec test=develop * update test=develop * fix api.spec test=develop * wip * wip, finish grad merge ops * wip, finish graph build * wip test running * work on 1 gpu * workable version * update * fix tests * fuse broadcast op * fix compile failed * refine * add batch merge test mnist * fix CI test=develop * fix build * use independent bn params for batch merge test=develop * update api.spec * follow comments and for test * wip * refine tests test=develop * follow comments test=develop * remove startup bn modify test=develop * follow comments test=develop * fix merge test=develop
-
- 27 10月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 13 9月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 12 9月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 22 6月, 2018 1 次提交
-
-
由 chengduo 提交于
-
- 21 6月, 2018 3 次提交
-
-
由 chengduo 提交于
* Fix Parallel Exe(VarHandel's version) * Fix broadcast * enhance ParallelExecutor stable
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 09 5月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 chengduoZH 提交于
-
- 05 5月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 04 5月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 02 5月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 20 4月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 18 4月, 2018 2 次提交
-
-
由 chengduoZH 提交于
-
由 Yu Yang 提交于
-
- 17 4月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 16 4月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-
- 13 4月, 2018 1 次提交
-
-
由 chengduoZH 提交于
-