- 27 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* add device-agnostic stream class * add stream.h * fix ut * fix cpu compile
-
- 07 12月, 2021 1 次提交
-
-
由 Zhanlue Yang 提交于
* Debug * Fixed issue with reset_grad_inplace_version when used with clear_gradient & cross-batch accumulation * Rearranged interfaces * Fixed ci issues
-
- 03 12月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* use sync h2d copy * use stream safe allocator in memcpy_h2d * remove wait * add guard
-
- 01 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
* fix inplace addto pass * update * fix ut * improve ci coverage * fix musl ci compile error
-
- 24 11月, 2021 1 次提交
-
-
由 piotrekobiIntel 提交于
* Add second batch of deprecated mkldnn namespace and macro changes * Unlock CI * Fix temporary namespace alias placing
-
- 22 11月, 2021 2 次提交
-
-
由 Feiyu Chan 提交于
* disable copying of datatype when sharing buffer between two tensors. * fix for mkldnn operator kernels (elementwise_add, sum, softplus, softmax, scale, activation), mannually set the data type when reusing memory by ShareBufferWith.
-
由 chentianyu03 提交于
* add cast kernel * add cast cuda kernel * add cast kernel * make cast kernel output dtype undefined * get cast dtype from vardesc * move cast to manipulation and add test case * add castinfershape * avoid reinitilaze variable * InitializeVariable support datatype * merge develop branch * fix merge bug * revert modify initializeVariable * revert modify on InitializeVariable * revert modify on InitializeVariable * mutable support reset dtype * enable make pten tensor from variable when def_arg.type is undefined * fix build pten ctx start_idx error * copy pten out tensor to variable * merge develop branch * fix non pten kernel cast failed * add reset allocation place for remake tensor * fix inplace realloc error * add mutable on pten kernles and remove unused cast files * rename function names * fix output type error * fix conflict with develop branch * set data type to variable with pten's dtype * fix test_cast_api type mismatch * densorTensro mutable_data support 0 bytes value * fix the inplace bug of reshape kernel * fix pten.backend != variable.place when moving storage, palce mismatch bug * fix conflict with develop branch * Fix bug of paddle::experimental::MovesStorage * fix ReMakePtenDenseTensor place mismatch bug * Revert "fix ReMakePtenDenseTensor place mismatch bug" This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a. * fix ReMakePtenDenseTensor place mismatch bug * reverts the set_lod interface, test=develop * modify by the review options * modify error message * add & for const input arguments * add reference in params * elementwise_sub add mutable_data * fix ResetHolderWithType check size bug * add dependence pten_tensor to test_cast_api object * remove unused code to pass ci coverage Co-authored-by: NChen Weihang <chenweihang@baidu.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 28 6月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] fix RNN miopen as weight need to permuted, test=develop * [ROCM] fix data share when is_test, test=develop * update, test=develop
-
- 27 12月, 2020 1 次提交
-
- 25 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* try to add fwd op input dtypes * refactor base impl * return tmp_ins after dygraph prepare data * fix typo found in debug * polish comment & add complex net test * revert detail change * fix unittest failed * add complex kernel condition control * fix xpu test failed & polish comment * polish details by review comments
-
- 30 11月, 2020 1 次提交
-
-
由 liym27 提交于
* Add a class TensorInplaceVersion to count the inplace version and put it in framework::Tensor instead of Allocation or Variable. * Add a new attribute `_inplace_version` for VarBase. * Raise exception if an inplace operation can result in incorrect gradient computation. * Add a new interface _bump_inplace_version() for VarBase to bump the version whenever the Tensor is modified through an inplace operation. * For api assign, call _bump_inplace_version() when it's an inplace operation inn dynamic mode. * Use original var_wrapper if the inplace_version is not changed. * Replace SnapshotVarWrapperList with SnapshotVarWrapper to optimize performane.
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 08 9月, 2020 1 次提交
-
-
由 WeiXin 提交于
* polish one line error message in tensor.cc * polish error messages in tensor.cc,tensor.h tensor_impl.h * polish error messages in tensor.cc tensor.h tensor_impl.h * polish error messages in tensor.cc,tensor.h tensor_impl.h * polish error messages in tensor.cc tensor.h tensor_impl.h tensor_test.cc * polish error messages in tensor.cc tensor.h tensor_impl.h
-
- 03 6月, 2020 1 次提交
-
-
由 Michał Gallus 提交于
-
- 25 3月, 2020 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 10 12月, 2019 1 次提交
-
-
由 Adam 提交于
* MKLDNN v1.0 rebase to Paddle 1.6 test=develop * Add hacky paddle::string::to_string() implementation * vectorize<int64-t>() -> vectorize() cleanup test=develop * PADDLE_ENFORCE and void_cast fixes test=develop * Rebase changes test=develop * Cosmetics test=develop * Delete MKL from mkldnn.cmake test=develop * CMake debug commands test=develop * Delete MKLDNN_VERBOSE and rebase fixes test=develop * Rebase fixes test=develop * Temporarily disable int8 resnet101 vgg16 and vgg19 tests test=develop * Add libmkldnn.so.1 to python setup test=develop * Add libmkldnn.so.1 to inference_lib cmake after rebase test=develop * Post rebase fixes + FC int8 changes test=develop * Fix LRN NHWC test=develop * Fix NHWC conv3d test=develop * Windows build fix + next conv3d fix test=develop * Fix conv2d on AVX2 machines test=develop
-
- 27 11月, 2019 1 次提交
-
-
由 Youwei Song 提交于
* add numpy bridge * fix template compile * add unittest, add default test=develop * fix unittest test=develop * fix unittest test=develop * zero_copy=True for to_variable, test=develop * bug fix test=develop * disable deprecated NumPy API test=develop * use better design of NumpyAllocator test=develop * fix Py_None check test=develop * reset c++ tracer when jump out dygraph guard test=develop * refine PADDLE_ENFORCE_xx format test=develop * bug fix of tracer switch test=develop * update decref test=develop
-
- 19 11月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 11 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop
-
- 10 6月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* remove attribute in Allocator::Allocate, test=develop * fix travis ci error, test=develop
-
- 28 3月, 2019 2 次提交
-
-
由 chengduo 提交于
* fuse optimizer
-
由 Jacek Czaja 提交于
* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)" This reverts commit 13816dd4. Apart from enabling transformer for MKL-DNN * Revert "- MKL-DNN pooling updated to set_prim_desc" This reverts commit c63f6b20. Conflicts: paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc * Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)" test=develop This reverts commit dec9cf53. * - concat compilation fix - lint test=develop - Lint fixes test=develop - Lint fixes test=develop - Fix Transpose MKLDNN op test=develop
-
- 25 2月, 2019 1 次提交
-
-
由 Jacek Czaja 提交于
* - Implemented draft of primitive desc keeping in Tensor test=develop - TransposeMKLDNNHandler::AcquireSrcMemory was reimplemented - Added nchw and nc formats setting for sake of compatiblity Fixed unit tests - Worakaround to problem with 5D data in conv - Added 3D and 1D MKL-DNN formats for name handles for tensor test=develop - Fix to UTs test=develop - Conv fp32 op was updated Cosmetic fixes test=develop - tensor mkldnn cosmetics test=develop - Moved most of mkl-dnn specific code from Tensor to mkl-dnn utils * - Lint fixes test=develop * - setting prim dec in Tensor , sets also layout to kMKLDNN test=develop * - Moved creation of prim desc totally out of Tensor test=develop * - Cosmetic fixes adter review test=develop
-
- 25 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine tensor test=develop * refine tensor test=develop * fix device_context log test=develop
-
- 21 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* Add Temporal Allocator * add Temporay Allocator to DeviceContext test=develop * code refine test=develop * fix mean_iou test=develop * Add DeviceTemporaryAllocator test=develop * fix conv_op bug test=develop * small fix test=develop * code refine test=develop * log refine test=develop * fix unit test test=develop * move double check * refine concat_and_split test=develop * add limit_of_temporary_allocation test=develop * fix name test=develop
-
- 17 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
test=develop
-
- 13 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 12 12月, 2018 1 次提交
-
-
由 Yu Yang 提交于
test=develop
-
- 07 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
add unittest model containing while_op remove unnecessary codes test=develop
-
- 04 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 03 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 08 11月, 2018 1 次提交
-
-
由 Zhaolong Xing 提交于
-
- 25 10月, 2018 1 次提交
-
-
由 Yu Yang 提交于
test=develop
-
- 23 10月, 2018 1 次提交
-
-
由 Yu Yang 提交于
test=develop
-
- 29 9月, 2018 1 次提交
-
-
由 Yu Yang 提交于
-
- 28 9月, 2018 1 次提交
-
-
由 Yu Yang 提交于
Use OO style to rewrite memory allocation.
-
- 15 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 12 9月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 02 9月, 2018 1 次提交
-
-
由 dzhwinter 提交于
-
- 17 8月, 2018 1 次提交
-
-
由 Michal Gallus 提交于
Also replace memory types from int64_t to size_t
-