- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 15 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Fixed example code failure * Polished function names, removed duplicated forward declarations
-
- 13 1月, 2022 1 次提交
-
-
由 石晓伟 提交于
-
- 15 12月, 2021 1 次提交
-
-
由 jianghaicheng 提交于
* add ipu_inference * resovle commments * resolve comments * add EnableIpu introduction * rm line * restore npu update * add ernie and resnet50 test * fix copyright time Co-authored-by: Nyaozhixin <522190855@qq.com>
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 20 10月, 2021 1 次提交
-
-
由 Steffy-zxf 提交于
Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent. * support the text string as an input Tensor * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization. * It first applies basic tokenization, followed by wordpiece tokenization.
-
- 31 8月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* Revert "Revert "Add copy from tensor (#34406)" (#35173)" This reverts commit 32c1ec42. * add template instantiation
-
- 27 8月, 2021 1 次提交
-
-
由 zhangchunle 提交于
This reverts commit ac33c0ca.
-
- 26 8月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add api * temp save * revert * copytocpu async ok * fix style * copy sync ok * fix compile error * fix compile error * api done * update python async api * fix compile * remove async python api; add c++ async unittest * remove python async api * update unittest * update unittest * add C++ unittest for copytensor * add unittest * update namespace utils to class TensorUtils * add unittest * update unittest * update unittest * update code style * update code style * update unittest
-
- 12 8月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 06 8月, 2021 1 次提交
-
-
由 houj04 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 houj04 提交于
* add NPU support for zero_copy_tensor. * revert unnesessary codes. * revert unnesessary codes.
-
- 14 7月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 17 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 25 5月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 08 3月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 05 3月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 20 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 07 2月, 2021 1 次提交
-
-
由 石晓伟 提交于
* bug fix of xpu lite engine, test=develop * xpu zero copy tensor, test=develop * revert paddle/fluid/inference/tests/api/CMakeLists.txt
-
- 04 2月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 04 1月, 2021 1 次提交
-
-
由 cc 提交于
* zero_copy_tensor supports int8_t
-
- 28 12月, 2020 1 次提交
-
-
由 石晓伟 提交于
* fix a bug in reset_tensor_array, test=develop * ci coverage, test=develop
-
- 24 9月, 2020 1 次提交
-
-
由 wanghuancoder 提交于
* use iwyu clean include, test=develop, test=win * compilation error, test=develop * fix compilation error2, test=develop * fix compilation error3, test=develop * fix compilation error4, test=develop * fix compilation error5, test=develop * fix compilation error6, test=develop * fix compilation error7, test=develop * fix compilation error8, test=develop * fix compilation error8, test=develop * fix compilation error10, test=develop * fix compilation error11, test=develop
-
- 11 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 01 6月, 2020 1 次提交
-
-
由 silingtong123 提交于
* add SetCommandLineOption * add the print_FLAGS function * remove the test demo * modify the location of macro * add the 'WITH_STATIC_LIB' option on windows * modify the macro of PD_INFER_DECL * modify the the fuction name * modify the unittest * modify the code style
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 05 2月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add mutex for trt engine test=develop * add the test for copy_to_cpu test=develop
-
- 10 12月, 2019 1 次提交
-
-
由 rensilin 提交于
* ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop
-
- 17 9月, 2019 1 次提交
-
-
由 Pei Yang 提交于
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
-
- 09 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* paddle::framework::vectorize() templatization test=develop * update pybind/imperative.cc test=develop * revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc test=develop
-
- 26 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 22 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
2. refine trt code test=develop
-
- 20 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
1. refine anakin engine 2. add data type for zero copy align dev branch and PaddlePaddle:feature/anakin-engine brach the cudnn workspace modify was not included for now, because we use a hard code way in feature/anakin-engine branch. There should be a better way to implement it, and subsequent submissions will be made. test=develop
-
- 11 3月, 2019 1 次提交
-
-
由 luotao1 提交于
test=develop
-
- 08 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
update trt in docker file test=develop
-
- 01 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
update trt in docker file test=develop
-
- 17 1月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 16 1月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Simplify the compare op for CPU. * Use asynchronous tensor copy in reshape_op's kernel. * Optimize while_op for test, avoiding creating variables every time. test=develop * Enable the cache of kernel type and kernel function. test=develop * Enable profiling with gperftools. * Remove flags for testing, and fix the linking error. test=develop * Delete the codes of ChooseKernel. test=develop * Fix bug when preparing ExecutorPrepareContext for while_op. * Fix missing depending on grpc libraries. * Remove the redundant print. test=develop * Follow comments. * Remove the codes related to prepare the ExecutorPrepareContext for while_op. test=develop
-
- 19 12月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 29 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-