- 13 1月, 2022 1 次提交
-
-
由 石晓伟 提交于
-
- 15 12月, 2021 1 次提交
-
-
由 jianghaicheng 提交于
* add ipu_inference * resovle commments * resolve comments * add EnableIpu introduction * rm line * restore npu update * add ernie and resnet50 test * fix copyright time Co-authored-by: Nyaozhixin <522190855@qq.com>
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 20 10月, 2021 1 次提交
-
-
由 Steffy-zxf 提交于
Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent. * support the text string as an input Tensor * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization. * It first applies basic tokenization, followed by wordpiece tokenization.
-
- 31 8月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* Revert "Revert "Add copy from tensor (#34406)" (#35173)" This reverts commit 32c1ec42. * add template instantiation
-
- 27 8月, 2021 1 次提交
-
-
由 zhangchunle 提交于
This reverts commit ac33c0ca.
-
- 26 8月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add api * temp save * revert * copytocpu async ok * fix style * copy sync ok * fix compile error * fix compile error * api done * update python async api * fix compile * remove async python api; add c++ async unittest * remove python async api * update unittest * update unittest * add C++ unittest for copytensor * add unittest * update namespace utils to class TensorUtils * add unittest * update unittest * update unittest * update code style * update code style * update unittest
-
- 12 8月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 houj04 提交于
* add NPU support for zero_copy_tensor. * revert unnesessary codes. * revert unnesessary codes.
-
- 14 7月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 17 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 25 5月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 05 3月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 20 2月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 07 2月, 2021 1 次提交
-
-
由 石晓伟 提交于
* bug fix of xpu lite engine, test=develop * xpu zero copy tensor, test=develop * revert paddle/fluid/inference/tests/api/CMakeLists.txt
-
- 04 1月, 2021 1 次提交
-
-
由 cc 提交于
* zero_copy_tensor supports int8_t
-
- 11 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 01 6月, 2020 1 次提交
-
-
由 silingtong123 提交于
* add SetCommandLineOption * add the print_FLAGS function * remove the test demo * modify the location of macro * add the 'WITH_STATIC_LIB' option on windows * modify the macro of PD_INFER_DECL * modify the the fuction name * modify the unittest * modify the code style
-
- 11 5月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* add new macro BOOST_GET_SAFELY & unittests, test=develop * add different macro type, test=develop * fix get macro type in executor, test=develop * four macro part change backup * using one macro for all case, test=develop * revert attribute change, test=develop * change to three func to solve gcc4.8 bug, test=develop * polish some details, test=develop
-
- 05 2月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add mutex for trt engine test=develop * add the test for copy_to_cpu test=develop
-
- 10 12月, 2019 1 次提交
-
-
由 rensilin 提交于
* ZeroCopyTensor::mutable_data in the right device, test=develop * add unittest for zerocopy, test=develop
-
- 17 9月, 2019 1 次提交
-
-
由 Pei Yang 提交于
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
-
- 09 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* paddle::framework::vectorize() templatization test=develop * update pybind/imperative.cc test=develop * revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc test=develop
-
- 26 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 22 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
2. refine trt code test=develop
-
- 20 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
1. refine anakin engine 2. add data type for zero copy align dev branch and PaddlePaddle:feature/anakin-engine brach the cudnn workspace modify was not included for now, because we use a hard code way in feature/anakin-engine branch. There should be a better way to implement it, and subsequent submissions will be made. test=develop
-
- 11 3月, 2019 1 次提交
-
-
由 luotao1 提交于
test=develop
-
- 08 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
update trt in docker file test=develop
-
- 01 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
update trt in docker file test=develop
-
- 17 1月, 2019 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 14 11月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
-
- 28 9月, 2018 1 次提交
-
-
由 Yan Chunwei 提交于
- add naive executor - fix concurrency performance issue
-