- 11 2月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
* improve backward performance * support different dtypes for elementwise ops
-
- 08 2月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 06 2月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 24 1月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
-
- 20 1月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* Migrate bfloat16/float16/complex from platform into pten::common * fix typo * fix code style
-
- 18 1月, 2022 2 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 Yiqun Liu 提交于
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 15 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Fixed example code failure * Polished function names, removed duplicated forward declarations
-
- 12 1月, 2022 1 次提交
-
-
由 Zhang Ting 提交于
* code clean * [part 3]change type of function args
-
- 17 12月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 27 11月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] reorganization for device API abstraction * [NPU] delete old files * [NPU] fix npu_collective_helper * [NPU] fix collective_helper * [NPU] fix ut * [NPU] mod memory allocation and hccl_helper * [NPU] fix place_type * [NPU] split enfoce.h * move acl* call into npu_info * merge conflict * fix merge * merge conflict * merge conflict
-
- 24 11月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 22 11月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 02 11月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
-
- 29 10月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* fix some bug in new executor, test=develop * fix error message, test=develop
-
- 25 10月, 2021 1 次提交
-
-
由 TTerror 提交于
* add some ops to train ssd on kunlun * add some ops to train ssd on kunlun * add some ops to train ssd on kunlun * update cast op unittest * update cast op unittest * update cast op unittest * update xpu cmake * update cast unittest
-
- 20 10月, 2021 1 次提交
-
-
由 Steffy-zxf 提交于
Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent. * support the text string as an input Tensor * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization. * It first applies basic tokenization, followed by wordpiece tokenization.
-
- 14 9月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
Implement FunctionTraits to support two kinds of elementwise functor and remove some old codes for broadcast. (#35688)
-
- 13 9月, 2021 2 次提交
- 08 9月, 2021 1 次提交
-
-
由 CtfGo 提交于
* mark WhileOp AsExtra attribute * revert kX and kOutputs
-
- 01 9月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* modify fetch logic, use D2H Stream, test=develop * refine, test=develop
-
- 31 8月, 2021 1 次提交
-
-
由 Huihuang Zheng 提交于
As the title, see details at the PR description.
-
- 24 8月, 2021 1 次提交
-
-
由 wanghuancoder 提交于
* add fetch, test=develop * fix fetch2op, test=develop * fix fetch2op, test=develop * refine, test=develop * fix fetch ctx, test=develop * add wait, test=develop * rename fetch2 to fetch_v2, test=develop * merge, test=develop
-
- 11 8月, 2021 1 次提交
-
-
由 pangyoki 提交于
* add while read_from_array write_to_array npu op * optimize unittest
-
- 05 8月, 2021 1 次提交
-
-
由 baoachun 提交于
* add not_equal NPU op * add not_equal NPU op * add not_equal NPU op * add not_equal NPU op
-
- 28 7月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] add NPU ops&uts of compare, test=develop * testing * try style-format * [NPU] update compare_op_npu uts * [NPU] fix code sytle of test_compare_op_npu.py
-
- 26 7月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 23 7月, 2021 1 次提交
-
-
由 will-jl944 提交于
* logical ops support int8, int16, int32, int64, float, double * update docs of logical ops * fix npu and xpu logical ops * fix npu and xpu logical ops * fix bug in xpu logical op code * update test_logical_op_npu and test_logical_op_xpu * correct error type
-
- 15 7月, 2021 2 次提交
-
-
由 Qi Li 提交于
-
由 Aurelius84 提交于
* Refine Constructor logic of ParallelExecutor * Replace executor into ParallelExecutor in run_program_op
-
- 05 7月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 29 6月, 2021 1 次提交
-
-
由 Leo Chen 提交于
-
- 16 6月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 15 6月, 2021 1 次提交
-
-
由 wawltor 提交于
add the support for the bool in compare ops
-
- 04 6月, 2021 2 次提交