- 01 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add layer norm * add p norm * add reduce sum * refine layer norm register bf16 for cudnn811 * add bf16 cast for hip * add unittest * refine rocm * refine layer_norm unittest * refine reduce op * refine unittest * enhance atol for reduce unittest
-
- 26 1月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* update cmake file to remove fluid kernel * add pten declaration.h to where pybind.h used * fix sync_bn and tensorrt_engine * refine detection_library * fix interpreter_core * support eager legacy * fit eager legacy for pten * fall back to cpu if not found kernel * fix compile problem * fix compile problem * refine fallback logic * fit operator.run() * fix xpu compile * fit for new_exec * add REGISTER_OP_WITHOUT_GRADIENT * un-cache pt_kernel_context * fix compile * fix cudnn * fix compiling with on_infer * fix mkldnn * fix isfinite_v2 * fix xpu problem * fix op_device * refine fallback for xpu * fix xpu compile * merge develop * refine code format * fix compile * fix compile * add data_transfer * fix PreparePtenData * fix cpu context * merge develop * fix compile * fix error device context * fix xpu * fix dev_ctx
-
- 15 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* replace with pten kernel in cast cuda compute and remove unused codes * rm unused header file * replace CastCUDAOpKernel with CastOpKernel
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 23 11月, 2021 1 次提交
-
-
由 Aurelius84 提交于
* Add transfer_layout/dtype op * clean useless codes * fix unused var * add optest in white.txt * split into data_transfer.cc * fix cmake * modify according reviewer comment * replace cast_op with transfer_dtype_op
-
- 09 10月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 08 10月, 2021 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 03 9月, 2021 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 26 8月, 2021 1 次提交
-
-
由 duanboqiang 提交于
-
- 20 7月, 2021 1 次提交
-
-
由 李季 提交于
* fix cast
-
- 21 5月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* replace complex in set tensor from and to numpy * replace complex template in cast op
-
- 18 5月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
-
- 15 1月, 2021 1 次提交
-
-
由 Zhang Ting 提交于
-
- 11 12月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 04 12月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* basic impl of type promote * add comment & another testcase * fix complex bugs & support python op promote type * fix failed unittests & polish code * add unittest for coverage * change to only promote complex type * polish code details * polish several comments
-
- 20 11月, 2020 1 次提交
-
-
由 Zhang Ting 提交于
-
- 29 5月, 2018 1 次提交
-
-
由 fengjiayi 提交于
-
- 16 3月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 13 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
Also add `elemwise_sub/mul/abs/clip` layers
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 29 10月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Cast Operator Cast input variable to other data type * Fix compile error * Add cast op * Follow comments
-
- 08 8月, 2017 2 次提交
-
-
由 dongzhihong 提交于
-
由 dzhwinter 提交于
-
- 07 8月, 2017 1 次提交
-
-
由 dongzhihong 提交于
-
- 04 8月, 2017 2 次提交
- 26 7月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-