- 02 3月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library * fix CI * fix rocm ci
-
- 01 3月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* add scale gather sum * refine CUDA_ATOMIC_WRAPPER ADD for bf16 * add gather unittest * solve conflict * add scale uinttest * add sum unittest * solve conflict * refine gather unittest * refine unittest
-
- 15 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 11 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix gather shape bug * fix None * fix topo
-
- 23 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 14 5月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 12 6月, 2019 1 次提交
-
-
由 wawltor 提交于
* test=develop The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode. * test=develop Fix some code format problem, and the same time add the test case in gather and scatter op
-
- 25 5月, 2019 1 次提交
-
-
由 hutuxian 提交于
* gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop
-
- 30 1月, 2019 2 次提交
-
-
由 Yibing Liu 提交于
* Some improvements to support bert mixed precision training test=develop * Revert the cast in layer_norm test=develop
-
由 Yiqun Liu 提交于
* Refine beam_search_op to output an extra parent_idx tensor. test=develop * Fix the unittest test_beam_search_op. test=develop * Fix the merging mistake. test=develop
-
- 31 10月, 2018 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 19 4月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 26 12月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 04 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 03 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 29 9月, 2017 3 次提交
- 04 9月, 2017 1 次提交
-
-
由 Luo Tao 提交于
-
- 16 8月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 10 8月, 2017 3 次提交
-
-
由 qijun 提交于
-
由 qijun 提交于
-
由 dongzhihong 提交于
-
- 09 8月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 08 8月, 2017 1 次提交
-
-
由 dongzhihong 提交于
-
- 07 8月, 2017 2 次提交
-
-
由 dongzhihong 提交于
-
由 dongzhihong 提交于
-
- 04 8月, 2017 1 次提交
-
-
由 liaogang 提交于
-
- 03 8月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 31 7月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 25 7月, 2017 1 次提交
-
-
由 Yu Yang 提交于
Make implement an operator less noisy.
-
- 19 7月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 18 7月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 17 7月, 2017 3 次提交
-
-
由 Yu Yang 提交于
* Implement InferShape and register them, give a stub Kernel method by LOG(INFO)
-
由 Yu Yang 提交于
-
由 Yan Chunwei 提交于
* add NDEBUG switch to PADDLE_ENFORCE
-