- 02 3月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library * fix CI * fix rocm ci
-
- 15 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 11 6月, 2021 1 次提交
-
-
由 ShenLiang 提交于
* fix gather shape bug * fix None * fix topo
-
- 23 8月, 2020 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 14 5月, 2020 1 次提交
-
-
由 ShenLiang 提交于
-
- 12 6月, 2019 1 次提交
-
-
由 wawltor 提交于
* test=develop The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode. * test=develop Fix some code format problem, and the same time add the test case in gather and scatter op
-
- 25 5月, 2019 1 次提交
-
-
由 hutuxian 提交于
* gather_op support int64_t index by adding a template typename * add UT and rename typename test=develop
-
- 30 1月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine beam_search_op to output an extra parent_idx tensor. test=develop * Fix the unittest test_beam_search_op. test=develop * Fix the merging mistake. test=develop
-
- 19 4月, 2018 1 次提交
-
-
由 Abhinav Arora 提交于
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 04 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 03 10月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 29 9月, 2017 2 次提交
- 28 9月, 2017 1 次提交
-
-
由 Yu Yang 提交于
-
- 22 8月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 15 8月, 2017 1 次提交
-
-
由 zchen0211 提交于
-
- 07 8月, 2017 1 次提交
-
-
由 dongzhihong 提交于
-
- 05 8月, 2017 1 次提交
-
-
由 Yi Wang 提交于
-
- 02 8月, 2017 2 次提交
- 01 8月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
* use operator context * optimize code * update net infershape * update InferShape * disable override InferShape(scope) in OperatorBase * change InferShapeImpl to InferShape * add template to OperatorContext Input/Output * merge Input InputVar, Output OutputVar * change Inputs to MultiInput * fix conflict * fix MultiInput bugs and add unit test * rename KernelContext to ExecutionContext * clean code * change InferShape to protected * fix template bug * refine code * use InputVar instead of Input<Variable> * typo * optimize code
-
- 25 7月, 2017 1 次提交
-
-
由 Yu Yang 提交于
Make implement an operator less noisy.
-
- 19 7月, 2017 4 次提交
-
-
由 qiaolongfei 提交于
-
由 Qiao Longfei 提交于
* a simplest SGD op
-
由 qijun 提交于
-
由 Yi Wang 提交于
-
- 17 7月, 2017 3 次提交
-
-
由 qijun 提交于
-
由 Yan Chunwei 提交于
* add inputs * add ut for multiple inputs * fix AddToLayer * op_desc -> op_proto * CreateArgumentOffsetMap -> CreateInOutOffsetMap * move CreateInOutOffsetMap from OperatorBase to op registry * arg_idxs_ -> in_out_idxs_
-
由 qijun 提交于
-
- 14 7月, 2017 1 次提交
-
-
由 qijun 提交于
-
- 13 7月, 2017 2 次提交
-
-
由 Yu Yang 提交于
* Convert `op` --> `operators` * Remove AddType in OpProtoMaker, because type is part of registry. * Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.
-
由 Yu Yang 提交于
* Refine register methods, make Op can get rid of whole-archieve * `USE_OP` before a op is used. * Add unittest for add_op.
-