提交 · 9f74b84eea01c9286640a8be79190a628abd9eed · Crayon鑫 / Paddle

02 3月, 2022 1 次提交
- S
  Move gather.h/gather.cu.h/scatter.h/scatter.cu.h to the phi library (#40043) · 09258040
  由 sneaxiy 提交于 3月 02, 2022
```
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library

* fix CI

* fix rocm ci
```
  09258040
01 3月, 2022 1 次提交

[bf16] add bf16 kernel: scale gather sum (#39683) · 6d26b332

由 zhangbo9674 提交于 3月 01, 2022

* add scale gather sum

* refine CUDA_ATOMIC_WRAPPER ADD for bf16

* add gather unittest

* solve conflict

* add scale uinttest

* add sum unittest

* solve conflict

* refine gather unittest

* refine unittest

6d26b332

15 2月, 2022 1 次提交

[PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404

由 Aurelius84 提交于 2月 15, 2022

* #1 migrate dist-related type()-> dtype()

* move datatype function from pten -> fluid/framework

* change type() in imperative into convert(dtype())

* modify xx_tensor->type into xx_tensor->dtype

* change the set_type interface and the caller

* modify xx_tensor.type into xx_tensor.dtype

* fix mutable_data(place, dtype())

* change caller of mutable_data in pten and distributed

* change the caller of mutable_data in fluid/framework

* change the caller of mutable_data in imperative directory

* mutable_data: inference

* update the call of mutable_data

* transfer MakePenScalarArray MakePtenScalar ResetHolderWithType

* pass the compile. the next step is remove VarType in Pten

* fix all and remove VarType from pten. success in linux. Next task is other platform

* fix conflict with develop

* fix compiled error

* Fix reset conversion

* fix conflict

* fix compiled problem

* fix typo

* Fix << in tensor_utils.cc

* fix type->dtype

* fix unittest

* fix tensor init constructor

* fix DataTypeSize for BFloat16

* fix code style

* fix npu compiled error

* fix npu

* compile npu sucessfully

* fix conflict

* fix conflict
Co-authored-by: Nxiongkun <xiongkun03@baidu.com>

7e7e9404

11 6月, 2021 1 次提交
- S
  Fix gather infer shape using axis (#33413) · abc17ef7
  由 ShenLiang 提交于 6月 11, 2021
```
* fix gather shape bug

* fix None

* fix topo
```
  abc17ef7
23 8月, 2020 1 次提交
- W
  
  add paddle.gather for API2.0 (#26455) · ebf9b212
  由 wangchaochaohu 提交于 8月 23, 2020
  
  ebf9b212
14 5月, 2020 1 次提交
- S
  
  fix error message, test=develop (#24425) · 53e3c534
  由 ShenLiang 提交于 5月 14, 2020
  
  53e3c534
12 6月, 2019 1 次提交

Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3

由 wawltor 提交于 6月 12, 2019

* test=develop
The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.

* test=develop
Fix some code format problem, and the same time add the test case in gather and scatter op

8eb134c3

25 5月, 2019 1 次提交

Gather Op Index Support int64_t datatype (#17610) · 1670db5e

由 hutuxian 提交于 5月 25, 2019

* gather_op support int64_t index by adding a template typename

* add UT and rename typename

test=develop

1670db5e

30 1月, 2019 2 次提交

Y
Some improvements to support bert mixed precision training (#15585) · 170842cb
由 Yibing Liu 提交于 1月 30, 2019
```
* Some improvements to support bert mixed precision training

test=develop

* Revert the cast in layer_norm

test=develop
```
170842cb

Return parent_idx in beam_search op (#15520) · 16d54f7f

由 Yiqun Liu 提交于 1月 30, 2019

* Refine beam_search_op to output an extra parent_idx tensor.
test=develop

* Fix the unittest test_beam_search_op.
test=develop

* Fix the merging mistake.
test=develop

16d54f7f

31 10月, 2018 1 次提交
- C
  add int and int64 dtype for gather_op (#14175) · b73708d2
  由 chengduo 提交于 10月 31, 2018
```
test=develop
```
  b73708d2
19 4月, 2018 1 次提交
- A
  
  Fix CPPLint issues in expand_op, gather_op and get_places_op (#10000) · 9ca578d4
  由 Abhinav Arora 提交于 4月 18, 2018
  
  9ca578d4
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

04 10月, 2017 1 次提交
- Z
  
  gather scatter fix according to google style · 2d876b86
  由 zchen0211 提交于 10月 03, 2017
  
  2d876b86
03 10月, 2017 1 次提交
- Z
  
  gather scatter with cuda streams · 84b8baf1
  由 zchen0211 提交于 10月 02, 2017
  
  84b8baf1
29 9月, 2017 3 次提交
- Z
  
  1 api · 78808b20
  由 zchen0211 提交于 9月 28, 2017
  
  78808b20
- Z
  
  merge new op grammar · b851515b
  由 zchen0211 提交于 9月 28, 2017
  
  b851515b
- Z
  scatter gather gpu · 88a8eedd
  由 zchen0211 提交于 9月 28, 2017
```
gather scatter gpu
```
  88a8eedd
04 9月, 2017 1 次提交
- L
  
  remove scatter_op.cu/gather_op.cu as they support only_cpu now · 740c8ba1
  由 Luo Tao 提交于 9月 04, 2017
  
  740c8ba1
16 8月, 2017 1 次提交
- Z
  
  gather op added with python unittest · 323d4233
  由 zchen0211 提交于 8月 15, 2017
  
  323d4233
10 8月, 2017 3 次提交
- Q
  
  set gemm support continuous memory now · de967fce
  由 qijun 提交于 8月 10, 2017
  
  de967fce
- Q
  
  disable gpu implementation temporarily · 8de4e3bd
  由 qijun 提交于 8月 10, 2017
  
  8de4e3bd
- D
  
  "on hold" · 2ddb1122
  由 dongzhihong 提交于 8月 10, 2017
  
  2ddb1122
09 8月, 2017 1 次提交
- Q
  
  fix gpu build error · 7307b439
  由 qijun 提交于 8月 09, 2017
  
  7307b439
08 8月, 2017 1 次提交
- D
  
  "fix clang format" · 22f03c39
  由 dongzhihong 提交于 8月 08, 2017
  
  22f03c39
07 8月, 2017 2 次提交
- D
  
  "fix cuda error" · 79e76ea1
  由 dongzhihong 提交于 8月 07, 2017
  
  79e76ea1
- D
  
  "remove alias to more operators" · 6b23b91c
  由 dongzhihong 提交于 8月 07, 2017
  
  6b23b91c
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
03 8月, 2017 1 次提交
- Q
  
  add gemm for both cpu and gpu · 22dac40c
  由 qijun 提交于 8月 03, 2017
  
  22dac40c
31 7月, 2017 1 次提交
- Q
  
  add EIGEN_USE_GPU macro to op.cu file · 61f94f00
  由 qijun 提交于 7月 31, 2017
  
  61f94f00
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 1 次提交
- Q
  
  replace Tensor::tensor to EigenTensor::From · 736d078c
  由 qijun 提交于 7月 19, 2017
  
  736d078c
18 7月, 2017 1 次提交
- Q
  
  implement some basic OpKernel · b6c07552
  由 qijun 提交于 7月 18, 2017
  
  b6c07552
17 7月, 2017 3 次提交
- Y
  Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops · 1ed237c1
  由 Yu Yang 提交于 7月 17, 2017
```
* Implement InferShape and register them, give a stub Kernel method
  by LOG(INFO)
```
  1ed237c1
- Y
  
  Refine CMake dependencies graph · 38310f93
  由 Yu Yang 提交于 7月 17, 2017
  
  38310f93
- Y
  Add enforce switch for convient develop (#2850) · cdec5634
  由 Yan Chunwei 提交于 7月 17, 2017
```
* add NDEBUG switch to PADDLE_ENFORCE
```
  cdec5634

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致