提交 · ea9684f10617f21dd8dabc3a21402b5255f89353 · 机器未来 / Paddle

26 3月, 2022 1 次提交
- Y
  
  Optmize the CPU -> GPU memcpy and avoid explit sync in some operators. (#40933) · ea9684f1
  由 Yiqun Liu 提交于 3月 26, 2022
  
  ea9684f1
25 3月, 2022 1 次提交
- F
  add maximum limit for grid of reduce, elementwise, gather and scatter (#40813) · 608a5f55
  由 FlyingQianMM 提交于 3月 25, 2022
```
* add maximum limit for grid of reduce, elementwise and gather

* add {} after if
```
  608a5f55
02 3月, 2022 1 次提交
- S
  Move gather.h/gather.cu.h/scatter.h/scatter.cu.h to the phi library (#40043) · 09258040
  由 sneaxiy 提交于 3月 02, 2022
```
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library

* fix CI

* fix rocm ci
```
  09258040
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

11 2月, 2022 1 次提交
- F
  [Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
  由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
  d25a7f9e
17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
10 9月, 2021 1 次提交
- Z
  Fix scatter and gather bug (#35595) · 6f7aca9e
  由 Zeng Jinle 提交于 9月 10, 2021
```
* fix scatter gather bug:

* fix windows ci
```
  6f7aca9e
08 9月, 2021 1 次提交
- Z
  Fix scatter_nd_add and gather bug (#35544) · 3c457a38
  由 Zeng Jinle 提交于 9月 08, 2021
```
* fix scatter_add_nd and gather bug

* fix gather compile error
```
  3c457a38
16 7月, 2021 1 次提交
- H
  
  Fix gather_op to avoid cudaErrorLaunchFailure for solov2, test=develop (#34200) · 380bc4e6
  由 Haohongxiang 提交于 7月 15, 2021
  
  380bc4e6
13 7月, 2021 1 次提交
- H
  Fix gather_op by adding OurOfRangeCheck for param[Index], test=develop (#34096) · 64b9065d
  由 Haohongxiang 提交于 7月 13, 2021
```
* Fix gather_op by adding OurOfRangeCheck for param[Index]

* Code Optimization
```
  64b9065d
11 6月, 2021 1 次提交
- S
  Fix gather infer shape using axis (#33413) · abc17ef7
  由 ShenLiang 提交于 6月 11, 2021
```
* fix gather shape bug

* fix None

* fix topo
```
  abc17ef7
10 12月, 2020 1 次提交
- S
  
  fix error message of gather nd (#29521) · d8391a19
  由 ShenLiang 提交于 12月 10, 2020
  
  d8391a19
09 11月, 2020 1 次提交
- W
  
  refine the performance of gather Op (#28458) · e14ed71c
  由 wangchaochaohu 提交于 11月 09, 2020
  
  e14ed71c
23 8月, 2020 1 次提交
- W
  
  add paddle.gather for API2.0 (#26455) · ebf9b212
  由 wangchaochaohu 提交于 8月 23, 2020
  
  ebf9b212
11 7月, 2020 1 次提交

Fix index overflow bug of the CUDA kernel loop increment (#25435) · 0b54d54f

由 Chen Weihang 提交于 7月 11, 2020

* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop

* replace old macro & for condition, test=develop

* polish details, test=develop

0b54d54f

14 5月, 2020 1 次提交
- S
  
  fix error message, test=develop (#24425) · 53e3c534
  由 ShenLiang 提交于 5月 14, 2020
  
  53e3c534
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

30 8月, 2019 1 次提交
- S
  add gather_nd op and unit test (#19366) · 85914f7a
  由 ShenLiang 提交于 8月 30, 2019
```
* fixed the code for coverage

* fixed the document,test=document_preview test=develop
```
  85914f7a
14 8月, 2019 1 次提交
- C
  Fix gather op bug (#19168) · b5ba801e
  由 chengduo 提交于 8月 14, 2019
```
* fix gather op bug
test=develop
```
  b5ba801e
25 5月, 2019 1 次提交

Gather Op Index Support int64_t datatype (#17610) · 1670db5e

由 hutuxian 提交于 5月 25, 2019

* gather_op support int64_t index by adding a template typename

* add UT and rename typename

test=develop

1670db5e

26 3月, 2019 1 次提交
- X
  update DeepCF model · 1f89249a
  由 Xin Pan 提交于 3月 26, 2019
```
test=develop
```
  1f89249a
12 11月, 2018 1 次提交

Fix gather & stack op (#14355) · bd294378

由 Yibing Liu 提交于 11月 12, 2018

* Add int type support for stack_op

* Improve gather op to support index with shape N x 1

test=develop

* Fix stack_op kernel's registry

test=develop

bd294378

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

04 10月, 2017 1 次提交
- Z
  
  gather scatter fix according to google style · 2d876b86
  由 zchen0211 提交于 10月 03, 2017
  
  2d876b86
03 10月, 2017 1 次提交
- Z
  
  gather scatter with cuda streams · 84b8baf1
  由 zchen0211 提交于 10月 02, 2017
  
  84b8baf1
29 9月, 2017 2 次提交
- Z
  
  1 api · 78808b20
  由 zchen0211 提交于 9月 28, 2017
  
  78808b20
- Z
  scatter gather gpu · 88a8eedd
  由 zchen0211 提交于 9月 28, 2017
```
gather scatter gpu
```
  88a8eedd

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致