提交 · 9adca1e73e13049d82f667571e3f236372c1ef31 · BaiXuePrincess / Paddle

16 11月, 2022 1 次提交
- W
  
  move "gpu_primitives.h" to phi (#48015) · 9adca1e7
  由 Wang Xin 提交于 11月 16, 2022
  
  9adca1e7
10 11月, 2022 1 次提交
- P
  [PHI decoupling] remove "paddle/fluid/platform/device/gpu/gpu_launch_config.h" in phi (#47808) · 40a9b488
  由 PuQing 提交于 11月 10, 2022
```
* rm fluid gpu_launch_config

* fix type
```
  40a9b488
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
12 4月, 2022 1 次提交

add a inner loop for index_select_grad_init() in index_select op when dealing... · bc01242b

由 FlyingQianMM 提交于 4月 12, 2022

add a inner loop for index_select_grad_init() in index_select op when dealing with large-shape data (#41563)

* replace for with CUDA_KERNEL_LOOP for index_select_grad_init() in index_select op

* use CUDA_KERNEL_LOOP_TYPE

* fix code style

* replace index_select_grad_init with SetConstant

bc01242b

26 3月, 2022 1 次提交
- Y
  
  Optmize the CPU -> GPU memcpy and avoid explit sync in some operators. (#40933) · ea9684f1
  由 Yiqun Liu 提交于 3月 26, 2022
  
  ea9684f1
25 3月, 2022 1 次提交
- F
  add maximum limit for grid of reduce, elementwise, gather and scatter (#40813) · 608a5f55
  由 FlyingQianMM 提交于 3月 25, 2022
```
* add maximum limit for grid of reduce, elementwise and gather

* add {} after if
```
  608a5f55
02 3月, 2022 1 次提交
- S
  Move gather.h/gather.cu.h/scatter.h/scatter.cu.h to the phi library (#40043) · 09258040
  由 sneaxiy 提交于 3月 02, 2022
```
* move gather.h gather.cu.h scatter.h scatter.cu.h to phi library

* fix CI

* fix rocm ci
```
  09258040
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

19 2月, 2022 1 次提交

[Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264

由 Aurelius84 提交于 2月 19, 2022

* Unify paddle/pten::framework::ddim into pten::ddim

* fix paddle namespace

* compile sucessfully

* fix npu src file

* fix conflict

* fix conflict

* fix tensorrt compiler error

* fix conflict

* fix conflict

* fix tesst file conflict

* fix conflict

* fix mlu file conflict

* fix mlu file conflict

* fix cinn header file conflict

* fix conflict

* fix conflict

* fix conflict

* fix conflict

2fe04264

11 2月, 2022 1 次提交
- F
  [Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
  由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
  d25a7f9e
06 2月, 2022 1 次提交
- W
  
  [PTEN] Add Gpu context (#39305) · a821c4a9
  由 Wilber 提交于 2月 06, 2022
  
  a821c4a9
17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
10 9月, 2021 1 次提交
- Z
  Fix scatter and gather bug (#35595) · 6f7aca9e
  由 Zeng Jinle 提交于 9月 10, 2021
```
* fix scatter gather bug:

* fix windows ci
```
  6f7aca9e
01 7月, 2021 1 次提交
- S
  fix safe bug of scatter/scatter_nd (#33858) · c522530a
  由 ShenLiang 提交于 7月 01, 2021
```
* fix safe bug of scatter/scatter_nd
```
  c522530a
22 1月, 2021 1 次提交
- S
  
  Fix scatter grad bug (#30604) · 9514b4aa
  由 ShenLiang 提交于 1月 22, 2021
  
  9514b4aa
11 7月, 2020 1 次提交

Fix index overflow bug of the CUDA kernel loop increment (#25435) · 0b54d54f

由 Chen Weihang 提交于 7月 11, 2020

* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop

* replace old macro & for condition, test=develop

* polish details, test=develop

0b54d54f

13 5月, 2020 1 次提交
- F
  fix error message, test=develop (#24447) · 05c3bc3b
  由 ForFishes 提交于 5月 13, 2020
```
fix scatter and scatter_nd op error message
```
  05c3bc3b
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

04 9月, 2019 2 次提交

T
refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607) · 0a46d345
由 Tao Luo 提交于 9月 04, 2019
```
test=develop
```
0a46d345

add scatter_nd op and scatter_nd_add op (#19571) · 2cd3fa3e

由 ShenLiang 提交于 9月 04, 2019

* add scatter_nd op, test=document_preview test=develop

* fixed the document, test=document_preview test=develop

* modify the notes, test=document_preview test=develop

* remove the ShareDataWith, test=develop

2cd3fa3e

30 8月, 2019 1 次提交
- S
  add gather_nd op and unit test (#19366) · 85914f7a
  由 ShenLiang 提交于 8月 30, 2019
```
* fixed the code for coverage

* fixed the document,test=document_preview test=develop
```
  85914f7a
12 6月, 2019 1 次提交

Fix scatter and gather op when has duplicate index (#17952) · 8eb134c3

由 wawltor 提交于 6月 12, 2019

* test=develop
The scatter op has a calc bug when the indices has same index, the scatter op use overwrite mode to calculate the same index, fix this bug by using the accumulate mode to calculate the same index.At the same time, the gather op has the same bug when the op calc the grad. And we use the lib of open-blas and eigen to optimize the time cost in accumulate mode.

* test=develop
Fix some code format problem, and the same time add the test case in gather and scatter op

8eb134c3

25 5月, 2019 1 次提交

Gather Op Index Support int64_t datatype (#17610) · 1670db5e

由 hutuxian 提交于 5月 25, 2019

* gather_op support int64_t index by adding a template typename

* add UT and rename typename

test=develop

1670db5e

12 11月, 2018 1 次提交

Fix gather & stack op (#14355) · bd294378

由 Yibing Liu 提交于 11月 12, 2018

* Add int type support for stack_op

* Improve gather op to support index with shape N x 1

test=develop

* Fix stack_op kernel's registry

test=develop

bd294378

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
04 10月, 2017 1 次提交
- Z
  
  gather scatter fix according to google style · 2d876b86
  由 zchen0211 提交于 10月 03, 2017
  
  2d876b86
03 10月, 2017 1 次提交
- Z
  
  gather scatter with cuda streams · 84b8baf1
  由 zchen0211 提交于 10月 02, 2017
  
  84b8baf1
29 9月, 2017 2 次提交
- Z
  
  1 api · 78808b20
  由 zchen0211 提交于 9月 28, 2017
  
  78808b20
- Z
  scatter gather gpu · 88a8eedd
  由 zchen0211 提交于 9月 28, 2017
```
gather scatter gpu
```
  88a8eedd

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致