提交 · 39de9b8ab638ac2902b082ece3b58d215eb4f7d9 · BaiXuePrincess / Paddle

12 3月, 2022 1 次提交
- Z
  [PHI] Move forward kernel of roi_align into phi (#40382) · 39de9b8a
  由 zyfncg 提交于 3月 12, 2022
```
* move roi_align kernel to phi

* fix bug of roi_align xpu
```
  39de9b8a
20 2月, 2022 1 次提交

[PTen->Phi PR1] Change pten dirname and namespace to phi (#39748) · dcfe1986

由 Chen Weihang 提交于 2月 20, 2022

* rename pten dir to phi

* rename namespace to phi

* rename infrt pten dir to phi

* resolve conflict

* rename pten to phi in cmake

* revert all infrt change

* change needed files

* fix infrt failed

* fix inference failed

dcfe1986

11 2月, 2022 1 次提交
- F
  [Pten] move operators/math/math_function_* to pten/kernels/func (#39300) · d25a7f9e
  由 Feiyu Chan 提交于 2月 11, 2022
```
* move operators/math/math_function_* to pten/kernels/func
* namespace from `paddle::operators::math` to `pten::funcs`
```
  d25a7f9e
17 1月, 2022 1 次提交

[Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5

由 Wilber 提交于 1月 17, 2022

* add pten::Place data structure.

* update ci problem

* fix ci problem

* update

* using platform::Place=pten::Place

* remove BOOST_GET_CONST for CPUPlace and GPUPlace

* compile pass 25%.

* compile pass 45%

* compile pass 60%

* remove boost_get for xpu npu mlu and ipu

* compile pass on cpu and gpu.

* fix compile problem

* fix compile error.

* update

* fix ci problem

* update

* ci approve

* fix ci problem

* fix ci eager test problem

* remove BOOST_GET_CONST

* fix npu compile

c48a9ad5

16 12月, 2021 1 次提交

Faster implementation of CPU kernel for ROI Align operator (#37848) · 023ff4f5

由 Tomasz Socha 提交于 12月 16, 2021

* Faster implementation of CPU kernel for ROI_ALIGN Operator

* Add missing variable to CUDA roi_align_op

* Style

* Fix boundaries

* Rename variables for indexes calculation

* Remove unnecessary emplace

* Revert "Remove unnecessary emplace"

This reverts commit c10e87f7fb812f1a672fde32f2690a97d47e2f5a.

* Style

023ff4f5

03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
08 9月, 2021 1 次提交

merge CMakeList.txt manual (#35378) · c4a3e8b4

由 feng_shuai 提交于 9月 08, 2021

* merge CMakeList.txt manual

* add platform for changethreadnum

* repair some bugs according to make error

* do nothing just flush CI

* forget change thread num

* add inplace_atol param for check_output_with_place

* Windows

* std:min and std::max should be change because of windows

c4a3e8b4

10 6月, 2021 1 次提交
- W
  
  fix aligned in roi_align (#33444) · e19736d7
  由 wangguanzhong 提交于 6月 10, 2021
  
  e19736d7
09 3月, 2021 1 次提交
- W
  
  fix roi_align, test=develop (#31479) · 50af0c2c
  由 wangguanzhong 提交于 3月 09, 2021
  
  50af0c2c
19 2月, 2021 1 次提交
- G
  add offset parameter in roi_align,generate_proposals.etc ops (#30864) · 5b267474
  由 Guanghua Yu 提交于 2月 19, 2021
```
* add  parameter in roi_align op
```
  5b267474
08 9月, 2020 1 次提交

Enhance ops to support LoD as input for dygraph detection models. (#25316) · a28ae86e

由 wangguanzhong 提交于 9月 08, 2020

* enhance collect_op for dygraph, test=develop

* enhance detection ops with lod, test=develop

* support none bbox left in generate_proposals, test=develop

* unfiy MultiLevelRoisNum, test=develop

* update core.ops, test=develop

* add op register for new input & output, test=develop

a28ae86e

11 7月, 2020 1 次提交

Fix index overflow bug of the CUDA kernel loop increment (#25435) · 0b54d54f

由 Chen Weihang 提交于 7月 11, 2020

* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop

* replace old macro & for condition, test=develop

* polish details, test=develop

0b54d54f

12 5月, 2020 1 次提交
- W
  
  optimize error message, test=develop (#24420) · cd327e66
  由 wangguanzhong 提交于 5月 12, 2020
  
  cd327e66
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

11 4月, 2020 1 次提交
- F
  modify some op for dyg rcnn (#23648) · 0a878be8
  由 FDInSky 提交于 4月 11, 2020
```
* test=develop modify some op for dyg rcnn
```
  0a878be8
10 4月, 2020 1 次提交
- W
  
  enhance the error message of roi_align, test=develop (#23649) · c2f5a3ad
  由 wangguanzhong 提交于 4月 10, 2020
  
  c2f5a3ad
10 10月, 2019 1 次提交
- W
  
  enhance input check for roi_align, test=develop (#20238) · 6fbf4410
  由 wangguanzhong 提交于 10月 10, 2019
  
  6fbf4410
11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

23 1月, 2019 1 次提交

Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) · 07dc5a15

由 qingqing01 提交于 1月 23, 2019

* Add generate_mask_labels_op to support Mask-RCNN.
* Refine sigmoid_cross_entropy to support nomalize mode.
* Fix generator_proposals_label.
* Use DeviceTemporaryAllocator in roi_pool and roi_algin.
* Remove shape check in data_feeder.

07dc5a15

18 10月, 2018 1 次提交
- J
  
  test=develop · 9a14ca91
  由 jerrywgz 提交于 10月 18, 2018
  
  9a14ca91
16 10月, 2018 2 次提交
- J
  
  roi_align for gpu · 8c79071d
  由 jerrywgz 提交于 10月 16, 2018
  
  8c79071d
- J
  
  roi_align for gpu · c9d2046f
  由 jerrywgz 提交于 10月 16, 2018
  
  c9d2046f
15 8月, 2018 1 次提交

Add flatten op interface and enhance APIs about detection to support... · 9333a627

由 Bai Yifan 提交于 8月 15, 2018

Add flatten op interface and enhance APIs about detection to support variable-length image. (#12422)

* add flatten api&enhance detection api

* unify shape_op data type

* update API.spec

9333a627

01 6月, 2018 1 次提交

Add shape op to get the shape of variable. (#11048) · 28dc9ba3

由 whs 提交于 6月 01, 2018

* Add shape op to get the shape of variable.

* Rename get_shape to shape.

* Add checker for output and fix comments.

28dc9ba3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
22 12月, 2017 1 次提交
- Q
  add data layout (#6832) · 6b475981
  由 QI JUN 提交于 12月 22, 2017
```
* add data layout

* fix ci
```
  6b475981
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

25 10月, 2017 1 次提交

CPU Batch Norm Op (#4964) · ee998a9c

由 Qiao Longfei 提交于 10月 24, 2017

* init batch norm op

* prepare input output

* compute mean_out var_out save_mean save_var on CPU

* active is test

* use eigen to do computation

* complete batch norm forward

* set default momentum to 0.9

* add batch norm grad op in CPU

* add tensor_format and NHWC support, add python test

* add test training

* add batch norm gradient test

* improve comment, fix foward Python UnitTest

* add gradient test

* fix eigen warning

* follow name style

* fix a bug

* change float to T

* add simple forward test

* test with different place

* add backward test

* refine python test

* remove old python test code

* code clean

* follow code style

* update comment

ee998a9c

10 10月, 2017 1 次提交
- A
  
  Implementing the fill constant op for the executor · 6efacc14
  由 Abhinav Arora 提交于 10月 09, 2017
  
  6efacc14
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
20 9月, 2017 1 次提交
- D
  
  Share LoD between input and output of each opeators. · b65709e4
  由 dangqingqing 提交于 9月 19, 2017
  
  b65709e4
23 8月, 2017 1 次提交
- D
  
  Remove set functor and add comapre_grad test · f188e22b
  由 dangqingqing 提交于 8月 23, 2017
  
  f188e22b
11 8月, 2017 1 次提交
- Y
  
  Fix python unit tests · c99f84ac
  由 Yu Yang 提交于 8月 11, 2017
  
  c99f84ac
08 8月, 2017 1 次提交
- F
  
  fix bug · 28476676
  由 fengjiayi 提交于 8月 07, 2017
  
  28476676
07 8月, 2017 1 次提交
- D
  
  "remove type alias done." · 72fb86a2
  由 dongzhihong 提交于 8月 07, 2017
  
  72fb86a2
05 8月, 2017 1 次提交
- Y
  
  Reformat paddle/operators/* strictly following Google Style Guide · 9620df44
  由 Yi Wang 提交于 8月 04, 2017
  
  9620df44
02 8月, 2017 1 次提交
- F
  
  Add unittest for `FillZerosLikeOp` · 8bd73159
  由 fengjiayi 提交于 8月 01, 2017
  
  8bd73159
01 8月, 2017 1 次提交
- Y
  
  Follow comments and merge develop · e2fd2bd0
  由 Yu Yang 提交于 8月 01, 2017
  
  e2fd2bd0

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致