提交 · 75f81233aeeef200cd600c262651d9c76479f180 · BaiXuePrincess / Paddle

08 9月, 2020 1 次提交

Enhance ops to support LoD as input for dygraph detection models. (#25316) · a28ae86e

由 wangguanzhong 提交于 9月 08, 2020

* enhance collect_op for dygraph, test=develop

* enhance detection ops with lod, test=develop

* support none bbox left in generate_proposals, test=develop

* unfiy MultiLevelRoisNum, test=develop

* update core.ops, test=develop

* add op register for new input & output, test=develop

a28ae86e

11 7月, 2020 1 次提交

Fix index overflow bug of the CUDA kernel loop increment (#25435) · 0b54d54f

由 Chen Weihang 提交于 7月 11, 2020

* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop

* replace old macro & for condition, test=develop

* polish details, test=develop

0b54d54f

12 5月, 2020 1 次提交
- W
  
  optimize error message, test=develop (#24420) · cd327e66
  由 wangguanzhong 提交于 5月 12, 2020
  
  cd327e66
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

11 4月, 2020 1 次提交
- F
  modify some op for dyg rcnn (#23648) · 0a878be8
  由 FDInSky 提交于 4月 11, 2020
```
* test=develop modify some op for dyg rcnn
```
  0a878be8
10 4月, 2020 1 次提交
- W
  
  enhance the error message of roi_align, test=develop (#23649) · c2f5a3ad
  由 wangguanzhong 提交于 4月 10, 2020
  
  c2f5a3ad
10 10月, 2019 1 次提交
- W
  
  enhance input check for roi_align, test=develop (#20238) · 6fbf4410
  由 wangguanzhong 提交于 10月 10, 2019
  
  6fbf4410
11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

23 1月, 2019 1 次提交

Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371) · 07dc5a15

由 qingqing01 提交于 1月 23, 2019

* Add generate_mask_labels_op to support Mask-RCNN.
* Refine sigmoid_cross_entropy to support nomalize mode.
* Fix generator_proposals_label.
* Use DeviceTemporaryAllocator in roi_pool and roi_algin.
* Remove shape check in data_feeder.

07dc5a15

18 10月, 2018 1 次提交
- J
  
  test=develop · 9a14ca91
  由 jerrywgz 提交于 10月 18, 2018
  
  9a14ca91
16 10月, 2018 2 次提交
- J
  
  roi_align for gpu · 8c79071d
  由 jerrywgz 提交于 10月 16, 2018
  
  8c79071d
- J
  
  roi_align for gpu · c9d2046f
  由 jerrywgz 提交于 10月 16, 2018
  
  c9d2046f
15 8月, 2018 1 次提交

Add flatten op interface and enhance APIs about detection to support... · 9333a627

由 Bai Yifan 提交于 8月 15, 2018

Add flatten op interface and enhance APIs about detection to support variable-length image. (#12422)

* add flatten api&enhance detection api

* unify shape_op data type

* update API.spec

9333a627

01 6月, 2018 1 次提交

Add shape op to get the shape of variable. (#11048) · 28dc9ba3

由 whs 提交于 6月 01, 2018

* Add shape op to get the shape of variable.

* Rename get_shape to shape.

* Add checker for output and fix comments.

28dc9ba3

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
22 12月, 2017 1 次提交
- Q
  add data layout (#6832) · 6b475981
  由 QI JUN 提交于 12月 22, 2017
```
* add data layout

* fix ci
```
  6b475981
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

25 10月, 2017 1 次提交

CPU Batch Norm Op (#4964) · ee998a9c

由 Qiao Longfei 提交于 10月 24, 2017

* init batch norm op

* prepare input output

* compute mean_out var_out save_mean save_var on CPU

* active is test

* use eigen to do computation

* complete batch norm forward

* set default momentum to 0.9

* add batch norm grad op in CPU

* add tensor_format and NHWC support, add python test

* add test training

* add batch norm gradient test

* improve comment, fix foward Python UnitTest

* add gradient test

* fix eigen warning

* follow name style

* fix a bug

* change float to T

* add simple forward test

* test with different place

* add backward test

* refine python test

* remove old python test code

* code clean

* follow code style

* update comment

ee998a9c

10 10月, 2017 1 次提交
- A
  
  Implementing the fill constant op for the executor · 6efacc14
  由 Abhinav Arora 提交于 10月 09, 2017
  
  6efacc14
28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
20 9月, 2017 1 次提交
- D
  
  Share LoD between input and output of each opeators. · b65709e4
  由 dangqingqing 提交于 9月 19, 2017
  
  b65709e4
23 8月, 2017 1 次提交
- D
  
  Remove set functor and add comapre_grad test · f188e22b
  由 dangqingqing 提交于 8月 23, 2017
  
  f188e22b
11 8月, 2017 1 次提交
- Y
  
  Fix python unit tests · c99f84ac
  由 Yu Yang 提交于 8月 11, 2017
  
  c99f84ac
08 8月, 2017 1 次提交
- F
  
  fix bug · 28476676
  由 fengjiayi 提交于 8月 07, 2017
  
  28476676
07 8月, 2017 1 次提交
- D
  
  "remove type alias done." · 72fb86a2
  由 dongzhihong 提交于 8月 07, 2017
  
  72fb86a2
05 8月, 2017 1 次提交
- Y
  
  Reformat paddle/operators/* strictly following Google Style Guide · 9620df44
  由 Yi Wang 提交于 8月 04, 2017
  
  9620df44
02 8月, 2017 1 次提交
- F
  
  Add unittest for `FillZerosLikeOp` · 8bd73159
  由 fengjiayi 提交于 8月 01, 2017
  
  8bd73159
01 8月, 2017 1 次提交
- Y
  
  Follow comments and merge develop · e2fd2bd0
  由 Yu Yang 提交于 8月 01, 2017
  
  e2fd2bd0
26 7月, 2017 1 次提交
- F
  
  Add fill_zeros_like op · a2dc9614
  由 fengjiayi 提交于 7月 26, 2017
  
  a2dc9614
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 2 次提交
- Q
  
  add Flatten method to EigenVector · d9fa6159
  由 qijun 提交于 7月 19, 2017
  
  d9fa6159
- Y
  
  Update · 00ed5643
  由 Yi Wang 提交于 7月 18, 2017
  
  00ed5643
17 7月, 2017 3 次提交
- Q
  
  set correct place for output tensor · 2a03e380
  由 qijun 提交于 7月 17, 2017
  
  2a03e380
- Y
  Op varient inputs (#2901) · a0caf234
  由 Yan Chunwei 提交于 7月 17, 2017
```
* add inputs

* add ut for multiple inputs

* fix AddToLayer

* op_desc -> op_proto

* CreateArgumentOffsetMap -> CreateInOutOffsetMap

* move CreateInOutOffsetMap from OperatorBase to op registry

* arg_idxs_ -> in_out_idxs_
```
  a0caf234
- Q
  
  implement add_op kernel · d649dbf4
  由 qijun 提交于 7月 17, 2017
  
  d649dbf4
14 7月, 2017 1 次提交
- Q
  
  add_op kernel implementation · bac1426d
  由 qijun 提交于 7月 14, 2017
  
  bac1426d
13 7月, 2017 2 次提交

Follow comments · 79b70c2d

由 Yu Yang 提交于 7月 13, 2017

* Convert `op` --> `operators`
* Remove AddType in OpProtoMaker, because type is part of registry.
* Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.

79b70c2d

Add a sample op, `add_op` · a0aaafe9

由 Yu Yang 提交于 7月 13, 2017

* Refine register methods, make Op can get rid of whole-archieve
* `USE_OP` before a op is used.
* Add unittest for add_op.

a0aaafe9

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致