提交 · df0430dc2fbb2ea12590b7dc7aedb6ed4f887f90 · BaiXuePrincess / Paddle

17 12月, 2020 1 次提交

[cherry-pick]fix matmulv2 bug & add rebuild group & fix bug of download (#29726) · df0430dc

由 ShenLiang 提交于 12月 17, 2020

* Fix the dowanload bug in the case of multiple machines (#29551)

* fix the dowanload bug
* add sort for ips

* Fix bug of matmul_v2 for broadcast case (#29599)

* fix bug of matmul_v2 for broadcast

* Rebuild group automatically in dynamic graph distributed (#29255)

* add tensor_indices in AssignGroupBySize

* add rebuild group in reducer

* fix error message of gather nd (#29521)

df0430dc

09 11月, 2020 1 次提交
- W
  
  refine the performance of gather Op (#28458) · e14ed71c
  由 wangchaochaohu 提交于 11月 09, 2020
  
  e14ed71c
23 8月, 2020 1 次提交
- W
  
  add paddle.gather for API2.0 (#26455) · ebf9b212
  由 wangchaochaohu 提交于 8月 23, 2020
  
  ebf9b212
11 7月, 2020 1 次提交

Fix index overflow bug of the CUDA kernel loop increment (#25435) · 0b54d54f

由 Chen Weihang 提交于 7月 11, 2020

* fix softmax_with_cross_entropy cuda kernel overflow bug, test=develop

* replace old macro & for condition, test=develop

* polish details, test=develop

0b54d54f

14 5月, 2020 1 次提交
- S
  
  fix error message, test=develop (#24425) · 53e3c534
  由 ShenLiang 提交于 5月 14, 2020
  
  53e3c534
11 5月, 2020 1 次提交

Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f

由 Chen Weihang 提交于 5月 11, 2020

* add new macro BOOST_GET_SAFELY & unittests, test=develop

* add different macro type, test=develop

* fix get macro type in executor, test=develop

* four macro part change backup

* using one macro for all case, test=develop

* revert attribute change, test=develop

* change to three func to solve gcc4.8 bug, test=develop

* polish some details, test=develop

aa0f254f

11 9月, 2019 1 次提交

Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320

由 Huihuang Zheng 提交于 9月 11, 2019

TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.

We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.

Also added data_feed_proto to operator to fix CI in CPU compilation

12542320

30 8月, 2019 1 次提交
- S
  add gather_nd op and unit test (#19366) · 85914f7a
  由 ShenLiang 提交于 8月 30, 2019
```
* fixed the code for coverage

* fixed the document,test=document_preview test=develop
```
  85914f7a
14 8月, 2019 1 次提交
- C
  Fix gather op bug (#19168) · b5ba801e
  由 chengduo 提交于 8月 14, 2019
```
* fix gather op bug
test=develop
```
  b5ba801e
25 5月, 2019 1 次提交

Gather Op Index Support int64_t datatype (#17610) · 1670db5e

由 hutuxian 提交于 5月 25, 2019

* gather_op support int64_t index by adding a template typename

* add UT and rename typename

test=develop

1670db5e

26 3月, 2019 1 次提交
- X
  update DeepCF model · 1f89249a
  由 Xin Pan 提交于 3月 26, 2019
```
test=develop
```
  1f89249a
12 11月, 2018 1 次提交

Fix gather & stack op (#14355) · bd294378

由 Yibing Liu 提交于 11月 12, 2018

* Add int type support for stack_op

* Improve gather op to support index with shape N x 1

test=develop

* Fix stack_op kernel's registry

test=develop

bd294378

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

04 10月, 2017 1 次提交
- Z
  
  gather scatter fix according to google style · 2d876b86
  由 zchen0211 提交于 10月 03, 2017
  
  2d876b86
03 10月, 2017 1 次提交
- Z
  
  gather scatter with cuda streams · 84b8baf1
  由 zchen0211 提交于 10月 02, 2017
  
  84b8baf1
29 9月, 2017 2 次提交
- Z
  
  1 api · 78808b20
  由 zchen0211 提交于 9月 28, 2017
  
  78808b20
- Z
  scatter gather gpu · 88a8eedd
  由 zchen0211 提交于 9月 28, 2017
```
gather scatter gpu
```
  88a8eedd

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致