提交 · 6baeb2d1066b58be0f64d3f864b6e3aea0f5974d · PaddlePaddle / Paddle

17 10月, 2022 1 次提交

Support BF16 training for sharding (#46846) · 0b39b244

由 Ghost Screaming 提交于 10月 17, 2022

* Fix bug of reduce_sum op. When input.numel() > INT32_MAX, its result
is wrong.

* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* Support bfloat16 type for reducer and sharding.

* Fix some bug.

* Polish code.

* Polise code.

* Add bfloat16 datatype in fill_grad kernels.
Co-authored-by: Nsneaxiy <sneaxiy@126.com>

0b39b244

01 9月, 2022 1 次提交
- S
  Lazy initialize dense_contents_ in reducer (#45631) · 196b0187
  由 sneaxiy 提交于 9月 01, 2022
```
* make dense_contents_ lazy init

* update legacy dygraph

* fix legacy dygraph bug
```
  196b0187
03 8月, 2022 1 次提交
- R
  [CustomDevice] add custom ccl 2/2 (#44650) · 80ca78a2
  由 ronnywang 提交于 8月 03, 2022
```
* [CustomDevice] add custom ccl 2/2

* update

* update

* update launch
```
  80ca78a2
01 8月, 2022 1 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

29 7月, 2022 1 次提交
- J
  
  Support backward final hook (#44686) · 8c43c0fe
  由 Jiabin Yang 提交于 7月 29, 2022
  
  8c43c0fe
02 7月, 2022 1 次提交

unify cpu context, part2 (#44012) · 755438a7

由 Leo Chen 提交于 7月 02, 2022

* fix init()

* delete test_device_context

* replace CPUDeviceContext with CPUContext

* fix test_scalar

* remove dot_op.cc

* fix compile

755438a7

26 6月, 2022 1 次提交
- S
  
  format all files in fluid using new config (#43776) · 576236a0
  由 Sing_chan 提交于 6月 26, 2022
  
  576236a0
07 6月, 2022 1 次提交
- H
  [Dygraph] Fix bugs of EagerReducer for complex control flows (#43252) · 2922985a
  由 Haohongxiang 提交于 6月 07, 2022
```
* fix bugs of reducer

* update

* update
```
  2922985a
05 6月, 2022 1 次提交
- S
  
  【code format check upgrade】 step2：clang-format (#42840) · a3730dc8
  由 Sing_chan 提交于 6月 05, 2022
  
  a3730dc8
11 5月, 2022 1 次提交
- H
  [Dygraph] Support diff batch for sparse of EagerReducer (#42646) · c5232b4b
  由 Haohongxiang 提交于 5月 11, 2022
```
* support diff batch for sparse of eagerreducer

* fix
```
  c5232b4b
29 4月, 2022 1 次提交
- J
  
  Using small vector for slot and merge edge into grad_slot_meta (#42350) · 2bee99df
  由 Jiabin Yang 提交于 4月 29, 2022
  
  2bee99df
14 4月, 2022 1 次提交
- C
  
  remove all is initialized using (#41766) · 4733fe60
  由 Chen Weihang 提交于 4月 14, 2022
  
  4733fe60
13 4月, 2022 2 次提交
- L
  
  Use densetensor instead of Tensor for ProcessGroup (#41403) · 1e56ca8a
  由 lilong12 提交于 4月 13, 2022
  
  1e56ca8a
- C
  [Phi&CustomOp] Remove deprecated enum PlaceType for custom op & add warning (#41647) · 78ef1071
  由 Chen Weihang 提交于 4月 13, 2022
```
* remove old custom op placetype

* replace dist  placetype using

* add with gpu macro

* fix mutable_data error

* fix set value error

* add comment
```
  78ef1071
04 4月, 2022 1 次提交
- H
  [Dygraph] Support sparse tensor in refactored reducer (#40836) · 1b031987
  由 Haohongxiang 提交于 4月 04, 2022
```
* [Dygraph] Support sparse tensor in refactored reducer

* add uts

* refactor

* update

* fix bugs
```
  1b031987
31 3月, 2022 1 次提交
- Z
  [Phi] Rename ScalarArray to IntArray (#40975) · e559fe41
  由 zyfncg 提交于 3月 31, 2022
```
* rename scalar_array to int_array

* update cmake

* fix conflict

* remove useless log
```
  e559fe41
22 3月, 2022 1 次提交
- Z
  [Phi] Replace Backend by Place in C++ API (#40732) · 5b7fadec
  由 zyfncg 提交于 3月 22, 2022
```
* replace Backend by Place in C++ API

* fix left code

* fix test_to_api bug
```
  5b7fadec
18 3月, 2022 1 次提交
- S
  [DataParallel]Support control flow in new DP (#40593) · 984eacb3
  由 ShenLiang 提交于 3月 18, 2022
```
* fix bug

* fix bug
```
  984eacb3
15 3月, 2022 1 次提交

[Dygraph] Refactoring of reducer in DataParallel (#40389) · 1a32391c

由 Haohongxiang 提交于 3月 15, 2022

* refactor reducer

* modify cmakelists

* solve conflicts

* rename group and update process_group

* fix bugs of ProcessGroupNCCL

* modify for CIs

* refactoring reducer

1a32391c

01 3月, 2022 1 次提交
- S
  [DP] Construct reducer group (#39987) · 4da841e0
  由 ShenLiang 提交于 3月 01, 2022
```
* add reducer
```
  4da841e0

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功