提交 · d618de2dcd5197e822848f4431270adff762537f · BaiXuePrincess / Paddle

25 8月, 2021 1 次提交

SGD BF16 functional test. (#34648) · d618de2d

由 Adam Osewski 提交于 8月 25, 2021

* Enable BF16 for creating global tensor and reduce_mean.

* Functional test with small model.

d618de2d

21 6月, 2021 1 次提交

Add AXPY oneDNN handler (#33632) · 773aabc7

由 lidanqing 提交于 6月 21, 2021

* Add oneDNN AXPY handler.

* Add fallback for small tensors.

* Fix ifdefs

* Remove unnecessary namespace prefixes and add missing headers.

* Guard handler_axpy with proper ifdefs.

* Compilation of this function is possible only when Paddle is not build
with CUDA nor HIP.

* Move AXPY handler code to separate files.

* Use oneDNN AXPY handler in SGD op.

* Use axpy handler only when Paddle is built with oneDNN.

* Add test for SUM BF16 with big rows.

* Fix SFINAE rules for elementwise_add_to.

* Add test case for SGD with big rows.

* update

* update
Co-authored-by: NAdam Osewski <adam.osewski@intel.com>

773aabc7

06 5月, 2021 1 次提交
- A
  
  Sum kernel for CPU supporting BF16 and SelectedRows (#32631) · 9599c3b3
  由 Adam Osewski 提交于 5月 06, 2021
  
  9599c3b3
14 4月, 2021 1 次提交

adds new CPU kernel for SGD op supporting BF16 data type (#32162) · 3ac6c189

由 Adam Osewski 提交于 4月 14, 2021

* Initial draft for SGD BG16 kernel.

* Unit tests for SGD with BF16 data type.

* Add VLOG message to SGD BF16 op CPU kernel.

* Enhance error messages and error types.

* Refactor SGD op kernels to leverage some common code.

* Make easier to add new kerne invoke code.

* Fix SGD op kernel for sparse grad.

* Unify quotes style.

* Fix error for ROCM compilation.

* Use specialized PADDLE_ENFORCE_xx functions.

3ac6c189

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致