[Do not merge] Sparse sgd without atomicAdd (!11229) · 合并请求 · PaddlePaddle / Paddle

[Do not merge] Sparse sgd without atomicAdd !11229

Created by: sidgoyal78

This PR has an alternate implementation of the SparseSGD kernel without cuda atomics. It is not the most efficient implementation. It partitions work along the input dimension, so that each thread can work parallely on a given dimension across the sparse table.

PaddlePaddle / Paddle 大约 2 年 前同步成功

[Do not merge] Sparse sgd without atomicAdd !11229

PaddlePaddle / Paddle
大约 2 年前同步成功