Add correct memory-allocation at DeepSpeed-Attention (#2474)
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>
Showing
想要评论请 注册 或 登录
Co-authored-by: NJeff Rasley <jerasley@microsoft.com> Co-authored-by: NConnor Holmes <connorholmes@microsoft.com>