• J
    correct sync behavior for XPU distributed training (#47882) · aafa9820
    james 提交于
    * correct sync behavior for XPU distributed training
    
    XPU support event mechanism similar to cuda event, so it is advisable to
    use an event to sync compute/comm streams for performance. However this
    mechanism is never fully tested, and inconsistent loss/ending_epochs are
    reported. Therefore, this PR replaces event sync with stream waiting as
    a temporary solution.
    
    * remove compile warning
    aafa9820
BKCLTools.h 3.4 KB